milvus: error while loading shared libraries: libaio.so.1:

Error:Failed to connect to Milvus:Error:14 UNAVAILABLE:No connection established

当遇到 Error: 14 UNAVAILABLE: No connection established 错误时，这表明客户端无法成功建立与 Milvus 服务的连接。以下是可能导致该问题的原因以及对应的解决方法： #### 1. **网络配置问题** 如果 Attu 或...

开源向量相似度搜索引擎 Milvus：助力 AI 开发

"Infra-Meetup-121-顾钧-开源向量相似度搜索引擎 Milvus 简介1" 在当前的AI领域，深度学习已经证明了其在处理非结构化数据（如图像、视频、声音和文本）上的强大能力。随着越来越多的AI应用涌现，数据管理和相似度...

揭秘开源向量数据库Milvus：高效、灵活的TB级搜索解决方案

《向量数据库》-向量数据库 Milvus 是一款专为处理大规模向量数据而设计的开源软件。它提供了高效的向量存储、增删改操作和近实时查询功能，适用于需要高效搜索和相似度匹配的应用场景，如图像、视频、音频搜索，...

milvus: command not found

1. **未安装 Milvus**: 确保你已经正确地从GitHub或其他官方渠道下载并安装了 Milvus。如果没有，你需要先安装 Milvus 并添加其至PATH环境变量。 2. **安装路径不在PATH**: 安装完成后，检查 Milvus 是否被安装到了...

milvus etcd启动报错Warning Unhealthy 3m40s kubelet Liveness probe failed: {"level":"warn","ts":"2025-03-24T09:05:46.222578Z","logger":"client","caller":"v3@v3.5.18/retry_interceptor.go:63","msg":"retrying of unary invoker failed","target":"etcd-endpoints://0xc00044c000/my-release-etcd-0.my-release-etcd-headless.default.svc.cluster.local:2379","attempt":0,"error":"rpc error: code = DeadlineExceeded desc = latest balancer error: last connection error: connection error: desc = \"transport: Error while dialing: dial tcp 10.244.70.177:2379: connect: connection refused\""}

嗯，用户遇到了Milvus的etcd启动报错，显示健康检查失败，连接被拒绝。首先，我需要回忆一下Milvus和etcd的关系。Milvus依赖etcd作为元数据存储，如果etcd出问题，整个系统可能无法正常运行。接下来，看错误信息里...

docker安装milvus:v2.6.0

在 Docker 中安装 Milvus 2.6.0 版本可以通过 Docker Compose 实现。Milvus 官方提供了适用于独立部署的 docker-compose.yml 配置文件，用户可以通过以下步骤完成安装： ### 安装步骤 #### 1. 下载 Docker ...

通过DOCKER启动MILVUS 在Ubuntu终端中执行： docker run -d --name milvus \ -p 19530:19530 \ -p 9091:9091 \ milvusdb/milvus:v2.3.3 后反馈如下：2b573e7ea1680f04bbd530a01d0ecfad12b8fc6d29ef39a420e675638d8a8d9b

好的，用户想通过Docker启动Milvus，但执行命令后只得到了一个容器ID的反馈，需要我解释这个反馈并确认是否成功。首先，我需要回忆一下Docker的基本操作。当用户运行docker run命令时，如果加上-d参数，容器会在后台...

kubectl exec -it milvus-etcd-0 -- curl -v http://milvus-etcd-1.milvus-etcd-headless.default.svc.cluster.local:2380 error: Internal error occurred: error executing command in container: failed to exec in container: failed to start exec "648e678ae4a5af57195952244da3084f8cbd8bbc2ac8ef3ab0f92416d8a60d80": OCI runtime exec failed: exec failed: unable to start container process: exec: "curl": executable file not found in $PATH: unknown

#### **方案 1：使用 nc 命令测试端口连通性** 如果容器内安装了 netcat（常用工具），可用以下命令验证网络： bash kubectl exec -it milvus-etcd-0 -- nc -zv milvus-etcd-1.milvus-etcd-headless....

Error: Get "https://github.com/milvus-io/milvus-helm/releases/download/milvus-2.3.4/milvus-2.3.4.tgz": dial tcp 20.205.243.166:443: connect: connection refused

#### 方案1：检查网络连通性 bash # 测试GitHub的443端口访问 telnet github.com 443 # 或使用： nc -zv github.com 443 # 如果超时，尝试更换DNS（如使用Google DNS） echo "nameserver 8.8.8.8" | sudo tee /...

W0317 11:20:51.058447 52783 loader.go:223] Config not found: /etc/rancher/k3s/k3s.yaml W0317 11:20:51.058669 52783 loader.go:223] Config not found: /etc/rancher/k3s/k3s.yaml Error: Get "https://github.com/milvus-io/milvus-helm/releases/download/milvus-4.0.31/milvus-4.0.31.tgz": unexpected EOF

#### **步骤1：修复k3s配置** bash # 检查k3s服务状态（如未安装需先安装） sudo systemctl status k3s # 导出kubeconfig（若文件确实丢失） export KUBECONFIG=/etc/rancher/k3s/k3s.yaml # 验证集群连通性 ...

No libraries found for 'io.milvus.grpc.MetricType

好的，我需要帮助用户解决在Java项目中找不到'io.milvus.grpc.MetricType'库的问题。首先，我应该回想一下用户提供的引用内容，看看有没有相关的信息。用户提到引用[1]和引用[2]都涉及到了Milvus Java SDK的依赖...

# ======================================================= # 业务领域多模态RAG智能问答系统 (Business-RAG-MultiModal) # v2.1 - 最终稳定版 # ======================================================= # --- 核心依赖库导入 --- import os import hashlib import json import logging import base64 import pathlib import re import requests # 用于直接调用Ollama API import time # 新增：用于重试机制 from typing import List, Dict # --- LlamaIndex 核心导入 --- from llama_index.core import Settings from llama_index.core import VectorStoreIndex, SimpleDirectoryReader, PromptTemplate, Document, StorageContext from llama_index.core.readers.base import BaseReader as LlamaBaseReader from llama_index.core.node_parser import SentenceSplitter from llama_index.core.schema import TextNode from llama_index.llms.ollama import Ollama from llama_index.core.postprocessor import SentenceTransformerRerank from llama_index.embeddings.huggingface import HuggingFaceEmbedding # --- Milvus 相关导入 --- from llama_index.vector_stores.milvus import MilvusVectorStore from pymilvus import utility, connections, Collection # --- 配置日志 --- logging.basicConfig(level=logging.INFO, format='%(asctime)s - %(levelname)s - %(message)s') logger = logging.getLogger(name) # ======================================================= # 1. 全局配置区 # ======================================================= CONFIG = { "knowledge_base_dir": "knowledge_base", "image_cache_file": "image_description_cache_offline.json", "embed_model_path": "D:/models/text2vec-base-chinese", "reranker_model_path": "D:/models/bge-reranker-v2-m3", "llm_model_name": "qwen3:8b", "mllm_model_name": "llava", "llm_request_timeout": 600.0, "chunk_size": 512, "chunk_overlap": 50, "retrieval_top_k": 10, "rerank_top_n": 3, "device": "cpu", # --- Milvus配置 --- "milvus_host": "127.0.0.1", # 确保与Docker容器端口一致 "milvus_port": "19530", # 默认端口 "milvus_collection": "law_rag_collection_v1", "vector_dim": 768 } # ======================================================= # 2. 核心功能函数区 # ======================================================= def load_image_cache(): if os.path.exists(CONFIG["image_cache_file"]): with open(CONFIG["image_cache_file"], 'r', encoding='utf-8') as f: try: return json.load(f) except json.JSONDecodeError: return {} return {} def save_image_cache(cache): with open(CONFIG["image_cache_file"], 'w', encoding='utf-8') as f: json.dump(cache, f, ensure_ascii=False, indent=4) def get_image_description_from_mllm(image_bytes: bytes, local_mllm: Ollama, question="请详细描述这张图片的内容，如果图片中有文字也请一并识别出来。") -> str: """ 【修正版】直接调用Ollama API来获取图片描述，并使用正确的sha256。 """ image_cache = load_image_cache() image_hash = hashlib.sha256(image_bytes).hexdigest() if image_hash in image_cache: logger.info(f" - 发现已缓存的离线图片描述 (hash: {image_hash[:8]}...)，从缓存加载。") return image_cache[image_hash] logger.info(f" - 未找到图片缓存，正在直接调用Ollama API (模型: {local_mllm.model})...") try: image_b64 = base64.b64encode(image_bytes).decode("utf-8") payload = { "model": local_mllm.model, "prompt": question, "images": [image_b64], "stream": False } response = requests.post( "http://localhost:11434/api/generate", json=payload, timeout=CONFIG["llm_request_timeout"] ) response.raise_for_status() response_data = response.json() description = response_data.get('response', '[模型未返回有效描述]').strip() formatted_description = f"[图片描述]: {description}\n" image_cache[image_hash] = formatted_description save_image_cache(image_cache) return formatted_description except requests.exceptions.RequestException as e: logger.error(f"直接调用Ollama API处理图片时发生网络异常: {e}") return "[网络异常导致图片处理失败]\n" except Exception as e: logger.error(f"处理图片时发生未知异常: {e}") return "[未知异常导致图片处理失败]\n" class CustomMultimodalReader(LlamaBaseReader): def init(self, mllm_instance: Ollama): super().init() self.mllm = mllm_instance def load_data(self, file_path_obj: pathlib.Path, extra_info: Dict = None) -> List[Document]: file_path_str = str(file_path_obj) if file_path_str.endswith(".pdf"): return self._load_pdf(file_path_str, extra_info) elif file_path_str.endswith(".docx"): return self._load_docx(file_path_str, extra_info) else: # 为 .txt 文件添加一个基本的加载器 try: with open(file_path_str, 'r', encoding='utf-8') as f: text = f.read() return [Document(text=text, extra_info={(extra_info or {}), "file_name": os.path.basename(file_path_str)})] except Exception as e: logger.error(f"处理TXT文件 '{file_path_str}' 时发生错误: {e}") return [] def _load_pdf(self, file_path: str, extra_info: Dict = None) -> List[Document]: documents = [] try: import pypdf with open(file_path, "rb") as fp: reader = pypdf.PdfReader(fp) if reader.is_encrypted: logger.warning(f"文件 {os.path.basename(file_path)} 是加密PDF，已跳过。") return [] for i, page in enumerate(reader.pages): page_info = {(extra_info or {}), "page_label": str(i + 1), "file_name": os.path.basename(file_path)} if page_text := page.extract_text(): documents.append(Document(text=page_text.strip(), extra_info=page_info.copy())) for img_file_obj in page.images: if img_bytes := img_file_obj.data: image_description = get_image_description_from_mllm(img_bytes, self.mllm) documents.append(Document(text=image_description, extra_info={page_info.copy(), "content_type": "image_description"})) except Exception as e: logger.error(f"处理PDF文件 '{file_path}' 时发生错误: {e}") return documents def _load_docx(self, file_path: str, extra_info: Dict = None) -> List[Document]: documents = [] try: import docx doc = docx.Document(file_path) file_info = {(extra_info or {}), "file_name": os.path.basename(file_path)} for para in doc.paragraphs: if para.text.strip(): documents.append(Document(text=para.text.strip(), extra_info=file_info.copy())) for table in doc.tables: table_text = "\n".join([" | ".join([cell.text for cell in row.cells]) for row in table.rows]).strip() if table_text: documents.append(Document(text=f"[表格内容]:\n{table_text}", extra_info={file_info.copy(), "content_type": "table_content"})) for rel in doc.part.rels.values(): if "image" in rel.target_ref: if image_bytes := rel.target_part.blob: image_description = get_image_description_from_mllm(image_bytes, self.mllm) documents.append(Document(text=image_description, extra_info={file_info.copy(), "content_type": "image_description"})) except Exception as e: logger.error(f"处理DOCX文件 '{file_path}' 时发生错误: {e}") return documents # --- RAG核心流程函数 --- def setup_models_and_services(): """集中加载所有本地AI模型，并进行全局配置。""" logger.info("--- 步骤A: 加载所有本地AI模型 ---") embed_model = HuggingFaceEmbedding( model_name=CONFIG["embed_model_path"], device=CONFIG["device"] ) logger.info(f"成功加载本地嵌入模型: {CONFIG['embed_model_path']}") llm_model = Ollama(model=CONFIG["llm_model_name"], request_timeout=CONFIG["llm_request_timeout"]) logger.info(f"成功配置本地Ollama LLM (模型: {CONFIG['llm_model_name']})") mllm_for_parsing = Ollama(model=CONFIG["mllm_model_name"], request_timeout=300.0) logger.info(f"成功配置本地Ollama MLLM (模型: {CONFIG['mllm_model_name']})") reranker = SentenceTransformerRerank( model=CONFIG["reranker_model_path"], top_n=CONFIG["rerank_top_n"], device=CONFIG["device"] ) logger.info(f"成功加载本地重排模型: {CONFIG['reranker_model_path']}") # 将加载好的模型设置到全局 Settings 中，确保所有组件统一使用 Settings.embed_model = embed_model Settings.llm = llm_model logger.info("--- 已将embed_model和llm配置为全局默认 ---") logger.info("--- 所有AI模型加载完成 ---") return llm_model, embed_model, reranker, mllm_for_parsing def build_knowledge_index(mllm_for_parsing: Ollama, embed_model: HuggingFaceEmbedding): """ 【最终修正版】构建向量索引并将其持久化到Milvus数据库。包含手动嵌入生成以绕过库的潜在bug。 """ logger.info("--- 步骤B: 连接Milvus并构建/加载知识库向量索引 ---") # ===== 新增：Milvus连接重试机制 ===== max_retries = 5 retry_delay = 5 # 秒 connected = False for attempt in range(max_retries): try: connections.connect(alias="default", host=CONFIG["milvus_host"], port=CONFIG["milvus_port"]) logger.info(f"成功连接到 Milvus 服务 at {CONFIG['milvus_host']}:{CONFIG['milvus_port']}") connected = True break except Exception as e: logger.warning(f"连接Milvus失败（尝试 {attempt+1}/{max_retries}）: {e}") if attempt < max_retries - 1: logger.info(f"{retry_delay}秒后重试...") time.sleep(retry_delay) else: logger.error(f"无法连接到 Milvus 服务，已达到最大重试次数") raise ConnectionError(f"无法连接Milvus: {str(e)}") if not connected: raise RuntimeError("Milvus连接失败") vector_store = MilvusVectorStore( uri=f"http://{CONFIG['milvus_host']}:{CONFIG['milvus_port']}", collection_name=CONFIG["milvus_collection"], dim=CONFIG["vector_dim"], overwrite=False ) collection_exists_and_has_content = False if utility.has_collection(CONFIG["milvus_collection"]): collection = Collection(name=CONFIG["milvus_collection"]) collection.load() if collection.num_entities > 0: collection_exists_and_has_content = True if collection_exists_and_has_content: logger.info(f"在Milvus中已找到包含实体的集合，直接加载索引...") index = VectorStoreIndex.from_vector_store(vector_store) logger.info("从Milvus加载索引完成。") else: if utility.has_collection(CONFIG["milvus_collection"]): logger.info("在Milvus中找到空集合，开始处理并填充数据...") else: logger.info(f"在Milvus中未找到集合，开始完整的数据处理和索引构建流程...") # 步骤 1: 数据加载和切分 (此部分不变) reader = SimpleDirectoryReader( input_dir=CONFIG["knowledge_base_dir"], required_exts=[".pdf", ".docx", ".txt"], file_extractor={".pdf": CustomMultimodalReader(mllm_instance=mllm_for_parsing), ".docx": CustomMultimodalReader(mllm_instance=mllm_for_parsing)}, recursive=True ) documents = reader.load_data(show_progress=True) all_nodes = [] sentence_splitter = SentenceSplitter(chunk_size=CONFIG["chunk_size"], chunk_overlap=CONFIG["chunk_overlap"]) for doc in documents: filename = doc.metadata.get("file_name", "").lower() if doc.metadata.get("content_type") == "image_description": all_nodes.append(doc); continue if filename.endswith(".pdf"): article_pattern = r'(第[一二三四五六七八九十百千万零〇\d]+条)'; text_chunks = re.split(article_pattern, doc.text); i = 1 while i < len(text_chunks): article_title = text_chunks[i]; article_content = text_chunks[i+1] if (i + 1) < len(text_chunks) else "" full_article_text = (article_title + article_content).strip() if full_article_text: node = Document(text=full_article_text, extra_info=doc.metadata.copy()); all_nodes.append(node) i += 2 else: nodes = sentence_splitter.get_nodes_from_documents([doc]); all_nodes.extend(nodes) logger.info(f"文档条件化切分完毕，共生成 {len(all_nodes)} 个内容块 (Nodes)。") # --- 【核心修正】 --- # 步骤 2: 手动、显式地为所有节点生成向量嵌入 logger.info(f"正在为 {len(all_nodes)} 个节点手动生成向量嵌入...") for node in all_nodes: # node.get_content() 是获取节点文本最稳健的方法 node.embedding = embed_model.get_text_embedding(node.get_content()) logger.info("所有节点的向量嵌入已手动生成。") # --- 【核心修正结束】 --- # 步骤 3: 将已经带有向量的节点添加到Milvus logger.info(f"正在将 {len(all_nodes)} 个带有预生成向量的节点添加到Milvus...") vector_store.add(all_nodes) logger.info("节点已成功添加到Milvus。") # 步骤 4: 从已填充的向量存储创建索引对象 index = VectorStoreIndex.from_vector_store(vector_store) logger.info("索引对象创建完成。") connections.disconnect("default") logger.info("已断开与 Milvus 服务的连接。") return index def run_query_pipeline(index: VectorStoreIndex, llm_model: Ollama, reranker: SentenceTransformerRerank): """启动问答流程，循环处理预设的问题。""" logger.info("--- 步骤C: 开始RAG问答流程 ---") QA_PROMPT_TEMPLATE = PromptTemplate( "你是一个专业的业务问答助手，负责根据内部知识库提供精准、可靠的回答。\n\n" "你的任务是：\n" "1. 仔细阅读下面提供的“参考信息”。\n" "2. 根据“参考信息”直接回答“用户问题”，禁止进行任何形式的猜测、推理或使用你自己的知识。\n" "3. 引用来源：在回答中，如果引用了某份文件的内容，必须在相关句子末尾用 (文件名) 的格式注明来源。\n" "4. 版本对比：如果参考信息来自不同版本的文件（例如，文件名中包含年份），请对比说明它们之间的差异。\n" "5. 提供建议：在回答的最后，根据回答内容，提供1-2条具体、可执行的业务建议。\n" "6. 未知问题：如果“参考信息”中完全没有能回答问题的内容，你必须且只能回答：“根据提供的资料，无法回答该问题。”\n" "7. 格式要求：回答的最后，必须附上一个“参考依据”列表，列出所有被引用的文件名。\n\n" "---------------------\n" "参考信息:\n{context_str}\n" "---------------------\n" "用户问题: {query_str}\n\n" "你的回答:\n" ) query_engine = index.as_query_engine( similarity_top_k=CONFIG["retrieval_top_k"], node_postprocessors=[reranker], text_qa_template=QA_PROMPT_TEMPLATE # llm会从全局Settings获取 ) questions = [ "根据附图1的技术架构图，究竟是哪个芯片独立负责生成刷新信号？", "数据出境安全评估申报流程图里，如果个人信息达到10万人规模该怎么办？", "我国公民的基本权力以及义务有哪些", "借钱不还怎么办？" ] for q in questions: logger.info(f"\n{'='70}\n--- 用户提问: {q} ---") try: response = query_engine.query(q) logger.info("\n--- 模型最终回答 ---\n" + str(response)) logger.info("\n--- 回答引用的参考信息 (经重排后) ---") for i, node_with_score in enumerate(response.source_nodes): logger.info(f"--- 来源 {i+1} (得分: {node_with_score.score:.4f}, 文件: {node_with_score.metadata.get('file_name', 'N/A')}) ---") node = node_with_score.node if hasattr(node, 'text') and node.text: logger.info(f"内容预览: {node.text[:150]}...\n") else: logger.info(f"内容预览: [这是一个非文本节点，类型为: {type(node).name}]\n") except Exception as e: logger.error(f"执行RAG查询时发生错误: {e}", exc_info=True) logger.info(f"{'='70}") # ======================================================= # 3. 主执行入口 # ======================================================= if name == "main": logger.info("===== 启动业务领域多模态RAG智能问答系统 =====") try: llm, embed_model, reranker, mllm = setup_models_and_services() knowledge_index = build_knowledge_index(mllm, embed_model) run_query_pipeline(knowledge_index, llm, reranker) except Exception as e: logger.error(f"程序主流程发生致命错误，即将退出: {e}", exc_info=True) exit(1) logger.info("\n===== RAG系统执行完成 =====") 这是我的代码，(rag_project_env) PS D:\new_rag> & D:/miniconda/envs/rag_project_env/python.exe d:/new_rag/law_rag.py 2025-07-24 11:26:07,150 - INFO - ===== 启动业务领域多模态RAG智能问答系统 ===== 2025-07-24 11:26:07,150 - INFO - --- 步骤A: 加载所有本地AI模型 --- 2025-07-24 11:26:07,153 - INFO - Load pretrained SentenceTransformer: D:/models/text2vec-base-chinese 2025-07-24 11:26:08,791 - INFO - 成功加载本地嵌入模型: D:/models/text2vec-base-chinese 2025-07-24 11:26:08,791 - INFO - 成功配置本地Ollama LLM (模型: qwen3:8b) 2025-07-24 11:26:08,791 - INFO - 成功配置本地Ollama MLLM (模型: llava) 2025-07-24 11:26:09,586 - INFO - 成功加载本地重排模型: D:/models/bge-reranker-v2-m3 2025-07-24 11:26:09,586 - INFO - --- 已将embed_model和llm配置为全局默认 --- 2025-07-24 11:26:09,586 - INFO - --- 所有AI模型加载完成 --- 2025-07-24 11:26:09,586 - INFO - --- 步骤B: 连接Milvus并构建/加载知识库向量索引 --- 2025-07-24 11:26:19,712 - WARNING - 连接Milvus失败（尝试 1/5）: <MilvusException: (code=2, message=Fail connecting to server on 127.0.0.1:19530, illegal connection params or server unavailable)> 2025-07-24 11:26:19,712 - INFO - 5秒后重试... 2025-07-24 11:26:34,849 - WARNING - 连接Milvus失败（尝试 2/5）: <MilvusException: (code=2, message=Fail connecting to server on 127.0.0.1:19530, illegal connection params or server unavailable)> 2025-07-24 11:26:34,849 - INFO - 5秒后重试... 2025-07-24 11:26:49,974 - WARNING - 连接Milvus失败（尝试 3/5）: <MilvusException: (code=2, message=Fail connecting to server on 127.0.0.1:19530, illegal connection params or server unavailable)> 2025-07-24 11:26:49,978 - INFO - 5秒后重试... 2025-07-24 11:27:05,102 - WARNING - 连接Milvus失败（尝试 4/5）: <MilvusException: (code=2, message=Fail connecting to server on 127.0.0.1:19530, illegal connection params or server unavailable)> 2025-07-24 11:27:05,102 - INFO - 5秒后重试... 2025-07-24 11:27:20,192 - WARNING - 连接Milvus失败（尝试 5/5）: <MilvusException: (code=2, message=Fail connecting to server on 127.0.0.1:19530, illegal connection params or server unavailable)> 2025-07-24 11:27:20,192 - ERROR - 无法连接到 Milvus 服务，已达到最大重试次数 2025-07-24 11:27:20,192 - ERROR - 程序主流程发生致命错误，即将退出: 无法连接Milvus: <MilvusException: (code=2, message=Fail connecting to server on 127.0.0.1:19530, illegal connection params or server unavailable)> Traceback (most recent call last): File "d:\new_rag\law_rag.py", line 230, in build_knowledge_index connections.connect(alias="default", host=CONFIG["milvus_host"], port=CONFIG["milvus_port"]) File "D:\miniconda\envs\rag_project_env\lib\site-packages\pymilvus\orm\connections.py", line 459, in connect connect_milvus(**kwargs, user=user, password=password, token=token, db_name=db_name) File "D:\miniconda\envs\rag_project_env\lib\site-packages\pymilvus\orm\connections.py", line 420, in connect_milvus raise e from e File "D:\miniconda\envs\rag_project_env\lib\site-packages\pymilvus\orm\connections.py", line 412, in connect_milvus gh._wait_for_channel_ready(timeout=timeout) File "D:\miniconda\envs\rag_project_env\lib\site-packages\pymilvus\client\grpc_handler.py", line 159, in _wait_for_channel_ready raise MilvusException( pymilvus.exceptions.MilvusException: <MilvusException: (code=2, message=Fail connecting to server on 127.0.0.1:19530, illegal connection params or server unavailable)> During handling of the above exception, another exception occurred: Traceback (most recent call last): File "d:\new_rag\law_rag.py", line 374, in <module> knowledge_index = build_knowledge_index(mllm, embed_model) File "d:\new_rag\law_rag.py", line 241, in build_knowledge_index raise ConnectionError(f"无法连接Milvus: {str(e)}") ConnectionError: 无法连接Milvus: <MilvusException: (code=2, message=Fail connecting to server on 127.0.0.1:19530, illegal connection params or server unavailable)>这是我的报错，services: etcd: container_name: milvus-etcd image: quay.io/coreos/etcd:v3.5.5 environment: - ETCD_AUTO_COMPACTION_MODE=revision - ETCD_AUTO_COMPACTION_RETENTION=1000 - ETCD_QUOTA_BACKEND_BYTES=4294967296 - ETCD_SNAPSHOT_COUNT=50000 volumes: - ./volumes/etcd:/etcd command: etcd -advertise-client-urls=http://127.0.0.1:2379 -listen-client-urls http://0.0.0.0:2379 --data-dir /etcd minio: container_name: milvus-minio image: quay.io/minio/minio:RELEASE.2023-03-20T20-16-18Z environment: - MINIO_ROOT_USER=minioadmin - MINIO_ROOT_PASSWORD=minioadmin volumes: - ./volumes/minio:/minio_data command: minio server /minio_data standalone: container_name: milvus-standalone image: milvusdb/milvus:v2.5.0 # <-- 修正点在这里 command: ["milvus", "run", "standalone"] environment: - ETCD_ENDPOINTS=etcd:2379 - MINIO_ADDRESS=minio:9000 volumes: - ./volumes/milvus:/var/lib/milvus ports: - "19530:19530" # Milvus port - "9091:9091" # Milvus metrics port depends_on: - "etcd" - "minio" volumes: etcd: minio: milvus:这是我的docker-compose.yml，请帮我解决我的报错，或者更改我的代码让我避开这个问题

好的，我现在需要解决用户遇到的Milvus连接失败的问题。首先，用户的报错信息显示连接Milvus服务失败，错误信息是“Fail connecting to server on 127.0.0.1:19530”。用户提供了Docker Compose文件，我需要检查配置...

# -- coding: UTF-8 -- import logging from datetime import datetime import cx_Oracle import numpy as np import requests from pymilvus import connections, Collection, utility, CollectionSchema, FieldSchema, DataType, MilvusException from apscheduler.schedulers.background import BackgroundScheduler import time import re import sys import os from pathlib import Path import json # 获取当前脚本的父目录（即项目根目录） current_dir = os.path.dirname(os.path.dirname(os.path.abspath(file))) sys.path.append(str(current_dir)) # 将项目根目录添加到 sys.path from config.config1 import LOGGING_CONFIG, ORACLE_CONFIG, MODEL_CONFIG, MILVUS_CONFIG # 初始化日志 log_file_path = LOGGING_CONFIG["log_file"] log_file_path = Path(log_file_path) log_file_path.parent.mkdir(exist_ok=True) logging.basicConfig( level=LOGGING_CONFIG["level"], format="%(asctime)s - %(levelname)s - %(message)s", handlers=[ logging.FileHandler(log_file_path), logging.StreamHandler() ] ) logger = logging.getLogger("MaterialSync") class OracleClient: """Oracle数据库客户端""" def init(self): self.conn = None self.connect() def connect(self): try: self.conn = cx_Oracle.connect('ecology/tt2354oa@10.12.163.65/oadb', encoding='UTF-8', nencoding='UTF-8') # 指定编码 logger.info("Connected to Oracle database") except Exception as e: logger.error(f"Oracle connection failed: {str(e)}") raise def fetch_all_data(self): """从Oracle数据库中获取所有数据""" try: cursor = self.conn.cursor() query = """ SELECT TO_CHAR(matnr) AS matnr, TO_CHAR(matkl) AS matkl, TO_CHAR(maktx) AS maktx, TO_CHAR(classfication) AS classfication FROM Material_MASTER@ERPLINK WHERE mandt = '688' AND ( matnr like 'DJ%' OR matnr like 'DY%' ) ORDER BY matnr """ cursor.execute(query) columns = [col[0].lower() for col in cursor.description] return [dict(zip(columns, row)) for row in cursor] except Exception as e: logger.error(f"Oracle query failed: {str(e)}") return [] finally: cursor.close() class VectorServiceClient: """HTTP调用模型服务进行向量编码""" def init(self): self.service_url = MODEL_CONFIG["model_service_url"] self.timeout = 120 # 请求超时时间（秒） logger.info(f"Using vector service: {self.service_url}") def batch_encode_dense(self, texts): """批量生成密集向量""" return self._call_vector_service(texts, "dense") def batch_encode_sparse(self, texts): """批量生成稀疏向量""" return self._call_vector_service(texts, "sparse") def _call_vector_service(self, texts, vector_type): """调用向量服务通用方法""" try: if not texts: return [] # 准备请求数据 payload = { "texts": texts, "type": vector_type # 添加向量类型参数 } # 配置请求头 headers = { "Content-Type": "application/json; charset=utf-8", "Accept": "application/json" } # 详细记录请求格式 logger.debug(f"Request payload details:") logger.debug(f" Vector type: {vector_type}") logger.debug(f" Text count: {len(texts)}") # 记录前3条文本的详细信息 for i, text in enumerate(texts[:3]): logger.debug(f" Text #{i + 1} (length={len(text)}): {text[:100]}{'...' if len(text) > 100 else ''}") # 记录整个请求体（限制长度） payload_json = json.dumps(payload, ensure_ascii=False) if len(payload_json) > 1000: logger.debug(f" Full request body (truncated): {payload_json[:1000]}...") else: logger.debug(f" Full request body: {payload_json}") # 发送请求到模型服务 response = requests.post( self.service_url, json=payload, headers=headers , timeout=self.timeout ) # 检查响应状态 response.raise_for_status() # 解析响应数据 result = response.json() if "error" in result: logger.error(f"Vector service error ({vector_type}): {result['error']}") raise ValueError(result["error"]) if "vectors" not in result: logger.error(f"Invalid response from {vector_type} service: vectors not found") logger.error(f"Response: {json.dumps(result, ensure_ascii=False)[:500]}") raise ValueError(f"Invalid response from {vector_type} service") logger.info(f"Successfully encoded {len(texts)} texts for {vector_type} vectors") # 对于密集向量，转换为numpy数组 if vector_type == "dense": vectors = np.array(result["vectors"]) # 验证向量维度 expected_dim = MILVUS_CONFIG["vector_dim"] if vectors.shape[1] != expected_dim: logger.error(f"Vector dimension mismatch: expected {expected_dim}, got {vectors.shape[1]}") raise ValueError("Vector dimension mismatch") return vectors else: # 稀疏向量直接返回字典列表 return result["vectors"] except requests.exceptions.RequestException as e: logger.error(f"Request to {vector_type} service failed: {str(e)}") raise except Exception as e: logger.error(f"Encoding via {vector_type} service failed: {str(e)}") raise class MilvusHandler: """Milvus数据库处理器""" def init(self): self.collection = None self.vector_service = VectorServiceClient() self.connect() self.prepare_collection() def connect(self): try: connections.connect( host=MILVUS_CONFIG["host"], port=MILVUS_CONFIG["port"] ) logger.info(f"Connected to Milvus: {MILVUS_CONFIG['host']}") except Exception as e: logger.error(f"Milvus connection failed: {str(e)}") raise def prepare_collection(self): """准备集合（自动创建）""" collection_name = MILVUS_CONFIG["collection_name"] if not utility.has_collection(collection_name): fields = [ FieldSchema(name="matnr", dtype=DataType.VARCHAR, is_primary=True, max_length=100), FieldSchema(name="matkl", dtype=DataType.VARCHAR, max_length=50), FieldSchema(name="maktx", dtype=DataType.VARCHAR, max_length=1024), FieldSchema(name="classfication", dtype=DataType.VARCHAR, max_length=1024), FieldSchema(name="maktx_vector", dtype=DataType.FLOAT_VECTOR, dim=MILVUS_CONFIG["vector_dim"]), FieldSchema(name="classfication_vector", dtype=DataType.SPARSE_FLOAT_VECTOR) ] schema = CollectionSchema(fields, "Material vector storage") self.collection = Collection(collection_name, schema) # 创建稀疏向量索引 self.collection.create_index( "classfication_vector", {"index_type": "SPARSE_INVERTED_INDEX", "metric_type": "IP"} ) # 创建密集向量索引 self.collection.create_index( "maktx_vector", {"index_type": "IVF_FLAT", "metric_type": "IP", "params": {"nlist": 1024}} ) logger.info(f"Created collection with both vector types: {collection_name}") else: self.collection = Collection(collection_name) logger.info(f"Loaded collection schema: {collection_name}") # 确保集合已加载 self.ensure_collection_loaded() def ensure_collection_loaded(self): """确保集合已加载到内存""" try: collection_name = self.collection.name load_state = utility.load_state(collection_name) # 检查集合是否已加载 if load_state != "Loaded": logger.info(f"Collection state is {load_state}, loading now...") self.collection.load() logger.info("Collection loaded successfully") else: logger.info(f"Collection is already loaded (state: {load_state})") except MilvusException as e: logger.error(f"Failed to load collection: {str(e)}") raise except Exception as e: logger.error(f"Error checking collection state: {str(e)}") # 如果无法检查状态，尝试直接加载 try: self.collection.load() logger.info("Collection loaded successfully (using fallback)") except Exception as e2: logger.error(f"Fallback loading failed: {str(e2)}") raise def batch_upsert(self, data, batch_size=500): """分批次插入或更新数据""" total_records = len(data) processed_count = 0 for i in range(0, total_records, batch_size): batch_data = data[i:i + batch_size] # 确保集合已加载 self.ensure_collection_loaded() # 数据清洗 valid_batch_data = [] for item in batch_data: try: cleaned_item = { "matnr": self.clean_utf8(item["matnr"], 'matnr', item['matnr']), "matkl": self.clean_utf8(item["matkl"], 'matkl', item['matnr']), "maktx": self.clean_utf8(item["maktx"], 'maktx', item['matnr']), "classfication": self.clean_utf8(item.get("classfication", ""), 'classfication', item['matnr']) } # 验证UTF-8 if all(self.validate_utf8_string(v) for k, v in cleaned_item.items()): valid_batch_data.append(cleaned_item) else: logger.warning(f"Invalid UTF-8 data skipped: {cleaned_item}") except Exception as e: logger.error(f"Error cleaning item: {str(e)}") if not valid_batch_data: logger.info(f"No valid data in batch {i // batch_size + 1}") continue logger.info(f"Processing batch {i // batch_size + 1} with {len(valid_batch_data)} items") # 查询当前批次中已存在的物料编码 matnr_list = [item['matnr'] for item in valid_batch_data] existing_data = [] try: # 构建安全的查询表达式 safe_matnrs = [f"'{matnr}'" for matnr in matnr_list] expr = f"matnr in [{','.join(safe_matnrs)}]" logger.debug(f"Querying Milvus with expression: {expr}") existing_data = self.collection.query( expr=expr, output_fields=["matnr", "maktx", "classfication", "maktx_vector", "classfication_vector"] ) logger.debug(f"Found {len(existing_data)} existing records") except MilvusException as e: logger.error(f"Milvus query failed: {str(e)}") # 回退方案：逐个查询 logger.warning("Falling back to individual queries") for matnr in matnr_list: try: expr = f"matnr == '{matnr}'" item_data = self.collection.query(expr, output_fields=["matnr", "maktx", "classfication", "maktx_vector", "classfication_vector"]) if item_data: existing_data.extend(item_data) except Exception as e: logger.error(f"Failed to query matnr {matnr}: {str(e)}") existing_dict = {item["matnr"]: item for item in existing_data} # 准备需要重新生成向量的数据 maktx_to_encode = [] # 需要生成密集向量的物料描述 class_to_encode = [] # 需要生成稀疏向量的特征值 maktx_indices = [] # 需要更新密集向量的索引 class_indices = [] # 需要更新稀疏向量的索引 # 准备upsert数据 upsert_data = [] for idx, item in enumerate(valid_batch_data): matnr = item["matnr"] existing = existing_dict.get(matnr, {}) # 检查物料描述是否变化 if matnr in existing_dict: if item["maktx"] == existing.get("maktx", ""): # 物料描述相同，复用现有向量 item["maktx_vector"] = existing.get("maktx_vector") else: # 物料描述变化，需要重新生成 maktx_to_encode.append(item["maktx"]) maktx_indices.append(idx) else: # 新记录，需要生成向量 maktx_to_encode.append(item["maktx"]) maktx_indices.append(idx) # 处理特征值向量 class_value = item["classfication"] # 特征值为空的情况 if not class_value or class_value.isspace(): item["classfication_vector"] = None else: # 特征值不为空 if matnr in existing_dict: if class_value == existing.get("classfication", ""): # 特征值相同，复用现有向量 item["classfication_vector"] = existing.get("classfication_vector") else: # 特征值变化，需要重新生成 class_to_encode.append(class_value) class_indices.append(idx) else: # 新记录，需要生成向量 class_to_encode.append(class_value) class_indices.append(idx) upsert_data.append(item) # 批量生成物料描述向量（密集） if maktx_to_encode: try: logger.info(f"Encoding {len(maktx_to_encode)} dense vectors for maktx...") dense_vectors = self.vector_service.batch_encode_dense(maktx_to_encode) # 将向量分配给对应的记录 for vec_idx, data_idx in enumerate(maktx_indices): upsert_data[data_idx]["maktx_vector"] = dense_vectors[vec_idx] except Exception as e: logger.error(f"Failed to encode dense vectors: {str(e)}") # 跳过这个批次 continue # 批量生成特征值向量（稀疏） if class_to_encode: try: logger.info(f"Encoding {len(class_to_encode)} sparse vectors for classfication...") sparse_vectors = self.vector_service.batch_encode_sparse(class_to_encode) # 将向量分配给对应的记录 for vec_idx, data_idx in enumerate(class_indices): # 确保索引在范围内 if vec_idx < len(sparse_vectors): upsert_data[data_idx]["classfication_vector"] = sparse_vectors[vec_idx] except Exception as e: logger.error(f"Failed to encode sparse vectors: {str(e)}") # 跳过这个批次 continue # 准备Milvus实体数据 entities = [ [item["matnr"] for item in upsert_data], [item["matkl"] for item in upsert_data], [item["maktx"] for item in upsert_data], [item["classfication"] for item in upsert_data], [item.get("maktx_vector", []).tolist() if hasattr(item.get("maktx_vector", None), 'tolist') else [] for item in upsert_data], [self.format_sparse_vector(item.get("classfication_vector")) for item in upsert_data] ] # 执行upsert操作 if upsert_data: try: logger.info(f"Upserting {len(upsert_data)} records to Milvus...") self.collection.upsert(entities) self.collection.flush() # 统计空特征值数量 empty_class_count = sum(1 for item in upsert_data if not item["classfication"] or item["classfication"].isspace()) logger.info(f"Upserted batch {i // batch_size + 1}: " f"{len(upsert_data)} records ({empty_class_count} empty classfication)") processed_count += len(upsert_data) except MilvusException as e: logger.error(f"Milvus upsert failed: {str(e)}") # 记录前3条失败数据 for j in range(min(3, len(upsert_data))): sample = upsert_data[j] logger.error(f"Failed sample {j + 1}: matnr={sample['matnr']}, " f"maktx_len={len(sample['maktx'])}, " f"class_len={len(sample['classfication']) if sample['classfication'] else 0}") return processed_count def format_sparse_vector(self, vec): """格式化稀疏向量为Milvus兼容格式""" if vec is None: return {} # FlagEmbedding 返回的是 {token: weight} 格式 if isinstance(vec, dict): # 转换为 {index: weight} 格式 # 这里我们不需要实际索引，只需确保键是整数 # 使用枚举创建新索引，因为原始token字符串Milvus无法处理 formatted = {} for idx, (token, weight) in enumerate(vec.items()): # 确保权重非负 if float(weight) > 0: formatted[int(idx)] = float(weight) return formatted # 如果传入的是列表或其他格式，转换为字典 try: if isinstance(vec, (list, tuple, np.ndarray)): # 转换为稀疏字典格式，只保留正值 return {i: float(val) for i, val in enumerate(vec) if float(val) > 0} return {} except Exception as e: logger.error(f"Failed to format sparse vector: {str(e)}") return {} @staticmethod def validate_utf8_string(s): try: s.encode('utf-8').decode('utf-8') return True except (UnicodeEncodeError, UnicodeDecodeError): return False @staticmethod def clean_utf8(value, field_name, item_id): """强化 UTF-8 清洗逻辑""" if value is None: return '' try: value_str = str(value) cleaned = re.sub(r'\\u[0-9a-fA-F]{4}', '', value_str) cleaned = cleaned.replace('\xa0', ' ') cleaned = cleaned.encode('utf-8', errors='replace').decode('utf-8') return cleaned except Exception as e: logger.warning(f"Failed to clean UTF-8 for [{field_name}] ({item_id}): {str(e)}") return '' class SyncScheduler: """同步调度器""" def init(self): self.oracle = OracleClient() self.milvus = MilvusHandler() def execute_sync(self): """执行同步任务""" logger.info("Starting sync job...") start_time = time.time() try: # 从Oracle获取所有数据 logger.info("Fetching data from Oracle...") all_data = self.oracle.fetch_all_data() if not all_data: logger.info("No data found in Oracle") return logger.info(f"Retrieved {len(all_data)} records from Oracle") # 数据校验和清理 cleaned_data = [] invalid_count = 0 empty_class_count = 0 for item in all_data: try: # 处理可能的键名变化 class_value = item.get('classfication', item.get('classfication', '')) # 数据清洗 cleaned_item = { "matnr": self.clean_utf8(item['matnr'], 'matnr', item['matnr']), "matkl": self.clean_utf8(item['matkl'], 'matkl', item['matnr']), "maktx": self.clean_utf8(item['maktx'], 'maktx', item['matnr']), "classfication": self.clean_utf8(class_value, 'classfication', item['matnr']) } # 统计空特征值 if not cleaned_item["classfication"] or cleaned_item["classfication"].isspace(): empty_class_count += 1 # 验证UTF-8 if all(self.is_valid_utf8(v) for v in cleaned_item.values()): cleaned_data.append(cleaned_item) else: invalid_count += 1 logger.warning(f"Invalid UTF-8 data skipped: matnr={item['matnr']}") except Exception as e: invalid_count += 1 logger.error(f"Error processing item: {item}, error: {str(e)}") if invalid_count > 0: logger.warning(f"Skipped {invalid_count} invalid records") if cleaned_data: processed_count = self.milvus.batch_upsert(cleaned_data) logger.info(f"Successfully processed {processed_count}/{len(cleaned_data)} records") else: logger.warning("No valid data to sync") duration = time.time() - start_time logger.info(f"Sync job completed in {duration:.2f} seconds") except Exception as e: logger.error(f"Sync failed: {str(e)}") duration = time.time() - start_time logger.error(f"Sync job failed after {duration:.2f} seconds") # 尝试重新连接Milvus try: logger.info("Attempting to reconnect to Milvus...") self.milvus = MilvusHandler() logger.info("Milvus reconnected successfully") except Exception as reconnect_error: logger.error(f"Reconnection failed: {str(reconnect_error)}") @staticmethod def clean_utf8(value, field_name, item_id): """强化 UTF-8 清洗逻辑""" if value is None: return '' try: value_str = str(value) cleaned = re.sub(r'\\u[0-9a-fA-F]{4}', '', value_str) cleaned = cleaned.replace('\xa0', ' ') cleaned = cleaned.encode('utf-8', errors='replace').decode('utf-8') return cleaned except Exception as e: logger.warning(f"Failed to clean UTF-8 for [{field_name}] ({item_id}): {str(e)}") return '' @staticmethod def is_valid_utf8(s): try: s.encode('utf-8').decode('utf-8') return True except UnicodeError: return False if name == "main": scheduler = BackgroundScheduler() sync = SyncScheduler() # 立即执行一次同步 logger.info("Executing initial sync...") sync.execute_sync() # 每天凌晨2点执行 scheduler.add_job(sync.execute_sync, "cron", hour=10, minute=58) try: logger.info("Scheduler started with HTTP vector services") scheduler.start() # 保持主程序运行 while True: time.sleep(60) except (KeyboardInterrupt, SystemExit): logger.info("Scheduler stopped") scheduler.shutdown() except Exception as e: logger.error(f"Unexpected error: {str(e)}") import traceback logger.error(traceback.format_exc()) 这个代码报错2025-07-05 10:58:16,307 - ERROR - Request to dense service failed: 400 Client Error: Bad Request for url: http://10.162.244.27:8088/encode 2025-07-05 10:58:16,307 - ERROR - Failed to encode dense vectors: 400 Client Error: Bad Request for url: http://10.162.244.27:8088/encode

### 步骤1: 检查请求的URL和端口首先确认请求的URL和端口是否正确。确保没有拼写错误，且服务确实在指定的地址和端口上运行。可以使用工具如curl或telnet测试连通性： bash telnet 10.162.244.27 8088 ...

java.lang.RuntimeException: 需要在MC中配置milvus.host,milvus.user,milvus.passwd

嗯，用户现在遇到了Java中配置Milvus参数时的异常问题，具体是java.lang.RuntimeException，涉及到配置milvus.host、milvus.user和milvus.passwd。我需要先理解这个错误的可能原因，然后给出解决方案。首先，根据...

import logging from flask import Flask, request, jsonify import numpy as np import requests import time from pymilvus import ( Collection, connections, utility, AnnSearchRequest, # 添加这个导入 Reranker, # 添加这个导入 WeightedRanker # 添加这个导入 ) from config.config1 import MILVUS_CONFIG, MODEL_CONFIG # 初始化日志 logging.basicConfig( level=logging.INFO, format="%(asctime)s - %(levelname)s - %(message)s", handlers=[ logging.FileHandler("logs/classfication_query.log"), logging.StreamHandler() ] ) logger = logging.getLogger("ClassficationQuery") app = Flask(name) class VectorServiceClient: """HTTP调用模型服务进行向量编码（密集和稀疏）""" def init(self): self.service_url = MODEL_CONFIG["model_service_url"] self.timeout = 120 logger.info(f"Using vector service: {self.service_url}") def batch_encode_dense(self, texts): """批量生成密集向量""" return self._call_vector_service(texts, "dense") def batch_encode_sparse(self, texts): """批量生成稀疏向量""" return self._call_vector_service(texts, "sparse") def _call_vector_service(self, texts, vector_type): """调用向量服务通用方法""" try: if not texts: return [] payload = { "texts": texts, "type": vector_type } response = requests.post( self.service_url, headers={"Content-Type": "application/json"}, json=payload, timeout=self.timeout ) if response.status_code >= 400: error_detail = f"HTTP {response.status_code} Error: " try: error_detail += response.json().get("error", response.text[:500]) except: error_detail += response.text[:500] logger.error(f"{vector_type} service error: {error_detail}") response.raise_for_status() result = response.json() if "error" in result: logger.error(f"Vector service error ({vector_type}): {result['error']}") raise ValueError(result["error"]) if "vectors" not in result: logger.error(f"Invalid response from {vector_type} service: vectors not found") raise ValueError(f"Invalid response from {vector_type} service") return result["vectors"] except requests.exceptions.RequestException as e: logger.error(f"Request to {vector_type} service failed: {str(e)}") raise except Exception as e: logger.error(f"Encoding via {vector_type} service failed: {str(e)}") raise class ClassficationSearchHandler: """特征值搜索处理器""" def init(self): self.vector_service = VectorServiceClient() self.collection = None self.connect() self.load_collection() def connect(self): """连接Milvus数据库""" try: connections.connect( host=MILVUS_CONFIG["host"], port=MILVUS_CONFIG["port"] ) logger.info(f"Connected to Milvus: {MILVUS_CONFIG['host']}") except Exception as e: logger.error(f"Milvus connection failed: {str(e)}") raise def load_collection(self): """加载集合""" collection_name = MILVUS_CONFIG["collection_name"] try: if not utility.has_collection(collection_name): logger.error(f"Collection {collection_name} does not exist") raise ValueError(f"Collection {collection_name} not found") self.collection = Collection(collection_name) self.collection.load() logger.info(f"Loaded collection: {collection_name}") except Exception as e: logger.error(f"Failed to load collection: {str(e)}") raise def hybrid_search(self, query_text, top_k=5, dense_weight=0.5, sparse_weight=0.5): """ 执行混合检索（密集向量 + 稀疏向量）参数: query_text: 查询文本（特征值） top_k: 返回结果数量 dense_weight: 密集向量权重 sparse_weight: 稀疏向量权重返回: 排序后的结果列表 """ start_time = time.time() # 1. 编码查询文本 try: logger.info(f"Encoding query text: '{query_text}'") # 编码密集向量 dense_vectors = self.vector_service.batch_encode_dense([query_text]) if not dense_vectors or len(dense_vectors) == 0: logger.error("Dense vector encoding returned empty result") return [] dense_vector = dense_vectors[0] logger.info(f"Dense vector generated, length: {len(dense_vector)}") # 编码稀疏向量 sparse_vectors = self.vector_service.batch_encode_sparse([query_text]) if not sparse_vectors or len(sparse_vectors) == 0: logger.error("Sparse vector encoding returned empty result") return [] sparse_vector = sparse_vectors[0] logger.info(f"Sparse vector generated, length: {len(sparse_vector)}") except Exception as e: logger.error(f"Vector encoding failed: {str(e)}") return [] # 2. 创建搜索请求对象 try: # 创建密集向量搜索请求 dense_search_req = AnnSearchRequest( data=[dense_vector], # 注意：需要是二维列表 anns_field="classfication_dense_vector", param={"metric_type": "IP", "params": {"nprobe": 16}}, limit=top_k * 3, # 获取更多候选结果用于融合 weight=dense_weight ) # 创建稀疏向量搜索请求 sparse_search_req = AnnSearchRequest( data=[sparse_vector], # 注意：需要是二维列表 anns_field="classfication_sparse_vector", param={"metric_type": "IP", "params": {}}, # 稀疏向量不需要nprobe limit=top_k * 3, weight=sparse_weight ) # 3. 执行混合搜索 logger.info("Executing hybrid search...") start_search = time.time() # 使用RRF（Reciprocal Rank Fusion）进行结果融合 rerank = Reranker( strategy="rrf", # 使用RRF策略 params={"k": 60} # RRF参数 ) # 执行混合搜索 results = self.collection.hybrid_search( [dense_search_req, sparse_search_req], # 搜索请求列表 rerank=rerank, # 重排策略 limit=top_k, # 最终返回结果数量 output_fields=["matnr", "matkl", "maktx", "classfication"] ) search_time = time.time() - start_search logger.info(f"Hybrid search completed in {search_time:.2f}s, found {len(results)} results") except Exception as e: logger.error(f"Hybrid search failed: {str(e)}", exc_info=True) return [] # 4. 处理并返回结果 formatted_results = [] for i, hit in enumerate(results): entity = hit.entity formatted_results.append({ "rank": i + 1, "matnr": entity.get("matnr", ""), "matkl": entity.get("matkl", ""), "maktx": entity.get("maktx", ""), "classfication": entity.get("classfication", ""), "score": hit.score }) # 记录前5个结果 if i < 5: logger.info(f"Result #{i + 1}: MATNR={entity.get('matnr')}, Score={hit.score:.4f}") total_time = time.time() - start_time logger.info(f"Total search time: {total_time:.2f}s") return formatted_results # 初始化搜索处理器 search_handler = ClassficationSearchHandler() @app.route('/query_similar_by_classfication', methods=['POST']) def query_similar_by_classfication(): """特征值相似度查询接口""" try: data = request.json if not data or "query_text" not in data: return jsonify({"error": "Missing 'query_text' parameter"}), 400 query_text = data["query_text"] top_k = data.get("top_k", 5) dense_weight = data.get("dense_weight", 0.5) sparse_weight = data.get("sparse_weight", 0.5) logger.info(f"New query: text='{query_text}', top_k={top_k}, " f"dense_weight={dense_weight}, sparse_weight={sparse_weight}") # 执行混合搜索 results = search_handler.hybrid_search( query_text, top_k=top_k, dense_weight=dense_weight, sparse_weight=sparse_weight ) if not results: logger.info("No results found") return jsonify({"results": []}) return jsonify({"results": results}) except Exception as e: logger.error(f"API error: {str(e)}", exc_info=True) return jsonify({"error": "Internal server error"}), 500 @app.route('/health', methods=['GET']) def health_check(): """健康检查端点""" try: # 检查Milvus连接 if not search_handler.collection: return jsonify({"status": "down", "reason": "Milvus not connected"}), 500 # 检查模型服务 try: response = requests.get(f"{MODEL_CONFIG['model_service_url']}/health", timeout=5) if response.status_code != 200: return jsonify({"status": "down", "reason": "Model service unavailable"}), 500 except Exception as e: return jsonify({"status": "down", "reason": f"Model service error: {str(e)}"}), 500 return jsonify({"status": "up"}), 200 except Exception as e: return jsonify({"status": "down", "reason": str(e)}), 500 if name == 'main': try: logger.info("Starting Classfication Query Service on 0.0.0.0:8081") app.run(host='0.0.0.0', port=2379, debug=False) except Exception as e: logger.error(f"Failed to start service: {str(e)}")这段代码报错2025-07-07 16:05:40,087 - INFO - Executing hybrid search... 2025-07-07 16:05:40,087 [ERROR][handler]: Unexpected error: [hybrid_search], 'dict' object has no attribute 'data', <Time: {'RPC start': '2025-07-07 16:05:40.087355', 'Exception': '2025-07-07 16:05:40.087428'}> (decorators.py:158) 2025-07-07 16:05:40,087 - ERROR - Hybrid search failed: <MilvusException: (code=1, message=Unexpected error, message=<'dict' object has no attribute 'data'>)> 2025-07-07 16:05:40,087 - INFO - No results found

- 使用官方SDK文档作为参考：[Milvus Hybrid Search Documentation](https://milvus.io/docs/hybridsearch.md) - 测试环境与生产环境的Milvus版本保持一致 - 封装结果解析函数统一处理字典结构 > 通过上述调整，可...

Error loading "C:\Users\xyq\.conda\envs\milvus\Lib\site-packages\torch\lib\fbgemm.dll" or one of its dependencies.

1. **文件路径问题**：可能该dll文件不存在于指定的路径"C:\Users\xyq\.conda\envs\milvus\Lib\site-packages\torch\lib"下，或者环境变量配置不正确。 2. **版本冲突**：你的Python环境中可能安装了不同版本的...

TMS320F28335 SVPWM三相逆变学习板卡：硬件组成与功能详解

基于TMS320F28335 DSP的SVPWM三相逆变学习板卡，涵盖硬件组成、供电与保护机制、SVPWM技术原理及其优势、应用场景和输入电压范围。文中还展示了闭环控制程序的工作流程，并附有简化的示例代码。该板卡采用高效的SVPWM技术，使逆变器电压利用率提升至1.1倍，远高于传统SPWM的0.866倍，适用于多种逆变和控制任务，具有广泛的实际应用价值。适合人群：对电力电子、嵌入式系统和数字控制感兴趣的工程师和技术爱好者。使用场景及目标：①研究和学习SVPWM技术及其在三相逆变中的应用；②掌握TMS320F28335 DSP的硬件设计和编程技巧；③应用于电机控制、电源管理等领域，提高逆变效率和稳定性。其他说明：文中提供的示例代码有助于理解和实现AD采样数据处理及SVPWM更新，便于读者快速上手实践。

你好，你好。

刚才在云帆论坛看到，kdelibs4已经将shared-mime-info软件包作为Required级别的依赖了。这是个好消息。shared-mime-info 是Xorg发起的用户管理mime的软件包，gnome已经在早些时候支持它了。此次KDE4对其进行支持，估计将意味者gt

结合深度学习的期货程序化交易系统

资源下载链接为： https://pan.quark.cn/s/0623039af766 结合深度学......

milvus: error while loading shared libraries: libaio.so.1:

相关推荐

Milvus：百亿级向量数据库的探索.pdf

milvus的milvus.yaml 官方配置文件

7-2+Milvus+Towhee：向量数据库及embedding流水线.pdf

Error:Failed to connect to Milvus:Error:14 UNAVAILABLE:No connection established

开源向量相似度搜索引擎 Milvus：助力 AI 开发

揭秘开源向量数据库Milvus：高效、灵活的TB级搜索解决方案

milvus: command not found

docker安装milvus:v2.6.0

通过DOCKER启动MILVUS 在Ubuntu终端中执行： docker run -d --name milvus \ -p 19530:19530 \ -p 9091:9091 \ milvusdb/milvus:v2.3.3 后反馈如下：2b573e7ea1680f04bbd530a01d0ecfad12b8fc6d29ef39a420e675638d8a8d9b

Error: Get "https://github.com/milvus-io/milvus-helm/releases/download/milvus-2.3.4/milvus-2.3.4.tgz": dial tcp 20.205.243.166:443: connect: connection refused

No libraries found for 'io.milvus.grpc.MetricType

java.lang.RuntimeException: 需要在MC中配置milvus.host,milvus.user,milvus.passwd

Error loading "C:\Users\xyq\.conda\envs\milvus\Lib\site-packages\torch\lib\fbgemm.dll" or one of its dependencies.

TMS320F28335 SVPWM三相逆变学习板卡：硬件组成与功能详解

你好，你好。

结合深度学习的期货程序化交易系统

大家在看

system verilog for design 2nd edition

植物大战僵尸素材

文件夹监视工具

SAP中英文词典

纯电动汽车百公里电耗计算

最新推荐

TMS320F28335 SVPWM三相逆变学习板卡：硬件组成与功能详解

一个一键设置时间同步并关闭防火墙的桌面应用小工具

MATLAB实现主从博弈电热综合能源系统的动态定价与智能能量管理仿真

YOLOv5与海康相机在工业视觉领域的高效图像采集与目标检测解决方案 深度学习

基于MATLAB平台的多目标粒子群算法在冷热电联供综合能源系统优化运行中的应用 v4.0

Pansophica开源项目：智能Web搜索代理的探索

跨平台内容提取无忧：coze工作流应对社交媒体挑战

vrrp主设备发送的免费arp

为Ghost博客平台打造的Meteor流星包装使用指南

抖音标题生成自动化：用coze工作流释放创意

YOLOv5与海康相机在工业视觉领域的高效图像采集与目标检测解决方案深度学习