refactor!: 完全异步化 RAG 系统,移除 LangChain ParentDocumentRetriever 依赖
Some checks failed
构建并部署 AI Agent 服务 / deploy (push) Failing after 6m34s
Some checks failed
构建并部署 AI Agent 服务 / deploy (push) Failing after 6m34s
- 重写 rag_core/vector_store.py:完全异步实现 aadd_documents、asimilarity_search - 重写 app/rag/retriever.py:异步混合检索,移除同步兼容代码 - 修改 rag_indexer/index_builder.py:全链路异步调用 - 删除 rag_core/retriever_factory.py:不再使用 LangChain ParentDocumentRetriever - 清理冗余导入和代码:移除 model_services 兼容、不需要的异常导入 - 更新 rag_indexer/README.md:反映新架构 核心改进: - 完全异步化:索引构建和检索全链路 async/await - 自定义实现:不再依赖 LangChain 的 ParentDocumentRetriever - 双向量支持:子文档同时存储 dense + sparse 向量到 Qdrant - 架构清晰:rag_core 公共组件、rag_indexer 索引、app/rag 检索
This commit is contained in:
@@ -39,8 +39,9 @@ from .config import (
|
||||
|
||||
# 从 rag_core 重新导出常用组件
|
||||
from backend.rag_core import (
|
||||
LlamaCppEmbedder,
|
||||
QdrantVectorStore,
|
||||
get_embeddings,
|
||||
get_embedding_dimension,
|
||||
QdrantHybridStore,
|
||||
PostgresDocStore,
|
||||
create_docstore,
|
||||
)
|
||||
@@ -52,14 +53,14 @@ __all__ = [
|
||||
"IndexBuilder",
|
||||
"IndexBuilderConfig",
|
||||
"DocstoreConfig",
|
||||
|
||||
|
||||
# 加载器
|
||||
"DocumentLoader",
|
||||
|
||||
|
||||
# 切分相关
|
||||
"SplitterType",
|
||||
"get_splitter",
|
||||
|
||||
|
||||
# 配置
|
||||
"QDRANT_URL",
|
||||
"QDRANT_API_KEY",
|
||||
@@ -69,11 +70,12 @@ __all__ = [
|
||||
"DOCSTORE_URI",
|
||||
"RAG_OCR_LANGUAGES",
|
||||
"RAG_DOC_LANGUAGES",
|
||||
|
||||
|
||||
# 嵌入与向量存储
|
||||
"LlamaCppEmbedder",
|
||||
"QdrantVectorStore",
|
||||
|
||||
"get_embeddings",
|
||||
"get_embedding_dimension",
|
||||
"QdrantHybridStore",
|
||||
|
||||
# 文档存储
|
||||
"PostgresDocStore",
|
||||
"create_docstore",
|
||||
|
||||
Reference in New Issue
Block a user