重排,多路查询
All checks were successful
构建并部署 AI Agent 服务 / deploy (push) Successful in 35m37s

This commit is contained in:
2026-04-20 01:10:18 +08:00
parent 933d418d77
commit 3c906e91d9
21 changed files with 728 additions and 635 deletions

View File

@@ -2,52 +2,69 @@
RAG 检索与生成模块
提供在线检索与生成功能,包括:
- 基础向量检索
- 重排序
- RAG-Fusion
- Agentic RAG
- 基础向量检索(稠密向量 / 混合检索)
- 重排序Cross-Encoder
- 多路查询改写Multi-Query
- RRF 融合Reciprocal Rank Fusion
- 完整的 RAG 流水线
- Agent 工具封装
固定流水线:
用户查询 → 多路改写 → 并行检索 → RRF 融合 → 重排序 → 返回父文档
示例用法:
>>> from app.rag import RAGPipeline, search_knowledge_base
>>> from rag_core import LlamaCppEmbedder
>>>
>>> embeddings = LlamaCppEmbedder()
>>> pipeline = RAGPipeline(embeddings=embeddings)
>>>
>>> documents = pipeline.retrieve("戏耍貂蝉美女")
>>> context = pipeline.format_context(documents)
>>> from app.rag import RAGPipeline, create_rag_tool
>>> from rag_indexer.builder import IndexBuilder, IndexBuilderConfig
>>> from langchain_openai import ChatOpenAI
>>>
>>> # 获取基础检索器(如父子块检索器)
>>> config = IndexBuilderConfig(collection_name="my_docs")
>>> builder = IndexBuilder(config)
>>> retriever = builder.retriever
>>>
>>> # 创建 LLM 和流水线
>>> llm = ChatOpenAI(model="gpt-3.5-turbo")
>>> pipeline = RAGPipeline(retriever=retriever, llm=llm)
>>>
>>> # 检索
>>> docs = await pipeline.aretrieve("什么是 RAG")
>>> context = pipeline.format_context(docs)
>>>
>>> # 创建 Agent 工具
>>> rag_tool = create_rag_tool(retriever=retriever, llm=llm)
"""
from .retriever import (
create_base_retriever,
create_hybrid_retriever,
# create_ensemble_retriever,
create_qdrant_client,
)
from .reranker import CrossEncoderReranker
from .query_transform import MultiQueryTransformer
from .pipeline import RAGPipeline, RAGLevel
from .tools import search_knowledge_base, search_knowledge_base_sync
from .query_transform import MultiQueryGenerator
from .fusion import reciprocal_rank_fusion
from .pipeline import RAGPipeline
from .tools import create_rag_tool, create_rag_tool_sync
__all__ = [
# 检索器
# 检索器工厂函数
"create_base_retriever",
"create_hybrid_retriever",
# "create_ensemble_retriever",
"create_qdrant_client",
# 重排序器
"CrossEncoderReranker",
# 查询转换
"MultiQueryTransformer",
# 查询改写生成
"MultiQueryGenerator",
# 流水线
# 融合算法
"reciprocal_rank_fusion",
# 主流水线
"RAGPipeline",
"RAGLevel",
# 工具
"search_knowledge_base",
"search_knowledge_base_sync",
]
# 工具创建(供 Agent 使用)
"create_rag_tool",
"create_rag_tool_sync",
]