fix: 修复本地llm服务不可用问题 + 统一模型缓存目录位置
Some checks failed
构建并部署 AI Agent 服务 / deploy (push) Has been cancelled
Some checks failed
构建并部署 AI Agent 服务 / deploy (push) Has been cancelled
- 修复 config.py 添加 LOCAL_MODEL_NAME 配置项 - 修复 chat_services.py 本地模型检测时API路径重复问题(/v1/models -> /models) - 更新 .gitignore,移除模型目录跟踪 - 统一模型缓存到 docker/models/fastembed_cache,避免重复 - 更新 Dockerfile,正确复制预下载的BM25模型缓存
This commit is contained in:
@@ -63,6 +63,9 @@ SPARSE_MODEL_PATH = _get_str("SPARSE_MODEL_PATH") or "./models/sparse"
|
||||
SPARSE_MODEL_NAME = _get_str("SPARSE_MODEL_NAME") or "Qdrant/bm25"
|
||||
FASTEMBED_CACHE_PATH = _get_str("FASTEMBED_CACHE_PATH") or "./models/fastembed_cache"
|
||||
|
||||
# ========== 本地模型配置 ==========
|
||||
LOCAL_MODEL_NAME = _get_str("LOCAL_MODEL_NAME") or "gemma-4-E4B-it"
|
||||
|
||||
# ========== llama.cpp 服务配置(URL + API密钥 配对) ==========
|
||||
# 主 LLM 服务
|
||||
VLLM_BASE_URL = _get_str("VLLM_BASE_URL")
|
||||
|
||||
Reference in New Issue
Block a user