← 返回
开发者工具 中文

RAG

Build, optimize, and debug RAG pipelines with chunking strategies, retrieval tuning, evaluation metrics, and production monitoring.
构建、优化和调试 RAG 管道,包括分块策略、检索调优、评估指标以及生产监控。
ivangdavila
开发者工具 clawhub v1.0.0 1 版本 99559.9 Key: 无需
★ 3
Stars
📥 3,333
下载
💾 51
安装
1
版本
#latest

概述

When to Use

User wants to implement, improve, or troubleshoot Retrieval-Augmented Generation systems.

Quick Reference

TopicFile
-------------
Pipeline components & architecturearchitecture.md
Implementation patterns & codeimplementation.md
Evaluation metrics & debuggingevaluation.md
Security & compliancesecurity.md

Core Capabilities

  1. Architecture design — Select embedding models, vector DBs, and chunking strategies based on requirements
  2. Implementation — Write ingestion pipelines, query handlers, and update logic
  3. Retrieval optimization — Tune top-k, reranking, hybrid search parameters
  4. Evaluation — Build test datasets, measure recall/precision, diagnose failures
  5. Production ops — Monitor quality drift, set up alerts, debug degradation
  6. Security — PII detection, access control, compliance requirements

Decision Checklist

Before recommending architecture, ask:

  • [ ] What document types and volume?
  • [ ] Latency requirements (real-time chat vs batch)?
  • [ ] Update frequency (how often do docs change)?
  • [ ] Access control needs (who can see what)?
  • [ ] Compliance constraints (GDPR, HIPAA, SOC2)?
  • [ ] Budget (managed vs self-hosted, embedding costs)?

Critical Rules

  • Never skip access control — Filter at retrieval time, not after
  • Always overlap chunks — 10-20% prevents context loss at boundaries
  • Evaluate before optimizing — Build eval dataset first, then tune
  • Same embedding model — Query and documents must use identical model
  • Monitor similarity scores — Dropping averages signal drift or issues
  • Plan for deletion — GDPR erasure requires re-embedding capability

Common Failure Patterns

SymptomLikely CauseFix
----------------------------
Wrong docs retrievedQuery too vague, poor chunksQuery expansion, smaller chunks
Relevant doc missedNot indexed, low similarityCheck ingestion, hybrid search
Hallucinated answersContext too shortIncrease top-k, better reranking
Slow responsesLarge chunks, no cachingOptimize chunk size, cache embeddings
Inconsistent resultsNon-deterministic rerankingSet seeds, use stable sorting

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-28 22:45 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

developer-tools

Gog

steipete
Google Workspace 命令行工具,支持 Gmail、日历、云端硬盘、通讯录、表格和文档。
★ 921 📥 185,739
developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 668 📥 323,844
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,350 📥 317,745