Wang-Shuo/A-Guide-to-Retrieval-Augmented-LLM
an intro to retrieval augmented large language model
Combines retrieval-based techniques with LLMs to address hallucinations, outdated knowledge, and long-tail information gaps by dynamically fetching relevant external context during inference. The system integrates information retrieval (vector embeddings, semantic search) with prompt augmentation, allowing LLMs to ground responses in curated data sources while improving explainability and cost efficiency over simply expanding context windows. Covers architectural components including data ingestion/chunking pipelines, retrieval indexing strategies, and generation mechanisms—with analysis of frameworks like vector databases and embedding models commonly deployed in production RAG applications.
306 stars. No commits in the last 6 months.
Stars
306
Forks
24
Language
—
License
—
Category
Last pushed
Sep 09, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/Wang-Shuo/A-Guide-to-Retrieval-Augmented-LLM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Renumics/renumics-rag
Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚
VectorInstitute/retrieval-augmented-generation
Reference Implementations for the RAG bootcamp
naver/bergen
Benchmarking library for RAG
KalyanKS-NLP/rag-zero-to-hero-guide
Comprehensive guide to learn RAG from basics to advanced.
alan-turing-institute/t0-1
Application of Retrieval-Augmented Reasoning on a domain-specific body of knowledge