matt-bentley/LLM-RAG-Architecture

Production-grade Retrieval Augmented Generation (RAG) architecture using Open Source components

42
/ 100
Emerging

Implements hybrid search combining dense embeddings (BAAI/bge-small-en-v1.5) with BM25 sparse vectors through Reciprocal Rank Fusion in Qdrant, plus cross-encoder reranking for result quality. Built on .NET with Semantic Kernel orchestration, integrating FastAPI Python services for embeddings and reranking, with support for multiple LLM backends (Azure OpenAI, OpenAI, Ollama) and PdfPig-based document extraction strategies.

No Package No Dependents
Maintenance 10 / 25
Adoption 7 / 25
Maturity 9 / 25
Community 16 / 25

How are scores calculated?

Stars

27

Forks

7

Language

C#

License

MIT

Category

dotnet-azure-rag

Last pushed

Jan 12, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/matt-bentley/LLM-RAG-Architecture"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.