rag_api and rag-forge

These are complements: the first provides a production RAG API implementation while the second provides systematic benchmarking to optimize the chunking, embedding, and retrieval parameters that such an API would use.

rag_api
67
Established
rag-forge
22
Experimental
Maintenance 16/25
Adoption 10/25
Maturity 16/25
Community 25/25
Maintenance 13/25
Adoption 0/25
Maturity 9/25
Community 0/25
Stars: 772
Forks: 344
Downloads:
Commits (30d): 4
Language: Python
License: MIT
Stars:
Forks:
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No Package No Dependents
No Package No Dependents

About rag_api

danny-avila/rag_api

ID-based RAG FastAPI: Integration with Langchain and PostgreSQL/pgvector

Organizes embeddings by `file_id` to enable targeted, file-level vector retrieval with metadata filtering—particularly useful for multi-document RAG scenarios. Supports multiple embedding providers (OpenAI, Azure, Hugging Face, Bedrock, Ollama, Google) and vector backends beyond pgvector, with configurable chunking, batching, and async processing for scalability. Designed as a pluggable service for LibreChat but works as a standalone ID-based document indexing API with optional JWT authentication.

About rag-forge

Dyinu/rag-forge

Benchmark multiple chunking, embedding, and retrieval combinations for RAG pipelines to find the most effective setup without manual testing.

Scores updated daily from GitHub, PyPI, and npm data. How scores work