LeonardoCofone/cofone
Cofone is a small python library to manage RAG
Combines BM25 and FAISS semantic search for retrieval, with support for 19 LLM providers (OpenAI, Anthropic, Gemini, Groq, Ollama, etc.) and 10 embedding options including fully local inference. Handles diverse sources—files, PDFs, URLs, Wikipedia, YouTube transcripts—with automatic document chunking, chat memory, streaming, and Pydantic-based structured output via a fluent Python API.
Available on PyPI.
Stars
9
Forks
—
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 05, 2026
Commits (30d)
0
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/LeonardoCofone/cofone"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
OpenBMB/UltraRAG
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
Quansight/ragna
RAG orchestration framework ⛵️
microsoft/rag-time
RAG Time: A 5-week Learning Journey to Mastering RAG
microsoft/rag-experiment-accelerator
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the...
AnkitNayak-eth/EpsteinFiles-RAG
A RAG pipeline implementation built on the 'Epstein Files 20K' dataset from Hugging Face (Teyler).