FareedKhan-dev/best-llm-finder-pipeline
Agentic RAG, Multi-Agent Systems, and Vision Reasoning are three pipelines to find the perfect LLM
Implements role-specific LLM evaluation by running models through three production-grade pipelines—agentic RAG with recursive document navigation, multi-agent systems with parallel ideation and critique loops, and vision-based form processing with OCR refinement—measuring performance via task-specific metrics (faithfulness, latency, cost) rather than isolated benchmarks. Uses OpenAI-compatible APIs to support multiple model providers (Nebius, Ollama, Together AI), enabling cost-optimized routing by deploying smaller models for fast classification and larger models for synthesis and reasoning tasks.
132 stars. No commits in the last 6 months.
Stars
132
Forks
26
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Aug 20, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/FareedKhan-dev/best-llm-finder-pipeline"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pipeshub-ai/pipeshub-ai
PipesHub is a fully extensible and explainable workplace AI platform for enterprise search and...
xerrors/Yuxi
结合知识库管理的 Agent Harness 平台。 An agent harness that integrates a LightRAG knowledge base and...
xerrors/Yuxi-Know
结合LightRAG 知识库的知识图谱智能体平台。 An agent platform that integrates a LightRAG knowledge base and...
daxa-ai/pebblo
Pebblo enables developers to safely load data and promote their Gen AI app to deployment
graphlit/graphlit-client-python
Python client library for Graphlit Platform