overcastbulb/llm-eval-framework
Evaluation framework for LLMs and RAG pipelines LLM-as-a-judge scoring, hallucination detection, semantic similarity, BLEU/ROUGE, and a live dashboard.
Stars
—
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 11, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/overcastbulb/llm-eval-framework"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
artitw/text2text
Text2Text Language Modeling Toolkit
Azure-Samples/azure-ai-document-processing-samples
A collection of samples demonstrating techniques for processing documents with Azure AI...
build-on-aws/langchain-embeddings
This repository demonstrates the construction of a state-of-the-art multimodal search engine,...
aiplanethub/beyondllm
Build, evaluate and observe LLM apps
cofin/mogemma
🔥 Python / Mojo Interface for Google Gemma 3