mlchrzan/pairadigm

Concept-Guided Chain-of-Thought (CGCoT) pairwise annotation tool for systematic text evaluation using LLMs. Generate breakdowns, compare items, compute scores, and validate against human judgments. Supports Ollama, Hugging Face, Google Gemini, OpenAI, and Anthropic models.

/ 100

Emerging

Available on PyPI.

Maintenance 13 / 25

Adoption 9 / 25

Maturity 18 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

Apache-2.0

Category

evaluation-frameworks-metrics

Last pushed

Mar 09, 2026

Monthly downloads

247

Commits (30d)

Dependencies

GitHub PyPI

Evaluation Frameworks Metrics · 133 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/mlchrzan/pairadigm"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Featured in

You're Shipping AI You Can't Measure

Higher-rated alternatives

EvolvingLMMs-Lab/lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

open-compass/VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

EuroEval/EuroEval

The robust European language model benchmark.

vibrantlabsai/ragas

Supercharge Your LLM Application Evaluations 🚀

evalplus/evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Explore LLM Tools

All categories Trending LLM Tool directory Insights