cognitivefactory/llm-bias-analysis
Benchmark tool aimed at evaluating biases of large language models
No commits in the last 6 months.
Stars
—
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Feb 13, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/cognitivefactory/llm-bias-analysis"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
google-deepmind/long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper...
gnai-creator/aletheion-llm-v2
Decoder-only LLM with integrated epistemic tomography. Knows what it doesn't know.
sandylaker/ib-edl
Calibrating LLMs with Information-Theoretic Evidential Deep Learning (ICLR 2025)
nightdessert/Retrieval_Head
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
MLD3/steerability
An open-source evaluation framework for measuring LLM steerability.