MilaNLProc/honest

A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.

/ 100

Emerging

Evaluates bias across six languages (English, Italian, French, Portuguese, Romanian, Spanish) for binary gender and English for LGBTQAI+ stereotypes using template- and lexicon-based methodology. Integrates with HuggingFace's `transformers` library to score masked language models (BERT, GPT) by comparing their top-k completions against curated bias lexicons. The package provides structured templates and an `HonestEvaluator` class that computes aggregate bias scores from model predictions on stereotype-laden sentence fragments.

No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 2 / 25

Adoption 10 / 25

Maturity 18 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Featured in

Embeddings Are Easier Than Whatever You're Doing Instead You're Shipping AI You Can't Measure

Higher-rated alternatives

embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

yannvgn/laserembeddings

LASER multilingual sentence embeddings as a pip package

harmonydata/harmony

The Harmony Python library: a research tool for psychologists to harmonise data and...

embeddings-benchmark/results

Data for the MTEB leaderboard

fresh-stack/freshstack

This repository helps you evaluate your models on the FreshStack benchmark!

Explore Embedding Tools

All categories Trending Embeddings directory Insights