appeler/ethnicolr

Predict Race and Ethnicity Based on the Sequence of Characters in a Name

65
/ 100
Established

Provides multiple prediction models trained on US Census, Florida voter registration, and Wikipedia data, with varying ethnic granularity depending on source. Uses TensorFlow 2.x neural networks to infer probabilities across demographic categories from first and/or last names, supporting both lookup tables for exact census matches and learned models for names absent from training data. Offers a modern Click-based CLI for batch processing, model management, and confidence interval estimation via Monte Carlo sampling, alongside a Python API for pandas DataFrames.

249 stars and 60,141 monthly downloads. Available on PyPI.

No License
Maintenance 6 / 25
Adoption 20 / 25
Maturity 17 / 25
Community 22 / 25

How are scores calculated?

Stars

249

Forks

62

Language

Jupyter Notebook

License

Last pushed

Dec 28, 2025

Monthly downloads

60,141

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/appeler/ethnicolr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.