pemistahl/lingua-py
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Combines rule-based and statistical Naive Bayes classification without neural networks or external dependencies, enabling fully offline detection. Backed by compiled Rust bindings for performance and minimal memory footprint while supporting 75 languages with specialized accuracy on short text. Suitable for NLP preprocessing pipelines including text classification, email routing, and spell checking applications.
1,659 stars. Actively maintained with 4 commits in the last 30 days.
Stars
1,659
Forks
57
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 09, 2026
Commits (30d)
4
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/pemistahl/lingua-py"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
indix/whatthelang
Lightning Fast Language Prediction 🚀
nickdavidhaynes/spacy-cld
Language detection extension for spaCy 2.0+
mbanon/fastspell
Targetted language identifier, based on FastText and Hunspell.
nitotm/efficient-language-detector-js
Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.
nitotm/efficient-language-detector
Fast and accurate natural language detection. Detector written in PHP. Nito-ELD, ELD.