bjascob/LemmInflect
A python module for English lemmatization and inflection.
Combines dictionary lookup with neural networks to handle out-of-vocabulary words, drawing morphology rules from the NIH's SPECIALIST Lexicon and using unigram frequencies for disambiguation. Works as a standalone module or integrates directly into spaCy via custom token extension methods (`lemma()` and `inflect()`). Accepts Universal Dependencies or Penn Treebank tags to specify target word forms, achieving 95.6% accuracy on 119K inflected words with minimal dependencies (only NumPy).
275 stars and 94,489 monthly downloads. Used by 3 other packages. No commits in the last 6 months. Available on PyPI.
Stars
275
Forks
25
Language
Python
License
MIT
Category
Last pushed
Sep 14, 2023
Monthly downloads
94,489
Commits (30d)
0
Dependencies
1
Reverse dependents
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/bjascob/LemmInflect"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
natasha/yargy
Rule-based facts extraction for Russian language
nert-nlp/streusle
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
bretttolbert/verbecc
Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French, Italian, Portuguese and...
Ars-Linguistica/mlconjug3
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and...
google-research/turkish-morphology
A two-level morphological analyzer for Turkish.