HLasse/TextDescriptives
A Python library for calculating a large variety of metrics from text
Computes 40+ linguistic metrics including readability indices (Flesch-Kincaid, SMOG), dependency syntax statistics, POS tag proportions, and semantic coherence between sentences. Integrates as modular spaCy v3 pipeline components that attach results to Doc objects via custom extensions, or can be used standalone via the `extract_metrics()` API that automatically downloads appropriate language models. Outputs metrics as Pandas DataFrames or dictionaries for easy downstream analysis.
361 stars and 4,028 monthly downloads. Used by 1 other package. Available on PyPI.
Stars
361
Forks
25
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 30, 2026
Monthly downloads
4,028
Commits (30d)
0
Dependencies
6
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/HLasse/TextDescriptives"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
ryanjgallagher/shifterator
Interpretable data visualizations for understanding how texts differ at the word level
jboynyc/textnets
Text analysis with networks.
StatguyUser/TextFeatureSelection
Python library for feature selection for text features. It has filter method, genetic algorithm...
kasnerz/reffix
A tool for fixing a BibTeX reference list using DBLP API
harrisonpim/bookworm
:books: social networks from novels