jinhangjiang/textregress
TextRegress is a Python package designed to help researchers perform advanced regression analysis on long-form text data.
Researchers often need to predict numerical outcomes based on long text documents, like sentiment scores from reviews or risk levels from reports. This project helps by taking your text data and any additional numerical features, processing them, and then outputting precise numerical predictions along with explanations of which parts of the text or features contributed most. It's designed for quantitative researchers, data scientists, and analysts working with rich, unstructured text.
No commits in the last 6 months. Available on PyPI.
Use this if you need to build robust regression models that can accurately predict continuous values from extensive text documents, potentially combined with other structured data.
Not ideal if your primary goal is text classification (categorizing text) rather than predicting a numerical outcome, or if you only have short, simple text snippets.
Stars
7
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
Jul 06, 2025
Commits (30d)
0
Dependencies
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jinhangjiang/textregress"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
chakki-works/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
Hironsan/anago
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
jbesomi/texthero
Text preprocessing, representation and visualization from zero to hero.
hamelsmu/ktext
Utilities for preprocessing text for deep learning with Keras
asahi417/tner
Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An...