jinhangjiang/textregress

TextRegress is a Python package designed to help researchers perform advanced regression analysis on long-form text data.

/ 100

Emerging

Researchers often need to predict numerical outcomes based on long text documents, like sentiment scores from reviews or risk levels from reports. This project helps by taking your text data and any additional numerical features, processing them, and then outputting precise numerical predictions along with explanations of which parts of the text or features contributed most. It's designed for quantitative researchers, data scientists, and analysts working with rich, unstructured text.

No commits in the last 6 months. Available on PyPI.

Use this if you need to build robust regression models that can accurately predict continuous values from extensive text documents, potentially combined with other structured data.

Not ideal if your primary goal is text classification (categorizing text) rather than predicting a numerical outcome, or if you only have short, simple text snippets.

quantitative-research text-analytics predictive-modeling data-science sentiment-analysis

Stale 6m

Maintenance 2 / 25

Adoption 4 / 25

Maturity 25 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

chakki-works/seqeval

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Hironsan/anago

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

jbesomi/texthero

Text preprocessing, representation and visualization from zero to hero.

hamelsmu/ktext

Utilities for preprocessing text for deep learning with Keras

asahi417/tner

Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An...

Explore NLP Tools

All categories Trending NLP directory Insights