jbesomi/texthero
Text preprocessing, representation and visualization from zero to hero.
Provides composable preprocessing functions, vector representation methods (TF-IDF, embeddings), and clustering/dimensionality reduction for exploratory analysis—all integrated with Pandas DataFrames for tabular workflows. Built as a wrapper around Gensim, NLTK, SpaCy, and scikit-learn, it abstracts away the complexity of coordinating multiple NLP libraries through a unified pipeline API.
2,911 stars and 1,470 monthly downloads. No commits in the last 6 months. Available on PyPI.
Stars
2,911
Forks
237
Language
Python
License
MIT
Category
Last pushed
Aug 29, 2023
Monthly downloads
1,470
Commits (30d)
0
Dependencies
11
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jbesomi/texthero"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
chakki-works/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
asahi417/tner
Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An...
Hironsan/anago
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
hamelsmu/ktext
Utilities for preprocessing text for deep learning with Keras
shibing624/nerpy
🌈 NERpy: Implementation of Named Entity Recognition using Python....