yohasebe/wp2txt
A command-line tool to extract plain text from Wikipedia dumps with category and section filtering
57
/ 100
Established
191 stars.
No Package
No Dependents
Maintenance
10 / 25
Adoption
10 / 25
Maturity
16 / 25
Community
21 / 25
Stars
191
Forks
37
Language
Ruby
License
MIT
Category
Last pushed
Feb 23, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yohasebe/wp2txt"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
Tiiiger/bert_score
BERT score for text generation
71
DerwenAI/pytextrank
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
69
BrikerMan/Kashgari
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for...
64
asyml/texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. ...
63
neuralmind-ai/portuguese-bert
Portuguese pre-trained BERT models
49