deanmalmgren/textract
extract text from any document. no muss. no fuss.
4,482 stars and 388,531 monthly downloads. Used by 1 other package. Available on PyPI.
Stars
4,482
Forks
665
Language
HTML
License
MIT
Category
Last pushed
Feb 04, 2026
Monthly downloads
388,531
Commits (30d)
0
Dependencies
10
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/deanmalmgren/textract"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
deepdoctection/deepdoctection
A Repo For Document AI
eikek/docspell
Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources...
zzzDavid/ICDAR-2019-SROIE
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic...
axa-group/Parsr
Transforms PDF, Documents and Images into Enriched Structured Data