rahulpunia29/extractous-go
Fast, multi-format document extraction library for Go. Includes streaming API for large files and OCR for scanned documents via Tesseract.
28
/ 100
Experimental
No Package
No Dependents
Maintenance
6 / 25
Adoption
8 / 25
Maturity
9 / 25
Community
5 / 25
Stars
55
Forks
2
Language
Go
License
Apache-2.0
Category
Last pushed
Oct 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/rahulpunia29/extractous-go"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jdkato/prose
:book: A Golang library for text processing, including tokenization, part-of-speech tagging, and...
44
ikawaha/kagome-dict
Dictionary Library for Kagome v2
44
aaaton/golem
A lemmatizer implemented in Go
39
codingpot/kiwigo
https://github.com/bab2min/Kiwi for go
39
habeanf/yap
Yet Another (natural language) Parser
38