GiftMungmeeprued/document-parsers-list
A comprehensive list of document parsers, covering PDF-to-text conversion and layout extraction. Each tested for support of tables, equations, handwriting, two-column layouts, and multi-column layouts.
177 stars. No commits in the last 6 months.
Stars
177
Forks
3
Language
—
License
—
Category
Last pushed
Jul 14, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/GiftMungmeeprued/document-parsers-list"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
google/langextract
A Python library for extracting structured information from unstructured text using LLMs with...
Extralit/extralit
Fast and accurate systemic data extraction with LLM assistance
oidlabs-com/Lexoid
Multimodal document parser for high quality data understanding and extraction
Keyvanhardani/german-ocr
German-OCR is specifically trained to extract text from German documents including invoices,...
parsee-ai/parsee-core
Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and...