parsee-ai/parsee-core
Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular data extraction and multimodal queries.
Stars
83
Forks
2
Language
Python
License
MIT
Category
Last pushed
Jan 07, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/parsee-ai/parsee-core"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
google/langextract
A Python library for extracting structured information from unstructured text using LLMs with...
Extralit/extralit
Fast and accurate systemic data extraction with LLM assistance
oidlabs-com/Lexoid
Multimodal document parser for high quality data understanding and extraction
Keyvanhardani/german-ocr
German-OCR is specifically trained to extract text from German documents including invoices,...
davendw49/sciparser
PDF parsing toolkit for preparing academic text corpus