marieai/marie-ai
Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pipelines (GenAI, LLM, VLLM) into your applications, supporting various tasks such as document cleanup, optical character recognition (OCR), classification, splitting, named entity recognition, and form processing
Stars
87
Forks
11
Language
Python
License
MIT
Last pushed
Apr 08, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/document-ai/marieai/marie-ai"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
opendatalab/MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
mehmet-kozan/pdf-parse
Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs....
HIllya51/LunaTranslator
视觉小说翻译器 / Visual Novel Translator
ShareX/ShareX
ShareX is a free and open-source application that enables users to capture or record any area of...
btwld/docling-sdk
A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem and JavaScript/TypeScript.