RapidAI/RapidDoc
A high-performance, open-source PDF data extraction tool. 一站式开源高性能数据提取工具,将复杂 PDF 文档转换为 Markdown 和 JSON 格式,使用onnx模型。
146 stars.
Stars
146
Forks
28
Language
Python
License
Apache-2.0
Last pushed
Apr 10, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/document-ai/RapidAI/RapidDoc"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
opendatalab/MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
mehmet-kozan/pdf-parse
Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs....
HIllya51/LunaTranslator
视觉小说翻译器 / Visual Novel Translator
ShareX/ShareX
ShareX is a free and open-source application that enables users to capture or record any area of...
btwld/docling-sdk
A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem and JavaScript/TypeScript.