Uncategorized Document AI Tools

There are 154 uncategorized tools tracked. 4 score above 70 (verified tier). The highest-rated is opendatalab/MinerU at 80/100 with 59,166 stars. 8 of the top 10 are actively maintained.

Get all 154 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=document-ai&subcategory=uncategorized&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	opendatalab/MinerU Transforms complex documents like PDFs into LLM-ready markdown/JSON for your...	80	Verified	59,166	Python
2	mehmet-kozan/pdf-parse Pure TypeScript, cross-platform module for extracting text, images, and...	76	Verified	173	TypeScript
3	HIllya51/LunaTranslator 视觉小说翻译器 / Visual Novel Translator	71	Verified	11,129	C++
4	ShareX/ShareX ShareX is a free and open-source application that enables users to capture...	71	Verified	36,124	C#
5	btwld/docling-sdk A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem...	69	Established	25	TypeScript
6	STranslate/STranslate A ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具	69	Established	6,348	C#
7	tisfeng/Easydict 一个简洁优雅的词典翻译 macOS App。开箱即用，支持离线 OCR 识别，支持有道词典，🍎 苹果系统词典，🍎...	68	Established	12,798	Swift
8	zclucas/RMT RMT (RuoMengTu) is a free, open-source macro tool built on AHKv2. Let the...	68	Established	939	AutoHotkey
9	readur/readur Quick, painless, intuitive OCR platform written in Rust and TypeScript....	68	Established	725	Rust
10	pymupdf/PyMuPDF PyMuPDF is a high performance Python library for data extraction, analysis,...	68	Established	9,422	Python
11	run-llama/llama-cloud-py Python SDK for OCR and document parsing in the cloud with LlamaParse	67	Established	20	Python
12	TheJoeFin/Text-Grab Use OCR in Windows quickly and easily with Text Grab. With optional...	67	Established	4,695	C#
13	docling-project/docling Get your documents ready for gen AI	67	Established	57,530	Python
14	ocrmypdf/OCRmyPDF OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched	67	Established	33,179	Python
15	RapidAI/RapidOCR 📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime,...	66	Established	6,282	Python
16	bpwhelan/GameSentenceMiner An immersion toolkit for learning Languages through games and other visual media.	65	Established	559	Python
17	datalab-to/chandra OCR model that handles complex tables, forms, handwriting with full layout.	65	Established	8,500	Python
18	xushengfeng/eSearch 截屏离线OCR 搜索翻译以图搜图贴图录屏万向滚动截屏屏幕翻译 Screenshot Offline OCR Search ...	65	Established	6,344	TypeScript
19	run-llama/liteparse A fast, helpful, and open-source document parser	64	Established	4,104	TypeScript
20	zai-org/GLM-OCR GLM-OCR: Accurate × Fast × Comprehensive	64	Established	5,782	Python
21	pytr-org/pytr Use TradeRepublic in terminal and mass download all documents	64	Established	710	Python
22	CCExtractor/ccextractor CCExtractor - Official version maintained by the core team	64	Established	881	C
23	felipeall/resumeio-to-pdf Download your resume from resume.io as PDF	63	Established	795	Python
24	mittagessen/kraken OCR engine for all the languages	62	Established	977	Python
25	seanghay/sone Declarative Canvas layout engine for JavaScript with advanced rich text support.	60	Established	90	TypeScript
26	ballerine-io/ballerine Open-source infrastructure and data orchestration platform for risk decisioning	60	Established	2,377	TypeScript
27	thanhkeke97/RSTGameTranslation 🎮 Real-time Game Translation Tool \| OCR + AI Translation \| Windows Gaming \|...	59	Established	478	C#
28	hankei6km/gas-gocr2notion Google Drive で OCR を行い、結果を Notion データベースへ送信する Google Apps Script ライブラリー。	59	Established	9	TypeScript
29	formkiq/formkiq-core Open-source document management platform leveraging AWS managed services....	58	Established	154	Java
30	Achno/gowall A tool to convert a Wallpaper's color scheme / palette, OCR with VLM's...	58	Established	2,061	Go
31	RapidAI/RapidDoc A high-performance, open-source PDF data extraction tool. ...	58	Established	146	Python
32	ArtifexSoftware/mupdf.js JavaScript bindings for MuPDF	58	Established	597	—
33	meangrinch/MangaTranslator Manga translation app powered by AI	57	Established	170	Python
34	oomol-lab/pdf-craft PDF craft can convert PDF files into various other formats. This project...	57	Established	5,319	Python
35	TareHimself/manga-translator A manga translator built with python	56	Established	104	Python
36	shibing624/imgocr Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB),...	56	Established	129	Python
37	zibo-chen/rust-paddle-ocr 高性能OCR识别库，支持上百种语言，提供命令行、图形界面及C API多种调用方式，使用便捷高效。 High-performance OCR...	56	Established	210	Rust
38	dynobo/normcap OCR powered screen-capture tool to capture information instead of images	55	Established	2,571	Python
39	wikimedia/wikimedia-ocr API wrapper enabling Wikisources to submit images for optical character recognition.	55	Established	16	PHP
40	ispras/dedoc Dedoc is a library (service) for automate documents parsing and bringing to...	55	Established	656	Python
41	uptonking/note4yaoo daily notes	54	Established	78	HTML
42	scribeocr/scribeocr Web interface for recognizing text, proofreading OCR, and creating...	53	Established	774	JavaScript
43	bzsanti/oxidizePdf a PDF library for rust	53	Established	165	Rust
44	UB-Mannheim/escriptorium Clone of https://gitlab.com/scripta/escriptorium.git with updates from UB Mannheim	53	Established	34	Python
45	arvindrajan92/DTrOCR A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical...	53	Established	203	Python
46	ecdye/macSubtitleOCR Convert bitmap subtitles into SubRip format using the macOS Vision framework	53	Established	39	Swift
47	rtr46/meikipop meikipop - universal japanese ocr popup dictionary for windows, linux and macos	51	Established	263	Python
48	mindee/mindee-api-nodejs Mindee API Helper Library for Node.js	51	Established	26	TypeScript
49	ocrbase-hq/ocrbase 📄 PDF ->.MD/.JSON API & SDK for PaddleOCR-VL with structured data...	51	Established	989	TypeScript
50	danpla/dpscreenocr A program to recognize text on the screen	51	Established	287	C++
51	marieai/marie-ai Complex data extraction and orchestration framework designed for processing...	51	Established	87	Python
52	GeiserX/paperless-telegram-bot Manage Paperless-NGX documents entirely through Telegram - upload, search,...	51	Established	6	Python
53	scribeocr/scribe.js JavaScript OCR and text extraction for images and PDFs.	51	Established	269	JavaScript
54	Roots-Automation/GutenOCR Open-source tools for training and evaluating Vision Language Models for OCR	50	Established	181	Python
55	R0Wi-DEV/workflow_ocr This is a Nextcloud Workflow App which enables you to process files via OCR...	50	Established	93	PHP
56	Purfview/InpaintDelogo Advanced delogo plugin for AviSynth+	50	Established	95	—
57	doo/scanbot-sdk-example-flutter Easy-to-use Flutter document scanner and data extraction plugin	50	Established	104	Dart
58	ZANdewanai/Genshin-Impact-Rich-Presence Genshin Impact Rich Presence	50	Established	37	Python
59	NitishKumar-ai/PersonalLearningPro PersonalLearningPro is an open‑source, AI-powered school learning platform...	50	Established	17	TypeScript
60	wxyhgk/retain-pdf 在保留版面、公式与结构的前提下进行 PDF 翻译，适用于科研与技术文档	50	Established	300	Python
61	wolfmanstout/screen-ocr Easily perform OCR on portions of the screen, choosing from a selection of backends.	50	Established	50	Python
62	OuterSpaceHobo/ScanLingua Free open source chrome extension for immersive japanese, chinese and korean...	49	Emerging	26	TypeScript
63	dills122/MTG-Card-Analyzer (🚧) Analyze images of MTG cards (Clean up in progress)	48	Emerging	12	JavaScript
64	SnapXL/SnapX SnapX is a free, open-source, cross-platform tool that lets you capture or...	48	Emerging	875	C#
65	workoss/boot 常用的工具或者服务	48	Emerging	6	Java
66	yomihon/yomihon Free and open source manga reader for Android - now with OCR (text recognition)	46	Emerging	87	Kotlin
67	monchin/tablers A blazingly fast PDF table extraction library with python API powered by Rust	46	Emerging	7	Rust
68	CarvingIT/smart-repository Institutional Repository of Knowledge	46	Emerging	7	JavaScript
69	yash2974/Zenpark 🔐 Zenpark – Smart Parking Management System Zenpark is an AI-powered smart...	45	Emerging	3	TypeScript
70	ZingYao/autogo_scriptengine AutoGo 脚本引擎扩展方案 - 为 AutoGo 提供 JavaScript 和 Lua 双引擎支持，包含 20+...	45	Emerging	13	Go
71	zai-org/GLM-skills Official skills for the GLM family of models.	45	Emerging	297	Python
72	run-llama/llama-cloud-ts Typescript SDK for OCR and document parsing in the cloud with LlamaParse	45	Emerging	10	TypeScript
73	thomaswantstobeaskeleton/BallonsTranslator-Pro BallonsTranslatorPro — Community fork of BallonsTranslator....	45	Emerging	15	Python
74	nordie92/AoE4BO Age of Empires 4 overlay to lern build orders	45	Emerging	51	C#
75	veryfi/veryfi-nodejs Node.js module for communicating with the Veryfi OCR API. \|\| read:...	45	Emerging	32	JavaScript
76	openva/rs-video-processor The video OCR processor for Richmond Sunlight.	44	Emerging	3	PHP
77	doo/scanbot-sdk-example-ios Easy-to-use iOS document scanner and data extraction library for native iOS apps	44	Emerging	49	Swift
78	mindee/mindee-api-ruby Mindee API Helper Library for Ruby	44	Emerging	14	Ruby
79	Ronin-CK/QuickSnip ⚡ Lightweight Wayland OCR & Google Lens utility built with Quickshell.	43	Emerging	156	QML
80	ieasybooks/tahweel.rb تحويل ملفات PDF إلى TXT و DOCX و JSON	43	Emerging	3	Ruby
81	jasperdevs/yoink Free, open-source screenshot tool. Capture, annotate, and share with a single hotkey.	42	Emerging	150	C#
82	firecrawl/pdf-inspector Fast Rust library for PDF inspection, classification, and text extraction....	42	Emerging	62	Rust
83	Cilda/UmaUmaChecker ウマ娘で選択肢のステータスを画像認識によって可視化する	42	Emerging	48	C++
84	NirKli/WattBot Smart OCR app for reading electricity meters from images	42	Emerging	2	TypeScript
85	simpledms/simpledms Document management for small businesses.	41	Emerging	132	Go
86	doo/scanbot-sdk-maui-example Easy-to-use .NET MAUI document scanner and data extraction library	41	Emerging	30	C#
87	regulaforensics/DocumentReader-web-js-client Regula Document Reader web API js client for the browser and node.js based on axios	41	Emerging	13	TypeScript
88	prabesh704/ImageOCR 🖼️ Build an offline image search engine that indexes images and text for...	41	Emerging	3	Java
89	SeseydOw/Captcha-Bypass-Tool pentesting tool to bypass captcha Steam, Gmail, Instagram, Facebook,...	40	Emerging	7	C#
90	Sixt/tensorlake-go Go SDK for the Tensorlake API: document intelligence, cloud sandboxes, PTY...	40	Emerging	2	Go
91	Agions/HardSubX A professional video hard subtitle extraction tool with OCR. Extract...	40	Emerging	14	Vue
92	veryfi/veryfi-lens-react-native-demo Example Demo App about how to use our react native wrapper	39	Emerging	9	TypeScript
93	Akronae/windows_media_ocr_cli 🔎 OCR CLI that outputs structured data with bounding rects using local...	38	Emerging	11	C#
94	PELock/Dekoder-AZTEC-2D-JavaScript Dekoder Kodu AZTEC 2D z Dowodu Rejestracyjnego dla JavaScript i Node.js (Web...	37	Emerging	3	JavaScript
95	1003129155/jietuba A screenshot ocr and clipboard manager, available only on Windows 10+. ...	37	Emerging	26	Python
96	abishekgiri/boring-ai Self-hosted AI expense manager that turns receipts into structured data and...	36	Emerging	2	JavaScript
97	clark-labs-inc/pdfsink-rs Fast pure-Rust PDF extraction library and CLI by Clark Labs Inc. — 10–50x...	36	Emerging	2	Rust
98	narzaut/translator Translator Overlay	35	Emerging	6	Python
99	hcmhcs/screenTranslate Screen translation app for macOS — select any area, get instant translation....	35	Emerging	25	Swift
100	nonwill/nonwill.github.io CDN data of www.autoptr.top.	35	Emerging	21	HTML
101	dsebastien/obsidian-transcriber Transcriber converts images in your Obsidian vault to Markdown using Ollama...	35	Emerging	12	TypeScript
102	TroniePh/SmartMacroAI Advanced Windows & Web Automation Tool. Features: Record Clicks/Scrolls,...	35	Emerging	7	C#
103	hiimmuc/OCR-Handwritten-equations-solver Handwritten equation solver: OCR, DNN, Flask	35	Emerging	2	Python
104	neanes/byzantine-chant-ocr An OCR toolset for Byzantine chant notation	34	Emerging	9	Python
105	doo/scanbot-sdk-example-capacitor-ionic Easy-to-use Ionic and Capacitor document scanner and data extraction library...	34	Emerging	38	TypeScript
106	JimEverest/fastshot Fastshot is a GenAI powered screenshot and annotation tool designed to...	34	Emerging	9	Python
107	veryfi/veryfi-lens-receipts-android-demo Example codes about how to use Veryfi Lens SDKs	33	Emerging	8	Kotlin
108	Ganymede-Bio/gridgulp Automatically detect and extract tables from Excel, CSV, and text files.	33	Emerging	11	Python
109	veryfi/veryfi-lens-long-receipts-android-demo Example codes about how to use Veryfi Lens SDKs	33	Emerging	6	Kotlin
110	0pen-Sourcer/Complete-Utility-App A versatile desktop app offering wide range of tools for media downloading,...	33	Emerging	5	Python
111	abbasZaidi110/n8n-Parse-Invoices-Documents-with-Gemini-AI-OCR-and-Google-Sheets-Integration 📄 Streamline invoice processing by integrating n8n with Gemini AI OCR and...	32	Emerging	3	—
112	KyleDerZweite/spellbook Self-hosted TCG collection manager with mobile scanning, OCR recognition,...	32	Emerging	6	Svelte
113	rampaa/Tsukikage Hover-based output sender for OwOCR results	32	Emerging	16	C#
114	R0mb0/PDF_accessibility_fixer Client-side tool to check and fix PDF accessibility. Analyze PDFs for text...	32	Emerging	6	JavaScript
115	veryfi/veryfi-lens-headless-receipts-android-demo Example codes about how to use Veryfi Lens SDKs	31	Emerging	2	Kotlin
116	yelog/SnapTraTranslator 一款 macos 离线快速翻译软件	31	Emerging	56	Swift
117	doerfli/reeper Recipe management web application - parse recipes from image using AI based OCR	31	Emerging	2	Ruby
118	victorfu/snap-tray SnapTray is a tray-native screenshot and recording tool for macOS and...	31	Emerging	12	C++
119	veryfi/veryfi-lens-barcodes-android-demo Example about how to user Veryfi Lens for Barcodes	31	Emerging	2	Kotlin
120	cyanyux/pdf-ocr Self-hosted GPU-accelerated OCR web app — convert scanned PDFs to searchable...	30	Emerging	2	Python
121	iLejuxepWaduzd/structured-data-extractor 🛠️ Extract structured data from messy texts using Chain-of-Thought prompting...	30	Emerging	2	C#
122	nxoti1/POINTS-Reader-OCR 🖥️ Extract text from images easily with POINTS-Reader OCR, a high-accuracy...	30	Emerging	2	Python
123	rdantassilva/pdf2ocr A CLI tool to apply OCR on PDF files and export to multiple formats	30	Emerging	2	Python
124	DCC-BS/docling-glm-ocr A docling plugin to integrate a remote hosted GLM-OCR OCR model into docling	29	Experimental	9	Python
125	silenthillzeroq-code/clipnova Windows clipboard manager that turns clipboard history into notes,...	29	Experimental	14	TypeScript
126	TanyaMushonga/skymarshal-api Intelligent aerial traffic monitoring system featuring real-time vehicle...	28	Experimental	2	Python
127	Sabastincruzz/Tools_DeepSeekOCR 🖥️ Deploy DeepSeek-OCR for Optical Character Recognition directly from...	28	Experimental	2	Python
128	XUNRANA/LNU-LibSeat-Automation 🎯 辽宁大学图书馆自习室座位自动预约工具 \| GUI 双击即用 · 验证码 OCR · 多账号并发 · 精准卡点 · 邮件通知 \| Python + Selenium	28	Experimental	2	Python
129	r-uben/socr Multi-engine OCR with cascading fallback, quality audit, and figure extraction	28	Experimental	2	Python
130	dkorbelainen/sniptext screen text extractor with OCR and spell correction	27	Experimental	4	Python
131	sw-willie-wu/MediaTranX AI-powered local multimedia toolkit — speech-to-text, translation,...	27	Experimental	3	Python
132	nikazzio/universal-iiif-studio Modular tool for Digital Humanities: IIIF downloader + Studio environment....	27	Experimental	3	Python
133	JordanCoin/openfoia Local-first FOIA automation with AI-powered document analysis. Your data...	27	Experimental	4	Python
134	hyperpolymath/presswerk High-assurance local print router/server — Dioxus 0.7 mobile app with...	27	Experimental	3	Rust
135	Ajatt-Tools/lancet OCR application for reading manga in Japanese, made for AJATTers 🇯🇵 .	27	Experimental	3	Python
136	ieasybooks/tahweel-tauri تحويل ملفات PDF إلى TXT و DOCX و JSON	27	Experimental	3	TypeScript
137	dimitar-radenkov/SnippingTool Lightweight WPF screen capture, annotation, OCR and screen recorder for...	27	Experimental	4	C#
138	MonDevHub/monocr The MonOCR Platform: Academic-grade OCR for the Mon language....	26	Experimental	5	Kotlin
139	lzhgus/Capso Open-source screenshot and screen recording for macOS. The free, native...	26	Experimental	5	Swift
140	arikusi/sahaf Local PDF & EPUB to Markdown converter with OCR — runs on your hardware, no...	26	Experimental	2	Python
141	qyinm/duckdocs macOS app that parses PDF and Word documents into linked markdown packages using AI.	26	Experimental	2	Rust
142	run-llama/ParseBench ParseBench - A Document Parsing Benchmark for AI Agents	26	Experimental	5	Python
143	PaoloESAN/LN-Translator-Mobile An Android app to translate Japanese Light Novels directly from your screen...	26	Experimental	2	Kotlin
144	XMuli/QuickUtilitiesSuite Curated window utilities to boost your workflow (Quick ColorPicke, Quick...	25	Experimental	5	—
145	misraj-ai/kawn-python The official Python SDK for kawn.ai by Misraj AI. High-performance Arabic...	25	Experimental	3	Python
146	Lianye-Scythe/OCRTranslator Portable Windows OCR / AI desktop tool with screenshot, selected-text, and...	25	Experimental	3	Python
147	Alan5168/fapiao-clipper 发票夹子 - 本地大模型驱动的发票自动识别与报销管理工具（适配中国发票）	25	Experimental	3	Python
148	oskarasadullin/speechma-api Free, unlimited text-to-speech API with 486+ AI voices — unofficial Python...	25	Experimental	3	Python
149	AntoC-dev/Recipedia 📱 A React Native recipe management app with OCR scanning, shopping lists,...	24	Experimental	3	TypeScript
150	veryfi/veryfi-lens-ocr-android-demo Veryfi Lens OCR to read codes, numbers and short text	23	Experimental	2	Kotlin
151	duck-ai-yy/ex-ai 前任AI — 前任.skill 的零门槛替代。上传微信聊天记录，生成前任AI数字分身 Prompt。不用安装，打开网页就能用。	23	Experimental	1	JavaScript
152	TimLChan/steakcam steakcam - Get notified when there is a 72oz steak challenge happening at...	22	Experimental	2	Python
153	EdgeTypE/OldTurkicOCR Pure-Rust Old Turkic (Gokturk) OCR engine powered by ResNet	18	Experimental	5	Rust
154	DevadattaP/math_to_latex Converting handwritten mathematical expressions to LaTeX using...	18	Experimental	2	Jupyter Notebook