Uncategorized Document AI Tools
There are 154 uncategorized tools tracked. 4 score above 70 (verified tier). The highest-rated is opendatalab/MinerU at 80/100 with 59,166 stars. 8 of the top 10 are actively maintained.
Get all 154 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=document-ai&subcategory=uncategorized&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
opendatalab/MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your... |
|
Verified |
| 2 |
mehmet-kozan/pdf-parse
Pure TypeScript, cross-platform module for extracting text, images, and... |
|
Verified |
| 3 |
HIllya51/LunaTranslator
视觉小说翻译器 / Visual Novel Translator |
|
Verified |
| 4 |
ShareX/ShareX
ShareX is a free and open-source application that enables users to capture... |
|
Verified |
| 5 |
btwld/docling-sdk
A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem... |
|
Established |
| 6 |
STranslate/STranslate
A ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具 |
|
Established |
| 7 |
tisfeng/Easydict
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎... |
|
Established |
| 8 |
zclucas/RMT
RMT (RuoMengTu) is a free, open-source macro tool built on AHKv2. Let the... |
|
Established |
| 9 |
readur/readur
Quick, painless, intuitive OCR platform written in Rust and TypeScript.... |
|
Established |
| 10 |
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis,... |
|
Established |
| 11 |
run-llama/llama-cloud-py
Python SDK for OCR and document parsing in the cloud with LlamaParse |
|
Established |
| 12 |
TheJoeFin/Text-Grab
Use OCR in Windows quickly and easily with Text Grab. With optional... |
|
Established |
| 13 |
docling-project/docling
Get your documents ready for gen AI |
|
Established |
| 14 |
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched |
|
Established |
| 15 |
RapidAI/RapidOCR
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime,... |
|
Established |
| 16 |
bpwhelan/GameSentenceMiner
An immersion toolkit for learning Languages through games and other visual media. |
|
Established |
| 17 |
datalab-to/chandra
OCR model that handles complex tables, forms, handwriting with full layout. |
|
Established |
| 18 |
xushengfeng/eSearch
截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search ... |
|
Established |
| 19 |
run-llama/liteparse
A fast, helpful, and open-source document parser |
|
Established |
| 20 |
zai-org/GLM-OCR
GLM-OCR: Accurate × Fast × Comprehensive |
|
Established |
| 21 |
pytr-org/pytr
Use TradeRepublic in terminal and mass download all documents |
|
Established |
| 22 |
CCExtractor/ccextractor
CCExtractor - Official version maintained by the core team |
|
Established |
| 23 |
felipeall/resumeio-to-pdf
Download your resume from resume.io as PDF |
|
Established |
| 24 |
mittagessen/kraken
OCR engine for all the languages |
|
Established |
| 25 |
seanghay/sone
Declarative Canvas layout engine for JavaScript with advanced rich text support. |
|
Established |
| 26 |
ballerine-io/ballerine
Open-source infrastructure and data orchestration platform for risk decisioning |
|
Established |
| 27 |
thanhkeke97/RSTGameTranslation
🎮 Real-time Game Translation Tool | OCR + AI Translation | Windows Gaming |... |
|
Established |
| 28 |
hankei6km/gas-gocr2notion
Google Drive で OCR を行い、結果を Notion データベースへ送信する Google Apps Script ライブラリー。 |
|
Established |
| 29 |
formkiq/formkiq-core
Open-source document management platform leveraging AWS managed services.... |
|
Established |
| 30 |
Achno/gowall
A tool to convert a Wallpaper's color scheme / palette, OCR with VLM's... |
|
Established |
| 31 |
RapidAI/RapidDoc
A high-performance, open-source PDF data extraction tool. ... |
|
Established |
| 32 |
ArtifexSoftware/mupdf.js
JavaScript bindings for MuPDF |
|
Established |
| 33 |
meangrinch/MangaTranslator
Manga translation app powered by AI |
|
Established |
| 34 |
oomol-lab/pdf-craft
PDF craft can convert PDF files into various other formats. This project... |
|
Established |
| 35 |
TareHimself/manga-translator
A manga translator built with python |
|
Established |
| 36 |
shibing624/imgocr
Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB),... |
|
Established |
| 37 |
zibo-chen/rust-paddle-ocr
高性能OCR识别库,支持上百种语言,提供命令行、图形界面及C API多种调用方式,使用便捷高效。 High-performance OCR... |
|
Established |
| 38 |
dynobo/normcap
OCR powered screen-capture tool to capture information instead of images |
|
Established |
| 39 |
wikimedia/wikimedia-ocr
API wrapper enabling Wikisources to submit images for optical character recognition. |
|
Established |
| 40 |
ispras/dedoc
Dedoc is a library (service) for automate documents parsing and bringing to... |
|
Established |
| 41 |
uptonking/note4yaoo
daily notes |
|
Established |
| 42 |
scribeocr/scribeocr
Web interface for recognizing text, proofreading OCR, and creating... |
|
Established |
| 43 |
bzsanti/oxidizePdf
a PDF library for rust |
|
Established |
| 44 |
UB-Mannheim/escriptorium
Clone of https://gitlab.com/scripta/escriptorium.git with updates from UB Mannheim |
|
Established |
| 45 |
arvindrajan92/DTrOCR
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical... |
|
Established |
| 46 |
ecdye/macSubtitleOCR
Convert bitmap subtitles into SubRip format using the macOS Vision framework |
|
Established |
| 47 |
rtr46/meikipop
meikipop - universal japanese ocr popup dictionary for windows, linux and macos |
|
Established |
| 48 |
mindee/mindee-api-nodejs
Mindee API Helper Library for Node.js |
|
Established |
| 49 |
ocrbase-hq/ocrbase
📄 PDF ->.MD/.JSON API & SDK for PaddleOCR-VL with structured data... |
|
Established |
| 50 |
danpla/dpscreenocr
A program to recognize text on the screen |
|
Established |
| 51 |
marieai/marie-ai
Complex data extraction and orchestration framework designed for processing... |
|
Established |
| 52 |
GeiserX/paperless-telegram-bot
Manage Paperless-NGX documents entirely through Telegram - upload, search,... |
|
Established |
| 53 |
scribeocr/scribe.js
JavaScript OCR and text extraction for images and PDFs. |
|
Established |
| 54 |
Roots-Automation/GutenOCR
Open-source tools for training and evaluating Vision Language Models for OCR |
|
Established |
| 55 |
R0Wi-DEV/workflow_ocr
This is a Nextcloud Workflow App which enables you to process files via OCR... |
|
Established |
| 56 |
Purfview/InpaintDelogo
Advanced delogo plugin for AviSynth+ |
|
Established |
| 57 |
doo/scanbot-sdk-example-flutter
Easy-to-use Flutter document scanner and data extraction plugin |
|
Established |
| 58 |
ZANdewanai/Genshin-Impact-Rich-Presence
Genshin Impact Rich Presence |
|
Established |
| 59 |
NitishKumar-ai/PersonalLearningPro
PersonalLearningPro is an open‑source, AI-powered school learning platform... |
|
Established |
| 60 |
wxyhgk/retain-pdf
在保留版面、公式与结构的前提下进行 PDF 翻译,适用于科研与技术文档 |
|
Established |
| 61 |
wolfmanstout/screen-ocr
Easily perform OCR on portions of the screen, choosing from a selection of backends. |
|
Established |
| 62 |
OuterSpaceHobo/ScanLingua
Free open source chrome extension for immersive japanese, chinese and korean... |
|
Emerging |
| 63 |
dills122/MTG-Card-Analyzer
(🚧) Analyze images of MTG cards (Clean up in progress) |
|
Emerging |
| 64 |
SnapXL/SnapX
SnapX is a free, open-source, cross-platform tool that lets you capture or... |
|
Emerging |
| 65 |
workoss/boot
常用的工具或者服务 |
|
Emerging |
| 66 |
yomihon/yomihon
Free and open source manga reader for Android - now with OCR (text recognition) |
|
Emerging |
| 67 |
monchin/tablers
A blazingly fast PDF table extraction library with python API powered by Rust |
|
Emerging |
| 68 |
CarvingIT/smart-repository
Institutional Repository of Knowledge |
|
Emerging |
| 69 |
yash2974/Zenpark
🔐 Zenpark – Smart Parking Management System Zenpark is an AI-powered smart... |
|
Emerging |
| 70 |
ZingYao/autogo_scriptengine
AutoGo 脚本引擎扩展方案 - 为 AutoGo 提供 JavaScript 和 Lua 双引擎支持,包含 20+... |
|
Emerging |
| 71 |
zai-org/GLM-skills
Official skills for the GLM family of models. |
|
Emerging |
| 72 |
run-llama/llama-cloud-ts
Typescript SDK for OCR and document parsing in the cloud with LlamaParse |
|
Emerging |
| 73 |
thomaswantstobeaskeleton/BallonsTranslator-Pro
BallonsTranslatorPro — Community fork of BallonsTranslator.... |
|
Emerging |
| 74 |
nordie92/AoE4BO
Age of Empires 4 overlay to lern build orders |
|
Emerging |
| 75 |
veryfi/veryfi-nodejs
Node.js module for communicating with the Veryfi OCR API. || read:... |
|
Emerging |
| 76 |
openva/rs-video-processor
The video OCR processor for Richmond Sunlight. |
|
Emerging |
| 77 |
doo/scanbot-sdk-example-ios
Easy-to-use iOS document scanner and data extraction library for native iOS apps |
|
Emerging |
| 78 |
mindee/mindee-api-ruby
Mindee API Helper Library for Ruby |
|
Emerging |
| 79 |
Ronin-CK/QuickSnip
⚡ Lightweight Wayland OCR & Google Lens utility built with Quickshell. |
|
Emerging |
| 80 |
ieasybooks/tahweel.rb
تحويل ملفات PDF إلى TXT و DOCX و JSON |
|
Emerging |
| 81 |
jasperdevs/yoink
Free, open-source screenshot tool. Capture, annotate, and share with a single hotkey. |
|
Emerging |
| 82 |
firecrawl/pdf-inspector
Fast Rust library for PDF inspection, classification, and text extraction.... |
|
Emerging |
| 83 |
Cilda/UmaUmaChecker
ウマ娘で選択肢のステータスを画像認識によって可視化する |
|
Emerging |
| 84 |
NirKli/WattBot
Smart OCR app for reading electricity meters from images |
|
Emerging |
| 85 |
simpledms/simpledms
Document management for small businesses. |
|
Emerging |
| 86 |
doo/scanbot-sdk-maui-example
Easy-to-use .NET MAUI document scanner and data extraction library |
|
Emerging |
| 87 |
regulaforensics/DocumentReader-web-js-client
Regula Document Reader web API js client for the browser and node.js based on axios |
|
Emerging |
| 88 |
prabesh704/ImageOCR
🖼️ Build an offline image search engine that indexes images and text for... |
|
Emerging |
| 89 |
SeseydOw/Captcha-Bypass-Tool
pentesting tool to bypass captcha Steam, Gmail, Instagram, Facebook,... |
|
Emerging |
| 90 |
Sixt/tensorlake-go
Go SDK for the Tensorlake API: document intelligence, cloud sandboxes, PTY... |
|
Emerging |
| 91 |
Agions/HardSubX
A professional video hard subtitle extraction tool with OCR. Extract... |
|
Emerging |
| 92 |
veryfi/veryfi-lens-react-native-demo
Example Demo App about how to use our react native wrapper |
|
Emerging |
| 93 |
Akronae/windows_media_ocr_cli
🔎 OCR CLI that outputs structured data with bounding rects using local... |
|
Emerging |
| 94 |
PELock/Dekoder-AZTEC-2D-JavaScript
Dekoder Kodu AZTEC 2D z Dowodu Rejestracyjnego dla JavaScript i Node.js (Web... |
|
Emerging |
| 95 |
1003129155/jietuba
A screenshot ocr and clipboard manager, available only on Windows 10+. ... |
|
Emerging |
| 96 |
abishekgiri/boring-ai
Self-hosted AI expense manager that turns receipts into structured data and... |
|
Emerging |
| 97 |
clark-labs-inc/pdfsink-rs
Fast pure-Rust PDF extraction library and CLI by Clark Labs Inc. — 10–50x... |
|
Emerging |
| 98 |
narzaut/translator
Translator Overlay |
|
Emerging |
| 99 |
hcmhcs/screenTranslate
Screen translation app for macOS — select any area, get instant translation.... |
|
Emerging |
| 100 |
nonwill/nonwill.github.io
CDN data of www.autoptr.top. |
|
Emerging |
| 101 |
dsebastien/obsidian-transcriber
Transcriber converts images in your Obsidian vault to Markdown using Ollama... |
|
Emerging |
| 102 |
TroniePh/SmartMacroAI
Advanced Windows & Web Automation Tool. Features: Record Clicks/Scrolls,... |
|
Emerging |
| 103 |
hiimmuc/OCR-Handwritten-equations-solver
Handwritten equation solver: OCR, DNN, Flask |
|
Emerging |
| 104 |
neanes/byzantine-chant-ocr
An OCR toolset for Byzantine chant notation |
|
Emerging |
| 105 |
doo/scanbot-sdk-example-capacitor-ionic
Easy-to-use Ionic and Capacitor document scanner and data extraction library... |
|
Emerging |
| 106 |
JimEverest/fastshot
Fastshot is a GenAI powered screenshot and annotation tool designed to... |
|
Emerging |
| 107 |
veryfi/veryfi-lens-receipts-android-demo
Example codes about how to use Veryfi Lens SDKs |
|
Emerging |
| 108 |
Ganymede-Bio/gridgulp
Automatically detect and extract tables from Excel, CSV, and text files. |
|
Emerging |
| 109 |
veryfi/veryfi-lens-long-receipts-android-demo
Example codes about how to use Veryfi Lens SDKs |
|
Emerging |
| 110 |
0pen-Sourcer/Complete-Utility-App
A versatile desktop app offering wide range of tools for media downloading,... |
|
Emerging |
| 111 |
abbasZaidi110/n8n-Parse-Invoices-Documents-with-Gemini-AI-OCR-and-Google-Sheets-Integration
📄 Streamline invoice processing by integrating n8n with Gemini AI OCR and... |
|
Emerging |
| 112 |
KyleDerZweite/spellbook
Self-hosted TCG collection manager with mobile scanning, OCR recognition,... |
|
Emerging |
| 113 |
rampaa/Tsukikage
Hover-based output sender for OwOCR results |
|
Emerging |
| 114 |
R0mb0/PDF_accessibility_fixer
Client-side tool to check and fix PDF accessibility. Analyze PDFs for text... |
|
Emerging |
| 115 |
veryfi/veryfi-lens-headless-receipts-android-demo
Example codes about how to use Veryfi Lens SDKs |
|
Emerging |
| 116 |
yelog/SnapTraTranslator
一款 macos 离线快速翻译软件 |
|
Emerging |
| 117 |
doerfli/reeper
Recipe management web application - parse recipes from image using AI based OCR |
|
Emerging |
| 118 |
victorfu/snap-tray
SnapTray is a tray-native screenshot and recording tool for macOS and... |
|
Emerging |
| 119 |
veryfi/veryfi-lens-barcodes-android-demo
Example about how to user Veryfi Lens for Barcodes |
|
Emerging |
| 120 |
cyanyux/pdf-ocr
Self-hosted GPU-accelerated OCR web app — convert scanned PDFs to searchable... |
|
Emerging |
| 121 |
iLejuxepWaduzd/structured-data-extractor
🛠️ Extract structured data from messy texts using Chain-of-Thought prompting... |
|
Emerging |
| 122 |
nxoti1/POINTS-Reader-OCR
🖥️ Extract text from images easily with POINTS-Reader OCR, a high-accuracy... |
|
Emerging |
| 123 |
rdantassilva/pdf2ocr
A CLI tool to apply OCR on PDF files and export to multiple formats |
|
Emerging |
| 124 |
DCC-BS/docling-glm-ocr
A docling plugin to integrate a remote hosted GLM-OCR OCR model into docling |
|
Experimental |
| 125 |
silenthillzeroq-code/clipnova
Windows clipboard manager that turns clipboard history into notes,... |
|
Experimental |
| 126 |
TanyaMushonga/skymarshal-api
Intelligent aerial traffic monitoring system featuring real-time vehicle... |
|
Experimental |
| 127 |
Sabastincruzz/Tools_DeepSeekOCR
🖥️ Deploy DeepSeek-OCR for Optical Character Recognition directly from... |
|
Experimental |
| 128 |
XUNRANA/LNU-LibSeat-Automation
🎯 辽宁大学图书馆自习室座位自动预约工具 | GUI 双击即用 · 验证码 OCR · 多账号并发 · 精准卡点 · 邮件通知 | Python + Selenium |
|
Experimental |
| 129 |
r-uben/socr
Multi-engine OCR with cascading fallback, quality audit, and figure extraction |
|
Experimental |
| 130 |
dkorbelainen/sniptext
screen text extractor with OCR and spell correction |
|
Experimental |
| 131 |
sw-willie-wu/MediaTranX
AI-powered local multimedia toolkit — speech-to-text, translation,... |
|
Experimental |
| 132 |
nikazzio/universal-iiif-studio
Modular tool for Digital Humanities: IIIF downloader + Studio environment.... |
|
Experimental |
| 133 |
JordanCoin/openfoia
Local-first FOIA automation with AI-powered document analysis. Your data... |
|
Experimental |
| 134 |
hyperpolymath/presswerk
High-assurance local print router/server — Dioxus 0.7 mobile app with... |
|
Experimental |
| 135 |
Ajatt-Tools/lancet
OCR application for reading manga in Japanese, made for AJATTers 🇯🇵 . |
|
Experimental |
| 136 |
ieasybooks/tahweel-tauri
تحويل ملفات PDF إلى TXT و DOCX و JSON |
|
Experimental |
| 137 |
dimitar-radenkov/SnippingTool
Lightweight WPF screen capture, annotation, OCR and screen recorder for... |
|
Experimental |
| 138 |
MonDevHub/monocr
The MonOCR Platform: Academic-grade OCR for the Mon language.... |
|
Experimental |
| 139 |
lzhgus/Capso
Open-source screenshot and screen recording for macOS. The free, native... |
|
Experimental |
| 140 |
arikusi/sahaf
Local PDF & EPUB to Markdown converter with OCR — runs on your hardware, no... |
|
Experimental |
| 141 |
qyinm/duckdocs
macOS app that parses PDF and Word documents into linked markdown packages using AI. |
|
Experimental |
| 142 |
run-llama/ParseBench
ParseBench - A Document Parsing Benchmark for AI Agents |
|
Experimental |
| 143 |
PaoloESAN/LN-Translator-Mobile
An Android app to translate Japanese Light Novels directly from your screen... |
|
Experimental |
| 144 |
XMuli/QuickUtilitiesSuite
Curated window utilities to boost your workflow (Quick ColorPicke, Quick... |
|
Experimental |
| 145 |
misraj-ai/kawn-python
The official Python SDK for kawn.ai by Misraj AI. High-performance Arabic... |
|
Experimental |
| 146 |
Lianye-Scythe/OCRTranslator
Portable Windows OCR / AI desktop tool with screenshot, selected-text, and... |
|
Experimental |
| 147 |
Alan5168/fapiao-clipper
发票夹子 - 本地大模型驱动的发票自动识别与报销管理工具(适配中国发票) |
|
Experimental |
| 148 |
oskarasadullin/speechma-api
Free, unlimited text-to-speech API with 486+ AI voices — unofficial Python... |
|
Experimental |
| 149 |
AntoC-dev/Recipedia
📱 A React Native recipe management app with OCR scanning, shopping lists,... |
|
Experimental |
| 150 |
veryfi/veryfi-lens-ocr-android-demo
Veryfi Lens OCR to read codes, numbers and short text |
|
Experimental |
| 151 |
duck-ai-yy/ex-ai
前任AI — 前任.skill 的零门槛替代。上传微信聊天记录,生成前任AI数字分身 Prompt。不用安装,打开网页就能用。 |
|
Experimental |
| 152 |
TimLChan/steakcam
steakcam - Get notified when there is a 72oz steak challenge happening at... |
|
Experimental |
| 153 |
EdgeTypE/OldTurkicOCR
Pure-Rust Old Turkic (Gokturk) OCR engine powered by ResNet |
|
Experimental |
| 154 |
DevadattaP/math_to_latex
Converting handwritten mathematical expressions to LaTeX using... |
|
Experimental |