OCR Document Extraction Transformer Models
Tools for extracting text and structured data from images, PDFs, and documents using transformer-based OCR models. Does NOT include general document analysis, LLM-based summarization, or post-extraction processing (summarization/Q&A).
There are 43 ocr document extraction models tracked. 4 score above 50 (established tier). The highest-rated is kha-white/manga-ocr at 64/100 with 2,582 stars and 17,983 monthly downloads. 1 of the top 10 are actively maintained.
Get all 43 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=ocr-document-extraction&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
kha-white/manga-ocr
Optical character recognition for Japanese text, with the main focus being... |
|
Established |
| 2 |
clusterzx/paperless-ai
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama,... |
|
Established |
| 3 |
bytefer/ollama-ocr
Implementing OCR with a local visual model run by ollama. |
|
Established |
| 4 |
alephpi/Texo-web
The web application for Texo, a minimalist SOTA LaTeX OCR model which... |
|
Established |
| 5 |
alephpi/Texo
A minimalist SOTA LaTeX OCR model with only 20M parameters, running in... |
|
Emerging |
| 6 |
Dartvauder/NeuroSandboxWebUI
(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image,... |
|
Emerging |
| 7 |
FreeOCR-AI/layoutreader
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order. |
|
Emerging |
| 8 |
samestrin/llm-pdf-ocr-api
A Python-based REST API for PDF OCR using AI models with PyTorch and... |
|
Emerging |
| 9 |
neosantara-xyz/glm-ocr-inference
Fast and lightweight GLM-OCR inference on Modal with an OpenAI-compatible... |
|
Emerging |
| 10 |
JonSnow1807/Medical-Prescription-OCR
OCR system for handwritten medical prescriptions using Donut transformer and... |
|
Emerging |
| 11 |
CYFARE/PDXTRACT
Extract From PDF's Using Ollama Local LLM |
|
Experimental |
| 12 |
lucky-verma/SaastIE
Document understanding system using Donut transformer architecture |
|
Experimental |
| 13 |
sitammeur/gliner-litserve
Leverage ModernGLiNER's capabilities using LitServe. |
|
Experimental |
| 14 |
Dartvauder/NeuroTrainerWebUI
(Windows/Linux) Local WebUI for finetuning, evaluation and generation of... |
|
Experimental |
| 15 |
resetpaid/lumina
Perform passive domain reconnaissance using public data sources without... |
|
Experimental |
| 16 |
cristi4nhdz/osint-threat-intel-pipeline
Multi-source OSINT pipeline that ingests threat feeds, enriches entities... |
|
Experimental |
| 17 |
extractable-hoodedsheldrake431/deepseek_ocr_app
🖼️ Streamline your document processing with DeepSeek OCR, a modern app... |
|
Experimental |
| 18 |
Metedout-biographer66/dots.ocr-fix-demo
🖼️ Upload images to experience accurate multilingual OCR results with the... |
|
Experimental |
| 19 |
AleNard89/py-pytorch-invoice
Automated invoice data extraction using LayoutLMv3 (PyTorch) with PyQt6... |
|
Experimental |
| 20 |
muhammad-fiaz/EMSUGI
EMSUGI is a future prediction & analysis project on various factor like... |
|
Experimental |
| 21 |
Quotify-Bot/quotify-frontend
AI-powered inspirational quote generator |
|
Experimental |
| 22 |
zainafxal/DisasterInsight-AI
DisasterInsight AI: A multimodal AI platform orchestrating 5 models (Vision,... |
|
Experimental |
| 23 |
KadirCanCelik/Handwriting-to-digital
Handwriting to text conversion using line segmentation and OCR techniques |
|
Experimental |
| 24 |
SemanticWave-Hoyeon/NavtexRecovery
AI-powered restoration system for damaged NAVTEX (NAVigational TEleX)... |
|
Experimental |
| 25 |
bcastelino/ocr-text-vision-pro
AI-powered OCR application using Free OpenRouter Vision Models for advanced... |
|
Experimental |
| 26 |
Kovelja009/handwriting-recognition
Benchmark of different network architectures for handwritten text recognition. |
|
Experimental |
| 27 |
inuwamobarak/nougat
Nougat is a Meta AI's revolutionary OCR model designed to transcribe... |
|
Experimental |
| 28 |
koesan/Manga_Comic_Colorization_and_Translation_v1
AI-powered manga and comic translator using EasyOCR and Hugging Face... |
|
Experimental |
| 29 |
PRITHIVSAKTHIUR/dots.ocr-fix-demo
This Gradio application demonstrates the capabilities of the "dots.ocr"... |
|
Experimental |
| 30 |
ToluClassics/LowResourceOCR
This work is an adaptation of CNN+Transformer architecture to training text... |
|
Experimental |
| 31 |
kalimx03/intelli-credit
AI-powered corporate credit decisioning engine for Indian banking. Ingests... |
|
Experimental |
| 32 |
arora-r/gradio-example
This repository is an example of dockerizing a Gradio application which uses... |
|
Experimental |
| 33 |
Eduardo-PRg/NLM2Img
🖼️ Combine multi-page PDFs into a seamless image and add custom stamps, all... |
|
Experimental |
| 34 |
Mustapha-AJEGHRIR/arabic_calligraphy
This is a repo containing our code for Arabic calligraphy style detection... |
|
Experimental |
| 35 |
sorcero/ingestum
Read-only mirror of https://gitlab.com/sorcero/community/ingestum |
|
Experimental |
| 36 |
UP2040499/auto-osint-v
An automated tool for Validating OSINT. This forms part of the final step of... |
|
Experimental |
| 37 |
ramyadjoshi/IntelliDoc-AI-Powered-Intelligent-Document-Analysis-System
IntelliDoc is an intelligent document understanding system that helps users... |
|
Experimental |
| 38 |
SD7Campeon/Gemma3_OCR_Text_Extractor_LLM
Gemma-3 OCR exemplifies the confluence of abstruse computer vision and... |
|
Experimental |
| 39 |
Mohammed20201991/OCR_HU_Tra2022
HTR Transformer for Hungarian Language |
|
Experimental |
| 40 |
Parth844/AI_pdf_to_Epub
AI-powered PDF to EPUB conversion engine with LLM-based chapter detection... |
|
Experimental |
| 41 |
heyxalok/Optiv-AI-Project
AI-powered cybersecurity automation system that cleanses sensitive data from... |
|
Experimental |
| 42 |
stelaras36/OCRfixer
Web & CLI tool to fix noisy OCR text using a fine-tuned T5 model |
|
Experimental |
| 43 |
sitammeur/readerlm-litserve
Leverage Reader-LM's capabilities using LitServe. |
|
Experimental |