OCR Document Extraction Transformer Models

Tools for extracting text and structured data from images, PDFs, and documents using transformer-based OCR models. Does NOT include general document analysis, LLM-based summarization, or post-extraction processing (summarization/Q&A).

There are 43 ocr document extraction models tracked. 4 score above 50 (established tier). The highest-rated is kha-white/manga-ocr at 64/100 with 2,582 stars and 17,983 monthly downloads. 1 of the top 10 are actively maintained.

Get all 43 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=ocr-document-extraction&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 kha-white/manga-ocr

Optical character recognition for Japanese text, with the main focus being...

64
Established
2 clusterzx/paperless-ai

An automated document analyzer for Paperless-ngx using OpenAI API, Ollama,...

57
Established
3 bytefer/ollama-ocr

Implementing OCR with a local visual model run by ollama.

54
Established
4 alephpi/Texo-web

The web application for Texo, a minimalist SOTA LaTeX OCR model which...

51
Established
5 alephpi/Texo

A minimalist SOTA LaTeX OCR model with only 20M parameters, running in...

49
Emerging
6 Dartvauder/NeuroSandboxWebUI

(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image,...

42
Emerging
7 FreeOCR-AI/layoutreader

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

42
Emerging
8 samestrin/llm-pdf-ocr-api

A Python-based REST API for PDF OCR using AI models with PyTorch and...

40
Emerging
9 neosantara-xyz/glm-ocr-inference

Fast and lightweight GLM-OCR inference on Modal with an OpenAI-compatible...

34
Emerging
10 JonSnow1807/Medical-Prescription-OCR

OCR system for handwritten medical prescriptions using Donut transformer and...

31
Emerging
11 CYFARE/PDXTRACT

Extract From PDF's Using Ollama Local LLM

26
Experimental
12 lucky-verma/SaastIE

Document understanding system using Donut transformer architecture

23
Experimental
13 sitammeur/gliner-litserve

Leverage ModernGLiNER's capabilities using LitServe.

23
Experimental
14 Dartvauder/NeuroTrainerWebUI

(Windows/Linux) Local WebUI for finetuning, evaluation and generation of...

22
Experimental
15 resetpaid/lumina

Perform passive domain reconnaissance using public data sources without...

22
Experimental
16 cristi4nhdz/osint-threat-intel-pipeline

Multi-source OSINT pipeline that ingests threat feeds, enriches entities...

22
Experimental
17 extractable-hoodedsheldrake431/deepseek_ocr_app

🖼️ Streamline your document processing with DeepSeek OCR, a modern app...

22
Experimental
18 Metedout-biographer66/dots.ocr-fix-demo

🖼️ Upload images to experience accurate multilingual OCR results with the...

22
Experimental
19 AleNard89/py-pytorch-invoice

Automated invoice data extraction using LayoutLMv3 (PyTorch) with PyQt6...

22
Experimental
20 muhammad-fiaz/EMSUGI

EMSUGI is a future prediction & analysis project on various factor like...

20
Experimental
21 Quotify-Bot/quotify-frontend

AI-powered inspirational quote generator

20
Experimental
22 zainafxal/DisasterInsight-AI

DisasterInsight AI: A multimodal AI platform orchestrating 5 models (Vision,...

19
Experimental
23 KadirCanCelik/Handwriting-to-digital

Handwriting to text conversion using line segmentation and OCR techniques

19
Experimental
24 SemanticWave-Hoyeon/NavtexRecovery

AI-powered restoration system for damaged NAVTEX (NAVigational TEleX)...

19
Experimental
25 bcastelino/ocr-text-vision-pro

AI-powered OCR application using Free OpenRouter Vision Models for advanced...

19
Experimental
26 Kovelja009/handwriting-recognition

Benchmark of different network architectures for handwritten text recognition.

18
Experimental
27 inuwamobarak/nougat

Nougat is a Meta AI's revolutionary OCR model designed to transcribe...

18
Experimental
28 koesan/Manga_Comic_Colorization_and_Translation_v1

AI-powered manga and comic translator using EasyOCR and Hugging Face...

18
Experimental
29 PRITHIVSAKTHIUR/dots.ocr-fix-demo

This Gradio application demonstrates the capabilities of the "dots.ocr"...

17
Experimental
30 ToluClassics/LowResourceOCR

This work is an adaptation of CNN+Transformer architecture to training text...

17
Experimental
31 kalimx03/intelli-credit

AI-powered corporate credit decisioning engine for Indian banking. Ingests...

15
Experimental
32 arora-r/gradio-example

This repository is an example of dockerizing a Gradio application which uses...

14
Experimental
33 Eduardo-PRg/NLM2Img

🖼️ Combine multi-page PDFs into a seamless image and add custom stamps, all...

14
Experimental
34 Mustapha-AJEGHRIR/arabic_calligraphy

This is a repo containing our code for Arabic calligraphy style detection...

13
Experimental
35 sorcero/ingestum

Read-only mirror of https://gitlab.com/sorcero/community/ingestum

13
Experimental
36 UP2040499/auto-osint-v

An automated tool for Validating OSINT. This forms part of the final step of...

13
Experimental
37 ramyadjoshi/IntelliDoc-AI-Powered-Intelligent-Document-Analysis-System

IntelliDoc is an intelligent document understanding system that helps users...

12
Experimental
38 SD7Campeon/Gemma3_OCR_Text_Extractor_LLM

Gemma-3 OCR exemplifies the confluence of abstruse computer vision and...

12
Experimental
39 Mohammed20201991/OCR_HU_Tra2022

HTR Transformer for Hungarian Language

12
Experimental
40 Parth844/AI_pdf_to_Epub

AI-powered PDF to EPUB conversion engine with LLM-based chapter detection...

11
Experimental
41 heyxalok/Optiv-AI-Project

AI-powered cybersecurity automation system that cleanses sensitive data from...

11
Experimental
42 stelaras36/OCRfixer

Web & CLI tool to fix noisy OCR text using a fine-tuned T5 model

11
Experimental
43 sitammeur/readerlm-litserve

Leverage Reader-LM's capabilities using LitServe.

11
Experimental