LaTeX OCR Tools ML Frameworks

Tools and models for converting images of mathematical equations, formulas, and technical diagrams into LaTeX code or markup. Includes handwritten and printed formula recognition, circuit diagram conversion, and speech-to-LaTeX translation. Does NOT include general document OCR, non-mathematical content recognition, or LaTeX editing/compilation tools.

There are 54 latex ocr tools frameworks tracked. 1 score above 70 (verified tier). The highest-rated is naptha/tesseract.js at 78/100 with 37,920 stars and 3,951,624 monthly downloads. 4 of the top 10 are actively maintained.

Get all 54 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=latex-ocr-tools&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

78
Verified
2 open-mmlab/mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

66
Established
3 lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

63
Established
4 zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

62
Established
5 tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

61
Established
6 mayocream/koharu

ML-powered manga translator, written in Rust.

58
Established
7 dmMaze/BallonsTranslator

深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga...

57
Established
8 mindspore-lab/mindocr

A toolbox of ocr models and algorithms based on MindSpore

56
Established
9 ogkalu2/comic-translate

Desktop app for automatically translating comics - BDs, Manga, Manhwa,...

50
Established
10 VoxelCubes/PanelCleaner

An AI-powered tool to clean manga panels.

47
Emerging
11 microsoft/OCR-Form-Tools

A set of tools to use in Microsoft Azure Form Recognizer and OCR services.

46
Emerging
12 SakuraMathcraft/LaTeXSnipper

A powerful LaTeX formula recognition tool powered by pix2tex and pix2text. ...

45
Emerging
13 LinXueyuanStdio/LaTeX_OCR_PRO

:art: 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro,...

43
Emerging
14 LinXueyuanStdio/LaTeX_OCR

:gem: 数学公式识别 Math Formula OCR

43
Emerging
15 kingyiusuen/image-to-latex

Convert images of LaTex math equations into LaTex code.

41
Emerging
16 AlibabaResearch/AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced...

41
Emerging
17 dmMaze/comic-text-detector

Manga&Comic text detection

40
Emerging
18 KUR-creative/SickZil-Machine

Manga/Comics Translation Helper Tool

40
Emerging
19 RQLuo/MixTeX-Latex-OCR

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient...

38
Emerging
20 untrix/im2latex

Solution to im2latex request for research of openai

37
Emerging
21 YongWookHa/swin-transformer-ocr

swin-transformer custom for OCR

37
Emerging
22 VikParuchuri/texify

Math OCR model that outputs LaTeX and markdown

37
Emerging
23 bensonruan/Tesseract-OCR

Tesseract.js OCR

37
Emerging
24 fh2019ustc/Awesome-Document-Image-Rectification

A comprehensive list of awesome document image rectification papers.

36
Emerging
25 jtl1207/comic-translation

基于深度学习的漫画翻译辅助工具,包含翻译、朗读、图像去字、自动嵌字功能。 目的是帮助非专业汉化人员完成更简单,快速的翻译任务。

35
Emerging
26 juvian/Manga-Text-Segmentation

Segmentation of text in manga images

35
Emerging
27 ritheshkumar95/im2latex-tensorflow

Tensorflow implementation of the HarvardNLP paper - What You Get Is What You...

34
Emerging
28 XJF2332/GOT-OCR-2-GUI

GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能

33
Emerging
29 stacksapien/react-tesseract-ocr

Tesseract OCR implementation in React JS

33
Emerging
30 Aeonss/BubbleBlaster

Bubble Blaster removes text from speech bubbles in mangas/manhwas, made for...

33
Emerging
31 vkit-x/vkit

Boosting Document Intelligence

32
Emerging
32 sitammeur/TextSnap

TextSnap: Demo for Florence 2 model used in OCR tasks to extract and...

28
Experimental
33 JeffersonQin/YuzuMarker

🍋 [WIP] Manga Translation Tool

27
Experimental
34 lizhaoliu-Lec/CCSE

Instance Segmentation for Chinese Character Stroke Extraction, Datasets and...

26
Experimental
35 CatUnderTheLeaf/musicScanner

Optical Music Recognition using Deep Learning

24
Experimental
36 jackvial/tuatara

Tuatara: Deep Learning OCR Engine

24
Experimental
37 jasmine-dragons/VoTeX

Speech to LaTeX translator. LA Hacks 3rd place overall and Best Hack Using...

24
Experimental
38 tuanio/image2latex

Image to Latex using Encoder-Decoder architecture

24
Experimental
39 CrispenGari/CR

📸📷 Character Recognition (CR) is an AI tool for performing optic character...

23
Experimental
40 nakamura196/koten-ocr-ios

KotenOCR — iOS app for OCR of classical and modern Japanese texts using NDL...

23
Experimental
41 endx707/tesseract

🖥️ Perform optical character recognition with Tesseract, an open-source tool...

22
Experimental
42 Muiz20/macula

Detect and correct OCR errors directly in the browser using a lightweight,...

22
Experimental
43 tony-xlh/SynthMRZ

Code for generating synthetic MRZ images

21
Experimental
44 abhijoshi03/system-ocr

This repository offers a simple OCR library that leverages system APIs like...

20
Experimental
45 rn-snehapriya/Automatic-Note-Taking-From-Video-Using-Tesseract-OCR

Text from the video is extracted and saved into a .docx file in the form of notes.

20
Experimental
46 chencxt/MoreMTQE

更多更易用的机器翻译质量评估(Machine Translation Quality Estimation)方案

19
Experimental
47 AdelRizq/Orchestra

Orchestra is a sheet music reader (optical music recognition (OMR) system)...

19
Experimental
48 olibridge01/TeXOCR

Optical Character Recognition (OCR) model for Image-to-LaTeX conversion

17
Experimental
49 Makena123456/Paper-Comicizer

📚 Transform academic PDFs into engaging Doraemon comics for easier...

15
Experimental
50 RQLuo/MixTeX-DataHub

LaTeXDataHub is an open-source platform dedicated to the sharing and...

14
Experimental
51 SlavaKuzkinHackathon/ScanTire-AI-Architecture

Technical architecture and ML pipeline overview for the ScanTire.com OCR &...

14
Experimental
52 gnurt2041/MangaOCR

A lightweight OCR model for Japanese text, especially in Manga

14
Experimental
53 Aadv1k/OctetOCR

Octet is an exploratory OCR or text recognition library to prepare and train...

12
Experimental
54 Carath/TeXdrawer

Small tool for handwritten LaTeX symbols recognition.

11
Experimental