LaTeX OCR Tools ML Frameworks
Tools and models for converting images of mathematical equations, formulas, and technical diagrams into LaTeX code or markup. Includes handwritten and printed formula recognition, circuit diagram conversion, and speech-to-LaTeX translation. Does NOT include general document OCR, non-mathematical content recognition, or LaTeX editing/compilation tools.
There are 54 latex ocr tools frameworks tracked. 1 score above 70 (verified tier). The highest-rated is naptha/tesseract.js at 78/100 with 37,920 stars and 3,951,624 monthly downloads. 4 of the top 10 are actively maintained.
Get all 54 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=latex-ocr-tools&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
naptha/tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥 |
|
Verified |
| 2 |
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox |
|
Established |
| 3 |
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code. |
|
Established |
| 4 |
zyddnys/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working) |
|
Established |
| 5 |
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository) |
|
Established |
| 6 |
mayocream/koharu
ML-powered manga translator, written in Rust. |
|
Established |
| 7 |
dmMaze/BallonsTranslator
深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga... |
|
Established |
| 8 |
mindspore-lab/mindocr
A toolbox of ocr models and algorithms based on MindSpore |
|
Established |
| 9 |
ogkalu2/comic-translate
Desktop app for automatically translating comics - BDs, Manga, Manhwa,... |
|
Established |
| 10 |
VoxelCubes/PanelCleaner
An AI-powered tool to clean manga panels. |
|
Emerging |
| 11 |
microsoft/OCR-Form-Tools
A set of tools to use in Microsoft Azure Form Recognizer and OCR services. |
|
Emerging |
| 12 |
SakuraMathcraft/LaTeXSnipper
A powerful LaTeX formula recognition tool powered by pix2tex and pix2text. ... |
|
Emerging |
| 13 |
LinXueyuanStdio/LaTeX_OCR_PRO
:art: 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro,... |
|
Emerging |
| 14 |
LinXueyuanStdio/LaTeX_OCR
:gem: 数学公式识别 Math Formula OCR |
|
Emerging |
| 15 |
kingyiusuen/image-to-latex
Convert images of LaTex math equations into LaTex code. |
|
Emerging |
| 16 |
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced... |
|
Emerging |
| 17 |
dmMaze/comic-text-detector
Manga&Comic text detection |
|
Emerging |
| 18 |
KUR-creative/SickZil-Machine
Manga/Comics Translation Helper Tool |
|
Emerging |
| 19 |
RQLuo/MixTeX-Latex-OCR
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient... |
|
Emerging |
| 20 |
untrix/im2latex
Solution to im2latex request for research of openai |
|
Emerging |
| 21 |
YongWookHa/swin-transformer-ocr
swin-transformer custom for OCR |
|
Emerging |
| 22 |
VikParuchuri/texify
Math OCR model that outputs LaTeX and markdown |
|
Emerging |
| 23 |
bensonruan/Tesseract-OCR
Tesseract.js OCR |
|
Emerging |
| 24 |
fh2019ustc/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers. |
|
Emerging |
| 25 |
jtl1207/comic-translation
基于深度学习的漫画翻译辅助工具,包含翻译、朗读、图像去字、自动嵌字功能。 目的是帮助非专业汉化人员完成更简单,快速的翻译任务。 |
|
Emerging |
| 26 |
juvian/Manga-Text-Segmentation
Segmentation of text in manga images |
|
Emerging |
| 27 |
ritheshkumar95/im2latex-tensorflow
Tensorflow implementation of the HarvardNLP paper - What You Get Is What You... |
|
Emerging |
| 28 |
XJF2332/GOT-OCR-2-GUI
GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能 |
|
Emerging |
| 29 |
stacksapien/react-tesseract-ocr
Tesseract OCR implementation in React JS |
|
Emerging |
| 30 |
Aeonss/BubbleBlaster
Bubble Blaster removes text from speech bubbles in mangas/manhwas, made for... |
|
Emerging |
| 31 |
vkit-x/vkit
Boosting Document Intelligence |
|
Emerging |
| 32 |
sitammeur/TextSnap
TextSnap: Demo for Florence 2 model used in OCR tasks to extract and... |
|
Experimental |
| 33 |
JeffersonQin/YuzuMarker
🍋 [WIP] Manga Translation Tool |
|
Experimental |
| 34 |
lizhaoliu-Lec/CCSE
Instance Segmentation for Chinese Character Stroke Extraction, Datasets and... |
|
Experimental |
| 35 |
CatUnderTheLeaf/musicScanner
Optical Music Recognition using Deep Learning |
|
Experimental |
| 36 |
jackvial/tuatara
Tuatara: Deep Learning OCR Engine |
|
Experimental |
| 37 |
jasmine-dragons/VoTeX
Speech to LaTeX translator. LA Hacks 3rd place overall and Best Hack Using... |
|
Experimental |
| 38 |
tuanio/image2latex
Image to Latex using Encoder-Decoder architecture |
|
Experimental |
| 39 |
CrispenGari/CR
📸📷 Character Recognition (CR) is an AI tool for performing optic character... |
|
Experimental |
| 40 |
nakamura196/koten-ocr-ios
KotenOCR — iOS app for OCR of classical and modern Japanese texts using NDL... |
|
Experimental |
| 41 |
endx707/tesseract
🖥️ Perform optical character recognition with Tesseract, an open-source tool... |
|
Experimental |
| 42 |
Muiz20/macula
Detect and correct OCR errors directly in the browser using a lightweight,... |
|
Experimental |
| 43 |
tony-xlh/SynthMRZ
Code for generating synthetic MRZ images |
|
Experimental |
| 44 |
abhijoshi03/system-ocr
This repository offers a simple OCR library that leverages system APIs like... |
|
Experimental |
| 45 |
rn-snehapriya/Automatic-Note-Taking-From-Video-Using-Tesseract-OCR
Text from the video is extracted and saved into a .docx file in the form of notes. |
|
Experimental |
| 46 |
chencxt/MoreMTQE
更多更易用的机器翻译质量评估(Machine Translation Quality Estimation)方案 |
|
Experimental |
| 47 |
AdelRizq/Orchestra
Orchestra is a sheet music reader (optical music recognition (OMR) system)... |
|
Experimental |
| 48 |
olibridge01/TeXOCR
Optical Character Recognition (OCR) model for Image-to-LaTeX conversion |
|
Experimental |
| 49 |
Makena123456/Paper-Comicizer
📚 Transform academic PDFs into engaging Doraemon comics for easier... |
|
Experimental |
| 50 |
RQLuo/MixTeX-DataHub
LaTeXDataHub is an open-source platform dedicated to the sharing and... |
|
Experimental |
| 51 |
SlavaKuzkinHackathon/ScanTire-AI-Architecture
Technical architecture and ML pipeline overview for the ScanTire.com OCR &... |
|
Experimental |
| 52 |
gnurt2041/MangaOCR
A lightweight OCR model for Japanese text, especially in Manga |
|
Experimental |
| 53 |
Aadv1k/OctetOCR
Octet is an exploratory OCR or text recognition library to prepare and train... |
|
Experimental |
| 54 |
Carath/TeXdrawer
Small tool for handwritten LaTeX symbols recognition. |
|
Experimental |