lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Combines a Vision Transformer encoder with a Transformer decoder to predict LaTeX from equation images, incorporating an auxiliary resolution-prediction network that automatically preprocesses input to match training data characteristics. Supports multiple deployment modes including command-line, desktop GUI with screenshot capture, REST API, and direct Python integration, with MathJax rendering for real-time validation. Includes end-to-end training infrastructure with custom tokenizer generation and dataset preparation from rendered LaTeX via XeLaTeX and ImageMagick.
16,245 stars and 10,850 monthly downloads. No commits in the last 6 months. Available on PyPI.
Stars
16,245
Forks
1,289
Language
Python
License
MIT
Category
Last pushed
Jan 18, 2025
Monthly downloads
10,850
Commits (30d)
0
Dependencies
16
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/lukas-blecher/LaTeX-OCR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
naptha/tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
mayocream/koharu
ML-powered manga translator, written in Rust.
mindspore-lab/mindocr
A toolbox of ocr models and algorithms based on MindSpore
zyddnys/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)