lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

/ 100

Established

Combines a Vision Transformer encoder with a Transformer decoder to predict LaTeX from equation images, incorporating an auxiliary resolution-prediction network that automatically preprocesses input to match training data characteristics. Supports multiple deployment modes including command-line, desktop GUI with screenshot capture, REST API, and direct Python integration, with MathJax rendering for real-time validation. Includes end-to-end training infrastructure with custom tokenizer generation and dataset preparation from rendered LaTeX via XeLaTeX and ImageMagick.

16,245 stars and 10,850 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 19 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

16,245

Forks

1,289

Language

Python

License

MIT

Related frameworks

naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

open-mmlab/mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

mayocream/koharu

ML-powered manga translator, written in Rust.

mindspore-lab/mindocr

A toolbox of ocr models and algorithms based on MindSpore

zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

Explore ML Frameworks

All categories Trending ML Framework directory Insights