lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

63
/ 100
Established

Combines a Vision Transformer encoder with a Transformer decoder to predict LaTeX from equation images, incorporating an auxiliary resolution-prediction network that automatically preprocesses input to match training data characteristics. Supports multiple deployment modes including command-line, desktop GUI with screenshot capture, REST API, and direct Python integration, with MathJax rendering for real-time validation. Includes end-to-end training infrastructure with custom tokenizer generation and dataset preparation from rendered LaTeX via XeLaTeX and ImageMagick.

16,245 stars and 10,850 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 0 / 25
Adoption 19 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

16,245

Forks

1,289

Language

Python

License

MIT

Category

latex-ocr-tools

Last pushed

Jan 18, 2025

Monthly downloads

10,850

Commits (30d)

0

Dependencies

16

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/lukas-blecher/LaTeX-OCR"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.