kha-white/manga-ocr

Optical character recognition for Japanese text, with the main focus being Japanese manga

64
/ 100
Established

Built on Transformers' Vision Encoder Decoder architecture, it handles multi-line text recognition in a single forward pass—enabling entire manga speech bubbles to be processed without line splitting. The model is specifically trained to robustly handle manga-specific challenges including vertical/horizontal text, furigana annotations, image overlays, and low-quality scans. Integrates with clipboard and directory monitoring for background processing, enabling workflows with screenshot tools (ShareX, Flameshot) and dictionary lookup applications like Yomitan.

2,582 stars and 17,983 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 2 / 25
Adoption 20 / 25
Maturity 25 / 25
Community 17 / 25

How are scores calculated?

Stars

2,582

Forks

127

Language

Python

License

Apache-2.0

Last pushed

Jun 14, 2025

Monthly downloads

17,983

Commits (30d)

0

Dependencies

10

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/kha-white/manga-ocr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.