readbeyond/aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

63
/ 100
Established

Leverages FFmpeg for audio processing and eSpeak for speech recognition to compute frame-level alignments, outputting results in 10+ formats including SMIL for EPUB 3, WebVTT for captions, and research formats like ELAN and TextGrid. Designed as both a Python library and command-line tool with batch job processing capabilities via ZIP containers, supporting multiple languages and text input types (plain text, HTML with ID markers, structured formats).

2,811 stars and 4,158 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m No Dependents
Maintenance 0 / 25
Adoption 18 / 25
Maturity 25 / 25
Community 20 / 25

How are scores calculated?

Stars

2,811

Forks

270

Language

Python

License

AGPL-3.0

Last pushed

Jun 22, 2024

Monthly downloads

4,158

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/readbeyond/aeneas"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.