saurabhshri/CCAligner

🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.

39
/ 100
Emerging

Leverages PocketSphinx for speech recognition and CMU language modeling tools to perform forced alignment between audio and subtitles, generating per-word timing data in XML/JSON formats. Operates as both a standalone CLI tool and an embeddable C++ library, supporting multi-platform builds (Linux, macOS, Windows) with optional neural grapheme-to-phoneme conversion via TensorFlow for improved accuracy. Requires 16-bit mono PCM audio at 16kHz and clean SRT subtitle files as input.

172 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 21 / 25

How are scores calculated?

Stars

172

Forks

34

Language

C++

License

Last pushed

Oct 27, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/saurabhshri/CCAligner"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.