saurabhshri/CCAligner

🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.

/ 100

Emerging

Leverages PocketSphinx for speech recognition and CMU language modeling tools to perform forced alignment between audio and subtitles, generating per-word timing data in XML/JSON formats. Operates as both a standalone CLI tool and an embeddable C++ library, supporting multi-platform builds (Linux, macOS, Windows) with optional neural grapheme-to-phoneme conversion via TensorFlow for improved accuracy. Requires 16-bit mono PCM audio at 16kHz and clean SRT subtitle files as input.

172 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 21 / 25

How are scores calculated?

Stars

172

Forks

Language

C++

License

—

Higher-rated alternatives

ieasybooks/tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

botbahlul/autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech...

feldberlin/timething

Timething is a library for aligning text transcripts with their audio recordings.

botbahlul/PyAutoSRT

PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free Google Speech...

abhirooptalasila/AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

Explore Voice AI Tools

All categories Trending Voice AI directory Insights