saurabhshri/CCAligner
🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Leverages PocketSphinx for speech recognition and CMU language modeling tools to perform forced alignment between audio and subtitles, generating per-word timing data in XML/JSON formats. Operates as both a standalone CLI tool and an embeddable C++ library, supporting multi-platform builds (Linux, macOS, Windows) with optional neural grapheme-to-phoneme conversion via TensorFlow for improved accuracy. Requires 16-bit mono PCM audio at 16kHz and clean SRT subtitle files as input.
172 stars. No commits in the last 6 months.
Stars
172
Forks
34
Language
C++
License
—
Category
Last pushed
Oct 27, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/saurabhshri/CCAligner"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ieasybooks/tafrigh
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
botbahlul/autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech...
feldberlin/timething
Timething is a library for aligning text transcripts with their audio recordings.
botbahlul/PyAutoSRT
PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free Google Speech...
abhirooptalasila/AutoSub
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui