cpfair/quran-align
Word-accurate timestamps for Qur'anic audio.
Performs automatic speech recognition on Qur'anic audio using CMU Sphinx with speaker-specific acoustic models and Qur'an-specific language models, then aligns recognized words to reference text via edit-distance matching and MFCC-based refinement to syllable boundaries. Outputs JSON timing data compatible with EveryAyah recordings, with pre-generated datasets available for multiple qaris that achieve 98.5-99.9% word segmentation accuracy and <73ms average timing deviation validated against independent implementations.
241 stars. No commits in the last 6 months.
Stars
241
Forks
43
Language
C++
License
MIT
Category
Last pushed
Jan 26, 2017
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/cpfair/quran-align"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ieasybooks/tafrigh
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
botbahlul/autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech...
feldberlin/timething
Timething is a library for aligning text transcripts with their audio recordings.
botbahlul/PyAutoSRT
PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free Google Speech...
abhirooptalasila/AutoSub
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui