feldberlin/timething

Timething is a library for aligning text transcripts with their audio recordings.

/ 100

Established

Leverages Wav2Vec CTC models from Hugging Face for forced alignment, producing character and word-level timestamps with confidence scores via the PyTorch forced alignment approach. Supports both long-form media (podcasts, audiobooks) and machine learning workflows, with optional audio re-cutting based on alignment results. Handles multiple audio formats and 12+ languages, running efficiently on CPUs or GPUs with configurable batch processing.

130 stars and 152 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 15 / 25

Maturity 25 / 25

Community 13 / 25

How are scores calculated?

Stars

130

Forks

Language

Jupyter Notebook

License

MIT

Related tools

ieasybooks/tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

botbahlul/autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech...

botbahlul/PyAutoSRT

PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free Google Speech...

abhirooptalasila/AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

cpfair/quran-align

Word-accurate timestamps for Qur'anic audio.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights