jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
Optimizes layer caching and parallel builds to efficiently manage 175 pre-built Docker images (~10GB each) on GitHub's free runners with weekly CI updates. Provides 40+ pre-baked model variants across languages (tiny to large-v3) alongside a `no_model` tag for custom model selection, with GPU acceleration support via NVIDIA Container Toolkit.
422 stars.
Stars
422
Forks
49
Language
Dockerfile
License
MIT
Category
Last pushed
Mar 15, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/jim60105/docker-whisperX"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...
linto-ai/linto-stt
An automatic speech recognition API
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
ringger/transcribe-critic
Multi-source transcript merging inspired by textual criticism — LLM adjudicates multiple...