pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX
Provides modular speech processing services including transcription, speaker diarization, and transcript alignment via individual endpoints, with async SQLAlchemy task persistence supporting SQLite or PostgreSQL backends. Configurable Whisper model selection and compute precision (float16/int8) enables deployment across CUDA and CPU environments. Includes Kubernetes-ready health probes and Swagger UI documentation for integration into broader audio/video processing pipelines.
174 stars.
Stars
174
Forks
58
Language
Python
License
MIT
Category
Last pushed
Mar 17, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/pavelzbornik/whisperX-FastAPI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
kurianbenoy/whisper_normalizer
A python package for whisper normalizer
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI