pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

61
/ 100
Established

Provides modular speech processing services including transcription, speaker diarization, and transcript alignment via individual endpoints, with async SQLAlchemy task persistence supporting SQLite or PostgreSQL backends. Configurable Whisper model selection and compute precision (float16/int8) enables deployment across CUDA and CPU environments. Includes Kubernetes-ready health probes and Swagger UI documentation for integration into broader audio/video processing pipelines.

174 stars.

No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 22 / 25

How are scores calculated?

Stars

174

Forks

58

Language

Python

License

MIT

Last pushed

Mar 17, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/pavelzbornik/whisperX-FastAPI"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.