whisper-asr-webservice and whisperX-FastAPI
These are ecosystem siblings, as they both offer an API wrapper around different versions of theWhisper speech-to-text model, with `ahmetoner/whisper-asr-webservice` using OpenAI's original Whisper and `pavelzbornik/whisperX-FastAPI` utilizing WhisperX, an enhanced version of Whisper.
About whisper-asr-webservice
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
Provides pluggable ASR engine selection (OpenAI Whisper, Faster Whisper, WhisperX) with support for speaker diarization, word-level timestamps, and voice activity detection. Containerized with Docker for CPU/GPU deployment via environment variables, integrates FFmpeg for diverse audio/video codec handling, and exposes a REST API with Swagger documentation for transcription, translation, and language identification tasks.
About whisperX-FastAPI
pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX
Provides modular speech processing services including transcription, speaker diarization, and transcript alignment via individual endpoints, with async SQLAlchemy task persistence supporting SQLite or PostgreSQL backends. Configurable Whisper model selection and compute precision (float16/int8) enables deployment across CUDA and CPU environments. Includes Kubernetes-ready health probes and Swagger UI documentation for integration into broader audio/video processing pipelines.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work