saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
Combines faster-whisper for optimized inference with Diart and Pyannote for speaker diarization, enabling multi-language transcription with speaker identification. Uses a Python backend (configurable CPU/GPU compute) paired with a React frontend, offering both real-time streaming and sequential transcription modes with adjustable beam size and timeout parameters for accuracy-latency tradeoffs.
833 stars. No commits in the last 6 months.
Stars
833
Forks
145
Language
Python
License
MIT
Category
Last pushed
Sep 12, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/saharmor/whisper-playground"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
oseiskar/autosubsync
Automatically synchronize subtitles with audio using machine learning
FL33TW00D/whisper-turbo
Cross-Platform, GPU Accelerated Whisper 🏎️
machinelearningZH/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
shhossain/BanglaSpeech2Text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...