saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

/ 100

Established

Combines faster-whisper for optimized inference with Diart and Pyannote for speaker diarization, enabling multi-language transcription with speaker identification. Uses a Python backend (configurable CPU/GPU compute) paired with a React frontend, offering both real-time streaming and sequential transcription modes with adjustable beam size and timeout parameters for accuracy-latency tradeoffs.

833 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

833

Forks

145

Language

Python

License

MIT

Related tools

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

oseiskar/autosubsync

Automatically synchronize subtitles with audio using machine learning

FL33TW00D/whisper-turbo

Cross-Platform, GPU Accelerated Whisper 🏎️

machinelearningZH/audio-transcription

Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.

shhossain/BanglaSpeech2Text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights