CheshireCC/faster-whisper-GUI
faster_whisper GUI with PySide6
Wraps faster-whisper and WhisperX with configurable VAD/model parameters, supporting batch processing and multiple output formats (SRT, VTT, LRC, SMI). Integrates optional audio source separation via Demucs for improved transcription accuracy, and enables word-level timestamp generation for karaoke-style lyrics. Built on PySide6 Fluent Widgets with model conversion utilities and Silero VAD support.
2,911 stars. No commits in the last 6 months.
Stars
2,911
Forks
168
Language
Python
License
AGPL-3.0
Category
Last pushed
Dec 08, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/CheshireCC/faster-whisper-GUI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
kurianbenoy/whisper_normalizer
A python package for whisper normalizer
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API