mutablelogic/go-whisper
Speech-to-Text in golang
Provides a unified API for multiple speech-to-text backends—local whisper.cpp models with GPU acceleration (CUDA, Vulkan, Metal) plus commercial providers (OpenAI, ElevenLabs)—with optional speaker diarization and real-time streaming. Deployed as an HTTP server with a CLI client, supporting transcription, translation, and output in JSON, SRT, VTT formats. Binds directly to whisper.cpp C++ library while abstracting provider differences through a manager layer for flexible model selection.
178 stars.
Stars
178
Forks
21
Language
Go
License
Apache-2.0
Category
Last pushed
Mar 02, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mutablelogic/go-whisper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
kurianbenoy/whisper_normalizer
A python package for whisper normalizer
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX