mutablelogic/go-whisper

Speech-to-Text in golang

51
/ 100
Established

Provides a unified API for multiple speech-to-text backends—local whisper.cpp models with GPU acceleration (CUDA, Vulkan, Metal) plus commercial providers (OpenAI, ElevenLabs)—with optional speaker diarization and real-time streaming. Deployed as an HTTP server with a CLI client, supporting transcription, translation, and output in JSON, SRT, VTT formats. Binds directly to whisper.cpp C++ library while abstracting provider differences through a manager layer for flexible model selection.

178 stars.

No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

178

Forks

21

Language

Go

License

Apache-2.0

Last pushed

Mar 02, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mutablelogic/go-whisper"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.