GetcharZp/go-speech
go-speech 基于 Golang + ONNX 构建的轻量语音库,支持 TTS(文本转语音)与 ASR(语音转文字)。已集成 MeloTTS、Piper、达摩院 Paraformer 架构模型、Whisper 模型。
Leverages ONNX Runtime for cross-platform model inference without external dependencies, enabling local deployment of speech models directly in Go applications. Provides unified APIs across multiple engine backends (MeloTTS for multilingual synthesis, Paraformer for Mandarin ASR, Whisper for multilingual recognition), with configurable model paths and direct WAV byte output for seamless integration into audio pipelines.
Stars
46
Forks
7
Language
Go
License
MIT
Category
Last pushed
Feb 23, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/GetcharZp/go-speech"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
deepgram/deepgram-go-sdk
Official Go SDK for Deepgram.
hegedustibor/htgo-tts
Text to speech package for Golang.
Kardbord/hfapigo
Unofficial (Golang) Go bindings for the Hugging Face Inference API
asticode/go-astideepspeech
Golang bindings for Mozilla's DeepSpeech speech-to-text library
asticode/go-astibob
Golang framework to build an AI that can understand and speak back to you, and everything else you want