k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
Exclusively supports end-to-end transducer and CTC models with both C++ and Python APIs, emphasizing inference deployment rather than training. Provides PyTorch-based inference with complementary ONNX and NCNN variants for edge/mobile deployment. Includes browser-based testing via Hugging Face Spaces without local installation.
896 stars. Actively maintained with 9 commits in the last 30 days.
Stars
896
Forks
146
Language
C++
License
Apache-2.0
Category
Last pushed
Mar 18, 2026
Commits (30d)
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/k2-fsa/sherpa"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
yeyupiaoling/YeAudio
Python的音频工具
zaigie/FunSpeech
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端
manyeyes/ManySpeech
AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment Extraction, Audio...