k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages
Built on ONNX Runtime with Kaldi-style FST-based decoding, the project enables both streaming and non-streaming inference across diverse architectures including specialized NPU accelerators (Rockchip, Qualcomm, Ascend, Axera) for hardware-optimized performance. Beyond core speech tasks, it includes specialized modules like keyword spotting, audio tagging, spoken language identification, and punctuation restoration—accessible through a unified C++/ONNX backend with language bindings spanning Python, Go, Java, Rust, and WebAssembly. Pre-trained models are available on Hugging Face with browser-based demos, supporting multi-language inference (Chinese, English, Cantonese, Japanese, Korean, Thai) on everything from Raspberry Pi to NVIDIA Jetson edge devices.
10,885 stars and 181,098 monthly downloads. Actively maintained with 138 commits in the last 30 days. Available on PyPI.
Stars
10,885
Forks
1,235
Language
C++
License
Apache-2.0
Category
Last pushed
Mar 18, 2026
Monthly downloads
181,098
Commits (30d)
138
Dependencies
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/k2-fsa/sherpa-onnx"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Related tools
ccoreilly/vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
alphacep/vosk
VOSK Speech Recognition Toolkit
alphacep/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
solyarisoftware/voskJs
Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.
alphacep/vosk-asterisk
Speech Recognition in Asterisk with Vosk Server