k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

/ 100

Established

Exclusively supports end-to-end transducer and CTC models with both C++ and Python APIs, emphasizing inference deployment rather than training. Provides PyTorch-based inference with complementary ONNX and NCNN variants for edge/mobile deployment. Includes browser-based testing via Hugging Face Spaces without local installation.

896 stars. Actively maintained with 9 commits in the last 30 days.

No Package No Dependents

Maintenance 20 / 25

Adoption 10 / 25

Maturity 9 / 25

Community 23 / 25

How are scores calculated?

Stars

896

Forks

146

Language

C++

License

Apache-2.0

Related tools

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...

Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

yeyupiaoling/YeAudio

Python的音频工具

zaigie/FunSpeech

开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端

manyeyes/ManySpeech

AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment Extraction, Audio...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights