chenkui164/FastASR

这是一个用C++实现ASR推理的项目，它依赖很少，安装也很简单，推理速度很快，在树莓派4B等ARM平台也可以流畅的运行。支持的模型是由Google的Transformer模型中优化而来，数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时)，所以识别效果也很好，可以媲美许多商用的ASR软件。

/ 100

Emerging

Implements four distinct model architectures—three non-streaming (Paraformer, K2-RNNT2, Conformer) and one streaming (Conformer Online)—with VAD-based long-form audio support, using memory-mapped binary model files for efficient parameter loading. Built entirely in C++ without deep learning framework dependencies, relying only on libfftw3 and libopenblas, with pointer-based algorithms that minimize reshape/permute operations to maximize CPU utilization across x86, ARM, and other platforms. Provides both a static C++ library and Python bindings across CPython 3.6-3.11, supporting Linux, macOS, and Windows with pre-built wheels.

547 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 9 / 25

Community 20 / 25

How are scores calculated?

Stars

547

Forks

Language

License

Apache-2.0

Higher-rated alternatives

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...

k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

yeyupiaoling/YeAudio

Python的音频工具

zaigie/FunSpeech

开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端

Explore Voice AI Tools

All categories Trending Voice AI directory Insights