chenkui164/FastASR
这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。
Implements four distinct model architectures—three non-streaming (Paraformer, K2-RNNT2, Conformer) and one streaming (Conformer Online)—with VAD-based long-form audio support, using memory-mapped binary model files for efficient parameter loading. Built entirely in C++ without deep learning framework dependencies, relying only on libfftw3 and libopenblas, with pointer-based algorithms that minimize reshape/permute operations to maximize CPU utilization across x86, ARM, and other platforms. Provides both a static C++ library and Python bindings across CPython 3.6-3.11, supporting Linux, macOS, and Windows with pre-built wheels.
547 stars. No commits in the last 6 months.
Stars
547
Forks
81
Language
C
License
Apache-2.0
Category
Last pushed
Mar 19, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/chenkui164/FastASR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
yeyupiaoling/YeAudio
Python的音频工具
zaigie/FunSpeech
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端