FunASR Speech Recognition Voice AI Tools
Speech recognition APIs and clients built on or wrapping FunASR and similar open-source ASR frameworks. Includes deployment servers, language bindings, and integration layers. Does NOT include text-to-speech, voice assistants, or end-user applications using ASR as a component.
There are 46 funasr speech recognition tools tracked. 1 score above 70 (verified tier). The highest-rated is PaddlePaddle/PaddleSpeech at 82/100 with 12,556 stars and 3,580 monthly downloads. 3 of the top 10 are actively maintained.
Get all 46 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=funasr-speech-recognition&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model,... |
|
Verified |
| 2 |
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi |
|
Established |
| 3 |
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning |
|
Established |
| 4 |
yeyupiaoling/YeAudio
Python的音频工具 |
|
Established |
| 5 |
zaigie/FunSpeech
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端 |
|
Emerging |
| 6 |
manyeyes/ManySpeech
AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment... |
|
Emerging |
| 7 |
atomiechen/FunASR-Client
Really easy-to-use Python client for FunASR runtime server. |
|
Emerging |
| 8 |
Picovoice/leopard
On-device speech-to-text engine powered by deep learning |
|
Emerging |
| 9 |
lukeewin/FunASR_API
这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech... |
|
Emerging |
| 10 |
Quantatirsk/funasr-api
Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52... |
|
Emerging |
| 11 |
sipeed/Maix-Speech
Maix Speech AI lib, a fast and small speech lib running on embedded devices,... |
|
Emerging |
| 12 |
cvqluu/simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to... |
|
Emerging |
| 13 |
zhangzijie-pro/Speaker-Verification
Dual-model speech AI toolkit for speaker verification and speaker-aware... |
|
Emerging |
| 14 |
chenkui164/FastASR
这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。... |
|
Emerging |
| 15 |
RapidAI/RapidASR
📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR... |
|
Emerging |
| 16 |
bgArray/ZhiYin
知音 - AI音频听觉功能集成软件。提供声乐技术识别分析、伴奏分离等伴奏多种工具。 |
|
Emerging |
| 17 |
PhuocElec/zipformer-asr-api
REST-API implementation of ZipFormer for automatic speech recognition (ASR)... |
|
Emerging |
| 18 |
kroko-ai/kroko-onnx
Kroko ASR - Speech-to-text |
|
Emerging |
| 19 |
xhuvom/omnilingual-ASR-Web-Dashboard
Meta Omnilingual ASR web based dashboard for testing and API based... |
|
Emerging |
| 20 |
jianchang512/fireredasr-ui
一个中文语音转文字项目,封装自FireRedASR |
|
Emerging |
| 21 |
tsengia/JSGFKit_Plus_Plus
A C++ library for parsing and manipulating JSGF grammar files. |
|
Experimental |
| 22 |
qkl9527/voice-assistant
基于Funasr的[实时]AI语音助手 |
|
Experimental |
| 23 |
jaganadhg/nemoexamples
Experiments with NVIDIA NeMo |
|
Experimental |
| 24 |
Ikaros-521/FunASR_WS
基于FunASR官方Demo修改的WS服务端,配合FastAPI提供HTTP服务,可以在浏览器中进行实时ASR测试 |
|
Experimental |
| 25 |
taeyoun811/Whisfusion
Whisfusion: Parallel ASR Decoding via a Diffusion Transformer |
|
Experimental |
| 26 |
vahnxu/doubao-asr
Agent Skill: Transcribe audio files via ByteDance Volcengine Seed-ASR 2.0... |
|
Experimental |
| 27 |
yuhanwang14/ASR-Pipeline
Local GPU-accelerated speech transcription pipeline with speaker diarization... |
|
Experimental |
| 28 |
huakunyang/SummerAsr
SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese... |
|
Experimental |
| 29 |
binglel/asr_baidu_web_server
asr web server based on flask |
|
Experimental |
| 30 |
SzLeaves/asr-webapp
ASR Web APP 中文语音识别实验室APP,使用Django构建,包含中文语音转文字与中文语音聊天机器人模块 |
|
Experimental |
| 31 |
Anwarvic/Web-Interface-for-NVIDIA-NeMo
This repository contains an attempt to utilize the NeMo toolkit created by NVIDIA |
|
Experimental |
| 32 |
HsiangNianian/funasr-api
FunASR API is a FastAPI-based inference gateway that wraps multiple FunASR... |
|
Experimental |
| 33 |
Kaljurand/Grammars
Grammatical Framework based speech recognition grammars for Estonian,... |
|
Experimental |
| 34 |
wq2012/VB_diarization
VB Diarization with Eigenvoice and HMM Priors, refactored |
|
Experimental |
| 35 |
terry-yip/speech-to-text
Speaker diarization and speech to text |
|
Experimental |
| 36 |
ArenAcikgoz/Whisper-Alignment
Forced alignment decoder for Whisper. |
|
Experimental |
| 37 |
atomiechen/funasr-client-ts
Really easy-to-use Typescript client for FunASR runtime server. |
|
Experimental |
| 38 |
DDDeeeee/Teasr
Microphone-free speech recognition and text polishing for vibe coding. |
|
Experimental |
| 39 |
SunPCSolutions/DiarASR
Enterprise-Grade Secure ASR Diarization Pipeline - HIPAA-compliant speech... |
|
Experimental |
| 40 |
moziarnj07-sys/doubaoime-asr
🎤 Enable voice recognition for the Doubao input method using Python; ideal... |
|
Experimental |
| 41 |
aidayang/FunASR-OneClick
FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件 |
|
Experimental |
| 42 |
adamelkholyy/hpc-nemo
Fork for running Whisper transcriptions with Nemo diarization on University... |
|
Experimental |
| 43 |
kaka-lin/multi-asr-toolkit
A flexible speech recognition toolkit supporting multiple backends... |
|
Experimental |
| 44 |
adityajn105/google_speech_diarization_demo
A demo to show Speech Diarization (seperating audio of different speaker)... |
|
Experimental |
| 45 |
jaycollett/hass_nemo
Simple Python Docker exposing an API using Nemo to perform text... |
|
Experimental |
| 46 |
aaaastark/NeMo-WeightsBiases-TTS
Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases |
|
Experimental |