FunASR Speech Recognition Voice AI Tools

Speech recognition APIs and clients built on or wrapping FunASR and similar open-source ASR frameworks. Includes deployment servers, language bindings, and integration layers. Does NOT include text-to-speech, voice assistants, or end-user applications using ASR as a component.

There are 46 funasr speech recognition tools tracked. 1 score above 70 (verified tier). The highest-rated is PaddlePaddle/PaddleSpeech at 82/100 with 12,556 stars and 3,580 monthly downloads. 3 of the top 10 are actively maintained.

Get all 46 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=funasr-speech-recognition&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model,...

82
Verified
2 k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

62
Established
3 Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

58
Established
4 yeyupiaoling/YeAudio

Python的音频工具

51
Established
5 zaigie/FunSpeech

开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端

46
Emerging
6 manyeyes/ManySpeech

AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment...

46
Emerging
7 atomiechen/FunASR-Client

Really easy-to-use Python client for FunASR runtime server.

46
Emerging
8 Picovoice/leopard

On-device speech-to-text engine powered by deep learning

45
Emerging
9 lukeewin/FunASR_API

这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech...

42
Emerging
10 Quantatirsk/funasr-api

Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52...

42
Emerging
11 sipeed/Maix-Speech

Maix Speech AI lib, a fast and small speech lib running on embedded devices,...

39
Emerging
12 cvqluu/simple_diarizer

Simplified diarization pipeline using some pretrained models - audio file to...

39
Emerging
13 zhangzijie-pro/Speaker-Verification

Dual-model speech AI toolkit for speaker verification and speaker-aware...

39
Emerging
14 chenkui164/FastASR

这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。...

39
Emerging
15 RapidAI/RapidASR

📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR...

38
Emerging
16 bgArray/ZhiYin

知音 - AI音频听觉功能集成软件。提供声乐技术识别分析、伴奏分离等伴奏多种工具。

36
Emerging
17 PhuocElec/zipformer-asr-api

REST-API implementation of ZipFormer for automatic speech recognition (ASR)...

36
Emerging
18 kroko-ai/kroko-onnx

Kroko ASR - Speech-to-text

36
Emerging
19 xhuvom/omnilingual-ASR-Web-Dashboard

Meta Omnilingual ASR web based dashboard for testing and API based...

34
Emerging
20 jianchang512/fireredasr-ui

一个中文语音转文字项目,封装自FireRedASR

34
Emerging
21 tsengia/JSGFKit_Plus_Plus

A C++ library for parsing and manipulating JSGF grammar files.

29
Experimental
22 qkl9527/voice-assistant

基于Funasr的[实时]AI语音助手

29
Experimental
23 jaganadhg/nemoexamples

Experiments with NVIDIA NeMo

28
Experimental
24 Ikaros-521/FunASR_WS

基于FunASR官方Demo修改的WS服务端,配合FastAPI提供HTTP服务,可以在浏览器中进行实时ASR测试

28
Experimental
25 taeyoun811/Whisfusion

Whisfusion: Parallel ASR Decoding via a Diffusion Transformer

28
Experimental
26 vahnxu/doubao-asr

Agent Skill: Transcribe audio files via ByteDance Volcengine Seed-ASR 2.0...

25
Experimental
27 yuhanwang14/ASR-Pipeline

Local GPU-accelerated speech transcription pipeline with speaker diarization...

24
Experimental
28 huakunyang/SummerAsr

SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese...

24
Experimental
29 binglel/asr_baidu_web_server

asr web server based on flask

24
Experimental
30 SzLeaves/asr-webapp

ASR Web APP 中文语音识别实验室APP,使用Django构建,包含中文语音转文字与中文语音聊天机器人模块

23
Experimental
31 Anwarvic/Web-Interface-for-NVIDIA-NeMo

This repository contains an attempt to utilize the NeMo toolkit created by NVIDIA

22
Experimental
32 HsiangNianian/funasr-api

FunASR API is a FastAPI-based inference gateway that wraps multiple FunASR...

22
Experimental
33 Kaljurand/Grammars

Grammatical Framework based speech recognition grammars for Estonian,...

21
Experimental
34 wq2012/VB_diarization

VB Diarization with Eigenvoice and HMM Priors, refactored

21
Experimental
35 terry-yip/speech-to-text

Speaker diarization and speech to text

20
Experimental
36 ArenAcikgoz/Whisper-Alignment

Forced alignment decoder for Whisper.

20
Experimental
37 atomiechen/funasr-client-ts

Really easy-to-use Typescript client for FunASR runtime server.

18
Experimental
38 DDDeeeee/Teasr

Microphone-free speech recognition and text polishing for vibe coding.

17
Experimental
39 SunPCSolutions/DiarASR

Enterprise-Grade Secure ASR Diarization Pipeline - HIPAA-compliant speech...

16
Experimental
40 moziarnj07-sys/doubaoime-asr

🎤 Enable voice recognition for the Doubao input method using Python; ideal...

15
Experimental
41 aidayang/FunASR-OneClick

FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件

14
Experimental
42 adamelkholyy/hpc-nemo

Fork for running Whisper transcriptions with Nemo diarization on University...

13
Experimental
43 kaka-lin/multi-asr-toolkit

A flexible speech recognition toolkit supporting multiple backends...

12
Experimental
44 adityajn105/google_speech_diarization_demo

A demo to show Speech Diarization (seperating audio of different speaker)...

12
Experimental
45 jaycollett/hass_nemo

Simple Python Docker exposing an API using Nemo to perform text...

11
Experimental
46 aaaastark/NeMo-WeightsBiases-TTS

Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases

10
Experimental