Quantatirsk/funasr-api

Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务，支持 52 种语言，兼容 OpenAI API 与阿里云语音 API。

/ 100

Emerging

Provides containerized local deployment with multi-model support (Qwen3-ASR and Paraformer), automatic speaker diarization via CAM++, and VAD-based audio segmentation for handling long recordings. Exposes dual API compatibility through OpenAI `/v1/audio/transcriptions` and Alibaba Cloud REST/WebSocket protocols, enabling zero-code client integration. Includes GPU batch processing, far-field noise filtering, and environment-variable configuration for flexible model selection and offline deployment scenarios.

191 stars.

No License No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 5 / 25

Community 18 / 25

How are scores calculated?

Stars

191

Forks

Language

Python

License

—

Higher-rated alternatives

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...

k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

Picovoice/leopard

On-device speech-to-text engine powered by deep learning

zaigie/FunSpeech

开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端

Explore Voice AI Tools

All categories Trending Voice AI directory Insights