lukeewin/FunASR_API

这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.

/ 100

Emerging

Exposes multiple HTTP endpoints (file upload, URL-based, async retrieval) with speaker-attributed transcription segments including timestamps, leveraging FastAPI with MySQL persistence. Supports CUDA acceleration via NVIDIA GPUs and handles multi-format audio normalization through FFmpeg integration. Language-agnostic API design enables consumption from Java, C++, Go, JavaScript and other HTTP clients across Linux, macOS, and Windows platforms.

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 9 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

HTML

License

MIT

Higher-rated alternatives

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...

k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

yeyupiaoling/YeAudio

Python的音频工具

zaigie/FunSpeech

开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端

Explore Voice AI Tools

All categories Trending Voice AI directory Insights