lukeewin/FunASR_API
这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.
Exposes multiple HTTP endpoints (file upload, URL-based, async retrieval) with speaker-attributed transcription segments including timestamps, leveraging FastAPI with MySQL persistence. Supports CUDA acceleration via NVIDIA GPUs and handles multi-format audio normalization through FFmpeg integration. Language-agnostic API design enables consumption from Java, C++, Go, JavaScript and other HTTP clients across Linux, macOS, and Windows platforms.
Stars
23
Forks
8
Language
HTML
License
MIT
Category
Last pushed
Feb 12, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/lukeewin/FunASR_API"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
yeyupiaoling/YeAudio
Python的音频工具
zaigie/FunSpeech
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端