FireRedTeam/FireRedASR2S
A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.
Built on encoder-adapter-LLM and attention-based encoder-decoder architectures, FireRedASR2S performs end-to-end speech processing with VAD pre-filtering, language identification, ASR decoding, and punctuation restoration in a unified pipeline. The system supports both streaming and non-streaming modes, with TensorRT-LLM acceleration delivering 12.7x speedup on GPU inference. Integration with vLLM and availability on both Hugging Face and ModelScope enables flexible deployment across research and production environments.
365 stars.
Stars
365
Forks
20
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/FireRedTeam/FireRedASR2S"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
meizhong986/WhisperJAV
ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
BryceWG/BiBi-Keyboard
说点啥(BiBi Keyboard):一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR voice input method...
DevEmperor/Dictate
A powerful Whisper AI keyboard for reliable speech transcription
vivekuppal/transcribe
Transcribe is a real time transcription, conversation, Language learning platform. It provides...
sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI