FireRedTeam/FireRedASR2S

A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.

/ 100

Emerging

Built on encoder-adapter-LLM and attention-based encoder-decoder architectures, FireRedASR2S performs end-to-end speech processing with VAD pre-filtering, language identification, ASR decoding, and punctuation restoration in a unified pipeline. The system supports both streaming and non-streaming modes, with TensorRT-LLM acceleration delivering 12.7x speedup on GPU inference. Integration with vLLM and availability on both Hugging Face and ModelScope enables flexible deployment across research and production environments.

365 stars.

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 11 / 25

Community 12 / 25

How are scores calculated?

Stars

365

Forks

Language

Python

License

Apache-2.0

Compare

FireRedASR2S and FireRedASR

Higher-rated alternatives

meizhong986/WhisperJAV

ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV

BryceWG/BiBi-Keyboard

说点啥（BiBi Keyboard）:一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR voice input method...

DevEmperor/Dictate

A powerful Whisper AI keyboard for reliable speech transcription

vivekuppal/transcribe

Transcribe is a real time transcription, conversation, Language learning platform. It provides...

sindresorhus/awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

Explore Voice AI Tools

All categories Trending Voice AI directory Insights