FireRedTeam/FireRedASR2S

A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.

46
/ 100
Emerging

Built on encoder-adapter-LLM and attention-based encoder-decoder architectures, FireRedASR2S performs end-to-end speech processing with VAD pre-filtering, language identification, ASR decoding, and punctuation restoration in a unified pipeline. The system supports both streaming and non-streaming modes, with TensorRT-LLM acceleration delivering 12.7x speedup on GPU inference. Integration with vLLM and availability on both Hugging Face and ModelScope enables flexible deployment across research and production environments.

365 stars.

No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 11 / 25
Community 12 / 25

How are scores calculated?

Stars

365

Forks

20

Language

Python

License

Apache-2.0

Last pushed

Mar 13, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/FireRedTeam/FireRedASR2S"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.