FireRedTeam/FireRedASR
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
Built on two complementary architectures—an Encoder-Adapter-LLM framework for peak performance and an Attention-based Encoder-Decoder for efficiency—FireRedASR enables end-to-end speech interaction while serving as a representation module in LLM-based systems. The framework integrates with Qwen2 for LLM variants and supports batch beam search decoding with configurable parameters (beam size, length penalties, temperature). Models are distributed via Hugging Face with Python and CLI interfaces, supporting audio up to 60s (AED) or 30s (LLM) at 16kHz.
1,796 stars.
Stars
1,796
Forks
159
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 25, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/FireRedTeam/FireRedASR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
meizhong986/WhisperJAV
ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
BryceWG/BiBi-Keyboard
说点啥(BiBi Keyboard):一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR voice input method...
DevEmperor/Dictate
A powerful Whisper AI keyboard for reliable speech transcription
vivekuppal/transcribe
Transcribe is a real time transcription, conversation, Language learning platform. It provides...
sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI