fishaudio/fish-speech
SOTA Open Source TTS
Implements a Dual-Autoregressive architecture combining a 4B-parameter slow decoder with a 400M fast decoder for semantic and acoustic codebook generation, trained on 10M+ hours across 80+ languages. Supports sub-word prosody and emotion control via inline natural language tags (e.g., `[whisper]`, `[excited]`), enabling multi-speaker conversations with reinforcement learning alignment for instruction adherence and naturalness.
26,613 stars. Actively maintained with 26 commits in the last 30 days.
Stars
26,613
Forks
2,237
Language
Python
License
—
Category
Last pushed
Mar 13, 2026
Commits (30d)
26
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/fishaudio/fish-speech"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's...
lenML/Speech-AI-Forge
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server...
mlalma/kokoro-ios
Kokoro TTS for iOS and macOSX
sidharthrajaram/StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
mlalma/KokoroTestApp
Test application for Kokoro TTS model