fishaudio/fish-speech

SOTA Open Source TTS

68
/ 100
Established

Implements a Dual-Autoregressive architecture combining a 4B-parameter slow decoder with a 400M fast decoder for semantic and acoustic codebook generation, trained on 10M+ hours across 80+ languages. Supports sub-word prosody and emotion control via inline natural language tags (e.g., `[whisper]`, `[excited]`), enabling multi-speaker conversations with reinforcement learning alignment for instruction adherence and naturalness.

26,613 stars. Actively maintained with 26 commits in the last 30 days.

No Package No Dependents
Maintenance 23 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

26,613

Forks

2,237

Language

Python

License

Last pushed

Mar 13, 2026

Commits (30d)

26

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/fishaudio/fish-speech"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.