RaduBolbo/F5-TTS-Emotional-CFG

Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS

45
/ 100
Emerging

Extends F5-TTS with multi-term classifier-free guidance for explicit emotion conditioning across five emotion classes (Neutral, Happy, Sad, Angry, Surprised), fine-tuned on the ESD dataset. The approach enables independent control over emotion intensity via a separate CFG strength parameter while preserving zero-shot voice cloning capabilities. Provides CLI inference with tunable emotion guidance strength to balance synthesis naturalness against emotion expressiveness.

No Package No Dependents
Maintenance 10 / 25
Adoption 7 / 25
Maturity 15 / 25
Community 13 / 25

How are scores calculated?

Stars

30

Forks

5

Language

Python

License

MIT

Last pushed

Mar 03, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/RaduBolbo/F5-TTS-Emotional-CFG"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.