01Zhangbw/Awesome-Expressive-speech-synthesis
This is a summary of Expressive speech synthesis papers. Now update: 13 May.
This is a curated list of research papers on expressive speech synthesis, including related work in audio and music generation. It provides a comprehensive overview of recent advancements in creating realistic and emotionally nuanced synthesized speech and sound. The resource is ideal for researchers and practitioners who want to stay updated on the state-of-the-art in generating human-like speech, singing, and even accompanying visual gestures from text or other inputs.
No commits in the last 6 months.
Use this if you are a researcher or developer working on advanced speech synthesis and want to find academic papers on topics like emotional voice generation, multi-party dialogue synthesis, or creating talking head avatars.
Not ideal if you are looking for an off-the-shelf tool or software to implement expressive speech synthesis without diving into academic literature.
Stars
8
Forks
—
Language
—
License
—
Category
Last pushed
May 13, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/01Zhangbw/Awesome-Expressive-speech-synthesis"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System