01Zhangbw/Awesome-Expressive-speech-synthesis

This is a summary of Expressive speech synthesis papers. Now update: 13 May.

/ 100

Experimental

This is a curated list of research papers on expressive speech synthesis, including related work in audio and music generation. It provides a comprehensive overview of recent advancements in creating realistic and emotionally nuanced synthesized speech and sound. The resource is ideal for researchers and practitioners who want to stay updated on the state-of-the-art in generating human-like speech, singing, and even accompanying visual gestures from text or other inputs.

No commits in the last 6 months.

Use this if you are a researcher or developer working on advanced speech synthesis and want to find academic papers on topics like emotional voice generation, multi-party dialogue synthesis, or creating talking head avatars.

Not ideal if you are looking for an off-the-shelf tool or software to implement expressive speech synthesis without diving into academic literature.

Speech Synthesis Audio Generation Expressive AI Voices Digital Narratives Computational Linguistics

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

—

License

—

Higher-rated alternatives

index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

stepfun-ai/Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...

lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

unilight/seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Explore Voice AI Tools

All categories Trending Voice AI directory Insights