keonlee9420/Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

46
/ 100
Emerging

Non-autoregressive architecture enabling fast inference while conditioning on categorical or continuous emotion descriptors and conversational context through separate branch implementations. Includes annotated datasets (IEMOCAP for English, AIHub Multimodal for Korean) and language-specific text processing pipelines with Montreal Forced Aligner integration for adapting to new languages. Provides multi-speaker synthesis with emotion/conversation-aware prosody control as a PyTorch framework extending FastSpeech2's base architecture.

318 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

318

Forks

48

Language

Python

License

Last pushed

Aug 25, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/keonlee9420/Expressive-FastSpeech2"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.