keonlee9420/Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

/ 100

Emerging

Non-autoregressive architecture enabling fast inference while conditioning on categorical or continuous emotion descriptors and conversational context through separate branch implementations. Includes annotated datasets (IEMOCAP for English, AIHub Multimodal for Korean) and language-specific text processing pipelines with Montreal Forced Aligner integration for adapting to new languages. Provides multi-speaker synthesis with emotion/conversation-aware prosody control as a PyTorch framework extending FastSpeech2's base architecture.

318 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

318

Forks

Language

Python

License

—

Compare

Expressive-FastSpeech2 and FastSpeech2

Higher-rated alternatives

TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...

lucasnewman/nanospeech

A simple, hackable text-to-speech system in PyTorch and MLX

Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...

jxzhanggg/nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

keonlee9420/STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights