rishikksh20/FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

48
/ 100
Emerging

Builds on ESPnet's FastSpeech architecture with explicit duration, pitch, and energy prediction modules for fine-grained prosody control. Integrates NVIDIA's Tacotron 2 preprocessing pipeline with MelGAN vocoding, and supports Montreal Forced Aligner for dataset phoneme alignment without manual text-audio synchronization. Includes TorchScript export capability and pre-aligned LJSpeech filelists for immediate training.

233 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 22 / 25

How are scores calculated?

Stars

233

Forks

52

Language

Jupyter Notebook

License

Apache-2.0

Last pushed

Jun 22, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rishikksh20/FastSpeech2"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.