r9y9/deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

51
/ 100
Established

Implements guided attention loss and dilated convolutions to improve alignment convergence, supporting both single-speaker and multi-speaker models trained on LJSpeech, VCTK, and JSUT datasets. Uses convolutional sequence-to-sequence architecture with attention mechanisms and language-dependent text processors for English and Japanese. Includes preprocessors compatible with standard TTS datasets and provides preset hyperparameters for reproducible training across different model variants.

1,982 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

1,982

Forks

482

Language

Python

License

Last pushed

Dec 19, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/r9y9/deepvoice3_pytorch"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.