Kyubyong/dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

/ 100

Established

Implements the DC-TTS architecture with a two-stage pipeline: Text2Mel generates mel-spectrograms from text using guided attention for monotonic alignment, while SSRN (speaker-dependent vocoder) converts spectrograms to waveforms. Supports multilingual training across English and Korean datasets, with practical modifications including layer normalization and learning rate decay to improve convergence over the original paper's approach.

1,159 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

1,159

Forks

360

Language

Python

License

Apache-2.0

Compare

dc_tts and dctts-pytorch

Related tools

bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Kyubyong/tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Rayhane-mamah/Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

DemisEom/SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

vlomme/Multi-Tacotron-Voice-Cloning

Phoneme multilingual(Russian-English) voice cloning based on

Explore Voice AI Tools

All categories Trending Voice AI directory Insights