soobinseo/Tacotron-pytorch

Pytorch implementation of Tacotron

/ 100

Emerging

Implements the full Tacotron architecture with encoder-decoder attention, CBHG modules, and mel-spectrogram generation for end-to-end text-to-speech synthesis. Preprocesses text into phoneme indices and audio into spectrograms, supporting the LJSpeech dataset pipeline. Includes separate training and inference scripts for model optimization and TTS sample generation.

206 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

206

Forks

Language

Python

License

Apache-2.0

Compare

Tacotron-pytorch and Tacotron Tacotron-pytorch and Tacotron-2 Tacotron-pytorch and tacotron

Higher-rated alternatives

bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Kyubyong/tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Rayhane-mamah/Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

DemisEom/SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Kyubyong/dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Explore Voice AI Tools

All categories Trending Voice AI directory Insights