Tacotron and Tacotron-pytorch

These are competitors—both are independent PyTorch implementations of the Tacotron text-to-speech architecture, offering alternative codebases for the same synthesis task with no technical dependency between them.

Tacotron

Established

Tacotron-pytorch

Emerging

Maintenance 0/25

Adoption 13/25

Maturity 25/25

Community 20/25

Maintenance 0/25

Adoption 10/25

Maturity 16/25

Community 21/25

Stars: 115

Forks: 26

Downloads: 32

Commits (30d): 0

Language: Python

License: MIT

Stars: 206

Forks: 40

Downloads: —

Commits (30d): 0

Language: Python

License: Apache-2.0

Stale 6m

Stale 6m No Package No Dependents

About Tacotron

bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Implements location-relative attention with dynamic convolution to improve alignment robustness in text-to-mel-spectrogram synthesis, enabling stable training on single GPUs with mixed precision. Integrates with the UniversalVocoder for end-to-end audio generation from text via CMUDict phoneme conversion. Provides pretrained LJSpeech weights and preprocessing utilities for dataset training, with architectural optimizations including gradient clipping and modified learning schedules for efficient single-GPU convergence.

About Tacotron-pytorch

soobinseo/Tacotron-pytorch

Pytorch implementation of Tacotron

Implements the full Tacotron architecture with encoder-decoder attention, CBHG modules, and mel-spectrogram generation for end-to-end text-to-speech synthesis. Preprocesses text into phoneme indices and audio into spectrograms, supporting the LJSpeech dataset pipeline. Includes separate training and inference scripts for model optimization and TTS sample generation.

Related comparisons

Tacotron and tacotron Tacotron and Tacotron-2 Tacotron and tacotron

Scores updated daily from GitHub, PyPI, and npm data. How scores work