atomicoo/tacotron2-mandarin
Tensorflow implementation of Chinese/Mandarin TTS (Text-to-Speech) based on Tacotron-2 model.
Implements the seq2seq encoder-decoder architecture from DeepMind's Tacotron-2 paper, predicting mel-spectrograms from Chinese text input with Griffin-Lim vocoding for waveform synthesis. Supports multiple open datasets (BIAOBEI, THCHS-30) with preprocessing pipelines for audio normalization and mel-spectrogram extraction. Includes pretrained model checkpoints and evaluation utilities for monitoring synthesis quality during training.
131 stars. No commits in the last 6 months.
Stars
131
Forks
45
Language
Python
License
MIT
Category
Last pushed
Jul 06, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/atomicoo/tacotron2-mandarin"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...