tacotron and GST-Tacotron
About tacotron
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
This project helps you transform written text into natural-sounding spoken audio. You input text, and it produces an audio file of that text being read aloud, much like an audiobook. This is ideal for content creators, educators, or anyone needing to generate speech from text for various applications.
About GST-Tacotron
KinglittleQ/GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
This project helps creators and developers generate natural-sounding speech from Chinese text, giving them control over the style and emotion of the spoken output. You input Chinese text and it synthesizes high-quality audio that can express different 'styles' (like happy, sad, or formal) even if those styles weren't explicitly labeled in the training data. This is useful for anyone creating audio content, such as voiceovers for videos, audiobooks, or interactive voice assistants.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work