AlexandaJerry/vits-mandarin-biaobei
application of vits on mandarin tts
Implements conditional VAE with adversarial learning and normalizing flows for end-to-end Mandarin speech synthesis, enabling parallel decoding with stochastic duration prediction to capture natural variations in pitch and rhythm. Trained on the Biaobei dataset with phoneme-level text processing via g2p conversion and monotonic alignment search. Supports single-speaker and multi-speaker configurations through separate training pipelines optimized for 22050 Hz audio.
121 stars. No commits in the last 6 months.
Stars
121
Forks
104
Language
Jupyter Notebook
License
MIT
Category
Last pushed
May 11, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/AlexandaJerry/vits-mandarin-biaobei"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
chinokikiss/GSV-TTS-Lite
GSV-TTS-Lite A high-performance inference engine specifically designed for the GPT-SoVITS...
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
High-Logic/Genie-TTS
GPT-SoVITS ONNX Inference Engine & Model Converter
FENRlR/MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
Artrajz/vits-simple-api
A simple VITS HTTP API, developed by extending Moegoe with additional features.