PlayVoice/vits_chinese
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
Combines BERT-derived prosody embeddings with VITS architecture to capture natural pauses from linguistic structure, while incorporating NaturalSpeech's inference loss to reduce audio artifacts. Supports both non-streaming ONNX export and chunked streaming inference via encoder-decoder decomposition, with optional knowledge distillation enabling 3× speedup on a 53M student model. Targets Chinese TTS with multi-speaker capability and includes text preprocessing pipelines for phoneme conversion and numerical text normalization.
1,227 stars. No commits in the last 6 months.
Stars
1,227
Forks
178
Language
Python
License
MIT
Category
Last pushed
Feb 05, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/PlayVoice/vits_chinese"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
chinokikiss/GSV-TTS-Lite
GSV-TTS-Lite A high-performance inference engine specifically designed for the GPT-SoVITS...
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
High-Logic/Genie-TTS
GPT-SoVITS ONNX Inference Engine & Model Converter
FENRlR/MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
AlexandaJerry/vits-mandarin-biaobei
application of vits on mandarin tts