FENRlR/MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
Combines multi-band inverse Short-Time Fourier Transform (MB-iSTFT) vocoding with VITS2's end-to-end text-to-speech architecture, enabling subband-wise synthesis for improved audio quality. Supports multiple alignment backends including Triton-accelerated Super Monotonic Align, eliminating Cython compilation requirements. Offers variants ranging from full MB-iSTFT-VITS2 to lightweight Mini configurations, with single and multi-speaker training pipelines.
134 stars.
Stars
134
Forks
31
Language
Python
License
MIT
Category
Last pushed
Dec 29, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/FENRlR/MB-iSTFT-VITS2"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
chinokikiss/GSV-TTS-Lite
GSV-TTS-Lite A high-performance inference engine specifically designed for the GPT-SoVITS...
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
High-Logic/Genie-TTS
GPT-SoVITS ONNX Inference Engine & Model Converter
AlexandaJerry/vits-mandarin-biaobei
application of vits on mandarin tts
Artrajz/vits-simple-api
A simple VITS HTTP API, developed by extending Moegoe with additional features.