FENRlR/MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

/ 100

Established

Combines multi-band inverse Short-Time Fourier Transform (MB-iSTFT) vocoding with VITS2's end-to-end text-to-speech architecture, enabling subband-wise synthesis for improved audio quality. Supports multiple alignment backends including Triton-accelerated Super Monotonic Align, eliminating Cython compilation requirements. Offers variants ranging from full MB-iSTFT-VITS2 to lightweight Mini configurations, with single and multi-speaker training pipelines.

134 stars.

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

134

Forks

Language

Python

License

MIT

Compare

MB-iSTFT-VITS2 and MB-iSTFT-VITS-with-AutoVocoder

Related tools

chinokikiss/GSV-TTS-Lite

GSV-TTS-Lite A high-performance inference engine specifically designed for the GPT-SoVITS...

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

High-Logic/Genie-TTS

GPT-SoVITS ONNX Inference Engine & Model Converter

AlexandaJerry/vits-mandarin-biaobei

application of vits on mandarin tts

Artrajz/vits-simple-api

A simple VITS HTTP API, developed by extending Moegoe with additional features.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights