unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
Built on Transformer architectures with TTS-based pretraining, it enables prosody-preserving voice conversion through multiple approaches: non-autoregressive seq2seq with automatic alignment search (AAS), ground-truth-free foreign accent conversion, and the original Voice Transformer Network (VTN). Recipes follow ESPNet/Kaldi conventions and support both parallel and non-parallel training paradigms across datasets like CMU ARCTIC and LJSpeech.
108 stars.
Stars
108
Forks
15
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 15, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/unilight/seq2seq-vc"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
Edresson/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone