Spr-Aachen/Easy-Voice-Toolkit
A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
Integrates established models like OpenAI Whisper for transcription and GPT-SoVITS for voice conversion, with a modular pipeline enabling sequential processing from raw audio through dataset creation and model training. Built on PyTorch with a desktop GUI (Windows) and Colab-compatible backend, supporting both standalone and cloud-based workflows. Audio preprocessing leverages intelligent slicing and voice activity detection to prepare data for downstream voice synthesis tasks.
875 stars. Actively maintained with 3 commits in the last 30 days.
Stars
875
Forks
123
Language
Python
License
GPL-3.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Spr-Aachen/Easy-Voice-Toolkit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
ftyers/commonvoice-utils
Linguistic processing for Common Voice
alphacep/awesome-russian-speech
Russian speech technology links
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
microsoft/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
PrzemyslawSwiderski/python-gradle-plugin
Gradle plugin to run Python projects.