Spr-Aachen/Easy-Voice-Toolkit

A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

/ 100

Established

Integrates established models like OpenAI Whisper for transcription and GPT-SoVITS for voice conversion, with a modular pipeline enabling sequential processing from raw audio through dataset creation and model training. Built on PyTorch with a desktop GUI (Windows) and Colab-compatible backend, supporting both standalone and cloud-based workflows. Audio preprocessing leverages intelligent slicing and voice activity detection to prepare data for downstream voice synthesis tasks.

875 stars. Actively maintained with 3 commits in the last 30 days.

No Package No Dependents

Maintenance 16 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

875

Forks

123

Language

Python

License

GPL-3.0

Related tools

ftyers/commonvoice-utils

Linguistic processing for Common Voice

alphacep/awesome-russian-speech

Russian speech technology links

microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

microsoft/SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

PrzemyslawSwiderski/python-gradle-plugin

Gradle plugin to run Python projects.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights