dunky11/voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

/ 100

Emerging

Built on a two-stage pipeline combining modified DelightfulTTS and UnivNet architectures pretrained on 5000 speakers, it enables fine-tuning for single and multispeaker TTS without coding. Includes automatic text normalization and dataset preprocessing tools, with GPU acceleration via CUDA and containerized training through Docker. Targets Windows and Linux with a desktop installer, supporting inference on both custom and emotional speech datasets.

229 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

229

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

herimor/voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control

EveryVoiceTTS/EveryVoice

The EveryVoice TTS Toolkit - Text To Speech for your language

kadirnar/VoiceHub

VoiceHub: A Unified Inference Interface for TTS Models

NeonGeckoCom/neon-tts-plugin-coqui

Coqui AI TTS plugin

Atm4x/tts-with-rvc

TTS with RVC-module to generate .wav audios

Explore Voice AI Tools

All categories Trending Voice AI directory Insights