dunky11/voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

44
/ 100
Emerging

Built on a two-stage pipeline combining modified DelightfulTTS and UnivNet architectures pretrained on 5000 speakers, it enables fine-tuning for single and multispeaker TTS without coding. Includes automatic text normalization and dataset preprocessing tools, with GPU acceleration via CUDA and containerized training through Docker. Targets Windows and Linux with a desktop installer, supporting inference on both custom and emotional speech datasets.

229 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

229

Forks

33

Language

Python

License

Apache-2.0

Last pushed

Oct 10, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/dunky11/voicesmith"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.