jamiepine/voicebox
The open-source voice synthesis studio
Supports voice cloning from short audio samples and offers 5 interchangeable TTS engines covering 23 languages with paralinguistic expression tags. Built on Tauri (Rust) with a timeline editor for multi-voice composition, post-processing effects (pitch, reverb, compression, filters), and a REST API for integration. Runs entirely locally with hardware acceleration across macOS (Metal/MLX), Windows (CUDA), Linux, AMD ROCm, and Docker.
13,404 stars. Actively maintained with 244 commits in the last 30 days.
Stars
13,404
Forks
1,562
Language
TypeScript
License
MIT
Category
Last pushed
Mar 18, 2026
Commits (30d)
244
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/jamiepine/voicebox"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
devnen/Chatterbox-TTS-Server
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible...
daswer123/xtts-api-server
A simple FastAPI Server to run XTTSv2
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to...
Aivis-Project/AivisSpeech-Engine
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine
nari-labs/dia
A TTS model capable of generating ultra-realistic dialogue in one pass.