lenML/Speech-AI-Forge
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Supports multiple cutting-edge TTS models (ChatTTS, CosyVoice, F5-TTS, FishSpeech, GPT-SoVITS) with integrated ASR, voice cloning, and audio enhancement through a unified FastAPI backend. Features advanced capabilities like long-text batch processing, SSML-based podcast generation, custom voice builders with seed blending, and real-time voice style/speed/pitch adjustment. Deployable via standalone API server, Docker containers, or interactive Gradio WebUI with comprehensive voice management hub.
1,386 stars. Actively maintained with 9 commits in the last 30 days.
Stars
1,386
Forks
182
Language
Python
License
AGPL-3.0
Category
Last pushed
Mar 06, 2026
Commits (30d)
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/lenML/Speech-AI-Forge"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's...
fishaudio/fish-speech
SOTA Open Source TTS
mlalma/kokoro-ios
Kokoro TTS for iOS and macOSX
sidharthrajaram/StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
mlalma/KokoroTestApp
Test application for Kokoro TTS model