maum-ai/sane-tts
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
This tool helps content creators and businesses generate natural-sounding speech from text across multiple languages, even if the original speaker only spoke one language. You input written text and select a desired voice, and it outputs audio speech that sounds like the chosen speaker, but in the new language. It's designed for anyone needing high-quality, consistent voiceovers or audio content in various languages.
No commits in the last 6 months.
Use this if you need to generate realistic, consistent voice narration in multiple languages, while maintaining the same speaker's vocal characteristics.
Not ideal if you only need speech generation in a single language and don't require cross-lingual voice cloning.
Stars
11
Forks
—
Language
—
License
—
Category
Last pushed
Jun 30, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/maum-ai/sane-tts"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System