Text-to-Speech Conversion ML Frameworks
Tools and applications that convert written text into spoken audio output. Includes voice cloning, multi-language support, and audiobook generation. Does NOT include speech recognition, voice user interfaces, or general audio processing frameworks.
There are 39 text-to-speech conversion frameworks tracked. The highest-rated is asmith26/speech2caret at 37/100 with 3 stars and 634 monthly downloads.
Get all 39 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=text-to-speech-conversion&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
asmith26/speech2caret
Use your speech to write to the current caret position! |
|
Emerging |
| 2 |
pranayjoshi/Speech_recognition
Indo:- A mini Speech Recognizer |
|
Experimental |
| 3 |
DominicTWHV/LJSpeech_Dataset_Generator
LJSpeech dataset generator for TTS model training/fine tuning |
|
Experimental |
| 4 |
kristofferv98/openai-voicestream
OpenAI VoiceStream: Real-time text-to-speech library for processing text and... |
|
Experimental |
| 5 |
helanzhiyi/audio-annotation-platform
🎤 Manage audio transcription workflows efficiently with this open-source... |
|
Experimental |
| 6 |
Dineshkumar-Ponnusamy/maya-voice-ai
Maya Voice AI is an open-source project that demonstrates the Maya1 model,... |
|
Experimental |
| 7 |
lexust1/av2txtsum
Automatic speech recognition (ASR) |
|
Experimental |
| 8 |
ombharatiya/Speech-To-Text-Android
It is based on Google STT API. The app simply takes your audio as input... |
|
Experimental |
| 9 |
Chaitanya31612/SIH_VAKPRATYABHIJNA
Developed Sanskrit voice to text and interface in Google Search Engine... |
|
Experimental |
| 10 |
idsudd/tricahue
🦜 Tricahue: modelo de transcripción de voz especializado en español chileno |
|
Experimental |
| 11 |
hsahovic/Speech-to-maths
Speech-recognition for Latex generation |
|
Experimental |
| 12 |
Curovearth/Pi_Giving_Voice_to_Voiceless
Software and Hardware prototype for giving voice to those who are unable to... |
|
Experimental |
| 13 |
rottter4585/Llasa-GRPO
🎤 Fine-tune the Llasa TTS model with GRPO using Hugging Face tools to... |
|
Experimental |
| 14 |
srume/LLM-powered-TextToSpeech
In this repository, I have built different version of Text to speech models... |
|
Experimental |
| 15 |
nhcarrigan-mentorship/matanat-khalilova
VoiceBridge: Assistive technology designed to transform atypical speech... |
|
Experimental |
| 16 |
bv2518/text-to-speech
🔊 Convert text into MP3 audio files quickly, enhancing content,... |
|
Experimental |
| 17 |
achrafash/voice-studio
An audio labeling studio for ASR tasks (transcription, diarization, VAD, etc) |
|
Experimental |
| 18 |
babula-cpu/stt-server
Provide real-time speech-to-text conversion via WebSocket using pluggable... |
|
Experimental |
| 19 |
Reprompts/pyttsgen
pyttsgen is a lightweight, developer-friendly Python library for generating... |
|
Experimental |
| 20 |
zatomos/Speech-to-text_bot
A Discord bot for voice message transcription |
|
Experimental |
| 21 |
jalalzia1/kokoro-web
Deliver high-quality text-to-speech in the browser with 28 voices, no... |
|
Experimental |
| 22 |
SivaguruArumugam/speech-to-text-app
A real-time speech recognition application that converts spoken audio into... |
|
Experimental |
| 23 |
Vasanth2005kk/VoxLibri
VoxLibri: The Ultimate AI-Powered eBook to Audiobook Converter. 🎧📚 Transform... |
|
Experimental |
| 24 |
damasvasree/AI_Text-to-speech-chatbot
Conversational AI chatbot with speech-to-text and text-to-speech... |
|
Experimental |
| 25 |
c0dest3r/vocalcanvas-studio
🗣️ Transform text, images, and PDFs into expressive speech directly in your... |
|
Experimental |
| 26 |
Surya12nisha/Voxera
🔊 Enable real-time voice, video, and screen sharing with Voxera, a web-based... |
|
Experimental |
| 27 |
tjas/postgrad-ai-nlp2-voice-ui
A Voice User Interface tool for Text-to-Speech and Speech-to-Text, built... |
|
Experimental |
| 28 |
AchrafMoualem/Voice_ID
End-to-end speech intelligence pipeline — CNN speaker identification,... |
|
Experimental |
| 29 |
llegomark/synthesizer
This repository contains the source code for a Text-to-Speech (TTS) web... |
|
Experimental |
| 30 |
sitammeur/kokoro-litserve
Leverage Kokoro's TTS capabilities using LitServe. |
|
Experimental |
| 31 |
Himanshi-2519/Speech-To-Text-API
Capturing the Rhythm of your words. Real-time AI transcription with a... |
|
Experimental |
| 32 |
olympus-terminal/text-to-speech
Text-to-speech tools and utilities |
|
Experimental |
| 33 |
siddhali24/Speech-to-text-generator
This project will turn audio to text using hugging face transformers |
|
Experimental |
| 34 |
moazshorbagy/speak-code
An interface for programming using voice |
|
Experimental |
| 35 |
rennerdo30/murmura
Japanese alphabet learning application with dual support for Hiragana and... |
|
Experimental |
| 36 |
pratham-ak2004/PDF-Reader
python notebook to convert text from pdf and read it aloud |
|
Experimental |
| 37 |
asigalov61/voice4voice
Giving voice to the voiceless and helping voiceless to be heard |
|
Experimental |
| 38 |
zlatnaspirala/artificial-intelligence
Write as you speak and read as it is written. — Vuk Stefanović Karadžić |
|
Experimental |
| 39 |
Tushar-ml/Voice-Based-Speech-Enhancer
VoiceAI™ helps to analyze the sentiment of Voice and text engagement. The... |
|
Experimental |