Text-to-Speech Conversion ML Frameworks

Tools and applications that convert written text into spoken audio output. Includes voice cloning, multi-language support, and audiobook generation. Does NOT include speech recognition, voice user interfaces, or general audio processing frameworks.

There are 39 text-to-speech conversion frameworks tracked. The highest-rated is asmith26/speech2caret at 37/100 with 3 stars and 634 monthly downloads.

Get all 39 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=text-to-speech-conversion&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Framework	Score	Tier	Stars	Language
1	asmith26/speech2caret Use your speech to write to the current caret position!	37	Emerging	3	Python
2	pranayjoshi/Speech_recognition Indo:- A mini Speech Recognizer	28	Experimental	8	Python
3	DominicTWHV/LJSpeech_Dataset_Generator LJSpeech dataset generator for TTS model training/fine tuning	25	Experimental	4	Python
4	kristofferv98/openai-voicestream OpenAI VoiceStream: Real-time text-to-speech library for processing text and...	23	Experimental	7	Python
5	helanzhiyi/audio-annotation-platform 🎤 Manage audio transcription workflows efficiently with this open-source...	22	Experimental	—	Python
6	Dineshkumar-Ponnusamy/maya-voice-ai Maya Voice AI is an open-source project that demonstrates the Maya1 model,...	22	Experimental	3	Python
7	lexust1/av2txtsum Automatic speech recognition (ASR)	22	Experimental	1	HTML
8	ombharatiya/Speech-To-Text-Android It is based on Google STT API. The app simply takes your audio as input...	21	Experimental	8	Java
9	Chaitanya31612/SIH_VAKPRATYABHIJNA Developed Sanskrit voice to text and interface in Google Search Engine...	19	Experimental	5	—
10	idsudd/tricahue 🦜 Tricahue: modelo de transcripción de voz especializado en español chileno	18	Experimental	4	—
11	hsahovic/Speech-to-maths Speech-recognition for Latex generation	18	Experimental	6	CSS
12	Curovearth/Pi_Giving_Voice_to_Voiceless Software and Hardware prototype for giving voice to those who are unable to...	18	Experimental	3	JavaScript
13	rottter4585/Llasa-GRPO 🎤 Fine-tune the Llasa TTS model with GRPO using Hugging Face tools to...	16	Experimental	2	Python
14	srume/LLM-powered-TextToSpeech In this repository, I have built different version of Text to speech models...	15	Experimental	—	Python
15	nhcarrigan-mentorship/matanat-khalilova VoiceBridge: Assistive technology designed to transform atypical speech...	15	Experimental	1	JavaScript
16	bv2518/text-to-speech 🔊 Convert text into MP3 audio files quickly, enhancing content,...	15	Experimental	1	—
17	achrafash/voice-studio An audio labeling studio for ASR tasks (transcription, diarization, VAD, etc)	14	Experimental	4	TypeScript
18	babula-cpu/stt-server Provide real-time speech-to-text conversion via WebSocket using pluggable...	14	Experimental	—	HTML
19	Reprompts/pyttsgen pyttsgen is a lightweight, developer-friendly Python library for generating...	14	Experimental	3	Python
20	zatomos/Speech-to-text_bot A Discord bot for voice message transcription	14	Experimental	1	Python
21	jalalzia1/kokoro-web Deliver high-quality text-to-speech in the browser with 28 voices, no...	14	Experimental	—	—
22	SivaguruArumugam/speech-to-text-app A real-time speech recognition application that converts spoken audio into...	14	Experimental	—	Python
23	Vasanth2005kk/VoxLibri VoxLibri: The Ultimate AI-Powered eBook to Audiobook Converter. 🎧📚 Transform...	14	Experimental	—	Python
24	damasvasree/AI_Text-to-speech-chatbot Conversational AI chatbot with speech-to-text and text-to-speech...	14	Experimental	—	JavaScript
25	c0dest3r/vocalcanvas-studio 🗣️ Transform text, images, and PDFs into expressive speech directly in your...	14	Experimental	—	JavaScript
26	Surya12nisha/Voxera 🔊 Enable real-time voice, video, and screen sharing with Voxera, a web-based...	14	Experimental	—	JavaScript
27	tjas/postgrad-ai-nlp2-voice-ui A Voice User Interface tool for Text-to-Speech and Speech-to-Text, built...	13	Experimental	5	JavaScript
28	AchrafMoualem/Voice_ID End-to-end speech intelligence pipeline — CNN speaker identification,...	13	Experimental	—	HTML
29	llegomark/synthesizer This repository contains the source code for a Text-to-Speech (TTS) web...	12	Experimental	4	Python
30	sitammeur/kokoro-litserve Leverage Kokoro's TTS capabilities using LitServe.	12	Experimental	—	Python
31	Himanshi-2519/Speech-To-Text-API Capturing the Rhythm of your words. Real-time AI transcription with a...	11	Experimental	—	JavaScript
32	olympus-terminal/text-to-speech Text-to-speech tools and utilities	11	Experimental	—	Python
33	siddhali24/Speech-to-text-generator This project will turn audio to text using hugging face transformers	11	Experimental	—	Jupyter Notebook
34	moazshorbagy/speak-code An interface for programming using voice	11	Experimental	—	JavaScript
35	rennerdo30/murmura Japanese alphabet learning application with dual support for Hiragana and...	11	Experimental	—	TypeScript
36	pratham-ak2004/PDF-Reader python notebook to convert text from pdf and read it aloud	10	Experimental	1	Jupyter Notebook
37	asigalov61/voice4voice Giving voice to the voiceless and helping voiceless to be heard	10	Experimental	1	—
38	zlatnaspirala/artificial-intelligence Write as you speak and read as it is written. — Vuk Stefanović Karadžić	10	Experimental	1	JavaScript
39	Tushar-ml/Voice-Based-Speech-Enhancer VoiceAI™ helps to analyze the sentiment of Voice and text engagement. The...	10	Experimental	1	CSS