Text-to-Speech Conversion ML Frameworks

Tools and applications that convert written text into spoken audio output. Includes voice cloning, multi-language support, and audiobook generation. Does NOT include speech recognition, voice user interfaces, or general audio processing frameworks.

There are 39 text-to-speech conversion frameworks tracked. The highest-rated is asmith26/speech2caret at 37/100 with 3 stars and 634 monthly downloads.

Get all 39 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=text-to-speech-conversion&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 asmith26/speech2caret

Use your speech to write to the current caret position!

37
Emerging
2 pranayjoshi/Speech_recognition

Indo:- A mini Speech Recognizer

28
Experimental
3 DominicTWHV/LJSpeech_Dataset_Generator

LJSpeech dataset generator for TTS model training/fine tuning

25
Experimental
4 kristofferv98/openai-voicestream

OpenAI VoiceStream: Real-time text-to-speech library for processing text and...

23
Experimental
5 helanzhiyi/audio-annotation-platform

🎤 Manage audio transcription workflows efficiently with this open-source...

22
Experimental
6 Dineshkumar-Ponnusamy/maya-voice-ai

Maya Voice AI is an open-source project that demonstrates the Maya1 model,...

22
Experimental
7 lexust1/av2txtsum

Automatic speech recognition (ASR)

22
Experimental
8 ombharatiya/Speech-To-Text-Android

It is based on Google STT API. The app simply takes your audio as input...

21
Experimental
9 Chaitanya31612/SIH_VAKPRATYABHIJNA

Developed Sanskrit voice to text and interface in Google Search Engine...

19
Experimental
10 idsudd/tricahue

🦜 Tricahue: modelo de transcripción de voz especializado en español chileno

18
Experimental
11 hsahovic/Speech-to-maths

Speech-recognition for Latex generation

18
Experimental
12 Curovearth/Pi_Giving_Voice_to_Voiceless

Software and Hardware prototype for giving voice to those who are unable to...

18
Experimental
13 rottter4585/Llasa-GRPO

🎤 Fine-tune the Llasa TTS model with GRPO using Hugging Face tools to...

16
Experimental
14 srume/LLM-powered-TextToSpeech

In this repository, I have built different version of Text to speech models...

15
Experimental
15 nhcarrigan-mentorship/matanat-khalilova

VoiceBridge: Assistive technology designed to transform atypical speech...

15
Experimental
16 bv2518/text-to-speech

🔊 Convert text into MP3 audio files quickly, enhancing content,...

15
Experimental
17 achrafash/voice-studio

An audio labeling studio for ASR tasks (transcription, diarization, VAD, etc)

14
Experimental
18 babula-cpu/stt-server

Provide real-time speech-to-text conversion via WebSocket using pluggable...

14
Experimental
19 Reprompts/pyttsgen

pyttsgen is a lightweight, developer-friendly Python library for generating...

14
Experimental
20 zatomos/Speech-to-text_bot

A Discord bot for voice message transcription

14
Experimental
21 jalalzia1/kokoro-web

Deliver high-quality text-to-speech in the browser with 28 voices, no...

14
Experimental
22 SivaguruArumugam/speech-to-text-app

A real-time speech recognition application that converts spoken audio into...

14
Experimental
23 Vasanth2005kk/VoxLibri

VoxLibri: The Ultimate AI-Powered eBook to Audiobook Converter. 🎧📚 Transform...

14
Experimental
24 damasvasree/AI_Text-to-speech-chatbot

Conversational AI chatbot with speech-to-text and text-to-speech...

14
Experimental
25 c0dest3r/vocalcanvas-studio

🗣️ Transform text, images, and PDFs into expressive speech directly in your...

14
Experimental
26 Surya12nisha/Voxera

🔊 Enable real-time voice, video, and screen sharing with Voxera, a web-based...

14
Experimental
27 tjas/postgrad-ai-nlp2-voice-ui

A Voice User Interface tool for Text-to-Speech and Speech-to-Text, built...

13
Experimental
28 AchrafMoualem/Voice_ID

End-to-end speech intelligence pipeline — CNN speaker identification,...

13
Experimental
29 llegomark/synthesizer

This repository contains the source code for a Text-to-Speech (TTS) web...

12
Experimental
30 sitammeur/kokoro-litserve

Leverage Kokoro's TTS capabilities using LitServe.

12
Experimental
31 Himanshi-2519/Speech-To-Text-API

Capturing the Rhythm of your words. Real-time AI transcription with a...

11
Experimental
32 olympus-terminal/text-to-speech

Text-to-speech tools and utilities

11
Experimental
33 siddhali24/Speech-to-text-generator

This project will turn audio to text using hugging face transformers

11
Experimental
34 moazshorbagy/speak-code

An interface for programming using voice

11
Experimental
35 rennerdo30/murmura

Japanese alphabet learning application with dual support for Hiragana and...

11
Experimental
36 pratham-ak2004/PDF-Reader

python notebook to convert text from pdf and read it aloud

10
Experimental
37 asigalov61/voice4voice

Giving voice to the voiceless and helping voiceless to be heard

10
Experimental
38 zlatnaspirala/artificial-intelligence

Write as you speak and read as it is written. — Vuk Stefanović Karadžić

10
Experimental
39 Tushar-ml/Voice-Based-Speech-Enhancer

VoiceAI™ helps to analyze the sentiment of Voice and text engagement. The...

10
Experimental