Text to Speech TTS Transformer Models

Tools for converting written text into spoken audio using transformer models and neural vocoding. Includes TTS engines, voice synthesis systems, and voice cloning capabilities. Does NOT include speech recognition, speech-to-text, audio classification, or general audio processing without text input.

There are 35 text to speech tts models tracked. 1 score above 50 (established tier). The highest-rated is OpenVoiceOS/ovos-audio-transformer-plugin-ggwave at 52/100 with 2 stars and 3,682 monthly downloads.

Get all 35 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=text-to-speech-tts&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 OpenVoiceOS/ovos-audio-transformer-plugin-ggwave

data over sound plugin

52
Established
2 edwko/OuteTTS

Interface for OuteTTS models.

48
Emerging
3 fluxions-ai/vui

100M parameter lightweight conversational text-to-speech model with breaths,...

46
Emerging
4 Aratako/T5Gemma-TTS

Multilingual TTS model with voice cloning and duration control, based on...

40
Emerging
5 inboxpraveen/LLM-Minutes-of-Meeting

🎤📄 An innovative tool that transforms audio or video files into text...

39
Emerging
6 mbzuai-oryx/LLMVoX

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

39
Emerging
7 maciekt07/Lecture-Note-Generator-POC

📒 A proof-of-concept app that transcribes lecture recordings into text and...

26
Experimental
8 skjp/spout

Workspace Repo for Synergistic Plugins Optimizing Usability of Transformers(Spout)

25
Experimental
9 tahaabbas/dictator

Dictator – Supercharge Cursor Chat with voice-to-text, custom AI prompts,...

24
Experimental
10 OpenVoiceOS/ovos-audio-transformer-plugin-speechbrain-langdetect

speech language detection plugin

24
Experimental
11 mwasifanwar/VoiceClone-Pro

Advanced voice cloning and speech synthesis system that can mimic any voice...

22
Experimental
12 jaden3289/llasa-tts-8b-webui

🎙️ Generate high-quality speech from text with Llasa-TTS-8B, featuring...

22
Experimental
13 arifulislamat/local-voice-cloning-app

Powered by ChatterboxTTS | Transformer | Llama | Gradio

22
Experimental
14 fbotathome/butia_speech

This package provides some tools to make the robot DoRIS speak and listen....

20
Experimental
15 AzkaQadir/MeetMind

AI-powered meeting intelligence system — upload any recording or transcript...

19
Experimental
16 neosun100/llasa-tts-8b-webui

🎙️ High-quality Text-to-Speech system based on Llasa-8B with intelligent GPU...

16
Experimental
17 eray-yuztyurk/python-ai-text-to-speech

Multilingual text-to-speech and text summarization toolkit using...

16
Experimental
18 sreenathyadavk/AI-Meeting-Tracker

🎙️ Self-hosted AI-powered meeting transcription and task extraction using...

16
Experimental
19 Swap98-Coder/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS)...

15
Experimental
20 PRITHIVSAKTHIUR/Orpheus-TTS-Edge

Play with Orpheus TTS, a Llama-based Speech-LLM designed for high-quality,...

14
Experimental
21 ItxMatti/tts

🗣️ Deploy high-quality text-to-speech services with Gemini, OpenAI, and...

14
Experimental
22 yamanobora/Android-Offline-Meeting-Recorder

Android app for offline speech recognition and AI meeting summarization...

14
Experimental
23 fikriaf/EncoAI

🤖 Enco AI, Based on Neural Network (PyTorch), Can Listen, Understand,...

12
Experimental
24 fahimakhalifa/ai-notes-api

Authenticated Notes API with Hugging Face summarization, sentiment analysis,...

12
Experimental
25 rk-vashista/TTS-Story_Generator

A versatile app that converts images into short stories and lifelike audio...

11
Experimental
26 vantix-code/VoiceSnap

AI-powered voice memo app that transforms recordings into bullet points,...

11
Experimental
27 nurgalive/nurgavoice

AI Transcription & Summarization Service build with open-source models.

11
Experimental
28 metacore-stack/Voice-to-Insights

Enterprise AI platform that transforms audio meetings into structured...

11
Experimental
29 thewh1teagle/sheen

LLM based TTS using Qwen and the SNAC audio codec

11
Experimental
30 metacore-stack/AuraVoice

Production-grade on-device AI meeting assistant featuring real-time...

11
Experimental
31 Afhrodite/Audio-LLM-Playground

A collection of audio transcription and summarization tools developed during...

11
Experimental
32 oscargullberg/tldwol

Web API that summarizes multimedia from various sources using modern AI tools.

11
Experimental
33 vijay0320/meeting-notes-cleaner

NLP pipeline fine-tuning flan-t5-small on meeting transcripts. 99.7%...

11
Experimental
34 IshaanLabs/Text-to-Speech-TTS

Open Source Text-to-Speech (TTS) repository

11
Experimental
35 arafat2020/cut_py

An R&D project to cut the best prat from a video using AI and ffmpeg.

10
Experimental