The Voice AI Directory
Quality-scored directory of 6,983 voice ai tools, updated daily. Every tool scored on maintenance, adoption, maturity, and community signals.
Voice AI covers text-to-speech synthesis, speech recognition, voice cloning, voice agents, and audio processing.
29
70–100
260
50–69
1,855
30–49
4,839
10–29
Top tools by quality score
| # | Tool | Score |
|---|---|---|
| 1 |
espnet/espnet
End-to-End Speech Processing Toolkit |
|
| 2 |
TalAter/annyang
💬 Speech recognition for your site |
|
| 3 |
Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS)... |
|
| 4 |
elevenlabs/elevenlabs-python
The official Python SDK for the ElevenLabs API. |
|
| 5 |
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, speech enhancement,... |
|
| 6 |
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs,... |
|
| 7 |
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) |
|
| 8 |
jdepoix/youtube-transcript-api
This is a python API which allows you to get the transcript/subtitles for a... |
|
| 9 |
DrewThomasson/ebook2audiobook
Generate audiobooks from e-books, voice cloning & 1158+ languages! |
|
| 10 |
KoljaB/RealtimeTTS
Converts text to speech in realtime |
|
| 11 |
cmusphinx/pocketsphinx
A small speech recognizer |
|
| 12 |
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model,... |
|
| 13 |
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers... |
|
| 14 |
OpenBMB/VoxCPM
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and... |
|
| 15 |
pndurette/gTTS
Python library and CLI tool to interface with Google Translate's text-to-speech API |
|
| 16 |
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT... |
|
| 17 |
nateshmbhat/pyttsx3
Offline Text To Speech synthesis for python |
|
| 18 |
denizsafak/abogen
Generate audiobooks from EPUBs, PDFs and text with synchronized captions. |
|
| 19 |
gradio-app/fastrtc
The python library for real-time communication |
|
| 20 |
salute-developers/GigaAM
Foundational Model for Speech Recognition Tasks |
|
Browse by category
.NET TTS Libraries
203 tools
General Purpose Voice Assistants
187 tools
Lightweight TTS Libraries
185 tools
Automatic Speech Recognition
161 tools
Web Speech API Libraries
149 tools
Web Speech API TTS
149 tools
Speech-To-Text Converters
147 tools
Android Speech Apps
113 tools
Keyword Speech Recognition
112 tools
End-to-End ASR Frameworks
109 tools
Local Voice Assistants
101 tools
iOS Speech Frameworks
99 tools
Self-Hosted TTS Servers
97 tools
Voice Agent Applications
88 tools
Discord TTS Bots
86 tools
Python Voice Assistants
82 tools
Voice Controlled Robotics
81 tools
Lightweight TTS Runtimes
79 tools
Speech Recognition APIs
78 tools
AI Video Generation
75 tools
Google TTS Libraries
75 tools
FastSpeech TTS Models
74 tools
Kokoro TTS Ecosystem
72 tools
Voice Cloning Tools
71 tools
OpenAI TTS Applications
71 tools
Neural Vocoder Implementations
71 tools
Coqui TTS Applications
70 tools
Tacotron TTS Models
70 tools
Voice Chatbot Applications
67 tools
Kaldi ASR Ecosystem
66 tools
CTC ASR Implementations
65 tools
Java TTS Libraries
65 tools
Voice Command Assistants
65 tools
Qwen3 TTS Applications
64 tools
Browser TTS Extensions
63 tools
Speech Corpora Datasets
63 tools
eBook to Audiobook Conversion
62 tools
Edge TTS Implementations
62 tools
Text To Speech Frameworks
62 tools
Local Voice Dictation
62 tools
Whisper Subtitle Generation
60 tools
AI Tutoring Platforms
59 tools
Go TTS Libraries
59 tools
Content-to-Podcast Converters
58 tools
Voice AI Learning Collections
57 tools
Educational Voice Apps
56 tools
AI Avatar Platforms
53 tools
Speech AI Coursework
53 tools
Voice ChatGPT Interfaces
53 tools
Multimodal Medical Assistants
53 tools
Android Voice Assistants
52 tools
TTS Model Fine-Tuning
52 tools
Assistive Vision AI
50 tools
Telegram Voice Transcription
49 tools
Meeting Transcription Summarizers
49 tools
Voice Controlled Desktop Automation
47 tools
FunASR Speech Recognition
46 tools
Wav2Vec2 ASR Models
46 tools
Speech Emotion Recognition
45 tools
Wake Word Detection
45 tools
Vue Speech Recognition
45 tools
Rust TTS Libraries
45 tools
Audio Transcription Apps
44 tools
eSpeak-NG Ecosystem
43 tools
Speech Translation Apps
43 tools
Zero-Shot Voice Synthesis
43 tools
Deepgram Starter Projects
43 tools
Vosk ASR Implementations
42 tools
Gradio TTS WebUIs
42 tools
Video Dubbing Tools
41 tools
Voice Cloning Synthesis
41 tools
ElevenLabs Integrations
40 tools
Video Transcription Extraction
39 tools
Real-Time Voice Translation
38 tools
Piper TTS Ecosystem
38 tools
Speaker Diarization Embedding
37 tools
Whisper Transcription Apps
36 tools
AWS Polly TTS
36 tools
Twitch Chat TTS
35 tools
Sign Language Translation
34 tools
AI-Powered eReaders
33 tools
React Native Voice Libraries
33 tools
TTS Dataset Creation
33 tools
VITS TTS Implementations
32 tools
PDF to Audio Conversion
32 tools
Audio Transcription Tools
31 tools
React Speech Recognition
31 tools
Voice Dictation Typing
30 tools
Parakeet ASR Implementations
30 tools
System TTS Wrappers
30 tools
SMS Voice Integrations
29 tools
Cross-Platform TTS Frameworks
29 tools
Conformer ASR Implementations
28 tools
Voice Enabled Coding Assistants
28 tools
Text To Speech Conversion
27 tools
Whisper Framework Ports
27 tools
Whisper Fine-Tuning
27 tools
Live Caption Generation
27 tools
ASR Evaluation Metrics
26 tools
ComfyUI TTS Nodes
26 tools
PHP TTS Libraries
26 tools
Audio Noise Reduction
25 tools
Live Meeting Translation
25 tools
Whisper Diarization
24 tools
Grapheme-to-Phoneme Conversion
24 tools
Rust Speech Recognition
24 tools
Streamlit TTS Apps
24 tools
Embedded TTS Systems
22 tools
Interactive AI Avatars
22 tools
Anki TTS Integration
22 tools
OpenClaw Voice Assistants
21 tools
Voice AI SDKs
21 tools
Yandex SpeechKit Tools
21 tools
Audio Source Separation
20 tools
News Audio Bulletins
19 tools
Web-Based TTS Apps
19 tools
AI Interview Simulators
19 tools
Image-to-Speech Synthesis
19 tools
Text Normalization Engines
17 tools
Home Assistant TTS
17 tools
Face Recognition Systems
17 tools
IBM Watson Speech
15 tools
Government Procurement Docs
15 tools
Clipboard Text-to-Speech
15 tools
Ukrainian Voice AI
13 tools
Text To Speech Tts
12 tools
Whisper Speech Transcription
12 tools
Voice Assistant Devices
12 tools
Persian Speech AI
12 tools
Audio Music Learning
11 tools
Multilingual Speech Datasets
11 tools
Speech To Text Transcription
10 tools
Voice Assistant Applications
9 tools
Voice Ai Assistants
8 tools
Voice Controlled Calculators
8 tools
Stt
8 tools
Ai Podcast Generation
7 tools
Conversational Chatbot Applications
7 tools
Lip Reading Synthesis
7 tools
Sign Language Recognition
7 tools
Voice Interactive Games
7 tools
Text To Speech
6 tools
Voice Assistant Frameworks
6 tools
Multimodal Vision Language
6 tools
Voice Assistant Projects
5 tools
Tts
5 tools
Wav2Vec2 Speech Recognition
4 tools
Data Annotation Tools
4 tools
Speech Synthesis Diffusion
4 tools
Text To Video Generation
4 tools
Virtual Assistants Nlp
4 tools
Bioacoustic Species Classification
4 tools
Audio Event Classification
4 tools
Voice Ai Agents
3 tools
Speech Recognition Datasets
3 tools
Unity Ml Inference
3 tools
Deepfake Detection Systems
3 tools
Personal Assistant Rag
3 tools
Conversational Rag Agents
3 tools
Image Caption Generation
3 tools
Facial Attribute Classification
3 tools
Joke Telling Apps
3 tools
Text To Speech Mcp
2 tools
Llm Scaling Architecture
2 tools
Comfyui Extensions
2 tools
Flutter Ai Chat Apps
2 tools
Multi Modal Ai Assistants
2 tools
Ai Virtual Companions
2 tools
Machine Translation Systems
2 tools
Ai Chatbot Interfaces
2 tools
Next Word Prediction
2 tools
Audio Classification Transformers
2 tools
Ai Image Generation Platforms
2 tools
Natural Language Task Scheduling
2 tools
Text Translation Tools
2 tools
Ai Workflow Automation
1 tools
Ai Assistant Platforms
1 tools
Text Embedding Runtimes
1 tools
Mediapipe Implementations
1 tools
Discord Ai Chatbots
1 tools
Vision Language Models
1 tools
Indic Language Translation
1 tools
Neural Machine Translation
1 tools
Gpt Implementation Tutorials
1 tools
Gemini Prompt Workbenches
1 tools
Speculative Decoding Algorithms
1 tools
Text Scanning Ocr
1 tools
Text Emotion Recognition
1 tools
Multi Agent Orchestration
1 tools
Llm Inference Serving
1 tools
Vibe Coding Frameworks
1 tools
Vietnamese Nlp Tools
1 tools
Respiratory Disease Detection
1 tools
Ai Terminal Agents
1 tools
Ai Note Taking Apps
1 tools
Document Qa Chatbots
1 tools
Ai Children Storytelling
1 tools
Nlp Task Libraries
1 tools
Llm Fine Tuning
1 tools
Chatbot Frameworks
1 tools
Talking Head Generation
1 tools
Gemini Api Applications
1 tools
Llm Docker Deployments
1 tools
Stress Detection Ml
1 tools
Nlp Dataset Collections
1 tools
Fullstack Ai Assistants
1 tools
Graph Database Rag
1 tools
Video Content Intelligence
1 tools
Temporal Expression Parsing
1 tools
Health App Development
1 tools
Clip Vision Language
1 tools
Ai Interview Coaching
1 tools
Hand Gesture Control
1 tools
Ml Benchmarking Frameworks
1 tools
Viral Clip Generation
1 tools
Model Compression Optimization
1 tools
Edge Camera Ml
1 tools
Ocr Document Extraction
1 tools
Go Ml Bindings
1 tools
Reading Comprehension Qa
1 tools
Tokenization Libraries
1 tools
Llm Translation Tools
1 tools
Ai Skill Integrations
1 tools
Facial Recognition Apps
1 tools
Federated Learning Frameworks
1 tools
Personal Knowledge Management
1 tools
Flashcard Generation
1 tools
Streamlit Chatbot Apps
1 tools
Ml Learning Resources
1 tools
Llm Sdk Packages
1 tools
Semantic Kernel Tools
1 tools
Embedding Model Tuning
1 tools
Llm Learning Resources
1 tools
Chatbot Nlp Frameworks
1 tools
Telegram Llm Bots
1 tools
Nlu Game Applications
1 tools
Diffusion Model Frameworks
1 tools
Image Classification Demos
1 tools