The Voice AI Directory

Quality-scored directory of 6,983 voice ai tools, updated daily. Every tool scored on maintenance, adoption, maturity, and community signals.

Voice AI covers text-to-speech synthesis, speech recognition, voice cloning, voice agents, and audio processing.

Verified

29

70–100

Established

260

50–69

Emerging

1,855

30–49

Experimental

4,839

10–29

Top tools by quality score

# Tool Score
1 espnet/espnet

End-to-End Speech Processing Toolkit

96
2 TalAter/annyang

💬 Speech recognition for your site

93
3 Blaizzy/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS)...

93
4 elevenlabs/elevenlabs-python

The official Python SDK for the ElevenLabs API.

92
5 k2-fsa/sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement,...

91
6 Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs,...

90
7 m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

90
8 jdepoix/youtube-transcript-api

This is a python API which allows you to get the transcript/subtitles for a...

86
9 DrewThomasson/ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1158+ languages!

84
10 KoljaB/RealtimeTTS

Converts text to speech in realtime

84
11 cmusphinx/pocketsphinx

A small speech recognizer

84
12 PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model,...

82
13 alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers...

81
14 OpenBMB/VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and...

81
15 pndurette/gTTS

Python library and CLI tool to interface with Google Translate's text-to-speech API

78
16 rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT...

76
17 nateshmbhat/pyttsx3

Offline Text To Speech synthesis for python

75
18 denizsafak/abogen

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

75
19 gradio-app/fastrtc

The python library for real-time communication

75
20 salute-developers/GigaAM

Foundational Model for Speech Recognition Tasks

74

Browse by category

.NET TTS Libraries

203 tools

General Purpose Voice Assistants

187 tools

Lightweight TTS Libraries

185 tools

Automatic Speech Recognition

161 tools

Web Speech API Libraries

149 tools

Web Speech API TTS

149 tools

Speech-To-Text Converters

147 tools

Android Speech Apps

113 tools

Keyword Speech Recognition

112 tools

End-to-End ASR Frameworks

109 tools

Local Voice Assistants

101 tools

iOS Speech Frameworks

99 tools

Self-Hosted TTS Servers

97 tools

Voice Agent Applications

88 tools

Discord TTS Bots

86 tools

Python Voice Assistants

82 tools

Voice Controlled Robotics

81 tools

Lightweight TTS Runtimes

79 tools

Speech Recognition APIs

78 tools

AI Video Generation

75 tools

Google TTS Libraries

75 tools

FastSpeech TTS Models

74 tools

Kokoro TTS Ecosystem

72 tools

Voice Cloning Tools

71 tools

OpenAI TTS Applications

71 tools

Neural Vocoder Implementations

71 tools

Coqui TTS Applications

70 tools

Tacotron TTS Models

70 tools

Voice Chatbot Applications

67 tools

Kaldi ASR Ecosystem

66 tools

CTC ASR Implementations

65 tools

Java TTS Libraries

65 tools

Voice Command Assistants

65 tools

Qwen3 TTS Applications

64 tools

Browser TTS Extensions

63 tools

Speech Corpora Datasets

63 tools

eBook to Audiobook Conversion

62 tools

Edge TTS Implementations

62 tools

Text To Speech Frameworks

62 tools

Local Voice Dictation

62 tools

Whisper Subtitle Generation

60 tools

AI Tutoring Platforms

59 tools

Go TTS Libraries

59 tools

Content-to-Podcast Converters

58 tools

Voice AI Learning Collections

57 tools

Educational Voice Apps

56 tools

AI Avatar Platforms

53 tools

Speech AI Coursework

53 tools

Voice ChatGPT Interfaces

53 tools

Multimodal Medical Assistants

53 tools

Android Voice Assistants

52 tools

TTS Model Fine-Tuning

52 tools

Assistive Vision AI

50 tools

Telegram Voice Transcription

49 tools

Meeting Transcription Summarizers

49 tools

Voice Controlled Desktop Automation

47 tools

FunASR Speech Recognition

46 tools

Wav2Vec2 ASR Models

46 tools

Speech Emotion Recognition

45 tools

Wake Word Detection

45 tools

Vue Speech Recognition

45 tools

Rust TTS Libraries

45 tools

Audio Transcription Apps

44 tools

eSpeak-NG Ecosystem

43 tools

Speech Translation Apps

43 tools

Zero-Shot Voice Synthesis

43 tools

Deepgram Starter Projects

43 tools

Vosk ASR Implementations

42 tools

Gradio TTS WebUIs

42 tools

Video Dubbing Tools

41 tools

Voice Cloning Synthesis

41 tools

ElevenLabs Integrations

40 tools

Video Transcription Extraction

39 tools

Real-Time Voice Translation

38 tools

Piper TTS Ecosystem

38 tools

Speaker Diarization Embedding

37 tools

Whisper Transcription Apps

36 tools

AWS Polly TTS

36 tools

Twitch Chat TTS

35 tools

Sign Language Translation

34 tools

AI-Powered eReaders

33 tools

React Native Voice Libraries

33 tools

TTS Dataset Creation

33 tools

VITS TTS Implementations

32 tools

PDF to Audio Conversion

32 tools

Audio Transcription Tools

31 tools

React Speech Recognition

31 tools

Voice Dictation Typing

30 tools

Parakeet ASR Implementations

30 tools

System TTS Wrappers

30 tools

SMS Voice Integrations

29 tools

Cross-Platform TTS Frameworks

29 tools

Conformer ASR Implementations

28 tools

Voice Enabled Coding Assistants

28 tools

Text To Speech Conversion

27 tools

Whisper Framework Ports

27 tools

Whisper Fine-Tuning

27 tools

Live Caption Generation

27 tools

ASR Evaluation Metrics

26 tools

ComfyUI TTS Nodes

26 tools

PHP TTS Libraries

26 tools

Audio Noise Reduction

25 tools

Live Meeting Translation

25 tools

Whisper Diarization

24 tools

Grapheme-to-Phoneme Conversion

24 tools

Rust Speech Recognition

24 tools

Streamlit TTS Apps

24 tools

Embedded TTS Systems

22 tools

Interactive AI Avatars

22 tools

Anki TTS Integration

22 tools

OpenClaw Voice Assistants

21 tools

Voice AI SDKs

21 tools

Yandex SpeechKit Tools

21 tools

Audio Source Separation

20 tools

News Audio Bulletins

19 tools

Web-Based TTS Apps

19 tools

AI Interview Simulators

19 tools

Image-to-Speech Synthesis

19 tools

Text Normalization Engines

17 tools

Home Assistant TTS

17 tools

Face Recognition Systems

17 tools

IBM Watson Speech

15 tools

Government Procurement Docs

15 tools

Clipboard Text-to-Speech

15 tools

Ukrainian Voice AI

13 tools

Text To Speech Tts

12 tools

Whisper Speech Transcription

12 tools

Voice Assistant Devices

12 tools

Persian Speech AI

12 tools

Audio Music Learning

11 tools

Multilingual Speech Datasets

11 tools

Speech To Text Transcription

10 tools

Voice Assistant Applications

9 tools

Voice Ai Assistants

8 tools

Voice Controlled Calculators

8 tools

Stt

8 tools

Ai Podcast Generation

7 tools

Conversational Chatbot Applications

7 tools

Lip Reading Synthesis

7 tools

Sign Language Recognition

7 tools

Voice Interactive Games

7 tools

Text To Speech

6 tools

Voice Assistant Frameworks

6 tools

Multimodal Vision Language

6 tools

Voice Assistant Projects

5 tools

Tts

5 tools

Wav2Vec2 Speech Recognition

4 tools

Data Annotation Tools

4 tools

Speech Synthesis Diffusion

4 tools

Text To Video Generation

4 tools

Virtual Assistants Nlp

4 tools

Bioacoustic Species Classification

4 tools

Audio Event Classification

4 tools

Voice Ai Agents

3 tools

Speech Recognition Datasets

3 tools

Unity Ml Inference

3 tools

Deepfake Detection Systems

3 tools

Personal Assistant Rag

3 tools

Conversational Rag Agents

3 tools

Image Caption Generation

3 tools

Facial Attribute Classification

3 tools

Joke Telling Apps

3 tools

Text To Speech Mcp

2 tools

Llm Scaling Architecture

2 tools

Comfyui Extensions

2 tools

Flutter Ai Chat Apps

2 tools

Multi Modal Ai Assistants

2 tools

Ai Virtual Companions

2 tools

Machine Translation Systems

2 tools

Ai Chatbot Interfaces

2 tools

Next Word Prediction

2 tools

Audio Classification Transformers

2 tools

Ai Image Generation Platforms

2 tools

Natural Language Task Scheduling

2 tools

Text Translation Tools

2 tools

Ai Workflow Automation

1 tools

Ai Assistant Platforms

1 tools

Text Embedding Runtimes

1 tools

Mediapipe Implementations

1 tools

Discord Ai Chatbots

1 tools

Vision Language Models

1 tools

Indic Language Translation

1 tools

Neural Machine Translation

1 tools

Gpt Implementation Tutorials

1 tools

Gemini Prompt Workbenches

1 tools

Speculative Decoding Algorithms

1 tools

Text Scanning Ocr

1 tools

Text Emotion Recognition

1 tools

Multi Agent Orchestration

1 tools

Llm Inference Serving

1 tools

Vibe Coding Frameworks

1 tools

Vietnamese Nlp Tools

1 tools

Respiratory Disease Detection

1 tools

Ai Terminal Agents

1 tools

Ai Note Taking Apps

1 tools

Document Qa Chatbots

1 tools

Ai Children Storytelling

1 tools

Nlp Task Libraries

1 tools

Llm Fine Tuning

1 tools

Chatbot Frameworks

1 tools

Talking Head Generation

1 tools

Gemini Api Applications

1 tools

Llm Docker Deployments

1 tools

Stress Detection Ml

1 tools

Nlp Dataset Collections

1 tools

Fullstack Ai Assistants

1 tools

Graph Database Rag

1 tools

Video Content Intelligence

1 tools

Temporal Expression Parsing

1 tools

Health App Development

1 tools

Clip Vision Language

1 tools

Ai Interview Coaching

1 tools

Hand Gesture Control

1 tools

Ml Benchmarking Frameworks

1 tools

Viral Clip Generation

1 tools

Model Compression Optimization

1 tools

Edge Camera Ml

1 tools

Ocr Document Extraction

1 tools

Go Ml Bindings

1 tools

Reading Comprehension Qa

1 tools

Tokenization Libraries

1 tools

Llm Translation Tools

1 tools

Ai Skill Integrations

1 tools

Facial Recognition Apps

1 tools

Federated Learning Frameworks

1 tools

Personal Knowledge Management

1 tools

Flashcard Generation

1 tools

Streamlit Chatbot Apps

1 tools

Ml Learning Resources

1 tools

Llm Sdk Packages

1 tools

Semantic Kernel Tools

1 tools

Embedding Model Tuning

1 tools

Llm Learning Resources

1 tools

Chatbot Nlp Frameworks

1 tools

Telegram Llm Bots

1 tools

Nlu Game Applications

1 tools

Diffusion Model Frameworks

1 tools

Image Classification Demos

1 tools