All Voice AI Tools

6,983 tools ranked by quality score

Showing 1–100 of 6,983

#	Tool	Score	Tier	Category	Stars	Language
1	espnet/espnet End-to-End Speech Processing Toolkit	96	Verified	speaker-diarization-embedding	9,768	Python
2	TalAter/annyang 💬 Speech recognition for your site	93	Verified	web-speech-api-libraries	6,667	TypeScript
3	Blaizzy/mlx-audio A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS)...	93	Verified	text-to-speech-tts	6,227	Python
4	elevenlabs/elevenlabs-python The official Python SDK for the ElevenLabs API.	92	Verified	ai-workflow-automation	2,887	Python
5	k2-fsa/sherpa-onnx Speech-to-text, text-to-speech, speaker diarization, speech enhancement,...	91	Verified	vosk-asr-implementations	10,885	C++
6	Uberi/speech_recognition Speech recognition module for Python, supporting several engines and APIs,...	90	Verified	automatic-speech-recognition	8,959	Python
7	m-bain/whisperX WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)	90	Verified	whisper-diarization	20,758	Python
8	jdepoix/youtube-transcript-api This is a python API which allows you to get the transcript/subtitles for a...	86	Verified	video-transcription-extraction	7,078	Python
9	DrewThomasson/ebook2audiobook Generate audiobooks from e-books, voice cloning & 1158+ languages!	84	Verified	ebook-to-audiobook-conversion	18,503	Python
10	KoljaB/RealtimeTTS Converts text to speech in realtime	84	Verified	lightweight-tts-libraries	3,800	Python
11	cmusphinx/pocketsphinx A small speech recognizer	84	Verified	automatic-speech-recognition	4,278	C
12	PaddlePaddle/PaddleSpeech Easy-to-use Speech Toolkit including Self-Supervised Learning model,...	82	Verified	funasr-speech-recognition	12,556	Python
13	alphacep/vosk-api Offline speech recognition API for Android, iOS, Raspberry Pi and servers...	81	Verified	text-to-speech-conversion	14,377	Jupyter Notebook
14	OpenBMB/VoxCPM VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and...	81	Verified	voice-cloning-tools	6,143	Python
15	pndurette/gTTS Python library and CLI tool to interface with Google Translate's text-to-speech API	78	Verified	lightweight-tts-libraries	2,594	Python
16	rany2/edge-tts Use Microsoft Edge's online text-to-speech service from Python WITHOUT...	76	Verified	edge-tts-implementations	10,304	Python
17	nateshmbhat/pyttsx3 Offline Text To Speech synthesis for python	75	Verified	lightweight-tts-libraries	2,493	Python
18	denizsafak/abogen Generate audiobooks from EPUBs, PDFs and text with synchronized captions.	75	Verified	ai-podcast-generation	4,194	Python
19	gradio-app/fastrtc The python library for real-time communication	75	Verified	ai-assistant-platforms	4,547	JavaScript
20	salute-developers/GigaAM Foundational Model for Speech Recognition Tasks	74	Verified	speech-emotion-recognition	504	Python
21	espeak-ng/espeak-ng eSpeak NG is an open source speech synthesizer that supports more than...	73	Verified	espeak-ng-ecosystem	6,250	C
22	ggml-org/whisper.cpp Port of OpenAI's Whisper model in C/C++	72	Verified	whisper-framework-ports	47,665	C++
23	huggingface/speech-to-speech Build local voice agents with open-source models	72	Verified	text-to-speech-conversion	4,541	Python
24	descriptinc/descript-audio-codec State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz,...	72	Verified	audio-noise-reduction	1,732	Python
25	supertone-inc/supertonic Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.	71	Verified	lightweight-tts-runtimes	2,734	C++
26	Picovoice/porcupine On-device wake word detection powered by deep learning	70	Verified	wake-word-detection	4,743	Python
27	jianchang512/pyvideotrans Translate the video from one language to another and embed dubbing & subtitles.	70	Verified	video-dubbing-tools	16,496	Python
28	thewh1teagle/kokoro-onnx TTS with kokoro and onnx runtime	70	Verified	kokoro-tts-ecosystem	2,419	Python
29	santinic/audiblez Generate audiobooks from e-books	70	Verified	ebook-to-audiobook-conversion	5,920	Python
30	readest/readest Readest is a modern, feature-rich ebook reader designed for avid readers...	69	Established	ai-powered-ereaders	18,791	TypeScript
31	livekit/livekit End-to-end realtime stack for connecting humans and AI	69	Established	ai-avatar-platforms	17,671	Go
32	IAHispano/Applio A simple, high-quality voice conversion tool focused on ease of use and performance.	69	Established	voice-cloning-tools	3,070	Python
33	speechmatics/speechmatics-python Python library and CLI for Speechmatics	69	Established	speech-recognition-apis	75	Python
34	rapidaai/voice-ai Rapida is an open-source, end-to-end voice AI orchestration platform for...	69	Established	voice-agent-applications	686	Go
35	pnnbao97/VieNeu-TTS Vietnamese TTS with instant voice cloning • On-device • Real-time CPU...	69	Established	voice-cloning-synthesis	894	Python
36	coqui-ai/TTS 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research...	69	Established	text-to-speech-frameworks	44,801	Python
37	fishaudio/fish-speech SOTA Open Source TTS	68	Established	text-to-speech-tts	26,613	Python
38	linto-ai/whisper-timestamped Multilingual Automatic Speech Recognition with word-level timestamps and confidence	68	Established	whisper-speech-transcription	2,778	Python
39	collabora/WhisperLive A nearly-live implementation of OpenAI's Whisper.	68	Established	speech-to-text-converters	3,894	Python
40	foyoux/pygtrans 谷歌翻译, 支持 APIKEY 一口气翻译十万条	67	Established	speech-translation-apps	246	Python
41	jamiepine/voicebox The open-source voice synthesis studio	67	Established	self-hosted-tts-servers	13,404	TypeScript
42	compulim/web-speech-cognitive-services Polyfill Web Speech API with Cognitive Services for both speech-to-text and...	67	Established	dotnet-tts-libraries	70	JavaScript
43	Softcatala/whisper-ctranslate2 Whisper command line client compatible with original OpenAI client based on...	67	Established	speech-to-text-converters	1,255	Python
44	mozilla-ai/document-to-podcast Blueprint by Mozilla.ai for generating podcasts from documents using local AI	66	Established	content-to-podcast-converters	173	Python
45	istupakov/onnx-asr A lightweight Python package for Automatic Speech Recognition using ONNX models	66	Established	automatic-speech-recognition	281	Python
46	kxxt/aspeak A simple text-to-speech client for Azure TTS API.	66	Established	openai-tts-applications	500	Rust
47	ccoreilly/vosk-browser A speech recognition library running in the browser thanks to a WebAssembly...	66	Established	vosk-asr-implementations	507	JavaScript
48	met4citizen/TalkingHead Talking Head (3D): A JavaScript class for real-time lip-sync using full-body...	66	Established	ai-avatar-platforms	1,101	JavaScript
49	TensorSpeech/TensorFlowTTS :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art...	66	Established	fastspeech-tts-models	3,995	Python
50	playht/pyht PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API	65	Established	text-to-speech	220	Python
51	FluidInference/FluidAudio Frontier CoreML audio models in your apps — text-to-speech, speech-to-text,...	65	Established	ios-speech-frameworks	1,689	Swift
52	SYSTRAN/faster-whisper Faster Whisper transcription with CTranslate2	65	Established	whisper-transcription-apps	21,444	Python
53	CorentinJ/Real-Time-Voice-Cloning Clone a voice in 5 seconds to generate arbitrary speech in real-time	65	Established	voice-cloning-synthesis	59,518	Python
54	devnen/Chatterbox-TTS-Server Self-host the powerful Chatterbox TTS model. This server offers a...	64	Established	self-hosted-tts-servers	1,101	Python
55	fishaudio/Bert-VITS2 vits2 backbone with multilingual-bert	64	Established	voice-assistant-devices	8,707	Python
56	snakers4/silero-models Silero Models: pre-trained text-to-speech models made embarrassingly simple	64	Established	gradio-tts-webuis	5,822	Jupyter Notebook
57	ChetanXpro/nodejs-whisper NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as...	64	Established	whisper-framework-ports	201	TypeScript
58	k2-fsa/sherpa-ncnn Real-time speech recognition and voice activity detection (VAD) using...	64	Established	ios-speech-frameworks	1,648	C++
59	FunAudioLLM/CosyVoice Multi-lingual large voice generation model, providing inference, training...	64	Established	voice-assistant-devices	19,991	Python
60	Rei-x/discord-speech-recognition Speech to text extension for discord.js	64	Established	discord-tts-bots	62	TypeScript
61	nazdridoy/kokoro-tts A CLI text-to-speech tool using the Kokoro model, supporting multiple...	63	Established	kokoro-tts-ecosystem	1,296	Python
62	herimor/voxtream VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and...	63	Established	coqui-tts-applications	210	Python
63	lucidrains/HS-TasNet Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation...	63	Established	audio-source-separation	86	Python
64	travisvn/chatterbox-tts-api Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling...	63	Established	voice-assistant-devices	554	Python
65	fgnt/meeteval MeetEval - A meeting transcription evaluation toolkit	63	Established	asr-evaluation-metrics	149	Python
66	Picovoice/web-voice-processor A library for real-time voice processing in web browsers	63	Established	web-speech-api-libraries	239	TypeScript
67	index-tts/index-tts An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System	63	Established	zero-shot-voice-synthesis	19,454	Python
68	yeyupiaoling/MASR Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2...	63	Established	text-to-speech-frameworks	724	Python
69	rsxdalv/TTS-WebUI A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio,...	63	Established	text-to-speech	3,017	TypeScript
70	mbailey/voicemode Natural (2-way) voice conversations with Claude Code	63	Established	text-to-speech-mcp	885	Python
71	FelippeChemello/podcast-maker Fully automated video maker using motion graphics and text-to-speech...	63	Established	ai-video-generation	672	TypeScript
72	readbeyond/aeneas aeneas is a Python/C library and a set of tools to automagically synchronize...	63	Established	asr-evaluation-metrics	2,811	Python
73	analyticsinmotion/werpy 🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error...	63	Established	asr-evaluation-metrics	23	Python
74	yeyupiaoling/PPASR 基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Confor...	63	Established	speaker-diarization-embedding	875	Python
75	daswer123/xtts-api-server A simple FastAPI Server to run XTTSv2	63	Established	self-hosted-tts-servers	577	Python
76	jatinkrmalik/vocalinux Free, open-source, 100% offline voice dictation for Linux. Speak and type...	63	Established	voice-dictation-typing	188	Python
77	meizhong986/WhisperJAV ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD....	63	Established	audio-transcription-tools	1,216	HTML
78	EDCD/EDDI Companion application for Elite Dangerous	62	Established	voice-controlled-robotics	520	C#
79	tensorflow/lingvo Lingvo	62	Established	automatic-speech-recognition	2,857	Python
80	khanld/chunkformer ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription	62	Established	conformer-asr-implementations	78	Python
81	shibing624/parrots Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine....	62	Established	parakeet-asr-implementations	526	Python
82	tsmdt/whisply 💬 Fast, cross-platform CLI and GUI for batch transcription, translation,...	62	Established	whisper-diarization	108	Python
83	Ailln/cn2an 📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）	62	Established	lightweight-tts-runtimes	758	Python
84	thewh1teagle/sherpa-rs Rust bindings to https://github.com/k2-fsa/sherpa-onnx	62	Established	text-embedding-runtimes	302	Rust
85	kahne/fastwer A PyPI package for fast word/character error rate (WER/CER) calculation	62	Established	asr-evaluation-metrics	70	Python
86	TensorSpeech/TensorFlowASR :zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in...	62	Established	end-to-end-asr-frameworks	1,005	Python
87	thewh1teagle/phonikud Hebrew grapheme to phoneme (G2P)	62	Established	grapheme-to-phoneme-conversion	91	Python
88	k2-fsa/sherpa Speech-to-text server framework with next-gen Kaldi	62	Established	funasr-speech-recognition	896	C++
89	diodiogod/TTS-Audio-Suite A ComfyUI custom node integration for multi-engine multi-language...	62	Established	comfyui-tts-nodes	774	Python
90	modelscope/FunASR A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA...	62	Established	automatic-speech-recognition	15,283	Python
91	speechbrain/speechbrain A PyTorch-based Speech Toolkit	61	Established	wav2vec2-speech-recognition	11,311	Python
92	lenML/Speech-AI-Forge 🍦 Speech-AI-Forge is a project developed around TTS generation model,...	61	Established	text-to-speech-tts	1,386	Python
93	RHVoice/RHVoice a free and open source speech synthesizer for Russian and other languages	61	Established	espeak-ng-ecosystem	1,771	C++
94	alphacep/vosk VOSK Speech Recognition Toolkit	61	Established	vosk-asr-implementations	493	C
95	daanzu/kaldi-active-grammar Python Kaldi speech recognition with grammars that can be set...	61	Established	kaldi-asr-ecosystem	347	Python
96	morganney/tts-react Convert text to speech using React.	61	Established	aws-polly-tts	67	TypeScript
97	openctp/openctp openctp提供CTP股票期权、中泰证券XTP、华鑫证券奇点TORA、东方证券OST、东方财富证券EMT、盈透证券TWS、易盛TAP、量投QDP等各通道...	61	Established	system-tts-wrappers	2,715	C
98	argmaxinc/WhisperKit On-device Speech Recognition for Apple Silicon	61	Established	whisper-speech-transcription	5,775	Swift
99	EDDiscovery/EDDiscovery Captains log and 3d star map for Elite Dangerous	61	Established	voice-controlled-robotics	880	C#
100	pion/mediadevices Go implementation of the MediaDevices API.	60	Established	mediapipe-implementations	633	Go

1 2 3 … … 68 69 70 Next »