All Voice AI Tools

6,983 tools ranked by quality score · Page 3 of 70

Showing 201–300 of 6,983

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
201	rzru/nightingale Machine learning powered Karaoke app (with scores!)	53	Established	audio-music-learning	548	Rust
202	kaldi-asr/kaldi kaldi-asr/kaldi is the official location of the Kaldi project.	53	Established	kaldi-asr-ecosystem	15,346	Shell
203	asterics/Asterics-AAC Free, easy-to-use AAC app with offline support, flexible input options,...	53	Established	android-voice-assistants	106	JavaScript
204	pot-app/pot-desktop 🌈一个跨平台的划词翻译和OCR软件 \| A cross-platform software for text translation and recognition.	53	Established	ios-speech-frameworks	17,383	JavaScript
205	supertone-inc/supertonic-py Lightning-Fast, On-Device TTS — running natively via ONNX.	53	Established	lightweight-tts-runtimes	16	Python
206	jianchang512/ChatTTS-ui 一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface...	53	Established	self-hosted-tts-servers	7,521	Python
207	Vonage/vonage-ruby-sdk Vonage REST API client for Ruby. API support for SMS, Voice, Text-to-Speech,...	53	Established	sms-voice-integrations	220	Ruby
208	Saurav-Paul/AI-virtual-assistant-python Command line virtual assistant for competitive programming	53	Established	general-purpose-voice-assistants	118	Python
209	pilot51/voicenotify Android app that speaks notifications	52	Established	android-speech-apps	218	Kotlin
210	FunAudioLLM/SenseVoice Multilingual Voice Understanding Model	52	Established	voice-assistant-devices	7,691	Python
211	Enemyx-net/VibeVoice-ComfyUI A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech...	52	Established	comfyui-tts-nodes	1,391	Python
212	abus-aikorea/voice-pro Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,...	52	Established	gradio-tts-webuis	6,366	Python
213	p0n1/epub_to_audiobook EPUB to audiobook converter, optimized for Audiobookshelf, WebUI included	52	Established	ai-podcast-generation	1,921	Python
214	OpenVoiceOS/ovos-tts-plugin-espeakNG espeakNG plugin	52	Established	espeak-ng-ecosystem	2	Python
215	sooftware/conformer [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented...	52	Established	conformer-asr-implementations	1,109	Python
216	evancohen/sonus :speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword...	52	Established	web-speech-api-libraries	636	JavaScript
217	alphacep/vosk-unity-asr Automatic Speech Recognition in Unity using Vosk library	52	Established	dotnet-tts-libraries	118	C#
218	mybigday/whisper.rn React Native binding of whisper.cpp.	52	Established	whisper-framework-ports	749	C++
219	Femoon/tts-azure-web TTS Azure Web 是一个 Azure 文本转语音（TTS）网页应用，可以在本地或者云端使用你的 Azure Key 一键部署。TTS...	52	Established	dotnet-tts-libraries	479	TypeScript
220	arcosoph/nanowakeword A lightweight, open-source, and intelligent wake word detection engine....	52	Established	wake-word-detection	48	Python
221	HeyWillow/willow Open source, local, and self-hosted Amazon Echo/Google Home competitive...	52	Established	voice-assistant-applications	2,987	C
222	SahilAggarwal2004/react-text-to-speech An easy-to-use React.js library that leverages the Web Speech API to convert...	52	Established	vue-speech-recognition	81	TypeScript
223	mdiller/MangoByte A discord bot that provides the ability to play dota hero response clips, do...	52	Established	discord-tts-bots	93	Python
224	antirek/voicer AGI-server voice recognizer for #Asterisk	52	Established	web-speech-api-libraries	101	JavaScript
225	TrevorS/voxtral-mini-realtime-rs Streaming speech recognition running natively and in the browser. A pure...	52	Established	rust-speech-recognition	710	Rust
226	richardr1126/openreader An open-source read-along document reader server with high-quality TTS...	51	Established	ai-powered-ereaders	292	TypeScript
227	RageAgainstThePixel/ElevenLabs-DotNet A Non-Official ElevenLabs RESTful API Client for dotnet	51	Established	elevenlabs-integrations	89	C#
228	BoltzmannEntropy/MimikaStudio MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic...	51	Established	qwen3-tts-applications	357	Dart
229	thevickypedia/Jarvis Fully Functional Voice Based Natural Language UI	51	Established	python-voice-assistants	232	Python
230	janvarev/Irene-Voice-Assistant Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы...	51	Established	general-purpose-voice-assistants	1,113	Python
231	bshall/Tacotron A PyTorch implementation of Location-Relative Attention Mechanisms For...	51	Established	tacotron-tts-models	115	Python
232	canopyai/Orpheus-TTS Towards Human-Sounding Speech	51	Established	multimodal-vision-language	6,000	Python
233	yeyupiaoling/YeAudio Python的音频工具	51	Established	funasr-speech-recognition	16	Python
234	davidacm/NVDA-IBMTTS-Driver This project is aimed at developing and maintaining the NVDA IBMTTS driver....	51	Established	piper-tts-ecosystem	71	Python
235	vivekuppal/transcribe Transcribe is a real time transcription, conversation, Language learning...	51	Established	audio-transcription-tools	250	Python
236	fishaudio/fish-audio-python The official Python library for the Fish Audio API.	51	Established	openai-tts-applications	151	Python
237	marytts/marytts MARY TTS -- an open-source, multilingual text-to-speech synthesis system...	51	Established	java-tts-libraries	2,573	Java
238	dictation-toolbox/dragonfly Speech recognition framework allowing powerful Python-based scripting and...	51	Established	automatic-speech-recognition	411	Python
239	ttop32/MouseTooltipTranslator Mouseover Translate Any Language At Once - Chrome Extension: PDF Translator,...	51	Established	live-meeting-translation	1,140	JavaScript
240	mlalma/kokoro-ios Kokoro TTS for iOS and macOSX	51	Established	text-to-speech-tts	209	Swift
241	EveryVoiceTTS/EveryVoice The EveryVoice TTS Toolkit - Text To Speech for your language	51	Established	coqui-tts-applications	43	Python
242	gooofy/py-kaldi-asr Some simple wrappers around kaldi-asr intended to make using kaldi's...	51	Established	kaldi-asr-ecosystem	170	C++
243	keithito/tacotron A TensorFlow implementation of Google's Tacotron speech synthesis with...	51	Established	text-to-speech-frameworks	2,988	Python
244	lucasnewman/nanospeech A simple, hackable text-to-speech system in PyTorch and MLX	51	Established	fastspeech-tts-models	186	Python
245	stefantaubert/pinyin-to-ipa Command-line interface and Python library to transcribe pinyin to IPA. The...	51	Established	grapheme-to-phoneme-conversion	53	Python
246	jonatasgrosman/huggingsound HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools	51	Established	wav2vec2-speech-recognition	470	Python
247	xiangyuecn/Recorder html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式，支持pc和Android、iOS部分浏览器、Hybrid...	51	Established	web-speech-api-libraries	5,577	JavaScript
248	DevEmperor/Dictate A powerful Whisper AI keyboard for reliable speech transcription	51	Established	audio-transcription-tools	183	Java
249	DigitalPhonetics/IMS-Toucan Controllable and fast Text-to-Speech for over 7000 languages!	51	Established	text-to-speech-frameworks	2,190	Python
250	moonstar-x/discord-tts-bot A Text-to-Speech bot for Discord.	51	Established	discord-tts-bots	102	JavaScript
251	gabrielmittag/NISQA NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment	51	Established	text-to-speech-frameworks	917	Python
252	deepgram/deepgram-rust-sdk Community Rust SDK for Deepgram.	51	Established	deepgram-starter-projects	65	Rust
253	Blaizzy/mlx-audio-swift A modular Swift SDK for audio processing with MLX on Apple Silicon	50	Established	ios-speech-frameworks	446	Swift
254	YuanGongND/whisper-at Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT:...	50	Established	whisper-fine-tuning	412	Python
255	capacitor-community/text-to-speech ⚡️ Capacitor plugin for synthesizing speech from text.	50	Established	web-speech-api-libraries	123	Java
256	sfortis/openai_tts Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine...	50	Established	voice-assistant-devices	181	Python
257	dectalk/dectalk Modern builds for the 90s/00s DECtalk text-to-speech application.	50	Established	dotnet-tts-libraries	418	PostScript
258	robdmac/talkito TalkiTo lets developers interact with AI systems through speech across...	50	Established	text-to-speech-mcp	54	Python
259	ai-ng/swift Fast voice assistant powered by Groq, Cartesia, and Vercel.	50	Established	conversational-chatbot-applications	590	TypeScript
260	readium/speech 💬 A TypeScript library for implementing read aloud on the Web	50	Established	web-speech-api-tts	12	TypeScript
261	kadirnar/VoiceHub VoiceHub: A Unified Inference Interface for TTS Models	50	Established	coqui-tts-applications	69	Python
262	FirezTheGreat/1SHOT All my works - https://github.com/FirezTheGreat (latest music commands/djs...	50	Established	discord-tts-bots	84	JavaScript
263	jaywalnut310/vits VITS: Conditional Variational Autoencoder with Adversarial Learning for...	50	Established	text-to-speech-frameworks	7,837	Python
264	MasuRii/opencode-smart-voice-notify 🔊 Smart voice notification plugin for OpenCode with multiple TTS engines...	50	Established	edge-tts-implementations	43	TypeScript
265	svc-develop-team/so-vits-svc SoftVC VITS Singing Voice Conversion	50	Established	text-to-speech-frameworks	28,008	Python
266	shivammehta25/Neural-HMM Neural HMMs are all you need (for high-quality attention-free TTS)	50	Established	text-to-speech-frameworks	164	Jupyter Notebook
267	Gr122lyBr/voicetag Speaker identification powered by pyannote and resemblyzer	50	Established	speech-to-text-transcription	32	Python
268	Picovoice/speech-to-text-benchmark speech to text benchmark framework	50	Established	text-to-speech-conversion	683	Python
269	hkchengrex/MMAudio [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality...	50	Established	vision-language-models	2,115	Python
270	snakers4/silero-stress Silero Stress — pre-trained enterprise-grade automated stress and homograph...	50	Established	gradio-tts-webuis	125	Python
271	i3thuan5/tai5-uan5_gian5-gi2_kang1-ku7 臺灣言語工具	50	Established	lightweight-tts-runtimes	144	Python
272	WhisperSpeech/WhisperSpeech An Open Source text-to-speech system built by inverting Whisper.	50	Established	speech-to-text-converters	4,575	Jupyter Notebook
273	petercunha/tts :pencil: :sound: A simple text-to-speech tool. Converts your text to speech...	50	Established	aws-polly-tts	171	JavaScript
274	zzw922cn/Automatic_Speech_Recognition End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow	50	Established	speaker-diarization-embedding	2,839	Python
275	R3gm/SoniTranslate Synchronized Translation for Videos. Video dubbing	50	Established	video-dubbing-tools	1,341	Python
276	vox-serve/vox-serve A Streaming-Native Serving Engine for TTS/STS Models	50	Established	text-to-speech-conversion	59	Python
277	pykaldi/pykaldi A Python wrapper for Kaldi	50	Established	kaldi-asr-ecosystem	1,030	Python
278	alphacep/vosk-android-demo Offline speech recognition for Android with Vosk library.	50	Established	java-tts-libraries	1,023	Java
279	stepfun-ai/Step-Audio-EditX A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model...	50	Established	zero-shot-voice-synthesis	884	Python
280	midas-research/audino Open source audio annotation tool for humans	50	Established	data-annotation-tools	1,131	TypeScript
281	yeyupiaoling/PaddlePaddle-DeepSpeech 基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。	50	Established	speaker-diarization-embedding	758	Python
282	funnyzak/tts-now 跨平台基于云平台(阿里云、讯飞等)语音合成 API 的文字转语音助手。支持单文本快速合成和批量合成。支持windows、macOS、Linux。	50	Established	google-tts-libraries	317	TypeScript
283	linto-ai/linto-stt An automatic speech recognition API	50	Established	whisper-diarization	81	Python
284	Aivis-Project/AivisSpeech-Engine AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine	50	Established	self-hosted-tts-servers	150	Python
285	nari-labs/dia A TTS model capable of generating ultra-realistic dialogue in one pass.	50	Established	self-hosted-tts-servers	19,202	Python
286	mgonzs13/whisper_ros Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2	50	Established	whisper-framework-ports	91	C++
287	mathigatti/midi2voice Singing synthesis from MIDI file	50	Established	espeak-ng-ecosystem	284	Python
288	soniqo/speech-swift AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and...	50	Established	ios-speech-frameworks	417	Swift
289	jim60105/docker-whisperX Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level...	50	Established	whisper-diarization	422	Dockerfile
290	myshell-ai/OpenVoice Instant voice cloning by MIT and MyShell. Audio foundation model.	49	Emerging	voice-cloning-tools	36,111	Python
291	yeyupiaoling/Whisper-Finetune Fine-tune the Whisper speech recognition model to support training without...	49	Emerging	whisper-speech-transcription	1,200	C
292	analyticsinmotion/werx 🐍📦 Easy-to-use Python package for lightning-fast Word Error Rate (WER) analysis	49	Emerging	asr-evaluation-metrics	8	Python
293	High-Logic/Genie-TTS GPT-SoVITS ONNX Inference Engine & Model Converter	49	Emerging	vits-tts-implementations	1,433	Python
294	lobehub/lobe-tts 🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser	49	Emerging	edge-tts-implementations	779	TypeScript
295	NeonGeckoCom/neon-tts-plugin-coqui Coqui AI TTS plugin	49	Emerging	coqui-tts-applications	85	Python
296	jeroenterheerdt/pycsspeechtts Python (py) library to use Microsofts Cognitive Services Speech (csspeech)...	49	Emerging	lightweight-tts-libraries	5	Python
297	ThioJoe/Auto-Synced-Translated-Dubs Automatically translates the text of a video based on a subtitle file, and...	49	Emerging	video-dubbing-tools	1,715	Python
298	sindresorhus/awesome-whisper 🔊 Awesome list for Whisper — an open-source AI-powered speech recognition...	49	Emerging	audio-transcription-tools	2,219	—
299	rwth-i6/rasr The RWTH ASR Toolkit.	49	Emerging	automatic-speech-recognition	58	C++
300	Stypox/dicio-android Dicio assistant app for Android	49	Emerging	android-voice-assistants	1,295	Kotlin

« Prev 1 2 3 4 5 … 68 69 70 Next »