All Voice AI Tools

6,983 tools ranked by quality score · Page 5 of 70

Showing 401–500 of 6,983

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
401	hehehai/voxt 🎙️Voice input and translation app for macOS. Press to talk, release to paste.	46	Emerging	local-voice-dictation	346	Swift
402	manyeyes/ManySpeech AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment...	46	Emerging	funasr-speech-recognition	71	C#
403	davidamacey/OpenTranscribe Self-hosted AI-powered transcription platform with speaker diarization,...	46	Emerging	audio-transcription-apps	32	Python
404	yanorei32/discord-tts TTS Discord Bot [VOICEROID, VOICEVOX, AivisSpeech, kttsproject, WinRT, and...	46	Emerging	discord-tts-bots	16	Rust
405	Henry-23/VideoChat 实时交互数字人，可自定义形象与音色，支持音色克隆，对话延迟低至3s。Real-time voice interactive digital human,...	46	Emerging	ai-avatar-platforms	1,223	Python
406	primepake/wav2lip_288x288 Wav2Lip version 288 and pipeline to train	46	Emerging	lip-reading-synthesis	642	Python
407	jpreprocess/jbonsai Voice synthesis library for Text-to-Speech applications (Currently HTS...	46	Emerging	rust-tts-libraries	13	Rust
408	common-voice/cv-dataset Metadata and versioning details for the Common Voice dataset	46	Emerging	speech-corpora-datasets	168	JavaScript
409	hetpandya/youtube_tts_data_generator A python library to generate speech dataset from Youtube videos	46	Emerging	tts-dataset-creation	37	Python
410	aahl/qwen-asr2api 🎤 Qwen 3 ASR to OpenAI API, 免费STT语音识别模型	46	Emerging	qwen3-tts-applications	70	Python
411	IhorShevchuk/piper-app The original Piper, now on iOS and macOS	46	Emerging	piper-tts-ecosystem	35	Swift
412	hgneng/ekho Chinese text-to-speech engine	46	Emerging	lightweight-tts-runtimes	1,202	Lex
413	PaciStardust/HOSCY Companion for OSC and Communication	46	Emerging	dotnet-tts-libraries	37	C#
414	Macoron/whisper.unity Running speech to text model (whisper.cpp) in Unity3d on your local machine.	46	Emerging	whisper-framework-ports	704	C#
415	Notely-Voice/NotelyVoice A 100% private AI voice transcription app that converts speech to text in...	46	Emerging	local-voice-dictation	629	C++
416	mlalma/KokoroTestApp Test application for Kokoro TTS model	46	Emerging	text-to-speech-tts	35	Swift
417	solyarisoftware/voskJs Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.	46	Emerging	vosk-asr-implementations	56	JavaScript
418	emnikhil/Sign-Language-To-Text-Conversion Sign Language to Text Conversion is a real-time system that uses a camera to...	46	Emerging	sign-language-recognition	348	Python
419	jianchang512/clone-voice A sound cloning tool with a web interface, using your voice or any sound to...	46	Emerging	voice-cloning-tools	8,922	Python
420	Lex-au/Orpheus-FastAPI High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices,...	46	Emerging	text-to-speech-conversion	673	Python
421	FunAudioLLM/Fun-ASR Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.	46	Emerging	automatic-speech-recognition	946	Python
422	BolajiAyodeji/chat-with-siri 🤖 A text-to-speech chatbot built using Nextjs, OpenAI, and ElevenLabs.	46	Emerging	voice-command-assistants	25	TypeScript
423	pnlpal/dictionariez 📚 A customizable dictionary extension that supports double-click lookups in...	46	Emerging	ai-powered-ereaders	635	JavaScript
424	wxxxcxx/ms-ra-forwarder 免费的在线文本转语音API	46	Emerging	google-tts-libraries	1,030	TypeScript
425	atomiechen/FunASR-Client Really easy-to-use Python client for FunASR runtime server.	46	Emerging	funasr-speech-recognition	4	Python
426	PraaneshSelvaraj/speech_engine Speech Engine is a Python package that provides a simple interface for...	45	Emerging	lightweight-tts-libraries	3	Python
427	AIGC-Audio/AudioGPT AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head	45	Emerging	voice-chatgpt-interfaces	10,210	Python
428	ArdaGnsrn/elevenlabs-laravel This is an Open Source PHP Laravel package for ElevenLabs Text to Speech API.	45	Emerging	elevenlabs-integrations	21	PHP
429	PrzemyslawSwiderski/python-gradle-plugin Gradle plugin to run Python projects.	45	Emerging	voice-ai-learning-collections	22	Kotlin
430	gabriele-mastrapasqua/qwen3-tts Pure C inference engine for Qwen3-TTS text-to-speech. No Python, no PyTorch...	45	Emerging	qwen3-tts-applications	25	C
431	mgonzs13/audio_common A PortAudio based audio_common with text to speech for ROS 2	45	Emerging	lightweight-tts-libraries	32	C++
432	deepgram-devs/nextjs-text-to-speech Get started using Deepgram's Text-to-Speech with this Next.js demo app	45	Emerging	deepgram-starter-projects	24	TypeScript
433	233stone/vocotype-cli VocoType 是一款运行在本地端侧的隐私安全语音输入工具，通过快捷键即可将语音实时转换为文字并自动输入到当前应用。支持语音转文字MCP、AI...	45	Emerging	local-voice-dictation	401	Python
434	misyaguziya/VRCT VRCT(VRChat Chatbox Translator & Transcription)	45	Emerging	dotnet-tts-libraries	340	Python
435	artibex/piper-http Creates a docker image that runs the piper http service	45	Emerging	piper-tts-ecosystem	18	Python
436	Picovoice/leopard On-device speech-to-text engine powered by deep learning	45	Emerging	funasr-speech-recognition	474	Python
437	rhasspy/piper A fast, local neural text to speech system	45	Emerging	piper-tts-ecosystem	10,694	C++
438	vannu07/jarvis 🤖 Jarvis - AI Voice Assistant with Face Recognition \| Hacktoberfest 2025...	45	Emerging	voice-assistant-projects	32	Python
439	createcandle/voco Privacy friendly voice control for the Candle Controller / WebThings...	45	Emerging	voice-assistant-frameworks	29	Python
440	Camb-ai/MARS5-TTS MARS5 speech model (TTS) from CAMB.AI	45	Emerging	voice-cloning-tools	2,814	Jupyter Notebook
441	alphacep/awesome-russian-speech Russian speech technology links	45	Emerging	voice-ai-learning-collections	370	—
442	asiff00/On-Device-Speech-to-Speech-Conversational-AI This is an on-CPU real-time conversational system for two-way speech...	45	Emerging	local-voice-assistants	242	Python
443	Weilbyte/tiktok-tts Generate TikTok Text-to-Speech voices in your browser	45	Emerging	telegram-voice-transcription	419	JavaScript
444	avinashvarna/sanskrit_tts Sanskrit text to speech	45	Emerging	lightweight-tts-libraries	33	Python
445	mlalma/MisakiSwift Swift port of Misaki G2P (grapheme-to-phoneme) library that can be used e.g....	45	Emerging	text-to-speech-tts	20	Swift
446	BuildWithAIs/voicekey Voice to text, one key to input.	45	Emerging	local-voice-dictation	142	TypeScript
447	rhasspy/rhasspy Offline private voice assistant for many human languages	45	Emerging	general-purpose-voice-assistants	2,725	Shell
448	gooofy/zerovox zero-shot realtime TTS system, fully offline, free and open source	45	Emerging	text-to-speech-frameworks	51	Python
449	shashank2122/Local-Voice A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local...	45	Emerging	voice-assistant-frameworks	34	Python
450	sanchit-gandhi/whisper-jax JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.	45	Emerging	whisper-transcription-apps	4,690	Jupyter Notebook
451	FENRlR/MB-iSTFT-VITS2 Application of MB-iSTFT-VITS components to vits2_pytorch	45	Emerging	vits-tts-implementations	134	Python
452	Purple-Horizons/openclaw-voice 🦞 Open-source browser-based voice chat for AI assistants. Self-hosted,...	45	Emerging	openclaw-voice-assistants	78	Python
453	Ashish-Patnaik/kokoclone Voice Cloning, Now Inside Kokoro. Generate natural multilingual speech and...	45	Emerging	kokoro-tts-ecosystem	62	Python
454	huggingface/distil-whisper Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller,...	45	Emerging	whisper-fine-tuning	4,056	Python
455	Thiagohgl/ai-pronunciation-trainer This tool uses AI to evaluate your pronunciation.	45	Emerging	ai-tutoring-platforms	452	Python
456	ceuk/speech-recognition-aws-polyfill Polyfill for the SpeechRecognition browser API using AWS Transcribe as a fallback	45	Emerging	web-speech-api-libraries	13	TypeScript
457	areebbeigh/winspeech Speech recognition and synthesis library for Windows - Python 2 and 3.	45	Emerging	lightweight-tts-libraries	12	Python
458	h5p/h5p-speak-the-words Create questions answered through speech	45	Emerging	web-speech-api-libraries	9	JavaScript
459	adrianlyjak/obsidian-aloud-tts Obsidian TTS Plugin	45	Emerging	edge-tts-implementations	80	TypeScript
460	OpenVoiceOS/ovos-tts-plugin-cotovia galician tts plugin for OVOS	45	Emerging	espeak-ng-ecosystem	3	Python
461	shhossain/BanglaTTS BanglaTTS is a text-to-speech (TTS) system for Bangla language that works in...	45	Emerging	tts-model-finetuning	23	Python
462	reazon-research/ReazonSpeech Massive open Japanese speech corpus	45	Emerging	speech-corpora-datasets	373	Python
463	thorstenMueller/Thorsten-Voice Thorsten-Voice: A free to use, offline working, high quality german TTS...	45	Emerging	coqui-tts-applications	705	Python
464	saharmor/whisper-playground Build real time speech2text web apps using OpenAI's Whisper...	45	Emerging	whisper-transcription-apps	833	Python
465	athena-team/athena an open-source implementation of sequence-to-sequence based speech processing engine	44	Emerging	ctc-asr-implementations	970	C++
466	gotev/android-speech Android speech recognition and text to speech made easy	44	Emerging	android-speech-apps	535	Java
467	i4Ds/whisper-finetune This repository contains code for fine-tuning the Whisper speech-to-text model.	44	Emerging	speech-to-text-transcription	22	Jupyter Notebook
468	totalvoice/totalvoice-node Client em NodeJS para API da Totalvoice	44	Emerging	sms-voice-integrations	61	JavaScript
469	thinhlpg/vixtts-demo A Vietnamese Voice Cloning Text-to-Speech Model ✨	44	Emerging	tts-model-finetuning	509	Jupyter Notebook
470	petermg/Chatterbox-TTS-Extended Modified version of Chatterbox that accepts text files as input and no...	44	Emerging	self-hosted-tts-servers	534	Python
471	zw76859420/ASR_Theory 语音识别理论、论文和PPT	44	Emerging	automatic-speech-recognition	619	—
472	MycroftAI/adapt Adapt Intent Parser	44	Emerging	speech-ai-coursework	722	Python
473	cosin2077/easyVoice 开源文本转语音工具，支持超长文本，多角色配音	44	Emerging	edge-tts-implementations	1,992	TypeScript
474	gooofy/py-nltools A collection of basic python modules for spoken natural language processing	44	Emerging	speech-recognition-apis	55	Python
475	AutoArk/GPA [AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion...	44	Emerging	telegram-voice-transcription	97	Python
476	mutablelogic/go-whisper Speech-to-Text in golang	44	Emerging	speech-to-text-converters	178	Go
477	tover0314-w/opentypeless Talkmore with Opentypeless. Type with your voice. Anywhere. Talk -...	44	Emerging	audio-transcription-tools	40	TypeScript
478	speechio/chinese_text_normalization Chinese text normalization for speech processing	44	Emerging	text-normalization-engines	722	Python
479	react-native-voice/voice :microphone: React Native Voice Recognition library for iOS and Android...	44	Emerging	react-native-voice-libraries	2,153	TypeScript
480	rse/speechflow Speech Processing Flow Graph	44	Emerging	web-speech-api-libraries	5	TypeScript
481	lifeiteng/vall-e PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo...	44	Emerging	multimodal-vision-language	2,207	Python
482	r9y9/deepvoice3_pytorch PyTorch implementation of convolutional neural networks-based text-to-speech...	44	Emerging	text-to-speech-frameworks	1,982	Python
483	NVIDIA/OpenSeq2Seq Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP	44	Emerging	neural-machine-translation	1,560	Python
484	spring-media/TransformerTTS 🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based...	44	Emerging	text-to-speech-frameworks	1,161	Python
485	xcmyz/FastSpeech The Implementation of FastSpeech based on pytorch.	44	Emerging	text-to-speech-frameworks	880	Python
486	ggeop/Python-ai-assistant Python AI assistant 🧠	44	Emerging	virtual-assistants-nlp	998	Python
487	soobinseo/Transformer-TTS A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"	44	Emerging	text-to-speech-frameworks	690	Python
488	shhossain/BanglaSpeech2Text BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla...	44	Emerging	whisper-transcription-apps	121	Python
489	Azure-Samples/SpeechToText-WebSockets-Javascript SDK & Sample to do speech recognition using websockets in Javascript	44	Emerging	web-speech-api-libraries	222	TypeScript
490	google/uis-rnn This is the library for the Unbounded Interleaved-State Recurrent Neural...	44	Emerging	speaker-diarization-embedding	1,589	Python
491	pannous/tensorflow-speech-recognition 🎙Speech recognition using the tensorflow deep learning framework,...	44	Emerging	speaker-diarization-embedding	2,176	Python
492	Amey-Thakur/DEEPFAKE-AUDIO 🎙️ Deepfake Audio – A neural voice cloning studio powered by SV2TTS technology.	44	Emerging	voice-cloning-synthesis	65	Python
493	jaywalnut310/glow-tts A Generative Flow for Text-to-Speech via Monotonic Alignment Search	44	Emerging	text-to-speech-frameworks	704	Python
494	bambocher/pocketsphinx-python Python interface to CMU Sphinxbase and Pocketsphinx libraries	44	Emerging	automatic-speech-recognition	373	Python
495	whitphx/streamlit-stt-app Real time web based Speech-to-Text app with Streamlit	44	Emerging	streamlit-tts-apps	253	Python
496	fatchord/WaveRNN WaveRNN Vocoder + TTS	44	Emerging	neural-vocoder-implementations	2,179	Python
497	ArkanDash/Multi-Model-RVC-Inference RVC Inference with multiple model and huggingface support	44	Emerging	voice-cloning-tools	112	Python
498	alumae/kaldi-gstreamer-server Real-time full-duplex speech recognition server, based on the Kaldi toolkit...	44	Emerging	kaldi-asr-ecosystem	1,092	Python
499	symblai/getting-started-samples Code samples to Get started quickly with Symbl's Voice SDK and APIs:...	44	Emerging	deepgram-starter-projects	19	Shell
500	wildminder/ComfyUI-VibeVoice ComfyUI custom node for the VibeVoice TTS. Expressive, long-form,...	44	Emerging	comfyui-tts-nodes	563	Python

« Prev 1 2 3 4 5 6 7 … 68 69 70 Next »