All Voice AI Tools

6,981 tools ranked by quality score · Page 6 of 70

Showing 501–600 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
501	ai-bot-pro/achatbot An open source chat bot architecture for voice/vision (and multimodal)...	50	Established	voice-chatbot-applications	88	Python
502	Finrandojin/alexandria-audiobook AI-powered multi-voice audiobook generator — LLM script annotation, voice...	50	Established	ebook-to-audiobook-conversion	371	Python
503	goodatlas/zeroth Kaldi-based Korean ASR (한국어 음성인식) open-source project	50	Established	kaldi-asr-ecosystem	358	Shell
504	dmotz/thing-translator 📷 🗣 Point your camera at things to hear how to say them in a different language	50	Established	text-scanning-ocr	1,334	JavaScript
505	j3soon/whisper-to-input An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI...	50	Established	whisper-framework-ports	117	Kotlin
506	jackaduma/CycleGAN-VC2 Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2	50	Established	text-to-speech-frameworks	571	Python
507	moeru-ai/unspeech 🗣️🔊 Your Text-to-Speech Services, All-in-One.	50	Established	elevenlabs-integrations	85	Go
508	liuli-moe/to-the-stars 魔法少女小圆飞向星空中文翻译	50	Established	google-tts-libraries	78	JavaScript
509	hasscc/hass-edge-tts 🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key	50	Established	edge-tts-implementations	476	Python
510	inevolin/DiscordEarsBot A speech-to-text framework and bot for Discord. Take control of your Discord...	50	Established	discord-tts-bots	78	JavaScript
511	woheller69/whoBIRD Identify bird sounds in real time with this Android version of BirdNET. Bird...	50	Established	bioacoustic-species-classification	784	Kotlin
512	sdsds222/Unitale 一个基于Indextts和Qwen3TTS的 AI 有声书制作工具。利用 LLM 自动拆解剧本与识别情绪，集成多角色 TTS...	50	Established	voice-ai-agents	89	HTML
513	NTT123/vietTTS Vietnamese Text to Speech library	50	Established	tts-model-finetuning	255	Python
514	Gr122lyBr/voicetag Speaker identification powered by pyannote and resemblyzer	50	Established	speech-to-text-transcription	32	Python
515	SamirPaulb/real-time-voice-translator A desktop application that uses AI to translate voice between languages in...	50	Established	audio-transcription-apps	396	Tcl
516	ekwek1/soprano-factory Soprano-Factory: Train your own 2000x realtime text-to-speech model	50	Established	tts-model-finetuning	212	Python
517	WhisperSpeech/WhisperSpeech An Open Source text-to-speech system built by inverting Whisper.	50	Established	speech-to-text-converters	4,575	Jupyter Notebook
518	Azure-Samples/Cognitive-Services-Voice-Assistant Welcome to the Microsoft Voice Assistant samples repository! Here you will...	50	Established	dotnet-tts-libraries	123	C++
519	ZDisket/TensorVox Desktop application for neural speech synthesis written in C++	50	Established	lightweight-tts-runtimes	212	C++
520	hirofumi0810/tensorflow_end2end_speech_recognition End-to-End speech recognition implementation base on TensorFlow (CTC,...	50	Established	ctc-asr-implementations	314	Python
521	israelg99/deepvoice Deep Voice: Real-time Neural Text-to-Speech	50	Established	text-to-speech-frameworks	364	Python
522	AlexandaJerry/vits-mandarin-biaobei application of vits on mandarin tts	50	Established	vits-tts-implementations	121	Jupyter Notebook
523	svc-develop-team/so-vits-svc SoftVC VITS Singing Voice Conversion	50	Established	text-to-speech-frameworks	28,008	Python
524	xkeyC/fl_caption Offline real-time captioning software written in Flutter and Rust, powered...	50	Established	flutter-ai-chat-apps	92	Dart
525	vlomme/Multi-Tacotron-Voice-Cloning Phoneme multilingual(Russian-English) voice cloning based on	50	Established	tacotron-tts-models	397	Python
526	FlashLabs-AI-Corp/FlashLabs-Chroma Worlds first open-source real-time end-to-end spoken dialogue model with...	50	Established	voice-cloning-tools	545	Jupyter Notebook
527	jiaqili3/DualCodec [Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural...	50	Established	speculative-decoding-algorithms	62	Jupyter Notebook
528	iMicknl/azure-podcast-generator Generate an engaging podcast based on your document using Azure OpenAI and...	49	Emerging	content-to-podcast-converters	42	Python
529	ddPn08/rvc-webui liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project	49	Emerging	voice-cloning-tools	519	Python
530	Gautham495/react-native-speech-recognition-kit React Native Turbo Module to access Speech Recognition in Android & iOS	49	Emerging	react-native-voice-libraries	3	TypeScript
531	litagin02/rvc-tts-webui Text-to-Speech Gradio webui using RVC and edge-tts	49	Emerging	self-hosted-tts-servers	336	Python
532	seungwonpark/melgan MelGAN vocoder (compatible with NVIDIA/tacotron2)	49	Emerging	neural-vocoder-implementations	650	Python
533	voice-cloning-app/Voice-Cloning-App A Python/Pytorch app for easily synthesising human voices	49	Emerging	voice-cloning-synthesis	1,443	Python
534	rakeshvar/rnn_ctc Recurrent Neural Network and Long Short Term Memory (LSTM) with...	49	Emerging	ctc-asr-implementations	221	Python
535	jonatasgrosman/asrecognition ASRecognition: just an easy-to-use library for Automatic Speech Recognition.	49	Emerging	automatic-speech-recognition	50	Python
536	mozilla/DeepSpeech DeepSpeech is an open source embedded (offline, on-device) speech-to-text...	49	Emerging	wake-word-detection	26,741	C++
537	metavoiceio/metavoice-src Foundational model for human-like, expressive TTS	49	Emerging	text-to-speech-frameworks	4,201	Python
538	Artrajz/vits-simple-api A simple VITS HTTP API, developed by extending Moegoe with additional features.	49	Emerging	vits-tts-implementations	1,045	Python
539	SlapBot/stephanie-va Stephanie is an open-source platform built specifically for voice-controlled...	49	Emerging	general-purpose-voice-assistants	798	Python
540	dessa-oss/fake-voice-detection Using temporal convolution to detect Audio Deepfakes	49	Emerging	deepfake-detection-systems	383	Python
541	DragonComputer/Dragonfire the open-source virtual assistant for Ubuntu based Linux distributions	49	Emerging	voice-assistant-applications	1,404	Python
542	santi-pdp/pase Problem Agnostic Speech Encoder	49	Emerging	speaker-diarization-embedding	447	Python
543	arghyasur1991/Spark-TTS-Unity Unity package for using Spark-TTS on-device models. This is a C# port of...	49	Emerging	unity-ml-inference	30	C#
544	nitaiaharoni1/whisper-speech-to-text Whisper Speech-to-Text is a JavaScript library for recording and...	49	Emerging	speech-to-text-converters	33	TypeScript
545	pedroetb/tts-api Text to speech REST API for multiple TTS engines	49	Emerging	self-hosted-tts-servers	34	JavaScript
546	jeroenterheerdt/pycsspeechtts Python (py) library to use Microsofts Cognitive Services Speech (csspeech)...	49	Emerging	lightweight-tts-libraries	5	Python
547	mpaepper/vibevoice Fast local speech-to-text for any app using faster-whisper	49	Emerging	whisper-transcription-apps	151	Python
548	p0p4k/vits2_pytorch unofficial vits2-TTS implementation in pytorch	49	Emerging	text-to-speech-frameworks	547	Python
549	jim-schwoebel/voicebook 🗣️ A book and repo to get you started programming voice computing...	49	Emerging	audio-transcription-apps	388	Python
550	analyticsinmotion/werx 🐍📦 Easy-to-use Python package for lightning-fast Word Error Rate (WER) analysis	49	Emerging	asr-evaluation-metrics	8	Python
551	woheller69/whisperIME Android Input Method Editor (IME) based on Whisper	49	Emerging	whisper-framework-ports	543	Java
552	gionanide/Speech_Signal_Processing_and_Classification Front-end speech processing aims at extracting proper features from short-...	49	Emerging	text-emotion-recognition	257	Python
553	junzew/HanTTS Chinese Text-to-Speech web service	49	Emerging	lightweight-tts-runtimes	313	Python
554	simonw/ospeak CLI tool for running text through OpenAI Text to speech	49	Emerging	openai-tts-applications	171	Python
555	C-Loftus/QuickPiperAudiobook With one command, create a natural-sounding audiobook from a variety of...	49	Emerging	ebook-to-audiobook-conversion	1,038	Go
556	modal-labs/quillman A voice chat app	49	Emerging	voice-agent-applications	1,198	Python
557	myshell-ai/OpenVoice Instant voice cloning by MIT and MyShell. Audio foundation model.	49	Emerging	voice-cloning-tools	36,111	Python
558	OpenVoiceOS/ovos-buildroot Open Voice Operating System - Buildroot edition is a minimalistic linux OS...	49	Emerging	multi-agent-orchestration	279	Python
559	vasistalodagala/whisper-finetune Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR)...	49	Emerging	whisper-speech-transcription	361	Python
560	thuhcsi/Crystal Crystal - C++ implementation of a unified framework for multilingual TTS...	49	Emerging	cross-platform-tts-frameworks	229	C++
561	juntaosun/ComeCut 「来剪」轻量级视频编辑器。网页版、桌面版等均可免费使用，功能灵感源自 CapCut 等编辑器。A Lightweight Video Editor....	49	Emerging	video-dubbing-tools	485	Batchfile
562	tugstugi/pytorch-dc-tts Text to Speech with PyTorch (English and Mongolian)	49	Emerging	text-to-speech-frameworks	187	Jupyter Notebook
563	revdotcom/fstalign An efficient OpenFST-based tool for calculating WER and aligning two...	49	Emerging	kaldi-asr-ecosystem	171	C++
564	Lex-au/Vocalis Speech-to-speech AI assistant with natural conversation flow, mid-speech...	49	Emerging	voice-assistant-applications	290	TypeScript
565	PriesiaMioShirakana/DragonianVoice 多个SVC/TTS的C++推理库	49	Emerging	lightweight-tts-runtimes	1,121	C
566	savbell/whisper-writer 💬📝 A small dictation app using OpenAI's Whisper speech recognition model.	49	Emerging	speech-to-text-converters	1,021	Python
567	jhuus/HawkEars1 ⚠️ HawkEars 1.0 (obsolete). See HawkEars 2.0 → https://github.com/jhuus/HawkEars	49	Emerging	bioacoustic-species-classification	32	Python
568	opendilab/CleanS2S High-quality and streaming Speech-to-Speech interactive agent in a single...	49	Emerging	text-to-speech-conversion	499	Python
569	dhruvapte26/B.E.N.J.I. B.E.N.J.I.- The Impossible Missions Force's digital assistant	49	Emerging	python-voice-assistants	89	Python
570	belambert/asr-evaluation Python module for evaluating ASR hypotheses (e.g. word error rate, word...	49	Emerging	asr-evaluation-metrics	283	Python
571	vannu07/jarvis 🤖 Jarvis - AI Voice Assistant with Face Recognition \| Hacktoberfest 2025...	49	Emerging	voice-assistant-projects	32	Python
572	Poeschl/Hassio-Addons The repository for my Home Assistant Supervisor Add-ons.	49	Emerging	home-assistant-tts	326	Dockerfile
573	Audio-WestlakeU/VINP Official PyTorch implementation of 'VINP: Variational Bayesian Inference...	49	Emerging	end-to-end-asr-frameworks	31	Python
574	robmsmt/KerasDeepSpeech A Keras CTC implementation of Baidu's DeepSpeech for model experimentation	49	Emerging	ctc-asr-implementations	243	Python
575	OpenBMB/UltraEval-Audio Your faithful, impartial partner for audio evaluation — know yourself, know...	49	Emerging	asr-evaluation-metrics	281	Python
576	eheikes/tts Tools to convert text to speech :books::speech_balloon:	49	Emerging	aws-polly-tts	93	JavaScript
577	google/tacotron Audio samples accompanying publications related to Tacotron, an end-to-end...	49	Emerging	text-to-speech-frameworks	539	HTML
578	sergenes/runandread-audiobook 🚀 Open-source project for creating high-quality AI TTS-narrated audiobooks...	49	Emerging	ebook-to-audiobook-conversion	57	Python
579	ARBML/klaam Arabic speech recognition, classification and text-to-speech.	49	Emerging	kaldi-asr-ecosystem	424	Jupyter Notebook
580	zzw922cn/awesome-speech-recognition-speech-synthesis-papers Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis,...	49	Emerging	speech-synthesis-diffusion	3,119	—
581	rishikksh20/iSTFTNet-pytorch iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating...	49	Emerging	neural-vocoder-implementations	274	Python
582	NevilPatel01/RVC-WebUI-MacOS Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs...	49	Emerging	text-to-speech-frameworks	31	Python
583	pnnbao97/sea-g2p Fast multilingual text-to-phoneme converter for South East Asian languages.	49	Emerging	grapheme-to-phoneme-conversion	64	Rust
584	deepgram/deepgram-go-sdk Official Go SDK for Deepgram.	49	Emerging	go-tts-libraries	78	Go
585	233stone/vocotype-cli VocoType 是一款运行在本地端侧的隐私安全语音输入工具，通过快捷键即可将语音实时转换为文字并自动输入到当前应用。支持语音转文字MCP、AI...	49	Emerging	local-voice-dictation	401	Python
586	scionoftech/DeepAsr Keras(Tensorflow) implementations of Automatic Speech Recognition	48	Emerging	ctc-asr-implementations	24	Jupyter Notebook
587	hehehai/voxt 🎙️Voice input and translation app for macOS. Press to talk, release to paste.	48	Emerging	local-voice-dictation	346	Swift
588	alumae/kaldi-offline-transcriber Offline transcription system for Estonian using Kaldi	48	Emerging	kaldi-asr-ecosystem	228	Python
589	mozilla/TTS :robot: :speech_balloon: Deep learning for Text to Speech (Discussion...	48	Emerging	text-to-speech-frameworks	10,123	Jupyter Notebook
590	lucoiso/UEAzSpeech This plugin integrates Azure Speech Cognitive Services in Unreal Engine.	48	Emerging	dotnet-tts-libraries	215	C++
591	hegedustibor/htgo-tts Text to speech package for Golang.	48	Emerging	go-tts-libraries	213	Go
592	ModelTC/LightTTS LightTTS is a lightweight TTS inference framework optimized for CosyVoice2...	48	Emerging	coqui-tts-applications	31	Python
593	haolinwang819-boop/ai-video-generation-workflow AI video generation workflow with script, slides, TTS, subtitles, and FFmpeg...	48	Emerging	ai-video-generation	179	TypeScript
594	Kaljurand/dictate.js A small Javascript library for browser-based real-time speech recognition,...	48	Emerging	web-speech-api-libraries	217	JavaScript
595	YaoFANGUK/video-subtitle-extractor 视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI...	48	Emerging	whisper-transcription-apps	8,505	Python
596	liangstein/Chinese-speech-to-text Chinese Speech To Text Using Wavenet	48	Emerging	wav2vec2-asr-models	163	Python
597	TheStageAI/TheWhisper Optimized Whisper models for streaming and on-device use	48	Emerging	whisper-framework-ports	821	Python
598	ivanvovk/durian-pytorch Implementation of "Duration Informed Attention Network for Multimodal...	48	Emerging	tacotron-tts-models	184	Python
599	upskyy/Squeezeformer PyTorch implementation of "Squeezeformer: An Efficient Transformer for...	48	Emerging	conformer-asr-implementations	148	Python
600	ActiveNick/HoloBot HoloBot is a reusable 3D interface that allows HoloLens & VR users to...	48	Emerging	voice-command-assistants	124	C#

« Prev 1 2 3 4 5 6 7 8 … 68 69 70 Next »