All Voice AI Tools

6,981 tools ranked by quality score · Page 20 of 70

Showing 1901–2000 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1901	Saganaki22/ComfyUI-KugelAudio 🗣️ ComfyUI nodes for KugelAudi- Open-source text-to-speech with voice...	34	Emerging	comfyui-tts-nodes	29	Python
1902	rohanprichard/fastrtc-demo A simple POC of FastRTC, a framework to use voice mode in python!	34	Emerging	voice-command-assistants	36	TypeScript
1903	Aman22sharma/Python-AI-Virtual-Assistant This is python AI Virtual Assistant.	34	Emerging	general-purpose-voice-assistants	40	Python
1904	m1el/nemotron-asr.cpp Nemotron ASR rewrite to GGML	34	Emerging	kaldi-asr-ecosystem	10	C++
1905	dsfsi/dsfsi-datasets Official DSFSI Public Datasets Registry - Comprehensive catalog of 50+...	34	Emerging	speech-corpora-datasets	6	Jupyter Notebook
1906	pevers/parkiet Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)	34	Emerging	parakeet-asr-implementations	69	Python
1907	sandy1990418/ChineseTaiwaneseWhisper This repository focuses on leveraging OpenAI's Whisper model for speech...	34	Emerging	whisper-fine-tuning	70	Python
1908	jhermann/kopfkino Syntactic sugar sprinkled on top of MoviePy and AI components to allow...	34	Emerging	ai-video-generation	1	Python
1909	seven-io/node-red The official Node-RED collection by seven.	34	Emerging	sms-voice-integrations	2	HTML
1910	ontypehq/mlx-swift-asr On-device speech recognition for Apple Silicon, powered by MLX.	34	Emerging	ios-speech-frameworks	4	Swift
1911	slp-rl/HebTTS The official implementation of "A Language Modeling Approach to...	34	Emerging	grapheme-to-phoneme-conversion	108	Python
1912	A-Jacobson/tacotron2 pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf	34	Emerging	tacotron-tts-models	43	Jupyter Notebook
1913	kwebby/Qwen3-TTS-Voice-Studio A Text to Speech App for Qwen3-TTS Family Models to create custom voices,...	34	Emerging	qwen3-tts-applications	4	JavaScript
1914	rishiskhare/parrot A free, offline, private AI text-to-speech desktop app built on Rust 🦜	34	Emerging	parakeet-asr-implementations	50	Rust
1915	RoyNkem/SwiftUI-AI-Voice-Assistant A multi-platform app for voice-based interactions built using SwiftUI with...	34	Emerging	ios-speech-frameworks	32	Swift
1916	boochow/TFLite_Micro_MicroSpeech_M5Stack M5Stack (ESP32) port of TensorFlow Lite for Microcontrollers demo "Micro Speech"	34	Emerging	wake-word-detection	31	C++
1917	yufan-aslp/AliMeeting The project is associated with the recently-launched ICASSP 2022...	34	Emerging	meeting-transcription-summarizers	135	Python
1918	soheil-mp/Speech-Recognition End-to-End Speech Recognition using Neural Networks.	34	Emerging	ctc-asr-implementations	35	Jupyter Notebook
1919	khuangaf/ITRI-speech-recognition-dataset-generation Automatic Speech Recognition Dataset Generation	34	Emerging	speech-corpora-datasets	37	Jupyter Notebook
1920	rishikksh20/TalkNet2-pytorch TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for...	34	Emerging	tacotron-tts-models	89	Python
1921	totalvoice/totalvoice-php Client em PHP para API da Totalvoice	34	Emerging	sms-voice-integrations	29	PHP
1922	gladchinda/web-speech-demo Learn how to build a simple text-to-speech voice app for the web using the...	33	Emerging	web-speech-api-tts	22	JavaScript
1923	henryhale/ttspeech 🔊 A fully basic voice synthesizer in vanillaJS	33	Emerging	web-speech-api-tts	17	HTML
1924	TheDeathDragon/LiveTranslate Real-time audio translation overlay for Windows — captures system audio +...	33	Emerging	real-time-voice-translation	47	Python
1925	GuangChen2333/FindUrVoicesPJSK 《世界计划 : 缤纷舞台》单角色语音数据集一键获取小工具 \| 无需手动打标 \| wav无压缩 \| A simple tool for obtaining...	33	Emerging	tts-dataset-creation	20	Python
1926	yousefkotp/Egyptian-Arabic-ASR-and-Diarization The official submission from Speech Squad team for the MTC-AIC 2 competition...	33	Emerging	voice-cloning-synthesis	17	Jupyter Notebook
1927	Allan-Nava/fakeyou.go A powerful golang sdk library for interacting with the FakeYouAPI easily	33	Emerging	go-tts-libraries	2	Go
1928	deepgram-starters/csharp-voice-agent Get started using Deepgram's Voice Agent with this C# demo app	33	Emerging	deepgram-starter-projects	2	C#
1929	inboxpraveen/Speech-Annotation-Tool Review, correct, and export ASR transcripts at scale. Web-based ASR accuracy...	33	Emerging	whisper-speech-transcription	10	Python
1930	b7s/whisper-php State-of-the-art speech recognition to your PHP/Laravel applications	33	Emerging	speech-to-text-converters	21	PHP
1931	surfaceyu/edge-tts-go Use Microsoft Edge's online text-to-speech service from golang WITHOUT...	33	Emerging	edge-tts-implementations	50	Go
1932	wongfei/UEHMI Unreal Engine Human Machine Interface	33	Emerging	dotnet-tts-libraries	2	C++
1933	lucascamillomd/anki-tts A free, open-source app for Anki text-to-speech in MacOS.	33	Emerging	anki-tts-integration	2	Python
1934	ethicalabs-ai/Kurtis-E1-MLX-Voice-Agent A lightweight voice companion, optimized for macOS.	33	Emerging	ios-speech-frameworks	9	Python
1935	VirtualZer0/StreamTalkerClient Cross-platform desktop app that reads Twitch and VK Play chat aloud using AI...	33	Emerging	dotnet-tts-libraries	2	C#
1936	minseok0809/robotic-process-automation File Management, School Automation, Text Automation, Web Crawler, Web...	33	Emerging	voice-ai-learning-collections	2	Jupyter Notebook
1937	ddlBoJack/MT4SSL [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL:...	33	Emerging	zero-shot-voice-synthesis	45	Python
1938	jindongwang/EasyEspnet Making Espnet easier to use	33	Emerging	end-to-end-asr-frameworks	54	Python
1939	renaudjenny/swift-tts A straightforward package containing version for Swift modern concurrency,...	33	Emerging	ios-speech-frameworks	52	Swift
1940	Alex-Tremayne/LaTeXt Python package for converting LaTeX to text which can be read by text to...	33	Emerging	lightweight-tts-libraries	4	Python
1941	Madhur215/Chatbot-cum-voice-Assistant An AI chatbot with features like conversation through voice, fetching events...	33	Emerging	general-purpose-voice-assistants	37	Python
1942	second-state/gsv_tts Streaming TTS API server written in Rust	33	Emerging	voice-ai-assistants	19	HTML
1943	Alenkar/kairos-asr Адаптированный ASR pipeline для удобной интеграции в другие приложения на...	33	Emerging	voice-cloning-synthesis	9	Python
1944	sp-squared/Turkic-Languages-Audio-to-Text-Transcription Open-source Automatic Speech Recognition (ASR) pipeline for Bashkir...	33	Emerging	automatic-speech-recognition	2	Python
1945	nchudleigh/sc2-ultra Voice-controlled StarCraft II - command Zerg, Protoss, or Terran using...	33	Emerging	voice-controlled-robotics	2	Python
1946	kanttouchthis/text_generation_webui_xtts XTTSv2 Extension for oobabooga text-generation-webui	33	Emerging	voice-assistant-devices	156	Python
1947	andi611/TTS-Tacotron-Pytorch Pytorch implementation of Tacotron, a speech synthesis end-to-end generative...	33	Emerging	tacotron-tts-models	29	Python
1948	skshadan/WhisCall A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo,...	33	Emerging	voice-ai-assistants	29	Python
1949	williamxhero/ttsmaker TTSMaker: A Python library for interacting with the TTSMaker API to easily...	33	Emerging	lightweight-tts-libraries	9	Python
1950	umbertocappellazzo/Omni-AVSR Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal...	33	Emerging	multimodal-vision-language	31	Python
1951	lucasnewman/descript-mlx Implementation of the Descript Audio Codec in MLX	33	Emerging	zero-shot-voice-synthesis	10	Python
1952	hanxi/epub2mp3 这是一个使用 Microsoft Edge TTS 服务将 EPUB 电子书转换为 MP3 音频文件的工具。	33	Emerging	ebook-to-audiobook-conversion	71	Python
1953	user3301/ssml_builder :sound: a general SSML(Speech Synthesis Markup Language) builder	33	Emerging	aws-polly-tts	10	Python
1954	dmatekenya/Chichewa-Speech2Text Automated Speech Recognition for Chichewa.	33	Emerging	automatic-speech-recognition	24	Jupyter Notebook
1955	Tinkoff/asterisk-voicekit-modules Non-blocking Asterisk modules for accessing VoiceKit services for speech...	33	Emerging	deepgram-starter-projects	36	Shell
1956	warisqr007/vocos Causal version of Vocos (neural vocoders for high-quality audio synthesis)...	33	Emerging	neural-vocoder-implementations	2	Jupyter Notebook
1957	aydinnyunus/LinuxVoiceAssistant Linux Voice Assistant for to Make Your Work Easier	33	Emerging	general-purpose-voice-assistants	38	Python
1958	tomik395/ESP32-AI Speak to your ESP32 and it speaks back! Your new personal assistance is...	33	Emerging	edge-camera-ml	77	C++
1959	Ishan7390/Jarvis_AI This is my attempt at building a not so much of an AI, Jarvis	33	Emerging	python-voice-assistants	30	Python
1960	huytd/speech A tool to practice English speaking	33	Emerging	vue-speech-recognition	72	Vue
1961	lang-uk/ukrainian-tts-preprocessing Tools and models for Ukrainian phonemization and lexical stress prediction	33	Emerging	ukrainian-voice-ai	8	Python
1962	codename0og/codename-rvc-fork-3 Codename's rvc fork version 3, based on Applio.	33	Emerging	voice-cloning-tools	37	Python
1963	nowickam/facial-animation Audio-driven facial animation generator with BiLSTM used for transcribing...	33	Emerging	ai-avatar-platforms	36	Jupyter Notebook
1964	felivalencia3/RealVoiceGPT RealVoiceGPT is a web application that lets you have voice conversations...	33	Emerging	voice-chatgpt-interfaces	29	JavaScript
1965	seungwonpark/awesome-tts-samples Awesome list of TTS papers with audio samples	33	Emerging	voice-ai-learning-collections	61	—
1966	ehtisham91/Django-Speech-to-text-Chat This App allows users to convert their speech into text and send that text...	33	Emerging	web-based-tts-apps	20	HTML
1967	Shyguy99/Whatsapp-bot A simple WhatsApp Bot made using open-wa library with some additional features.	33	Emerging	voice-chatgpt-interfaces	30	Python
1968	victor369basu/End2EndAutomaticSpeechRecognition In this repository, I have developed an end to end Automatic speech...	33	Emerging	speaker-diarization-embedding	34	Python
1969	naschorr/hawking The retro text-to-speech bot for Discord	33	Emerging	discord-tts-bots	27	Python
1970	EvilFreelancer/docker-fish-speech-server OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.	33	Emerging	self-hosted-tts-servers	30	Python
1971	aks-devs/mod_google_asr Freeswitch Speech-to-Text module	33	Emerging	vosk-asr-implementations	4	C
1972	saadbutt32/Conversion-of-Pakistan-Sign-Languag-into-Text-and-Speech-using-OpenPose-and-Machine-Learning Real-time translation of Pakistan sign language into text and speech using...	33	Emerging	sign-language-translation	28	Python
1973	Jdreioe/Wingmate A project to make people who cannot speak, speak!	33	Emerging	android-speech-apps	2	Kotlin
1974	atharva-again/indic-asr-onnx Helper package for using quantized versions of the Indic ASR Model by AI4Bharat.	33	Emerging	automatic-speech-recognition	2	Jupyter Notebook
1975	soundhound/houndify-sdk-go The official Houndify SDK for Go	33	Emerging	go-tts-libraries	25	Go
1976	RF5/transfusion-asr Transcribing Speech with Multinomial Diffusion, training code and models.	33	Emerging	end-to-end-asr-frameworks	80	Python
1977	stellarloop/bitbat.ai My father, a journalist, used to painstakingly transcribe interviews from a...	33	Emerging	audio-transcription-tools	81	Svelte
1978	matt-goldman/AI-Panelist An AI Panelist participating in Beer Driven Devs Live 2026	33	Emerging	ai-tutoring-platforms	2	C#
1979	MrAliHasan/Sophia-AI-Assistant Sophia AI Assistant is a Python-based desktop AI that performs a variety of...	33	Emerging	voice-controlled-desktop-automation	30	CSS
1980	HerbertHe/edge-tts-server Server for edge-tts	33	Emerging	edge-tts-implementations	29	TypeScript
1981	Umbaji/NMTMD Official repository for the Opensource Textdataset for NMT for local langues...	33	Emerging	speech-corpora-datasets	26	—
1982	dokuniev/claude-voice Hear which Claude Code session needs you — speaks the repo and branch name out loud	33	Emerging	voice-enabled-coding-assistants	2	Shell
1983	AdamHolwerda/bloom-cli A command line utitlity to create a multipage static website from Ulysses export	33	Emerging	web-speech-api-tts	6	JavaScript
1984	matthijsvk/TIMITspeech Speech recognition on the TIMIT (or any other) dataset	33	Emerging	ctc-asr-implementations	44	Python
1985	jekyll2014/VoiceAssistant Locally hosted voice assistant with plugin extension feature	33	Emerging	dotnet-tts-libraries	30	C#
1986	Rubiksman78/RenAI-Chat VN Like Interface for Chatbots	33	Emerging	voice-chatbot-applications	75	Python
1987	rhulha/Speech2Speech A web application that converts speech to speech 100% private	33	Emerging	voice-ai-assistants	84	JavaScript
1988	The-Swarm-Corporation/Voice-Agents Voice-Agents is a production-ready Python library for building...	33	Emerging	voice-agent-applications	6	Python
1989	Enforcer03/voice-cloning Voice cloning with tortoise-tts	33	Emerging	voice-cloning-tools	30	Jupyter Notebook
1990	marytts/gradle-marytts-voicebuilding-plugin A replacement for the legacy VoiceImportTools in MaryTTS	33	Emerging	java-tts-libraries	16	Groovy
1991	charstorm/vilberta Voice chatbot with voice+screen output to show that "not everything needs to...	33	Emerging	voice-chatbot-applications	6	Python
1992	pinch-eng/pinch-python-sdk Real-time voice translation SDK	33	Emerging	voice-ai-sdks	6	Python
1993	eazhary/dctts2 Deep Convolution Text to Speech	33	Emerging	fastspeech-tts-models	34	Python
1994	deepgram-devs/flask-live-chatgpt-text-to-speech Get started using Deepgram's Live ChatGPT Text-to-Speech with this Flask demo app	33	Emerging	deepgram-starter-projects	6	Python
1995	pnkvalavala/digitaltwin Using a single image and just 10 seconds of sample audio, our project...	33	Emerging	voice-cloning-tools	40	Jupyter Notebook
1996	tristan-mcinnis/Multimodal-voice-assistant This project is a multi-modal AI voice assistant that uses LM Studio, OpenAI...	33	Emerging	local-voice-assistants	9	Python
1997	lpalbou/VoiceLLM A modular Python library for voice interactions with AI systems, featuring...	33	Emerging	local-voice-assistants	5	Python
1998	kromme/Teams-Notetaker Let AI create the notes of your Teams Meeting	33	Emerging	meeting-transcription-summarizers	37	Python
1999	hwRG/End-to-End-TTS-Fine-Tune Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.	33	Emerging	fastspeech-tts-models	29	Python
2000	MotazSabri/Hanami-release Live translator that captures any audio that comes from a WINDOWS speaker or...	33	Emerging	real-time-voice-translation	46	—

« Prev 1 2 3 … 18 19 20 21 22 … 68 69 70 Next »