All Voice AI Tools

6,981 tools ranked by quality score · Page 17 of 70

Showing 1601–1700 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1601	DarioFT/ComfyUI-Qwen3-TTS A ComfyUI custom node suite for Qwen3-TTS, supporting 1.7B and 0.6B models,...	37	Emerging	comfyui-extensions	228	Python
1602	rerender2021/echo A simple asr translator powered by avernakis react.	37	Emerging	react-native-voice-libraries	120	TypeScript
1603	Wendison/FCL-taco2 Official implementation of FCL-taco2: Fast, Controllable and Lightweight...	37	Emerging	tacotron-tts-models	40	Python
1604	ScottishFold007/Cosyvoice_DPO_NOTES CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO...	37	Emerging	coqui-tts-applications	121	Python
1605	chenwr727/Stock-Insight-AI Stock-Insight-AI 一键生成股票与期货分析视频	37	Emerging	ai-video-generation	83	Python
1606	SamYuan1990/flet_sherpa_onnx flet_sherpa_onnx an ASR/STT library for flet basing on sherpa-onnx	37	Emerging	dotnet-tts-libraries	3	Dart
1607	byhow/yanyu A Text-to-Speech node package with pinyin audio library.	37	Emerging	google-tts-libraries	9	TypeScript
1608	DrAchernar/location-based-AR-app This Flutter project is an example for a location based AR app with...	37	Emerging	educational-voice-apps	78	Dart
1609	sq2ips/sr0wx Unowocześniony projekt automatycznej radioamatorskiej stacji pogodowej sr0wx	37	Emerging	voice-controlled-robotics	8	Python
1610	LlmKira/fast-langdetect ⚡️ 80x faster Fasttext language detection out of the box \| Split text by language	37	Emerging	speech-translation-apps	300	Python
1611	rcspam/dictee Push-to-talk voice dictation for Linux — 100% local, multilingual (25+...	37	Emerging	voice-dictation-typing	3	Python
1612	BobRandomNumber/ComfyUI-DiaTTS ComfyUI Dia safetensors implementation	37	Emerging	comfyui-tts-nodes	7	Python
1613	Kalebu/image-to-sound-python- A python project for converting an Image into audible sound using OCR and...	37	Emerging	image-caption-generation	68	Python
1614	Kalebu/Python-Speech-Recognition- This consist of basic examples of performing Speech Recognition in Python...	37	Emerging	text-to-speech-conversion	64	Python
1615	Gmzxdotzz/Dia-TTS-Server Self-host the powerful Dia TTS model. This server offers a user-friendly Web...	37	Emerging	self-hosted-tts-servers	4	Python
1616	dpm76/QuickRouteMap Simple route guidance application.	37	Emerging	android-speech-apps	1	Java
1617	KevKibe/African-Whisper 🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual...	37	Emerging	whisper-fine-tuning	37	Python
1618	DeutscheKI/tevr-asr-tool State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines...	37	Emerging	voice-cloning-synthesis	412	C
1619	ninjahuttjr/hal-answering-service I'm sorry, Dave. I'm afraid I can't let that spam call through. — Local AI...	37	Emerging	voice-agent-applications	9	Python
1620	nestyme/Subtitles-generator generates transcript for video from link	37	Emerging	whisper-subtitle-generation	89	Python
1621	elbruno/ElBruno.Realtime Pluggable real-time audio conversation framework for .NET. Local VAD, STT,...	37	Emerging	dotnet-tts-libraries	9	C#
1622	bdim404/Qwen3-TTS-WebUI 基于阿里巴巴 Qwen3-TTS 模型（17 亿参数）的全栈文本转语音 Web 应用，支持语音定制、语音设计和语音克隆，有声书生成功能。A...	37	Emerging	qwen3-tts-applications	16	Python
1623	timoil/whisper-subtitles 🎬 AI-powered localhost subtitle generator for hearing-impaired users....	37	Emerging	whisper-subtitle-generation	38	Python
1624	mozi1924/Qwen3-TTS-EasyFinetuning Easy fine-tuning for Qwen3-TTS: Fast voice cloning and high-quality...	37	Emerging	llm-fine-tuning	32	Python
1625	sai9640nayak/StreamingKokoroJS Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100%...	37	Emerging	kokoro-tts-ecosystem	3	JavaScript
1626	Amanbig/ChatMe ChatMe combines agent-driven AI, cross-platform responsiveness, and voice...	37	Emerging	voice-command-assistants	4	TypeScript
1627	Sri-Krishna-V/Elu AI-powered Chrome extension that makes any web article accessible —...	37	Emerging	browser-tts-extensions	3	JavaScript
1628	scripty-bot/scripty Speech to text bot for Discord	37	Emerging	discord-tts-bots	80	Rust
1629	mxvsh/wave Native macOS dictation app focused on fast voice-to-text workflows.	36	Emerging	local-voice-dictation	2	C++
1630	drivendataorg/childrens-speech-recognition-benchmark-pub Tutorial code for the On Top of Pasketti: Children’s Speech Recognition Challenge	36	Emerging	automatic-speech-recognition	2	Jupyter Notebook
1631	HachiroSan/google-pronouncer 🔊 Download pronunciation audio files from Google's dictionary service....	36	Emerging	lightweight-tts-libraries	3	Python
1632	saurabhdaware/bol Slightly more consistent Text-to-speech for Web and a wrapper around speechSynthesis	36	Emerging	web-speech-api-tts	3	JavaScript
1633	gittyeric/FAlexa Create your own verbal commands that fuzzily map to custom Javascript /...	36	Emerging	vue-speech-recognition	5	TypeScript
1634	PhuocElec/zipformer-asr-api REST-API implementation of ZipFormer for automatic speech recognition (ASR)...	36	Emerging	funasr-speech-recognition	2	Python
1635	mahimairaja/openrtc-python OpenRTC lets developers run multiple LiveKit voice agents in one Python...	36	Emerging	voice-agent-applications	2	Python
1636	wildminder/ComfyUI-KaniTTS ComfyUI node for modular, human‑like Kani TTS. Generate natural,...	36	Emerging	comfyui-tts-nodes	38	Python
1637	ALERTua/styletts2-ukrainian-openai-tts-api OpenAI TTS Compatible Ukrainian TTS StyleTTS2 Pipeline	36	Emerging	ukrainian-voice-ai	38	Python
1638	kaloprojects/KALO-ESP32-Voice-Assistant Code snippets showing how to record I2S audio and store as .wav file on...	36	Emerging	voice-controlled-robotics	42	C++
1639	Sundy1219/ctc_beam_search_lm CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统	36	Emerging	ctc-asr-implementations	48	C++
1640	ayutaz/uPiper Unity TTS plugin: Piper neural synthesis + pure C# G2P (Japanese/English) +...	36	Emerging	piper-tts-ecosystem	21	C#
1641	mgonzs13/piper_ros piper Text-to-Speech for ROS 2	36	Emerging	piper-tts-ecosystem	6	C++
1642	MahtaFetrat/ManaTTS-Persian-Speech-Dataset ManaTTS is the largest open Persian speech dataset with 114+ hours of...	36	Emerging	persian-speech-ai	49	Jupyter Notebook
1643	shanghaimoon888/mod_vadasr This is FreeSwitch module that can do VAD and ASR with IFLYTEK websocket api.	36	Emerging	vosk-asr-implementations	50	C
1644	sooftware/lightning-asr Modular and extensible speech recognition library leveraging...	36	Emerging	end-to-end-asr-frameworks	50	Python
1645	vectominist/MiniASR A mini, simple, and fast end-to-end automatic speech recognition toolkit.	36	Emerging	end-to-end-asr-frameworks	53	Jupyter Notebook
1646	Voice-Privacy-Challenge/Voice-Privacy-Challenge-2020 Baseline Recipe for VoicePrivacy Challenge 2020:...	36	Emerging	automatic-speech-recognition	64	Shell
1647	jcsilva/docker-kaldi-android Dockerfile for compiling Kaldi for Android.	36	Emerging	kaldi-asr-ecosystem	65	Shell
1648	ArchitParnami/Few-Shot-KWS Few-Shot Keyword Spotting	36	Emerging	wake-word-detection	71	Jupyter Notebook
1649	yh1008/speech-to-text mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras	36	Emerging	keyword-speech-recognition	71	Jupyter Notebook
1650	Franck-Dernoncourt/ASR_benchmark Program to benchmark various speech recognition APIs	36	Emerging	automatic-speech-recognition	81	Python
1651	wulee510505/Text2Speach 一句代码搞定语音合成，文字转语音	36	Emerging	java-tts-libraries	68	Java
1652	1038lab/ComfyUI-VoxCPMTTS A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech)...	36	Emerging	comfyui-tts-nodes	36	Python
1653	vigonotion/tts.astromech Text to Astromech integration for Home Assistant (R2D2 Beep Boop Sounds)	36	Emerging	home-assistant-tts	53	Python
1654	nixonyh/UnityASR Automatic Speech Recognition in Unity.	36	Emerging	dotnet-tts-libraries	32	C#
1655	ga642381/FastSpeech2 Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to...	36	Emerging	fastspeech-tts-models	99	Python
1656	usabarashi/voicevox-cli Japanese text-to-speech using VOICEVOX Core	36	Emerging	rust-tts-libraries	6	Rust
1657	LearnedVector/Wav2Letter Speech Recognition model based off of FAIR research paper built using Pytorch.	36	Emerging	wav2vec2-asr-models	87	Python
1658	b4rtaz/voice-assistant Voice assistant for Visual Studio Code.	36	Emerging	voice-command-assistants	296	TypeScript
1659	cdimascio/watson-html5-speech-recognition Speech Recognition for Browsers via Webkit, HTML5, and Watson	36	Emerging	web-speech-api-libraries	4	JavaScript
1660	echonoshy/tingshu Tingshu 听舒｜ Bringing the author’s voice directly to you	36	Emerging	lightweight-tts-runtimes	33	Python
1661	stefantaubert/pronunciation-dictionary-utils Utils to modify pronunciation dictionaries.	36	Emerging	tts-dataset-creation	1	Python
1662	smartgic/docker-mycroft Mycroft AI Voice Assistant Docker images and docker-compose.yml files for...	36	Emerging	coqui-tts-applications	41	Dockerfile
1663	wblgers/hmm_speech_recognition_demo A demo for simple isolated Chinese speech word recognition using GMMHMM in Python	36	Emerging	keyword-speech-recognition	43	Python
1664	bgArray/ZhiYin 知音 - AI音频听觉功能集成软件。提供声乐技术识别分析、伴奏分离等伴奏多种工具。	36	Emerging	funasr-speech-recognition	8	Python
1665	lissettecarlr/kuon 久远：一个开发中的大模型语音助手，当前关注易用性，简单上手，支持对话选择性记忆和Model Context Protocol (MCP)服务。...	36	Emerging	voice-agent-applications	47	Python
1666	Robofied/Voicenet Comprehensive Python library for speech and voice.	36	Emerging	text-to-speech-conversion	32	Jupyter Notebook
1667	Hagsten/Talkify Javascript Text to speech library	36	Emerging	web-speech-api-tts	239	JavaScript
1668	amd/LIRA This tool helps you easily deploy ASR models on NPUs on AMD's Ryzen AI 300...	36	Emerging	speech-to-text-converters	22	Python
1669	aviaryan/Very-Fast-Dictation Instant dictation app for Mac	36	Emerging	audio-transcription-tools	64	Python
1670	jsugg/ser The AI-powered ser Python package is a tool for recognizing and analyzing...	36	Emerging	speech-emotion-recognition	6	Python
1671	rorpage/openfaas-text-to-speech Generate an MP3 of text using Google's Text-to-Speech	36	Emerging	openai-tts-applications	11	Dockerfile
1672	CoffeeMethod/KokoroGUI An advanced TTS software, built for audiobooks, podcasts, videos, and more.	36	Emerging	kokoro-tts-ecosystem	6	Python
1673	soundhound/hound-sdk-web-example An example of how to work with text and voice requests using the Houndify...	36	Emerging	web-speech-api-libraries	7	JavaScript
1674	silenterus/deepspeech-cleaner Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework	36	Emerging	speech-corpora-datasets	48	Python
1675	Picovoice/speech-to-intent-benchmark benchmark for Speech-to-Intent engines	36	Emerging	speech-ai-coursework	17	Python
1676	pingfury108/book2tts 有声书制作工具	36	Emerging	ebook-to-audiobook-conversion	44	Python
1677	t0mer/tts-stt Small pyhon flask container allowing us to convert Text to Speech and Speech to Text	36	Emerging	self-hosted-tts-servers	11	Python
1678	ahaocd/davinci-voice-clone DaVinci Subtitle Alignment + Voice Clone + AI Emotion Optimization \| CosyVoice2 TTS	36	Emerging	voice-cloning-tools	4	Python
1679	niteshsharmacodes/neutts-ultimate NeuTTS-Ultimeate - Advanced Text-to-Speech generation with unlimited...	36	Emerging	coqui-tts-applications	5	Python
1680	DKMitt/speech-to-text-js The Voice Note App's purpose is to experiment with the Web Speech API by...	36	Emerging	web-speech-api-libraries	51	JavaScript
1681	goodmike31/pl-asr-bigos-tools Extendable toolkit for comprehensive evaluation of ASR systems. Currently...	36	Emerging	automatic-speech-recognition	11	Python
1682	gaborvecsei/whisper-live-transcription Live-Transcription (STT) with Whisper PoC	36	Emerging	whisper-transcription-apps	201	Python
1683	resemble-ai/resemble-unity-text-to-speech Resemble's voice cloning engine within Unity	36	Emerging	dotnet-tts-libraries	184	C#
1684	cottongeeks/podscript Generate podcast transcripts using language and speech-to-text models	36	Emerging	ai-podcast-generation	171	TypeScript
1685	atosystem/SpeechCLIP SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model,...	36	Emerging	clip-vision-language	119	Python
1686	titilambert/pynuance Wrapper for Nuance Communications services	36	Emerging	lightweight-tts-libraries	3	Python
1687	Degon3399/XTTS_V2 This repository offers a framework for fine-tuning the XTTS_V2 model,...	36	Emerging	tts-model-finetuning	1	Python
1688	DarkPancakes/clipforge AI-powered short-form video generator. Create viral YouTube Shorts & TikTok...	36	Emerging	ai-video-generation	1	Python
1689	grebtsew/Text_To_Speech_Server_Node A super simple speaking server node that receives requests and reads them...	36	Emerging	self-hosted-tts-servers	1	Python
1690	vkosuri/dialogflow-lite [Maintainer Required] A light-weight python library REST agent for Dialogflow	36	Emerging	voice-command-assistants	2	Python
1691	IBM/watson-streaming-stt Example of using Watson's Streaming Speech to Text websockets interface for...	36	Emerging	ibm-watson-speech	29	Python
1692	takahi-ro/ConvivialChat This system provides the web space where text and speech coexist, and you...	36	Emerging	voice-command-assistants	3	JavaScript
1693	mikopbx/ModuleSmartIVR Модуль умной маршрутизации для 1C:Предприятия	36	Emerging	ai-tutoring-platforms	4	PHP
1694	deepgram-starters/go-voice-agent Get started using Deepgram's Voice Agent with this Go demo app	36	Emerging	deepgram-starter-projects	7	Go
1695	Skeli010/GaryTTS 强大免费的本地文本转语音软件	36	Emerging	lightweight-tts-runtimes	2	—
1696	beyondwords-io/wordpress-plugin BeyondWords is the AI voice platform that brings frictionless audio...	36	Emerging	google-tts-libraries	2	PHP
1697	hopkira/k9 Latest main K9 robot repository with 3D vision, local STT/TTS with GPT-3 and...	36	Emerging	voice-controlled-robotics	24	Python
1698	seanghay/KLEA An open-source Khmer Word to Speech Model. Just single word not sentence!	36	Emerging	tts-model-finetuning	19	Python
1699	alam025/ai-voice-assistant-appointment-booking Enterprise-grade AI voice assistant for automated appointment scheduling...	36	Emerging	voice-agent-applications	24	Python
1700	richardassar/SampleRNN_torch Torch implementation of SampleRNN: An Unconditional End-to-End Neural Audio...	36	Emerging	audio-noise-reduction	156	Lua

« Prev 1 2 3 … 15 16 17 18 19 … 68 69 70 Next »