All Voice AI Tools

6,981 tools ranked by quality score · Page 44 of 70

Showing 4301–4400 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
4301	TicooLiu/HowTo-ASR 开源语音识别自定义数据模型训练指南	20	Experimental	automatic-speech-recognition	13	—
4302	kadirpili/text-to-video-bot-python Script that generates TikTok style videos using ffmpeg, moviepy, chatGPT,...	20	Experimental	text-to-video-generation	11	Python
4303	stefanpantic/asr Automatic speech recognition using neural networks	20	Experimental	automatic-speech-recognition	18	Python
4304	jonbrennecke/CaptionThis "Caption This" is an iOS app that adds real-time captions to videos for...	20	Experimental	live-caption-generation	13	JavaScript
4305	pushkar009/Smart-Room-Assistant This is repository containing Mega project code for Smart Room Assistant.	20	Experimental	general-purpose-voice-assistants	1	Python
4306	Konstantinos123456789/JARVIS_AI A modular Python AI Assistant (Jarvis) featuring Knowledge Graphs...	20	Experimental	python-voice-assistants	1	Python
4307	loganngarcia/chaplin-ui Web interface for a real-time silent speech recognition tool.	20	Experimental	speech-to-text-converters	1	Python
4308	ali7919/Talk-With-LLM-In-Unity Speech Recognition + LLM inference on device in Unity	20	Experimental	ai-virtual-companions	13	C#
4309	uberduck-ai/openduck Building an open-source interactive AI plush toy.	20	Experimental	voice-chatbot-applications	12	Python
4310	SakshiRathi77/hindiSpeechPro-Automatic-Speech-Recognization The project,being part of Kagglex BIPOC Mentorship Program final project,...	20	Experimental	automatic-speech-recognition	11	Jupyter Notebook
4311	iGerman00/buttercup-chrome A Chrome(ium) extension to replace YouTube's auto-captions with...	20	Experimental	live-meeting-translation	25	JavaScript
4312	bluenekozkm/moe-tts-webui The better web ui for MOE-TTS	20	Experimental	voice-cloning-synthesis	24	Python
4313	deepgram-starters/deno-text-to-speech Get started using Deepgram's Text-to-Speech with this Deno demo app	20	Experimental	deepgram-starter-projects	1	TypeScript
4314	incubated-geek-cc/Text-To-Speech-App A Fusion of OCR Technology (Tesseract.js) & Web Speech API. Standalone,...	20	Experimental	text-to-speech-conversion	24	JavaScript
4315	dingdangdog/VwordAi VwordAi 是一款文本转语音工具，支持多种语音服务提供商，让您轻松将文本转为自然流畅的语音。	20	Experimental	google-tts-libraries	1	JavaScript
4316	RhythmusByte/Sign-Language-to-Speech Real-time ASL interpreter using OpenCV and TensorFlow/Keras for hand gesture...	20	Experimental	sign-language-recognition	9	Python
4317	selmetwa/AnkiTTS Add audio to your Anki deck by leveraging eleven labs text-to-speech API	20	Experimental	anki-tts-integration	6	Python
4318	DarkOracle10/Video-to-Persian-Translator---Professional-AI-Translation-Pipeline Professional-AI-Translation-Pipeline	20	Experimental	video-dubbing-tools	1	Python
4319	mathquis/node-kaldi-online-nnet3-decoder ASR online decoding using Kaldi NNet3 GrammarFST	20	Experimental	kaldi-asr-ecosystem	8	C++
4320	miaogang1982/mod_ali FreeSwitch扩展模块，实现基于阿里云的语音合成功能	20	Experimental	vosk-asr-implementations	7	C++
4321	smswg/FreeSwitch-Mod_Asr FreeSWITCH阿里云Mod_ASR模块直连阿里云Asr大模型,全网2026年阿里云最新c++Sdk3.2研发，经过大量生产环境测试稳定。可用于AI智...	20	Experimental	vosk-asr-implementations	6	—
4322	Agrover112/Goodness-of-Pronunciation-Pipelines-for-OOV-Problem Goodness of Pronunciation Pipelines for OOV Removal	20	Experimental	kaldi-asr-ecosystem	10	Perl
4323	bagustris/id Iban-based Kaldi recipe for Indonesian speech Corpus, presented at ASJ Spring 2019.	20	Experimental	kaldi-asr-ecosystem	7	Shell
4324	CaesiumY/dding-dong Claude Code notification plugin — Sound alerts & OS notifications on task...	20	Experimental	voice-enabled-coding-assistants	1	JavaScript
4325	mrhallonline/WhisperXTranscription4Researchers This repository contains a Jupyter notebook for qualitative researchers to...	20	Experimental	whisper-transcription-apps	9	Jupyter Notebook
4326	speechnotes/speechnotes-website New (2023) Doks (hugo + npm) based website for speechnotes.co	20	Experimental	stt	1	HTML
4327	dnkilic/android-sesli-haber DEPRECATED - This application is created by a group of student who finished...	20	Experimental	android-speech-apps	8	Java
4328	Cinnamon/whisper-jargon [SIGDIAL'24] Improving Speech Recognition with Jargon Injection	20	Experimental	whisper-diarization	11	Python
4329	innovate-invent/ChatStream Multiplatform OBS Chat overlay	20	Experimental	twitch-chat-tts	1	TypeScript
4330	RazEini/e_commerce_shop Android E-Commerce App with Firebase Realtime DB, Authentication, Smart...	20	Experimental	android-voice-assistants	1	Java
4331	Philipelima/video-translate Have you ever thought about translate a YouTube video? That is the idea for...	20	Experimental	video-transcription-extraction	12	Python
4332	JarbasAl/pocketsphinx-models-mirror pocketsphinx models for languages originating from the iberian peninsula	20	Experimental	kaldi-asr-ecosystem	8	—
4333	takeoutfm/takeout_assistant Offline voice assistant for Android	20	Experimental	general-purpose-voice-assistants	10	Dart
4334	wq2012/mdeval Python implementation of the NIST md-eval.pl script for evaluating rich...	20	Experimental	asr-evaluation-metrics	1	Python
4335	light12222/Voice2Sub-Whisper-Live-Translator Real-time speech-to-text, subtitle overlay, and translation tool. Powered by...	20	Experimental	video-transcription-extraction	7	Python
4336	Maxborland/mindtype-app MindType — Voice-to-text with AI-powered summaries. 100+ languages, works...	20	Experimental	local-voice-dictation	1	Python
4337	dangvansam/phoneme2grapheme-vietnamese convert phoneme to grapheme vietnames	20	Experimental	grapheme-to-phoneme-conversion	6	Python
4338	rafaotetra/awesome-coding-by-voice A list of videos, papers, tools, APIs and projects about coding by voice	20	Experimental	voice-ai-learning-collections	17	—
4339	ab-smith/kokoro-tts-webui Gradio-based web ui for Kokoro to simplify its usage with multiple voices,...	20	Experimental	kokoro-tts-ecosystem	1	Python
4340	profdilley/markdown-speech-converter This tool converts Markdown files into speech-friendly plain text files....	20	Experimental	content-to-podcast-converters	1	Python
4341	Ziggx5/TalkToText Speech-to-text app bulit with Python and Vosk speech recognition engine	20	Experimental	vosk-asr-implementations	1	Python
4342	fr0stb1rd/Edge-TTS-Subtitle-Dubbing High-performance SRT to Audio Dubbing tool using Microsoft Edge TTS with...	20	Experimental	video-dubbing-tools	1	Python
4343	bionicop/TalkativeSubs Bring your subtitles to life with TalkativeSubs, a tool that converts SRT...	20	Experimental	whisper-subtitle-generation	5	Python
4344	HungerCoder01/jarvis-voice-assistant A Python-based voice assistant built while learning speech recognition,...	20	Experimental	python-voice-assistants	1	Python
4345	qwertypool/Python-Personal-Desktop-Assistant A personal assistant which automate your tasks such as search videos in...	20	Experimental	general-purpose-voice-assistants	9	Python
4346	RamanSharma100/Reactjs-voice-controllable-website this is the voice controllable website using React Js and youtube API	20	Experimental	react-speech-recognition	12	JavaScript
4347	miguelangelnieto/DNN-Speech-Recognizer Built a deep neural network that functions as part of an end-to-end...	20	Experimental	keyword-speech-recognition	5	HTML
4348	Kaljurand/speech-trigger Android Speech Recognizer service based on...	20	Experimental	android-speech-apps	5	Java
4349	nasrul21/kunci-tts-api API untuk mendapatkan kunci jawaban TTS (Teka Teki Silang) Indonesia	20	Experimental	android-speech-apps	9	JavaScript
4350	gastonmorixe/elevenlabs-reader-cli Unofficial ElevenLabs Reader CLI: create, stream, and play TTS with live karaoke	20	Experimental	elevenlabs-integrations	5	Python
4351	row-engineering/ai-narration A WordPress plugin that converts your posts into audio narrations using AI...	20	Experimental	google-tts-libraries	12	PHP
4352	RiccardoGrin/TerminalWhisper Voice-to-text for Windows using OpenAI Whisper. Hold a hotkey, speak, text appears.	20	Experimental	speech-to-text-converters	1	Python
4353	Shuichi346/qwen-voice-clone-webui A Gradio WebUI for voice cloning powered by Qwen3-TTS. Provide reference...	20	Experimental	qwen3-tts-applications	1	Python
4354	ivallesp/Xception1d Xception1d implementation for audio categorization	20	Experimental	keyword-speech-recognition	6	Python
4355	SUBHADIPMAITI-DEV/Speech-Recognition-Alexa A simple Python script that uses various libraries for speech recognition...	20	Experimental	general-purpose-voice-assistants	15	Python
4356	lucaslattari/IAGiroDeNoticias Repositório do projeto apresentado no vídeo "Bot que cria podcast sozinho??...	20	Experimental	general-purpose-voice-assistants	7	Python
4357	rwightman/tensorflow-speech_commands Speech commands training/models from TF repo adapted for speech commands Kaggle	20	Experimental	keyword-speech-recognition	6	Python
4358	useviolet/violetaudio Voice AI infrastructure and audio processing toolkit	20	Experimental	self-hosted-tts-servers	1	Python
4359	Yangyangii/AdvDCTTS Implementation of DCTTS with Adversarial Training	20	Experimental	tacotron-tts-models	12	Python
4360	terry-yip/speech-to-text Speaker diarization and speech to text	20	Experimental	funasr-speech-recognition	14	Python
4361	codersinthestorm/RecurrentNN_SpeechRecognition A model based in Tensorflow to recognize words from the 30 word Speech...	20	Experimental	keyword-speech-recognition	11	Python
4362	mp-web3/jarvis-v3 Fully local voice interface for Claude Code on Apple Silicon. Parakeet STT +...	20	Experimental	python-voice-assistants	15	Python
4363	GhostNaN/silero-webui Silero TTS web UI	20	Experimental	gradio-tts-webuis	15	Python
4364	zssloth/TF-Speech-Recognition Speech Recognition Using Tensorflow	20	Experimental	keyword-speech-recognition	13	Python
4365	AlexKly/Simple-Voice-Activity-Detector-using-MFCC-based-on-FPGA-Kintex Voice Activity Detector based on MFCC features and DNN model	20	Experimental	speaker-diarization-embedding	29	VHDL
4366	ccnixx/rt-stt-demo-app Real-time speech-to-text web app.	20	Experimental	web-speech-api-libraries	5	CSS
4367	innovatorved/tts-app This application converts text or PDF documents into speech using the...	20	Experimental	kokoro-tts-ecosystem	7	Python
4368	raghavkumar06/jarvis-ai-assistant Python-based voice assistant that performs tasks using speech recognition...	20	Experimental	python-voice-assistants	1	Python
4369	EthanC/Eavesdrop Discord Bot that transcribes voice messages and media attachments. Powered...	20	Experimental	speech-to-text-converters	1	Python
4370	syado/discord-vc-tts Discordのテキストチャンネルのメッセージをボイスチャンネルで読み上げるbot	20	Experimental	discord-tts-bots	5	Python
4371	ashfaaqrifath/Casper-PC-Assistant PC assistant with voice/text control, automating tasks using APIs, system...	20	Experimental	voice-controlled-desktop-automation	15	Python
4372	pilot7747/VoxDIY This repository provides data and code for "Vox Populi, Vox DIY: Benchmark...	20	Experimental	tts-dataset-creation	16	Python
4373	caiobd/sprite-ai Sprite AI - An AI companion for your desktop	20	Experimental	local-voice-assistants	11	Python
4374	lugia19/renpyDialogToAudio Takes a renpy dialog export and generates voices using elevenlabs	20	Experimental	elevenlabs-integrations	12	Python
4375	obro79/stormhacks Deploy full-stack web apps with zero typing — just your voice.	20	Experimental	ai-interview-simulators	1	TypeScript
4376	Harras3/unhallucinated-faster-whisper 'unhallucinated-faster-whisper,' a powerful enhancement built on the...	20	Experimental	speech-to-text-converters	12	Python
4377	antonin-lfv/ESP32-robot-piloting-with-TinySpeech Offline Keyword Spotting on ESP32-S3. TinySpeech implementation using...	20	Experimental	wake-word-detection	1	C
4378	speak-rs/speakly High-performance, extensible speech recognition toolkit for Rust — OpenAI...	20	Experimental	rust-speech-recognition	1	Rust
4379	TheMadMartina/Nexa Nexa is a Python AI voice assistant leveraging speech recognition and...	20	Experimental	general-purpose-voice-assistants	1	Python
4380	zry98/pomumd Wyoming Protocol TTS and STT & MLX LLM server for iOS/macOS	20	Experimental	lightweight-tts-runtimes	1	Swift
4381	jakariaemon/WSI Whisper Speaker Identification (WSI), a cutting-edge model for multilingual...	20	Experimental	embedding-model-tuning	26	Python
4382	serkanalgur/turkish-tts Turkish TTS with Piper TTS	20	Experimental	piper-tts-ecosystem	1	Python
4383	KrishnaDN/LAS-Pytorch Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch	19	Experimental	conformer-asr-implementations	7	Python
4384	deepgram-starters/php-text-to-speech Get started using Deepgram's Text-to-Speech with this PHP demo app	19	Experimental	deepgram-starter-projects	—	PHP
4385	CanadianCrafter/EngHacks2021-Text-To-Speech Text to Speech Highlighter is a Chrome extension that allows the user to...	19	Experimental	browser-tts-extensions	6	JavaScript
4386	noly24/spoken-subtitles "Chrome extension that reads subtitles aloud on streaming sites for accessibility"	19	Experimental	browser-tts-extensions	—	JavaScript
4387	adakrupp/voice-cloning Local AI voice cloning with Coqui TTS XTTS-v2 - Docker-ready, GPU-accelerated	19	Experimental	voice-cloning-tools	—	Python
4388	avreliusdante-web-creator/voice-input Browser extension: convert voice to text and send it with one click in open...	19	Experimental	browser-tts-extensions	—	JavaScript
4389	klimromanyuk/tg-tts-sum-bot Telegram bot with LLM (Ollama) and voice synthesis (Qwen3-TTS / Edge-TTS)	19	Experimental	telegram-voice-transcription	—	Python
4390	tqer39/tts-partner TTS Partner repository	19	Experimental	lightweight-tts-libraries	—	Python
4391	elerdg/ASR-for-low-resource-languages Fine-tune wav2vec2-xls-r on data from low-resource-languages	19	Experimental	wav2vec2-asr-models	6	Jupyter Notebook
4392	horatio-sans-serif/speeker TTS MCP, CLI, HTTP API with multiple engines, voice cloning, daemon for...	19	Experimental	voice-enabled-coding-assistants	—	Python
4393	martins-vds/my-assistant A voice-driven personal task-tracking assistant for tech workers who...	19	Experimental	voice-command-assistants	—	C#
4394	Atqarana/AI-Voicebot-for-Kids An interactive companion toy that engages kids with storytelling, singing,...	19	Experimental	voice-command-assistants	10	JavaScript
4395	d1pankarmedhi/CascadeS2S A low-latency (<5s) cascade-style speech-to-speech conversational system	19	Experimental	speech-recognition-apis	—	Python
4396	leonardofmed/stt-chat-tts This is a Python project that uses different modules to capture audio from a...	19	Experimental	voice-chatgpt-interfaces	9	Python
4397	sergicastellasape/gpt-reviews Code for GPT Reviews — a daily AI-generated podcast	19	Experimental	voice-chatgpt-interfaces	18	Python
4398	ThakkarVidhi/ai-banking-agent Vaulta AI is a voice-driven banking agent that authenticates users through a...	19	Experimental	voice-agent-applications	—	Python
4399	Bsh54/AI_Phone_Call Application web qui transforme la synthèse vocale traditionnelle en...	19	Experimental	voice-agent-applications	—	JavaScript
4400	vaishnavipatil29/Voice-Chatbot Voice Chatbot, Course Project, Speech Processing	19	Experimental	voice-chatbot-applications	6	Jupyter Notebook

« Prev 1 2 3 … 42 43 44 45 46 … 68 69 70 Next »