All Voice AI Tools

6,981 tools ranked by quality score · Page 61 of 70

Showing 6001–6100 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
6001	msalhab96/Listen-Attend-and-Spell PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper	12	Experimental	conformer-asr-implementations	12	Python
6002	tuanio/conformer-rnnt Conformer RNN-Transducer	12	Experimental	conformer-asr-implementations	14	Python
6003	zyascend/End-to-End-Speech-Recognition-Learning ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别	12	Experimental	end-to-end-asr-frameworks	12	—
6004	upskyy/RNN-Transducer PyTorch Implementation of RNN-Transducer	12	Experimental	end-to-end-asr-frameworks	3	Python
6005	khaykingleb/automatic-speech-recognition QuartzNet and DeepSpeech implementation for ASR	12	Experimental	end-to-end-asr-frameworks	4	Python
6006	avrtt/MoE-speech-recognition Mixture of experts architecture for speech-to-text and language...	12	Experimental	end-to-end-asr-frameworks	3	Python
6007	yandex-cloud-examples/yc-speechkit-async-recognizer SpeechKit Asynchronous Batch Recognizer.	12	Experimental	yandex-speechkit-tools	1	Python
6008	markus-m-u-e-l-l-e-r/CTC.ISL ISL Speech Recognition Toolkit for training neural networks with the CTC...	12	Experimental	ctc-asr-implementations	4	Python
6009	SrujanHR/Happy-AI-Voice-Assistant Happy is a Python-based personal voice assistant for Windows. It responds to...	12	Experimental	voice-controlled-desktop-automation	1	Python
6010	yehuohan/ln-asr Automatic Speech Recognition	12	Experimental	automatic-speech-recognition	3	C
6011	Omitg24/IIS-ASR Repositorio para Administración de Sistemas y Redes (ASR), asignatura del...	12	Experimental	automatic-speech-recognition	4	—
6012	subuhana2303/VaaniRakshak_Offline-Emergency-Voice-Assistant VaaniRakshak is an offline voice assistant built for disaster scenarios,...	12	Experimental	general-purpose-voice-assistants	1	Python
6013	sofiahernandes/speech-sci-calculator A smart scientific calculator app with speech recognition, built in Python...	12	Experimental	voice-controlled-calculators	1	Python
6014	AathifZahir/WhisprSplit A powerful, local speech-to-text transcription system that combines OpenAI's...	12	Experimental	whisper-diarization	1	Python
6015	DanteVela/Python-Voice-Assistant A repository of a speech-driven virtual assistant powered by Speech...	12	Experimental	general-purpose-voice-assistants	1	Python
6016	Brooklyn-Dev/Ultron-AI Voice-controlled AI gaming assistant for Marvel Rivals.	12	Experimental	python-voice-assistants	1	Python
6017	Manan-49/SRT-GENERATOR Offline desktop application for generating accurate subtitles (SRT) from...	12	Experimental	whisper-subtitle-generation	1	Python
6018	asiff00/TTS-Training-Blueprint Intuitive understanding of Autoregressive TTS Models	12	Experimental	fastspeech-tts-models	11	Python
6019	brandonviaje/echo voice assistant discord bot	12	Experimental	general-purpose-voice-assistants	1	Python
6020	Clats97/ClatScribe ClatScribe is a speech-to-text tool that captures real-time audio,...	12	Experimental	speech-to-text-converters	1	Python
6021	zayedalbloushi/AI-Transcription Stream audio from the browser, transcribe it in real time, and get live...	12	Experimental	speech-to-text-converters	1	Python
6022	msadeqsirjani/SubtitleGenerator 🎬 AI-powered subtitle generator using OpenAI Whisper. Multi-language...	12	Experimental	whisper-subtitle-generation	1	Python
6023	tuannho0802/PDFvert-TextToSpeech A web-based application for seamless PDF/DOCX conversion and natural...	12	Experimental	pdf-to-audio-conversion	1	JavaScript
6024	MrFlapstaart/GameOCRTTS Speak out text balloons in games without voice acting to use OCR on the...	12	Experimental	dotnet-tts-libraries	4	C#
6025	taeefnajib/Aximos Aximos is an innovative AI-powered tool that transforms your content into...	12	Experimental	content-to-podcast-converters	4	TypeScript
6026	noAbbreviation/approxima A command line program to loudly tell time (in chunks of 5 minutes).	12	Experimental	go-tts-libraries	1	Go
6027	LiZeC123/legado-tts-tencent Tencent TTS for Legado Reader 基于腾讯语音合成API的Legado(开源阅读)TTS服务.	12	Experimental	edge-tts-implementations	1	Go
6028	Aavache/pdf2speech Reading PDF files and converting them to audio tracks.	12	Experimental	pdf-to-audio-conversion	4	Python
6029	10809104/taigi-speech-to-text 台語語音轉文字訓練資料集，資料來源：教育部《臺灣閩南語常用詞辭典》。	12	Experimental	whisper-fine-tuning	1	Python
6030	benda1989/qwen3-tts qwen3-tts train multi-speaker emotion control	12	Experimental	qwen3-tts-applications	1	Python
6031	Prathuvj/spectrolingua 🎵 Audio Processing Studio - A comprehensive Django API with Streamlit...	12	Experimental	speech-translation-apps	1	Python
6032	alam025/AI-voice-assistant-with-RAG-powered-customer-support Enterprise-grade AI voice assistant with RAG-powered customer support,...	12	Experimental	voice-agent-applications	1	Python
6033	PedritoGMG/GMG-FunMenu Client-side commands for microphone interactions, sound effects, and more,...	12	Experimental	dotnet-tts-libraries	1	Java
6034	shaikhsaif72/Jarvis-Voice-Assistant A voice-activated virtual assistant using Python and OpenAI.	12	Experimental	python-voice-assistants	1	Python
6035	yigitaliayyildiz/SmartSEE Android object detection app using YOLOv8 (TFLite) with Turkish TTS feedback.	12	Experimental	android-voice-assistants	1	Java
6036	AapseMatlb/Pickasso-Speech Speech Interaction Subsystem for Pickasso Autonomous Robot Enables wake word...	12	Experimental	voice-controlled-robotics	1	Python
6037	Swathi-88/JARVIS-AI A voice-controlled desktop AI assistant for Windows featuring OpenAI...	12	Experimental	python-voice-assistants	1	Python
6038	AbhaySingh71/Multimodal-Agentic-Assistant-Clara Clara: An agentic multimodal AI assistant that can see through your webcam,...	12	Experimental	local-voice-assistants	1	Python
6039	isbendiyarovanezrin/SpeechDetection Speech Detection 💬	12	Experimental	web-speech-api-libraries	4	CSS
6040	masonintokyo/voicevox-srt-to-speak VOICEVOX Engine APIを使ってSubRipファイルから各セリフ時間内に収まるように音声合成します。	12	Experimental	whisper-subtitle-generation	4	Python
6041	Madh93/whisper 🎙️ My Whisper stuff	12	Experimental	whisper-framework-ports	1	Makefile
6042	YoungloLee/tf2-speech-recognition-transformer Tensorflow 2 Speech Recognition Code (Transformer)	12	Experimental	keyword-speech-recognition	25	Python
6043	jmrashed/ai-desktop-assistant A Python-based AI desktop assistant designed to perform various tasks like...	12	Experimental	voice-controlled-desktop-automation	1	Python
6044	dannis999/trained_SpeechRecognition 此项目用于备份一个完整的中文语音识别环境，包括环境配置和预训练模型，以方便直接使用	12	Experimental	keyword-speech-recognition	4	Python
6045	Masihtabaei/reswhis A lightweight, WebSocket-based server for real-time, remote audio...	12	Experimental	speech-to-text-converters	1	Python
6046	MSAbhishek22/Veronica_Chatbot 🤖 AI Chatbot with Voice Interface - A Flask web app featuring Groq-powered...	12	Experimental	voice-chatbot-applications	1	Python
6047	Hhhpraise/auto-subtitler a python based app that generates subtitles , and can also be translated ,...	12	Experimental	whisper-subtitle-generation	1	Python
6048	kevin30205/Media-Transcribe Media Transcribe: Seamlessly generate transcripts from your video and audio...	12	Experimental	real-time-voice-translation	1	Python
6049	wazeerc/voxie Voxie, Let Your Notes Speak	12	Experimental	web-speech-api-tts	1	TypeScript
6050	parula-app/assistant Parula - Digital assistant - Running entirely on your own device	12	Experimental	multimodal-medical-assistants	4	JavaScript
6051	CSFelix/audio-to-text 🔊 Extract Text from Audios 🔊	12	Experimental	web-speech-api-tts	4	JavaScript
6052	Renamekk/Voice-Assistant A simple and customizable voice assistant written in Python. Supports adding...	12	Experimental	general-purpose-voice-assistants	1	Python
6053	Akshitha0118/Akshitha-Voice-AI-Voice-Powered-YouTube-Assistant An AI-powered Voice Assistant built using Python and Streamlit that listens...	12	Experimental	general-purpose-voice-assistants	1	Python
6054	dudarev/speechdown CLI tool to transcribe your spoken audio notes into timestamped,...	12	Experimental	voice-dictation-typing	1	Python
6055	druellan/ED-AI-Companion A Python script to monitor the Elite Dangerous journal files and provide...	12	Experimental	voice-controlled-robotics	1	Python
6056	GlobussBiogestion/text-to-signals-and-voice This API works 100% in HTML with Javascipt so it is very light and easy to...	12	Experimental	web-speech-api-tts	3	HTML
6057	jetfontanilla/browser-text-to-speech a demo of what a browser is currently capable of in text-to-speech	12	Experimental	web-speech-api-tts	4	Svelte
6058	passion-27/openai-whisper-api A sample speech transcription app implementing OpenAI Text to Speech API...	12	Experimental	speech-to-text-converters	4	JavaScript
6059	13shivam/yt-agent Offline-friendly backend POC to transcribe YouTube videos and chat with...	12	Experimental	video-transcription-extraction	1	Python
6060	Eng-M-Abdrabbou/Sonix A high-speed speech processing engine that captures and converts spoken...	12	Experimental	speech-to-text-converters	1	Python
6061	kbhujbal/J.A.R.V.I.S-AI-Assistant 🤖 Voice-controlled AI assistant with speech recognition, Wikipedia search,...	12	Experimental	python-voice-assistants	1	Python
6062	mavleo96/whisper-accent Conditioning via Adaptive Layer Norm for accented speech recognition	12	Experimental	whisper-fine-tuning	1	Python
6063	5ekastanx/Text-To-Speech This Django project allows converting text to audio files and saving...	12	Experimental	web-based-tts-apps	1	Python
6064	Aavtic/ena A video generation program using GIFS.	12	Experimental	ai-video-generation	1	Python
6065	sruckh/VibeVoice-finetune-easy Simplified scripts for fine-tuning VibeVoice speech synthesis models with...	12	Experimental	qwen3-tts-applications	1	Jupyter Notebook
6066	gas/pronunza-tts-galego-onnx-colab Caderno de Colab para síntese de voz (TTS) en galego usando o modelo ONNX de Celtia	12	Experimental	tts-model-finetuning	1	Jupyter Notebook
6067	nmanikiran/ionic-allinone This is to give a demo of each feature that are there in ionic and ionic-native	12	Experimental	web-speech-api-libraries	4	TypeScript
6068	tb0hdan/voiceplay Client-side first music centered voice controlled player	12	Experimental	news-audio-bulletins	4	Python
6069	zefie/multi-tts Docker for multiple TTS Engines with a GRadio interface	12	Experimental	self-hosted-tts-servers	13	Jupyter Notebook
6070	Zuellni/XTTS-Server XTTS Server for SillyTavern.	12	Experimental	self-hosted-tts-servers	4	Python
6071	EllangoK/gpt-voice-companion Small, simple chatbot using GPT and ElevenLabs TTS	12	Experimental	voice-chatgpt-interfaces	3	Python
6072	vpakarinen2/text-voice-chatterbox Text-to-speech and voice cloning using Chatterbox Turbo.	12	Experimental	self-hosted-tts-servers	1	TypeScript
6073	ExplainableML/ZerAuCap [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning...	12	Experimental	multimodal-vision-language	18	Python
6074	jmaczan/asr-dysarthria Research on Automatic Speech Recognition for dysarthric speech	12	Experimental	speaker-diarization-embedding	19	Jupyter Notebook
6075	Temerold/TobsTTS Text to speech, Python 3.7. Swedish and English. bye	12	Experimental	lightweight-tts-libraries	3	Python
6076	SSusantAchary/AI_Resources Have read and collected few Interesting Papers , Projects	12	Experimental	speech-ai-coursework	3	Python
6077	ryanp3343/LiveScreenTranslator LiveScreenTranslator utilizes OCR and translation services to provide...	12	Experimental	real-time-voice-translation	3	Python
6078	Vidyut/vidyut-tts Streamlit frontend for Coqui-tts	12	Experimental	coqui-tts-applications	3	Python
6079	twers1/telegram-bot-audio Telegram bot text-to-speech and speech-to-text	12	Experimental	telegram-voice-transcription	3	Python
6080	khaykingleb/research-playground Efficient ML/DL implementations across multiple domains with K3s multi-node...	12	Experimental	keyword-speech-recognition	3	Python
6081	michaelmior/ha-silero Text-to-speech for Home Assistant using Silero	12	Experimental	home-assistant-tts	3	Python
6082	CaydendW/Cashew A python based virtual assistant	12	Experimental	general-purpose-voice-assistants	3	Python
6083	kuanyshbakytuly/camera-text-speech Blind Text-Assistance	12	Experimental	assistive-vision-ai	3	Python
6084	kunal2812/Programmophone It is a tool to program with speech and is intended to be used by sightless...	12	Experimental	speech-recognition-apis	3	Jupyter Notebook
6085	Joyeah/videomaker 批量图片生成视频	12	Experimental	ai-video-generation	3	Python
6086	lingualogic/speech-react Speech-React SDK	12	Experimental	react-speech-recognition	4	TypeScript
6087	TejasQ/react-praise A React binding for Praise.	12	Experimental	react-speech-recognition	4	TypeScript
6088	ponchotitlan/google_text-to-speech_prompt_maker Utility for Google Text-To-Speech batch audio files generator. Ideal for...	12	Experimental	lightweight-tts-libraries	3	Python
6089	willwade/TTS-Dataset A workflow to create a dataset of all TTS voices/languages available on...	12	Experimental	tts-dataset-creation	1	Python
6090	kaka-lin/rpi-voice-kit-app Using app to control Voice Kit(smart speaker)	12	Experimental	voice-controlled-robotics	3	Python
6091	Rumeysakeskin/Speech-Datasets-for-ASR Download speech datasets (English and non-English) for Automatic Speech Recognition	12	Experimental	speech-corpora-datasets	15	Jupyter Notebook
6092	arjunbazinga/speak Select any text and have it read out loud	12	Experimental	system-tts-wrappers	3	Shell
6093	OVOSHatchery/ovos-tts-plugin-responsivevoice responsive voice TTS plugin for mycroft	12	Experimental	espeak-ng-ecosystem	3	Python
6094	koth/kokoro.cpp kokoro tts in cpp	12	Experimental	kokoro-tts-ecosystem	9	C++
6095	Erio-Harrison/kokorotts_service A TTS service that deploys Kokoro model inference	12	Experimental	kokoro-tts-ecosystem	11	Rust
6096	robauto/bibli3.0 BiBli 3.0 for Raspberry Pi - Swarm Robotics and IoT Operating System - AI -...	12	Experimental	voice-controlled-robotics	3	Python
6097	Thukyd/OpenAI-Spechify-Your-Docs OpenAI-Spechify-Your-Docs is a Python project that converts text from...	12	Experimental	openai-tts-applications	4	Python
6098	ReadieFur/Stream-Tools A stream chat tool that features AWS text to speech, voice commands, chat...	12	Experimental	twitch-chat-tts	4	TypeScript
6099	zguesmi/image2speech Ethereum ready Dapp to speak your images.	12	Experimental	image-to-speech-synthesis	4	Python
6100	PeterTakahashi/openai-tts OpenAI Text to Speech	12	Experimental	openai-tts-applications	4	Swift

« Prev 1 2 3 … 59 60 61 62 63 … 68 69 70 Next »