All Voice AI Tools

6,981 tools ranked by quality score · Page 21 of 70

Showing 2001–2100 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
2001	MaxMax2016/Grad-TTS-Chinese Huawei Grad-TTS for Chinese	33	Emerging	lightweight-tts-runtimes	51	Python
2002	tabahi/WebSpeechAnalyzer JS speech analyzer for fast speech analysis and labeling	33	Emerging	web-speech-api-libraries	39	JavaScript
2003	rapidaai/rapida-go Open-source Golang SDK for Rapida to build real-time, observable Voice AI...	33	Emerging	go-tts-libraries	2	Go
2004	AcTePuKc/Kokoro-Local-Gui Hyper-fast, local, high-quality TTS based on Kokoro-82M. PySide6 GUI included.	33	Emerging	kokoro-tts-ecosystem	19	Python
2005	dsi-icl/do-voice-interaction The goal of this project is to provide a voice assistant to the Data...	33	Emerging	general-purpose-voice-assistants	6	HTML
2006	AASHISHAG/DeepSpeech-API The code enables users to use Mozilla's Deep Speech model over the Web Browser.	33	Emerging	web-speech-api-libraries	32	TypeScript
2007	ibm-self-serve-assets/Watson-Speech This collection demonstrates how to help you to quickly embed Watson Speech...	33	Emerging	audio-transcription-apps	17	Jupyter Notebook
2008	ajaygujja/Kahani-Storytelling-App-For-Children-With-Hearing-Impairment Storytelling App For Children With Hearing Impairment	33	Emerging	android-voice-assistants	63	Java
2009	thewh1teagle/vad-rs Speech detection using silero vad in Rust	33	Emerging	rust-speech-recognition	30	Rust
2010	madzadev/voice-cue 📣 Find sentiments, tags, entities, and actions in your voice recordings instantly	33	Emerging	web-speech-api-libraries	58	JavaScript
2011	yyaadet/autosrt_page AutoSRT is an macOS app that automatically generates dual language subtitles...	33	Emerging	text-to-speech-tts	48	HTML
2012	rryam/SakuraKit Swift SDK for Prototyping AI Speech Generation	33	Emerging	ios-speech-frameworks	26	Swift
2013	twn39/EdgeTTS.DotNet EdgeTTS.DotNet is a C# (.NET) library that allows you to use Microsoft...	33	Emerging	edge-tts-implementations	2	C#
2014	muhammadGagah/native-speech-generation Add-on NVDA untuk mengubah teks menjadi suara alami dengan Google Gemini AI.	33	Emerging	openai-tts-applications	1	Python
2015	small-cactus/Jarvis-ChatGPT-VoiceAssistant Jarvis powered by GPT-3.5/GPT-4	33	Emerging	python-voice-assistants	27	Python
2016	atakanakin/TutunSabri He is not our hero. He is a silent guardian. A watchful protector.	33	Emerging	telegram-voice-transcription	12	Python
2017	ywatanabe1989/scitex-notification Give your AI agents a voice — TTS, phone calls, SMS, email, webhooks. One...	33	Emerging	voice-enabled-coding-assistants	2	Python
2018	eminemahjoub/pdf-voice-reader "PDF Reader: A Python application for seamless PDF viewing with enhanced...	33	Emerging	pdf-to-audio-conversion	13	Python
2019	rt400/ReversoTTS-HA ReversoTTS component for HomeAssistant	33	Emerging	home-assistant-tts	41	Python
2020	fquirin/speech-recognition-experiments Experiments to test different speech recognition systems for SEPIA Framework	33	Emerging	automatic-speech-recognition	63	Python
2021	eellak/gsoc2019-sphinx Creation of an online Greek mail dictation system, using Sphinx and...	33	Emerging	web-speech-api-libraries	21	Python
2022	aishoot/Multi-Hotword_Spotting Won't it be cool to build a speech assistant like Alexa or Siri yourself...	33	Emerging	wake-word-detection	34	Jupyter Notebook
2023	gheyret/uyghur-asr-ctc Speech Recognition for Uyghur using deep learning	33	Emerging	ctc-asr-implementations	42	Python
2024	FlorianEagox/WeeaBlind A program to dub non-english media with modern AI speech synthesis,...	33	Emerging	video-dubbing-tools	323	Python
2025	vdutts7/ai-rapper Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise...	33	Emerging	speech-to-text-transcription	48	Python
2026	eellak/gsoc2021-audio-annotation-tool Creation of a multi user audio first annotation tool - GSoC 2021	33	Emerging	data-annotation-tools	29	HTML
2027	Harshit-shrivastav/TikTok-TTS-Bot A python TikTok Text to speech generator telegram bot.	33	Emerging	telegram-voice-transcription	15	Python
2028	vroomai/vst 🎹 Generate sounds from words. Directly in your DAW.	33	Emerging	audio-music-learning	139	C++
2029	0xPD33/sonori Sonori is a fully local STT app for Linux (Wayland).	33	Emerging	rust-speech-recognition	17	Rust
2030	Gust4voSales/Marvin-VirtualAssistent A dinamic virtual assistent made with Python, you can easily add more voice...	33	Emerging	general-purpose-voice-assistants	41	Python
2031	shawnrushefsky/talky-talky MCP server for Audio Generation and Analysis with a Variety of Open Models.	33	Emerging	voice-enabled-coding-assistants	2	Python
2032	mramshaw/Speech-Recognition Speech recognition with Python	33	Emerging	automatic-speech-recognition	18	Python
2033	xinjli/ucla-phonetic-corpus Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH...	33	Emerging	speech-corpora-datasets	46	Python
2034	akku2005/VocalInk Next-gen open-source voice-to-blog platform with AI, TTS, gamification, and...	33	Emerging	ai-tutoring-platforms	3	JavaScript
2035	lesleyrs/clipboard-narrator Turn any web page into an audiobook, works in the background on desktop!	33	Emerging	clipboard-text-to-speech	64	GDScript
2036	lucasnewman/vocos-mlx Implementation of 'Vocos: Closing the gap between time-domain and...	33	Emerging	fastspeech-tts-models	24	Python
2037	aks-devs/mod_openai_tts Freeswitch Speech-To-Text module	33	Emerging	vosk-asr-implementations	9	C
2038	speechly/ios-client The iOS client library for Speechly API	33	Emerging	ios-speech-frameworks	65	Swift
2039	wwdok/faster-whisper-webui-cn Cloned from https://huggingface.co/spaces/aadnk/faster-whisper-webui, and...	33	Emerging	speech-to-text-converters	28	Python
2040	jimbobbennett/SpeechToTextSamples Sample code showing how to use the Azure Speech to Text service from Python 🗣	33	Emerging	dotnet-tts-libraries	29	Python
2041	DojoCodingLabs/remotion-superpowers 🎬 Claude Code plugin — full video production studio for Remotion. AI...	33	Emerging	ai-video-generation	6	Shell
2042	royshil/obs-squawk Real-time Text-to-Speech AI Engine built-in OBS, integrative and intuitive	33	Emerging	live-caption-generation	65	C++
2043	lucadellalib/audiocodecs A collections of audio codecs with a standardized API	33	Emerging	neural-vocoder-implementations	36	Python
2044	ShawnPi233/SynParaSpeech Official Repository of Paper: "SynParaSpeech: Automated Synthesis of...	33	Emerging	tts-dataset-creation	66	JavaScript
2045	mravanelli/pytorch_MLP_for_ASR This code implements a basic MLP for speech recognition. The MLP is trained...	33	Emerging	end-to-end-asr-frameworks	40	Perl
2046	pviotti/sayit A text-to-speech command line tool backed by Azure Cognitive Services.	33	Emerging	dotnet-tts-libraries	19	F#
2047	HeyHeyChicken/NOVA-Python NOVA is a customizable voice assistant made with Python.	33	Emerging	voice-assistant-applications	17	Python
2048	ORI-Muchim/One-Click-MB-iSTFT-VITS2 MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making...	33	Emerging	vits-tts-implementations	13	Python
2049	prathamsolanki/gender-recognition-by-voice Identify a voice as male or female.	33	Emerging	speech-ai-coursework	33	Jupyter Notebook
2050	yui-mhcp/text_to_speech (Multi Speaker) Text-To-Speech (TTS) project	33	Emerging	fastspeech-tts-models	10	Python
2051	daisy/obi Obi is an open source audio book production tool that produces digital...	32	Emerging	ebook-to-audiobook-conversion	10	HTML
2052	r1di/neutts-fastapi OpenAI-compatible Text-to-Speech API server powered by NeuTTS. Drop-in...	32	Emerging	self-hosted-tts-servers	1	Python
2053	ga642381/Taiwanese-Whisper fine-tune Whipser model for Taiwanese speech recognition	32	Emerging	whisper-fine-tuning	37	Python
2054	Citadawn/VoiceDAO 语道 (VoiceDAO) - 专注于文本转语音功能的 Android 应用	32	Emerging	java-tts-libraries	1	Java
2055	taresh18/orpheus-streaming Orpheus TTS Server with streaming support (TTFB ~160ms)	32	Emerging	gradio-tts-webuis	24	Python
2056	ye-kyaw-thu/myG2P Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary...	32	Emerging	grapheme-to-phoneme-conversion	60	Perl
2057	thevickypedia/Jarvis_UI Light weight UI to interact with Jarvis via API calls	32	Emerging	python-voice-assistants	6	Python
2058	saky-semicolon/Emotion-Aware-AI-Support-System A smart AI-powered platform that detects emotions from student voice input,...	32	Emerging	speech-emotion-recognition	17	HTML
2059	jianchang512/kokoro-uiapi 用于kokoro TTS的webui界面和兼容openai api	32	Emerging	kokoro-tts-ecosystem	39	Python
2060	poretsky/ru_tts Compact and portable Russian speech synthesizer	32	Emerging	espeak-ng-ecosystem	27	C
2061	yanghaha0908/FastHuBERT Official implementation for Fast-HuBERT: An Efficient Training Framework for...	32	Emerging	fastspeech-tts-models	97	Python
2062	arunk140/serve-piper-tts Go Lang API Wrapper around Piper TTS - Supports TTS Inference and List of Voices	32	Emerging	go-tts-libraries	31	Go
2063	susilnem/American-sign-Language A CNN based human computer interface for American Sign Language recognition...	32	Emerging	sign-language-translation	22	Python
2064	esoyeon/KoreanTTS Korean Text To Speech Project: Using Tacotron1, Tacotron2, Wavenet and Melgan	32	Emerging	tacotron-tts-models	38	Jupyter Notebook
2065	SCRN-VRC/Voice-Recognition-Shader Audio detection with visemes in a fragment shader	32	Emerging	unity-ml-inference	32	ShaderLab
2066	rcdalj/speech2speech Full speech-to-speech workflow (can be customized to user's requirements)	32	Emerging	voice-chatbot-applications	5	Python
2067	manascb1344/zonos-api Production-ready FastAPI wrapper for Zonos TTS models with GPU acceleration,...	32	Emerging	self-hosted-tts-servers	40	Python
2068	unza-speech-lab/zambezi-voice Repository for multilingual speech data resources for native languages of Zambia.	32	Emerging	speech-corpora-datasets	20	—
2069	biyoml/End-to-End-Mandarin-ASR End-to-end speech recognition on AISHELL dataset.	32	Emerging	end-to-end-asr-frameworks	34	Python
2070	jonaro00/wallace-minion 🔨🙂 Discord Bot for my private friend server	32	Emerging	discord-tts-bots	7	Rust
2071	lcraver/ProxiTalk This is the repo for ProxiTalk OS. ProxiTalk is a custom operating system...	32	Emerging	self-hosted-tts-servers	7	Python
2072	30stomercury/Automatic-Speech-Recognition End-to-End Speech Recognition Using Tensorflow	32	Emerging	ctc-asr-implementations	40	Python
2073	phineas-pta/fine-tune-whisper-vi jupyter notebooks to fine tune whisper models on Vietnamese using Colab...	32	Emerging	whisper-fine-tuning	19	Jupyter Notebook
2074	LedoKun/028-simple-queue-system A real-time, responsive queue calling system designed for TV displays,...	32	Emerging	rust-tts-libraries	1	Rust
2075	ivanvovk/compressed-tacotron2-pytorch Compressed version of Tacotron 2 using Tensor Train + Waveglow.	32	Emerging	tacotron-tts-models	22	Jupyter Notebook
2076	SiddhantSadangi/st_deepgram_playground API playground for Deepgram built with Streamlit	32	Emerging	streamlit-tts-apps	21	Python
2077	DataXujing/ASR-paper :fire: ASR教程: https://dataxujing.github.io/ASR-paper/	32	Emerging	end-to-end-asr-frameworks	25	—
2078	vani-voice/vani Open protocol & middleware for Indian language voice agents — STT→LLM→TTS in...	32	Emerging	voice-agent-applications	1	Python
2079	Ephrem-ETH/E2E-KWS End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM	32	Emerging	wake-word-detection	43	Python
2080	Aditya-ds-1806/dictpress-tts TTS plugin for dictpress	32	Emerging	go-tts-libraries	7	Go
2081	sberdevices/smartspeech SmartSpeech — это сервис для синтеза и распознавания речи	32	Emerging	voice-ai-sdks	31	C++
2082	daymade/chattts-seed-example 这是一个 ChatTTS 音频仓库，包含用不同 seed 生成的不同音色，你可以方便地挑选你喜欢的 seed。	32	Emerging	self-hosted-tts-servers	54	—
2083	stefantaubert/mean-opinion-score Python library for calculating the mean opinion score and 95% confidence...	32	Emerging	lightweight-tts-libraries	24	Python
2084	thewh1teagle/israwave Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet	32	Emerging	grapheme-to-phoneme-conversion	39	Python
2085	funway/audible-epub3-maker Generate audiobooks from plain EPUB files in EPUB 3 Media Overlays format...	32	Emerging	ebook-to-audiobook-conversion	15	Python
2086	Deimos-M/DL-Virtual-Assistant It is a virtual assistant for visually impaired which include models like...	32	Emerging	general-purpose-voice-assistants	44	Python
2087	Yangyangii/TPGST-Tacotron Google's TPGST reimplementation.	32	Emerging	tacotron-tts-models	34	Python
2088	taikun114/VOICEVOX-TTS-for-Home-Assistant Custom integration for Japanese TTS using VOICEVOX in Home Assistant.	32	Emerging	home-assistant-tts	5	Python
2089	mike-nott/smart-announcements Intelligent context-aware voice announcements for Home Assistant....	32	Emerging	home-assistant-tts	7	Python
2090	AkshathRaghav/tinyspeech Code release for "TinySpeech: Attention Condensers for Deep Speech...	32	Emerging	wake-word-detection	21	C
2091	OpenTSLab/BELLE Official implementation of BELLE "Bayesian Speech Synthesizers Can Learn...	32	Emerging	fastspeech-tts-models	7	Python
2092	souvikg544/TTS_Data_Maker Text to speech is an emerging zone of AI. This repository helps to create a...	32	Emerging	tts-dataset-creation	28	Python
2093	ih3xcode/h3xassist Meeting assistant that records, transcribes, and summarizes online meetings...	32	Emerging	meeting-transcription-summarizers	34	TypeScript
2094	brewusinc/Edge-TTS Edge-TTS is a Swift implementation of Microsoft Edge's Text-to-Speech (TTS)...	32	Emerging	edge-tts-implementations	23	Swift
2095	samuelbradshaw/text-to-timestamps Python and command-line utility for aligning audio to a transcript.	32	Emerging	whisper-transcription-apps	15	Python
2096	georgesterpu/Taris Transformer-based online speech recognition system with TensorFlow 2	32	Emerging	ctc-asr-implementations	26	Python
2097	wahyd4/say-it TTS in command line -- Pronounce the Chinese and English words you typed in.	32	Emerging	system-tts-wrappers	20	Go
2098	art1415926535/yandex_speech Generation of speech using Yandex SpeechKit.	32	Emerging	yandex-speechkit-tools	24	Python
2099	oleges1/quartznet-pytorch Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]	32	Emerging	end-to-end-asr-frameworks	26	Jupyter Notebook
2100	mazzasaverio/youtube-auto-dub Automated voice dubbing for YouTube videos using Docker, OpenVoice, and...	32	Emerging	video-dubbing-tools	64	Python

« Prev 1 2 3 … 19 20 21 22 23 … 68 69 70 Next »