All Voice AI Tools

6,981 tools ranked by quality score · Page 23 of 70

Showing 2201–2300 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
2201	MbBrainz/ttslab TTSLab is THE place to easily test ANY text to text to speech model on your...	31	Emerging	elevenlabs-integrations	36	TypeScript
2202	kapi2800/qwen3-tts-mac Optimized implementation of Qwen3-TTS for Apple Silicon (M1-M4)	31	Emerging	qwen3-tts-applications	11	Python
2203	sayak-brm/espeakng-python An eSpeak NG TTS binding for Python3.	31	Emerging	espeak-ng-ecosystem	15	Python
2204	GloomyGrave/Sinsy-NG (discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis...	31	Emerging	espeak-ng-ecosystem	21	C++
2205	OpenVoiceOS/ovos-tts-plugin-beepspeak experiment adding new r2d2 tts engine for mycroft	31	Emerging	espeak-ng-ecosystem	4	Python
2206	HelloChatterbox/py_responsivevoice unoficial python api for responsive voice	31	Emerging	espeak-ng-ecosystem	16	Python
2207	gokhaneraslan/tts-dataset-generator With this tool you can create custom TTS dataset from video or audio.	31	Emerging	tts-dataset-creation	13	Python
2208	diggerdu/pytorch_audio audio processing module for pytorch:stft, istft	31	Emerging	neural-vocoder-implementations	36	Python
2209	andi611/CS-Tacotron-Pytorch Pytorch implementation of CS-Tacotron, a code-switching speech synthesis...	31	Emerging	tacotron-tts-models	23	Python
2210	hkdb/offline-tts A Chrome extension that reads web pages and PDFs aloud using Supertonic's...	31	Emerging	browser-tts-extensions	4	JavaScript
2211	USSLab/DolphinAttack Inaudible Voice Commands	31	Emerging	dotnet-tts-libraries	108	—
2212	Proteusiq/saa Making Time Speak! 🎙️	31	Emerging	temporal-expression-parsing	29	Python
2213	go-restream/supertts 🎧 Supertonic TTS ONNX Inference Openai Speech REST API	31	Emerging	lightweight-tts-runtimes	5	Rust
2214	Sciss/SpeechRecognitionHMM Exported from...	31	Emerging	keyword-speech-recognition	12	Java
2215	aidayang/LatentSync-OneClick 免费视频对口型软件LatentSync一键启动整合包	31	Emerging	speech-synthesis-diffusion	28	—
2216	AI-TOOLKIT/VoiceBridge VoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit	31	Emerging	lightweight-tts-runtimes	17	C++
2217	npuichigo/ttsflow tensorflow speech synthesis c++ inference for voicenet	31	Emerging	lightweight-tts-runtimes	16	C++
2218	hkilang/TTS 香港圍頭話及客家話文字轉語音朗讀器	31	Emerging	lightweight-tts-runtimes	12	TypeScript
2219	UFOAlastor/AI-Waifu-Project-LaIN 一个拥有长期记忆, 表情动作, 语音对话/打断/声纹识别, FunctionCall, 多模型支持的AI Waifu客户端.	31	Emerging	interactive-ai-avatars	26	Python
2220	Issac-Moses/Beacon Beacon – A lightweight voice-controlled AI assistant using Whisper.cpp. ...	31	Emerging	local-voice-assistants	8	C++
2221	wspr-ncsu/robocall-audio-dataset A dataset of real-world robocall audio recordings	31	Emerging	speech-corpora-datasets	14	—
2222	SEPIA-Framework/sepia-web-audio Create modular, cross-browser, web audio pipelines to record and process...	31	Emerging	web-speech-api-libraries	46	JavaScript
2223	skit-ai/speech-to-intent-dataset Dataset Release for Intent Classification from Speech	31	Emerging	speech-corpora-datasets	48	Python
2224	siddhant-vij/Health-Fitness-Tracker Health & fitness app with natural language processing, custom...	31	Emerging	ai-tutoring-platforms	9	Python
2225	scarletcho/prep4kaldi Data preparation code for building Kaldi ASR system	31	Emerging	kaldi-asr-ecosystem	14	Python
2226	krestaino/prankstr 📞 Prank your friends with text-to-speech phone calls powered by Twilio and...	31	Emerging	ai-tutoring-platforms	21	JavaScript
2227	amirharati/kaldi-alligner scripts to align a given wave to its transcription using trained models by Kaldi	31	Emerging	kaldi-asr-ecosystem	36	Shell
2228	hanxiao/mls MLX Local Serving (MLS) - Unified ASR, TTS, and Translation on Apple Silicon	31	Emerging	ios-speech-frameworks	10	HTML
2229	khanld/Wav2vec2-Pretraining Wav2vec 2.0 Self-Supervised Pretraining	31	Emerging	wav2vec2-asr-models	59	Python
2230	IPS-LMU/transcription-portal A portal that offers a transcription chain for multi upload and processing...	31	Emerging	meeting-transcription-summarizers	11	TypeScript
2231	deepgram-devs/deepgram-demos-rust Useful demo applications for Deepgram Voice AI APIs, using the Rust language! 🦀	31	Emerging	deepgram-starter-projects	8	Rust
2232	jopedroliveira/speech_recog_uc Speech processing ROS-package. Performs speech recognition and estimates the...	31	Emerging	automatic-speech-recognition	13	C++
2233	ASR-project/Multilingual-PR Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM....	31	Emerging	speaker-diarization-embedding	258	Python
2234	karrarkazuya/ArabicTTS ArabicTTS (TextToSpeech) Android library with a sample	31	Emerging	android-speech-apps	16	Java
2235	boudhayan-dev/Blind-Reader-project A low cost reading device for blind people.	31	Emerging	assistive-vision-ai	12	Python
2236	mozilla/deepspeech-playbook A crash course for training speech recognition models using DeepSpeech.	31	Emerging	ctc-asr-implementations	24	—
2237	SEPIA-Framework/sepia-docs Documentation and Wiki for SEPIA. Please post your questions and bug-reports...	31	Emerging	voice-chatbot-applications	251	—
2238	xcmyz/FastSpeech2 The Implementation of FastSpeech2 Based on Pytorch.	31	Emerging	fastspeech-tts-models	52	Python
2239	overcrash66/Audio-File-Translator---S2ST Audio file translator is a multilingual speech to speech and speech to text...	31	Emerging	speech-translation-apps	18	Python
2240	ayshrv/memento-app Android App which serves as an AI assistant for human memory	31	Emerging	android-voice-assistants	15	Java
2241	papercast-dev/papercast A Python pipeline tool and plugin ecosystem for processing technical...	31	Emerging	content-to-podcast-converters	54	Python
2242	shreyanspagariya/sankshep Video Summarization - Summarized a video lecture and converted it to a...	31	Emerging	meeting-transcription-summarizers	19	Shell
2243	ondrejklejch/learning_to_adapt Coordinate-wise meta-learner for speaker adaptation of ASR models.	31	Emerging	end-to-end-asr-frameworks	20	Python
2244	The-Data-Dilemma/ParquetToHuggingFace ParquetToHuggingFace processes raw audio data, converts it into Parquet...	31	Emerging	speech-ai-coursework	9	Python
2245	suzuran0y/Live2D-LLM-Chat Live2D + ASR + LLM + TTS → Real-time communication + Offline...	31	Emerging	interactive-ai-avatars	32	Python
2246	zalo/OpenAI-Voice A simple proof of concept for voice-to-voice interaction.	31	Emerging	voice-chatgpt-interfaces	9	JavaScript
2247	ericc-ch/edge-tts Use Microsoft Edge's online text-to-speech service from JS code directly!	31	Emerging	edge-tts-implementations	16	TypeScript
2248	laszukdawid/cracker Usable GUI for text-to-speech services	31	Emerging	lightweight-tts-libraries	5	Python
2249	AshutoshDongare/convo Open source voice bot for Humanoid Robots and virtual digital humans	31	Emerging	voice-chatbot-applications	17	Python
2250	X-LANCE/VoiceFlow-TTS [ICASSP 2024] This is the official code for "VoiceFlow: Efficient...	31	Emerging	fastspeech-tts-models	372	Python
2251	MichalKacprzak99/jarvis Jarvis is a personal voice assistant inspired by the Marvel movie series	31	Emerging	python-voice-assistants	14	Python
2252	jenswittmann/CurlyFramework Tiny Framework for accessibility and sustainability, not only for MODX or Kirby CMS.	31	Emerging	tts	10	HTML
2253	opsdroid/opsdroid-audio 🗣 A companion application for opsdroid which adds hotwords, speech...	31	Emerging	general-purpose-voice-assistants	5	Python
2254	HasnainDarkNet/DarKVoice DarKVoice is an open-source voice assistant and audio processing tool built...	31	Emerging	general-purpose-voice-assistants	5	Python
2255	upskyy/ContextNet PyTorch implementation of "ContextNet: Improving Convolutional Neural...	31	Emerging	end-to-end-asr-frameworks	38	Python
2256	hug33k/PyTalk-R2D2 Python script for R2D2 text-to-speech	31	Emerging	lightweight-tts-libraries	17	Python
2257	zmeet-ai/tts-demo 支持各种感情的男女声音，支持实时和离线文本合成tts语音；支持单模特声音变声，语音速率调整，语音音量大小调整；支持自定义语音模型。	31	Emerging	text-to-speech-frameworks	70	Java
2258	in03/squawk Automatic subtitles for DaVinci Resolve with OpenAI Whisper	31	Emerging	whisper-transcription-apps	38	Python
2259	Ronik22/Voice-Controlled-Email A python-based voice-controlled email application for visually impaired persons.	31	Emerging	voice-controlled-robotics	15	Python
2260	filimo/ReaderTranslator PDF/WebPages Reader with embedded Google Translate and voice engine on...	31	Emerging	llm-translation-tools	123	Swift
2261	ognistik/alfred-superwhisper Use Alfred to Control Superwhisper - AI Powered Voice to Text	31	Emerging	audio-transcription-tools	122	JavaScript
2262	JSON2Video/json2video-php-sdk Video automation with PHP: add watermarks, resize videos, create slideshows,...	31	Emerging	ai-video-generation	25	PHP
2263	telecombcn-dl/2018-dlsl UPC Deep Learning for Speech and Language 2018	31	Emerging	speech-ai-coursework	17	—
2264	azraelkuan/FFTNet FFTNet: a Real-Time Speaker-Dependent Neural Vocoder	31	Emerging	neural-vocoder-implementations	64	Python
2265	ckaytev/tgisper Telegram bot with ASR	31	Emerging	telegram-voice-transcription	22	Python
2266	vorojar/VoiceSnap Open-source offline voice dictation — a free alternative to Typeless. 100%...	31	Emerging	local-voice-dictation	42	Go
2267	ZeroMirai/Waifu_AI_Vtuber Waifu_AI_Vtuber is a AI virtual YouTuber chatbot powered by OpenAI GPT-3.5,...	31	Emerging	interactive-ai-avatars	34	Python
2268	hanifabd/voice-activity-detection-vad-realtime Real-time Voice Activity Detection (VAD) with some example use case like...	31	Emerging	speaker-diarization-embedding	106	Python
2269	hutchresearch/latex2speech TeX2Speech is an application that turns LaTeX documents into spoken audio.	31	Emerging	pdf-to-audio-conversion	19	Python
2270	PowerBeef/QwenVoice Native macOS app for Qwen3-TTS with custom voices, voice design, and voice...	31	Emerging	qwen3-tts-applications	57	Swift
2271	suzumushi0/SoundObject_binary SoundObject binary distribution.	31	Emerging	audio-source-separation	57	—
2272	HCI-LAB-UGSPEECHDATA/speech_data_ghana_ug The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani,...	31	Emerging	llm-scaling-architecture	11	HTML
2273	kcitlyn/PolyScribe_Desktop Fully-offline transcription and translator w/ speech-to-text and...	31	Emerging	audio-transcription-apps	10	Python
2274	i4Ds/whisper-prep Data preparation utility for the finetuning of OpenAI's Whisper model.	31	Emerging	speech-to-text-transcription	11	Python
2275	indri-voice/audiotoken Audio tokenization, in the fastest way possible!	31	Emerging	tokenization-libraries	53	Python
2276	BraceYourselfGames/UE-BYGTextToSpeech A plugin that uses the Windows Speech API to speak text in Unreal Engine 4.	31	Emerging	dotnet-tts-libraries	22	C++
2277	bensonruan/Speech-Command Speech Command Recognizer using tensorflowjs	31	Emerging	wake-word-detection	17	JavaScript
2278	theaifutureguy/Vocal-Agent A sophisticated real-time voice assistant that seamlessly integrates speech...	31	Emerging	conversational-chatbot-applications	25	Python
2279	led-mirage/VoivoClip VOICEVOXでクリップボードに貼り付けられたテキストを読み上げるアプリです。	31	Emerging	clipboard-text-to-speech	8	Python
2280	masonthemaker/saidwell Open Source Voice AI Dashboard	31	Emerging	ai-chatbot-interfaces	13	TypeScript
2281	lmangani/docker-rtpengine-speech OpenSIPS + RTPEngine Recording + Speech Recognition in HEP	31	Emerging	coqui-tts-applications	21	Shell
2282	hebbihebb/MBook EPUB to M4B using Maya1	31	Emerging	ebook-to-audiobook-conversion	5	Python
2283	gkrsv/split_audio A rough and ready Python utility which splits audio files based on silence...	31	Emerging	speech-to-text-transcription	16	Python
2284	oren-cohen/whatsmybitrate Whatsmybitrate analyzes audio files for quality metrics such as bit rate,...	31	Emerging	audio-music-learning	14	Python
2285	hollygrimm/voice-dataset-creation Tools to create your own voice dataset for TTS training	30	Emerging	tts-dataset-creation	71	Jupyter Notebook
2286	aabdurakhmanov/uzbekcha-gapir Matnni O'zbek tilida talafuz qiluvchi desktop dastur \| Text to speech...	30	Emerging	lightweight-tts-libraries	7	Python
2287	RapDoodle/Web-Real-Time-Speech-Recognition-with-Azure An example project that provides a web interface to real-time speech-to-text...	30	Emerging	dotnet-tts-libraries	3	HTML
2288	calinalexandru/pericles A browser extension offering intuitive text-to-speech functionality, making...	30	Emerging	browser-tts-extensions	15	TypeScript
2289	surajondev/text-to-speech Conver text into speech	30	Emerging	web-speech-api-tts	4	CSS
2290	vectominist/End-to-end-ASR-Pytorch-DLHLP Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation...	30	Emerging	end-to-end-asr-frameworks	17	Python
2291	gokulkarthik/text2speech Towards Building Text-To-Speech Systems for the Next Billion Users -...	30	Emerging	whisper-transcription-apps	57	Jupyter Notebook
2292	weespin/RequestifyTF2 Client side commands for mic spamming and more!	30	Emerging	dotnet-tts-libraries	16	C#
2293	SUNGBEOMCHOI/Korean-Streaming-ASR Korean Streaming ASR(with Denoiser and Conformer CTC)	30	Emerging	conformer-asr-implementations	43	Python
2294	Rongjiehuang/Multiband-WaveRNN An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio...	30	Emerging	neural-vocoder-implementations	28	Python
2295	jesseward/azuretexttospeech A Go library for Azure's Cognitive Services text-to-speech API.	30	Emerging	go-tts-libraries	8	Go
2296	Vazgen005/discord-virtual-micro Says everything you type in discord for you using ai (Silero Models)	30	Emerging	discord-tts-bots	8	Python
2297	betaoverflow/donna Transform your smart devices to intelligent communicators.	30	Emerging	educational-voice-apps	11	Dart
2298	CMsmartvoice/Unet-TTS One-shot TTS with Improved Unseen Speaker and Style Transfer	30	Emerging	zero-shot-voice-synthesis	37	—
2299	mishrababhishek/chatbot AI Chatbot answers students' queries about their college program using...	30	Emerging	voice-chatbot-applications	9	Python
2300	gokhaneraslan/XTTS_V2-finetuning Training XTTS V2 and PEFT LORA Text-to-Speech (TTS)	30	Emerging	tts-model-finetuning	4	Python

« Prev 1 2 3 … 21 22 23 24 25 … 68 69 70 Next »