All Voice AI Tools

6,981 tools ranked by quality score · Page 24 of 70

Showing 2301–2400 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
2301	BayramAnnakov/gmail-to-podcast Transform Gmail newsletters into AI-generated podcast conversations using...	30	Emerging	content-to-podcast-converters	5	Python
2302	adelacvg/detail_tts All generative model in one for better TTS model	30	Emerging	zero-shot-voice-synthesis	74	Python
2303	jonelo/jAdapterForNativeTTS A simple pure Java library that allows you to use the native Text To Speech...	30	Emerging	java-tts-libraries	12	Java
2304	daslearning-org/text-to-speech-offline A lightweight cross-platform Text-To-Speech application which works on...	30	Emerging	lightweight-tts-libraries	3	Python
2305	FOLLGAD/reddit-video-maker AI video content creation before it was cool	30	Emerging	ai-video-generation	32	JavaScript
2306	black-roland/homeassistant-salutespeech SaluteSpeech integration for Home Assistant providing speech-to-text and...	30	Emerging	home-assistant-tts	17	Python
2307	tts-hub/monotonic_alignment_search Monotonically align text and speech	30	Emerging	asr-evaluation-metrics	4	Python
2308	botbahlul/js-live-audio-video-translate HTML Web template that can RECOGNIZE any live audio/video streaming (using...	30	Emerging	live-meeting-translation	19	JavaScript
2309	Otosaku/OtosakuTTS-iOS Swift library for offline text-to-speech synthesis on iOS/macOS. Generate...	30	Emerging	ios-speech-frameworks	51	Swift
2310	clarinsi/Slovene_ASR_e2e Automatic Speech Recognition tool	30	Emerging	end-to-end-asr-frameworks	20	Python
2311	loushou/flutter_tts_improved A fork of the Flutter_TTS (https://github.com/dlutton/flutter_tts) plugin,...	30	Emerging	educational-voice-apps	10	Java
2312	Julia-Roman/pepega-tts Discord bot for Google and Polly Text-to-Speech	30	Emerging	discord-tts-bots	10	JavaScript
2313	jfainberg/lattice_combination Lattice combination algorithm to combine inaccurate transcripts with...	30	Emerging	automatic-speech-recognition	16	Jupyter Notebook
2314	linto-ai/linto-diarization Speaker diarization service	30	Emerging	whisper-diarization	28	Python
2315	chameleon-ai/vevo Simple GUI for Amphion Vevo	30	Emerging	coqui-tts-applications	14	Python
2316	rshahamiri/SpeechVision Speech Vision (SV) is a Dysarthric Speech Recognition System that adopts a...	30	Emerging	image-caption-generation	9	Jupyter Notebook
2317	daanzu/kaldi_ag_training Docker image and scripts for training finetuned or completely personal Kaldi...	30	Emerging	kaldi-asr-ecosystem	21	Shell
2318	SkyDocs/speaker-identification Speaker Identification using Neural Net.	30	Emerging	keyword-speech-recognition	20	Python
2319	valeriorlandini/sonus A Max/MSP package for sound experimentation and algorithmic composition	30	Emerging	audio-music-learning	16	C++
2320	frrobledo/AutoDub An advanced AI-powered tool that automatically translates and dubs YouTube...	30	Emerging	video-dubbing-tools	16	Python
2321	sahu-adarsh/intervyu Practice job interviews with Neerja, an AI interviewer powered by Claude....	30	Emerging	ai-interview-simulators	3	Python
2322	daswer123/silero-tts-enhanced Silero TTS Enhanced is a Python library that enhances the original Silero...	30	Emerging	gradio-tts-webuis	22	Python
2323	sshh12/Recording-Bot A bot built to record and transcribe audio fragments from Discord.	30	Emerging	discord-tts-bots	27	Python
2324	aws-samples/amazon-transcribe-email-workflow An Amazon Transcribe demo for "speech-to-text" conversion performed through...	30	Emerging	real-time-voice-translation	8	Java
2325	SaptakBhoumik/easySpeech easySpeech is an open-source Python wrapper for google speech to text API...	30	Emerging	speech-recognition-apis	16	Python
2326	OPEXGroup/ITCC.YandexSpeechKitClient Cross-platform client for Yandex SpeechKit Cloud API	30	Emerging	yandex-speechkit-tools	15	C#
2327	thc1006/whisper-colab-tpu-transcriber High-performance Google Colab Notebook for fast & accurate audio...	30	Emerging	whisper-transcription-apps	14	Jupyter Notebook
2328	geekgirljoy/PHP Examples of my PHP Code	30	Emerging	php-tts-libraries	4	PHP
2329	Abhishek-op/SR 💡Kivy-android speech recognition	30	Emerging	automatic-speech-recognition	15	Python
2330	cmsflash/deep-learning-sota State-of-the-art results for deep learning tasks in various fields.	30	Emerging	speech-ai-coursework	15	—
2331	zsl24/Tacotron2-Mandarin-HiFiGAN Implementation of TTS with combination of Tacotron2 and HiFi-GAN	30	Emerging	neural-vocoder-implementations	11	Python
2332	PRITHIVSAKTHIUR/Vision-to-VibeVoice-en A Gradio-based demo for end-to-end vision-to-speech inference: Extract text...	30	Emerging	qwen3-tts-applications	3	Python
2333	robmsmt/SpeechLoop Many ASRs under one roof. With Benchmarking... answering the question. What...	30	Emerging	automatic-speech-recognition	19	Python
2334	JSON2Video/json2video-nodejs-sdk Create videos programmatically in the cloud from NodeJS: add watermarks,...	30	Emerging	ai-video-generation	20	JavaScript
2335	NICEElevateAI/ElevateAIDotNetSDK .Net core 6 SDK for ElevateAI	30	Emerging	dotnet-tts-libraries	115	C#
2336	liou666/audiread 📻 A simple and user-friendly online TTS tool. (简单易用的在线文本转语音工具)	30	Emerging	google-tts-libraries	11	TypeScript
2337	brailcom/speechd-el Emacs speech and Braille output interface	30	Emerging	cross-platform-tts-frameworks	14	Emacs Lisp
2338	ARAI-Telegram/teledash-backend-processing Optional AI-powered features of Teledash, an open-source software for...	30	Emerging	telegram-voice-transcription	4	Python
2339	Audio-WestlakeU/UMA-ASR This repository is the official implementation of unimodal aggregation (UMA)...	30	Emerging	automatic-speech-recognition	35	Shell
2340	IndieCoderMM/smart-one-ai 🤖 AI assistant that can listen to user input and provide responses. It...	30	Emerging	voice-controlled-desktop-automation	16	Python
2341	revsic/speechset Numpy-librosa implementation of Speech dataset pipeline	30	Emerging	tts-dataset-creation	9	Python
2342	abinashmeher999/voice-data-extract A command line interface to combine text information from subtitles with...	30	Emerging	speech-recognition-apis	19	Python
2343	jumon/pywer A simple Python package to calculate word error rate (WER).	30	Emerging	asr-evaluation-metrics	5	Python
2344	mikopbx/ModuleRHVoice Text to speech voice generator by the RHVoice algoritm	30	Emerging	php-tts-libraries	2	PHP
2345	tcsenpai/audiocoqui A multilingual tool to convert PDF ebooks to audiobooks using XTTS v2 TTS...	30	Emerging	text-to-speech-conversion	18	Python
2346	eliangerard/simple-tts-mp3 Converts text to mp3 audio using google-tts-api, it hasn't a limit	30	Emerging	google-tts-libraries	17	JavaScript
2347	ameerbadri/twilio-asr-realtime-dashboard Twilio ASR and Intent Realtime Dashboard	30	Emerging	ai-tutoring-platforms	15	JavaScript
2348	nico-byte/whisper-web The Whisper Web Transcription Server is a Python-based real-time...	30	Emerging	speech-to-text-converters	3	Python
2349	rapidaai/rapida-python Open-source Python SDK for real-time Voice AI, voice agents, streaming...	30	Emerging	voice-ai-sdks	1	Python
2350	nemoramo/acoustic_model This is a sub-repository in building to create acoustic model in Mandarin...	30	Emerging	end-to-end-asr-frameworks	6	Python
2351	ndenicolais/SpeechAndText Android application built with Kotlin and Jetpack Compose that shows how to...	30	Emerging	android-speech-apps	16	Kotlin
2352	twirapp/silero-tts-api-server This is a simple server that uses Silero models to convert text to audio...	30	Emerging	gradio-tts-webuis	17	Python
2353	jorcelinojunior/whisper-vtt2srt A robust WebVTT to SRT converter optimized for AI transcriptions (Whisper,...	30	Emerging	whisper-subtitle-generation	2	Python
2354	LibraryOfCongress/speech-to-text-viewer AWS Transcribe evaluation pipeline: bulk-process audio files and view the results	30	Emerging	real-time-voice-translation	17	Python
2355	acyclics/speech-to-speech-translator Enables a device to input speech from a microphone, translate speech to a...	30	Emerging	speech-translation-apps	12	C++
2356	build-with-groq/groq-voice-agent-template A real-time voice AI agent built with Groq API that enables natural voice...	30	Emerging	voice-agent-applications	31	Svelte
2357	fano2458/Zhadiger-Kazakh-Language-AI AI services project "Zhadiger" for Kazakh Language developed using NVIDIA...	30	Emerging	ai-tutoring-platforms	4	Python
2358	Martouta/speech_processor Speech-to-text from videos and audios (including youtube and tiktok links)	30	Emerging	speech-recognition-apis	20	Python
2359	empowerai/fs-middlelayer-api US Forest Service ePermit API	30	Emerging	audio-transcription-apps	5	JavaScript
2360	Adibian/ResGrad Unofficial implementation of ResGrad: Residual Denoising Diffusion...	30	Emerging	fastspeech-tts-models	20	Python
2361	nonwill/GoldenDict-OCR GoldenDict++: Optimizations for faster dictionary loading and searching,...	30	Emerging	ai-powered-ereaders	215	—
2362	kurianbenoy/malayalam_asr_benchmarking A study to benchmark whisper based ASRs in Malayalam	30	Emerging	automatic-speech-recognition	11	Jupyter Notebook
2363	lottev1991/Project-AIdol-Public-English-Dataset Public female English corpus used for Project AI❤dol	30	Emerging	speech-recognition-datasets	14	—
2364	avarayr/yap-for-cursor Yap for Cursor - Voice To Text integration for Cursor IDE	30	Emerging	audio-transcription-apps	35	TypeScript
2365	clloret/speaking-practice An Android application to practice English pronunciation	30	Emerging	android-speech-apps	16	Kotlin
2366	amscotti/hn-podcaster The HackerNews Podcaster is a JavaScript application that utilizes the power...	30	Emerging	content-to-podcast-converters	2	TypeScript
2367	parthgupta1208/VoiceCraft Voice Craft is a desktop AI assistance tool designed to help people with...	30	Emerging	voice-assistant-devices	17	Python
2368	rafaelvalle/asrgen Attacking Speaker Recognition with Deep Generative Models	30	Emerging	neural-vocoder-implementations	34	Jupyter Notebook
2369	I5UCC/VRCTextboxSTT A SpeechToText application that uses OpenAI's whisper via faster-whisper to...	30	Emerging	speech-to-text-converters	63	Python
2370	reyniel26/bleepy Bleepy is a Python program that can block Tagalog and English profanity in...	30	Emerging	meeting-transcription-summarizers	12	Python
2371	qiujiali/lattice_rnn Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation	30	Emerging	ctc-asr-implementations	15	Python
2372	andi611/Conditional-SpecGAN-Tensorflow Text-to-Speech Synthesis by Generating Spectrograms using Generative...	30	Emerging	neural-vocoder-implementations	10	Python
2373	lars76/forced-alignment-chinese Mandarin Chinese audio datasets aligned with Montreal Forced Aligner	30	Emerging	kaldi-asr-ecosystem	17	Python
2374	Yeti47/Vosk4Unity Vosk4Unity is a module for the Unity Engine that provides a simple way to...	30	Emerging	dotnet-tts-libraries	29	C#
2375	xeden3/MSSpeechServer MSSpeechServer is a REST server based on the Microsoft Speech Platform that...	30	Emerging	dotnet-tts-libraries	9	C#
2376	doubleZ0108/Human-Computer-Interaction Human-Computer Interaction \| Tongji Univ. SSE Course Projects	30	Emerging	assistive-vision-ai	33	C#
2377	deepily/genie-in-the-box Genie in the Box: Distill Whisper STT => Mistral-7B =>...	30	Emerging	audio-transcription-tools	16	Jupyter Notebook
2378	EuleMitKeule/speaker-recognition Speaker recognition service for Home Assistant using voice embeddings. Train...	30	Emerging	speaker-diarization-embedding	17	Python
2379	jame25/Piper-Tray Piper Tray is a lightweight system tray utility written in C# for use with Piper TTS.	30	Emerging	piper-tts-ecosystem	32	C#
2380	scottgl9/openclaw-matrix-voice Matrix voice call bot with LiveKit, Whisper STT, and Chatterbox TTS,...	30	Emerging	openclaw-voice-assistants	5	TypeScript
2381	zhihanyang2022/gender-audio-classification A speaker gender classifier. MFC feature engineering and a pre-trained...	30	Emerging	keyword-speech-recognition	27	Jupyter Notebook
2382	nezhar/speech-condenser A tool for summarizing dialogues from videos or audio	30	Emerging	meeting-transcription-summarizers	84	Python
2383	FomTarro/word-salad Twitch TTS redeem that uses sentence mixing instead of synthesis.	30	Emerging	twitch-chat-tts	3	JavaScript
2384	EmZod/Speak-Turbo Ultra-fast local TTS for AI agents. ~90ms to first sound.	30	Emerging	voice-ai-agents	16	Python
2385	asaddi/f5-tts-serve A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful...	30	Emerging	self-hosted-tts-servers	14	Python
2386	slackr31337/wyoming-piper-gpu Wyoming Piper docker container with GPU support for Home-Assistant	30	Emerging	piper-tts-ecosystem	54	Python
2387	shafaypro/PYSHA A Simple Virtual Assistant Build in Python 3.5	30	Emerging	general-purpose-voice-assistants	19	Python
2388	NONAN23x/WhisperingNova An AI voice changer harnessing the power of Open AI and VoiceVox for...	30	Emerging	speech-to-text-converters	20	Python
2389	MiguelsPizza/local-transcription-mcp--parakeet-tdt-0.6b-v2-- Local MCP server that converts and transcribes video and audio files 100% on device	30	Emerging	voice-enabled-coding-assistants	10	Python
2390	Rishav-Agarwal/Translate-Language_Translator An android app that allows you to translate text and phrases between 90+...	30	Emerging	android-speech-apps	13	Java
2391	legekka/GanyuTTS A small VITS+SOVITS/RVC TTS API	30	Emerging	vits-tts-implementations	35	Python
2392	Vishnu-tppr/NEXORA-AI Made with Python, crafted by Vishnu 💻✨ Nexora AI – A smart Python voice...	30	Emerging	general-purpose-voice-assistants	13	Python
2393	koudounasalkis/AI4Voice This repo contains the code for "Voice Disorder Analysis: A...	30	Emerging	audio-classification-transformers	15	Python
2394	msalhab96/MultiSpeech pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with...	30	Emerging	fastspeech-tts-models	21	Python
2395	xnmeet/voi 一个基于 [Bob](https://bobtranslate.com/) 的文本转语音插件，使用 Kokoro 本地部署模型作为语音合成服务。	30	Emerging	kokoro-tts-ecosystem	11	Python
2396	LuluW8071/Conformer End-to-End Speech Recognition Training with Conformer CTC using PyTorch Lightning⚡	30	Emerging	conformer-asr-implementations	13	Jupyter Notebook
2397	skit-ai/speech-recognition SDKs and docs for Skit's speech to text service	30	Emerging	voice-ai-sdks	21	Python
2398	Yuan-ManX/ComfyUI-ChatterboxTTS ComfyUI-ChatterboxTTS is now available in ComfyUI, Chatterbox is the first...	30	Emerging	comfyui-tts-nodes	13	Python
2399	AsoSoft/AsoSoft-TTS-Speech-Corpus-for-Central-Kurdish AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech	30	Emerging	speech-corpora-datasets	19	—
2400	fernicar/Parakeet_GUI_TINS_Edition A desktop application built using the TINS paradigm for transcribing audio...	30	Emerging	parakeet-asr-implementations	3	Python

« Prev 1 2 3 … 22 23 24 25 26 … 68 69 70 Next »