All Voice AI Tools

8,525 tools ranked by quality score · Page 74 of 86

Showing 7301–7400 of 8,525

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
7301	Vlad1343/Sign-Wave Real-time Ukrainian Sign Language translator using computer vision and...	13	Experimental	sign-language-recognition	—	Python
7302	Sumit0ubey/TorvixAI TorchAI is an Android app that combines AI chat and voice assistance with...	13	Experimental	ai-tutoring-platforms	2	Kotlin
7303	funkyfranky/TTS-Radio Create voice overs with radio effects for DCS	13	Experimental	lightweight-tts-libraries	4	Python
7304	cser245086272/ComfyUI-FL-Qwen3TTS 🎤 Create realistic text-to-speech outputs with advanced voice cloning and...	13	Experimental	developer-portfolio-projects	—	—
7305	fclaeys/nix-nerd-dictation 🎤 Nix flake for offline French speech-to-text with nerd-dictation....	13	Experimental	voice-dictation-typing	—	Nix
7306	harlanx/voice_recorder_recognizer An audio recorder and speech to text with commands recognition created using...	13	Experimental	educational-voice-apps	9	Dart
7307	eddiedunn/transcribe [DEPRECATED — superseded by diarized_transcriber] Audio-to-text...	13	Experimental	video-transcription-extraction	—	—
7308	ItxMatti/tts 🗣️ Deploy high-quality text-to-speech services with Gemini, OpenAI, and...	13	Experimental	text-to-speech-tts	—	JavaScript
7309	traceypooh/audio2text creates text from audio of A/V input file, using docker, sphinx. extracts...	13	Experimental	real-time-voice-translation	10	Shell
7310	hannabdul/etf4asr Official repo for the paper "An Effective Training Framework for...	13	Experimental	end-to-end-asr-frameworks	8	Lex
7311	AnshGaikwad/Personal-Voice-Assistant Personal Voice Assistant: Easy to change the code and making it suitable for...	13	Experimental	general-purpose-voice-assistants	9	Python
7312	di37/speech-to-text-fine-tuning-on-unseen-language This projects aims to show how whisper model can be fine-tuned on language...	13	Experimental	video-content-intelligence	11	Jupyter Notebook
7313	Diluksha-Upeka/Voxis Voxis is an intelligent voice assistant powered by Groq's AI models,...	13	Experimental	conversational-chatbot-applications	—	JavaScript
7314	MichaelMBrown/VoiceLab Local Apple Silicon voice studio for Qwen3-TTS with a FastAPI backend and...	13	Experimental	qwen3-tts-applications	—	Python
7315	TJ-Neary/TommyTalker-Pro Privacy-first voice-to-text for macOS — local STT via mlx-whisper with...	13	Experimental	local-voice-dictation	—	—
7316	Karan36k/text2speech A Basic But Useful Online Text to Speech Converter with a male voice...	13	Experimental	web-speech-api-tts	11	HTML
7317	Srinath-N-R/IPA-Wav2Vec2-Phoneme-Recognition End-to-end IPA-based phoneme recognition pipeline using Wav2Vec2, featuring...	13	Experimental	llm-implementation-tutorials	11	Python
7318	IshaanLabs/Text-to-Speech-TTS Open Source Text-to-Speech (TTS) repository	13	Experimental	text-to-speech-tts	—	Jupyter Notebook
7319	NimbleAINinja/swift-scribe-rs Fast, on-device speech-to-text transcription for macOS using Apple's Speech framework	13	Experimental	local-voice-dictation	2	Rust
7320	Gokila-S/smart-translate Smart Translator is a modern MERN stack application that allows users to...	13	Experimental	live-meeting-translation	—	JavaScript
7321	Rayyan9477/speech-app AI Language Processor is a powerful application that leverages...	13	Experimental	audio-transcription-apps	3	TypeScript
7322	rk-vashista/TTS-Story_Generator A versatile app that converts images into short stories and lifelike audio...	13	Experimental	text-to-speech-tts	—	Python
7323	hongkongkiwi/scoop-elevenlabs-cli Official Scoop bucket for installing elevenlabs-cli on Windows.	13	Experimental	elevenlabs-integrations	—	—
7324	oddvoices/oddvoices An indie singing synthesizer	13	Experimental	espeak-ng-ecosystem	11	—
7325	bivex/whisper-large-v3-turbo Whisper Large V3 Turbo - fast speech-to-text model implementation with...	13	Experimental	whisper-fine-tuning	—	Python
7326	labestia2/Qwen3-Audiobook-Converter 🎧 Convert various document formats into high-quality audiobooks with Qwen3...	13	Experimental	stable-diffusion-tools	—	—
7327	upskaling/voice-keyboard an interface for nerd-dictation in gtk	13	Experimental	rust-speech-recognition	—	Rust
7328	Her-mia/Imgspeaker An Android app written in Kotlin that performs OCR on Simplified Chinese...	13	Experimental	android-voice-assistants	5	Kotlin
7329	maycondata/apontamento-op-por-voz Apontamento de produção por voz (Whisper STT + gTTS) com confirmação e...	13	Experimental	speech-to-text-converters	—	Jupyter Notebook
7330	akhilachiju/AI-Audio-Transcriber Audio transcription app using Whisper AI for accurate speech-to-text...	13	Experimental	whisper-transcription-apps	—	JavaScript
7331	metacore-stack/Voice-to-Insights Enterprise AI platform that transforms audio meetings into structured...	13	Experimental	text-to-speech-tts	—	Python
7332	anhuynh219/vietnamese_SVS Demo page for ViSVS: ON AUTOMATIC VIETNAMESE SINGING VOICE SYNTHESIS	13	Experimental	speech-synthesis-diffusion	—	HTML
7333	DemoL2004/Serverless-Content-Generation-Distribution-Pipeline Cloud-native media automation system integrating Reddit, ElevenLabs TTS,...	13	Experimental	ai-video-generation	—	Python
7334	Himanshi-2519/Speech-To-Text-API Capturing the Rhythm of your words. Real-time AI transcription with a...	13	Experimental	text-to-speech-conversion	—	JavaScript
7335	walid-hamdi/fluener_ai-service FastAPI AI microservice for language learning - Provides speech-to-text...	13	Experimental	speech-to-text-converters	—	Python
7336	RedDotz20/speech-to-text-recognition 🎤 Effortlessly integrate speech recognition capabilities into your React...	13	Experimental	react-speech-recognition	—	TypeScript
7337	mocarlaura-source/parakeet 🐦 Customize Fedora Silverblue with niri DE tailored for FriendlyElec NanopPC...	13	Experimental	parakeet-asr-implementations	—	—
7338	stefanpietrusky/QUEST Repository for the QUEST App prototype.	13	Experimental	voice-agent-applications	—	Python
7339	joachimhodana/rtTranslator Simple overlay for Windows, that listens for background sound and translates...	13	Experimental	real-time-voice-translation	9	Python
7340	THE-DEEPDAS/RealTime-Voice-Assistant Voice-activated assistant using Groq API, Streamlit UI, speech recognition, and TTS	13	Experimental	voice-agent-applications	3	Python
7341	SuperKabman/audioNote AI enabled notes taking app	13	Experimental	ai-content-writing	4	JavaScript
7342	elloza/slides2video-pinokio-script Pinokio script for installing the app slides2video	13	Experimental	ai-video-generation	4	JavaScript
7343	morelen17/tts-papers List of papers about TTS / Список статей о TTS	13	Experimental	zero-shot-voice-synthesis	10	—
7344	saroshfarhan/story-teller Story-Teller	13	Experimental	ibm-watson-speech	—	Jupyter Notebook
7345	x-phone/demos Working examples and tutorials for the x-phone ecosystem — xphone-go,...	13	Experimental	deepgram-starter-projects	—	Go
7346	unicodeveloper/voicery Play with voices. Speak any language. Clone your vibe.	13	Experimental	text-to-speech-conversion	5	TypeScript
7347	sj2tpgk/voiceroid-docker Voiceroid+ in docker on X64/Arm linux + web interface (mirrored from...	13	Experimental	coqui-tts-applications	—	Shell
7348	AbhiramMandala/virtual_assistant Voice-controlled virtual assistant built with Python using speech...	13	Experimental	general-purpose-voice-assistants	—	Python
7349	onwurahben/meeting-assistant Transform raw meeting audio into speaker-aware transcripts, summaries, and...	13	Experimental	meeting-transcription-automation	—	Python
7350	NafisRayan/AI-Voice-Assistant-ST AI voice assistant made with Streamlit python and powered by Gemini, Mistral...	13	Experimental	voice-controlled-desktop-automation	13	Python
7351	madebyaris/dsw-voice Real-time voice noise reduction app for macOS with virtual microphone support	13	Experimental	speaker-diarization-embedding	2	Swift
7352	manhph2211/ViTTS In this repo, I developed a step-by-step pipeline for a standard...	13	Experimental	tts-model-finetuning	12	Python
7353	kiraping1337/ChatTwitchTTS Twitch TTS бот с клонированием голоса через XTTS v2. Озвучивание сообщений...	13	Experimental	twitch-chat-tts	2	Python
7354	mccvliqht/signifeye-capstone a capstone project about real-time sign language translator using camera	13	Experimental	sign-language-translation	—	JavaScript
7355	karim23657/ParsiGoo ParsiGoo is a Persian multispeaker dataset for text-to-speech purposes. It...	13	Experimental	voice-cloning-synthesis	10	—
7356	heroic-differentialdiagnosis696/MeetingMindAI Capture, transcribe, and summarize meetings effortlessly with MeetingMindAI,...	13	Experimental	meeting-transcription-summarizers	—	—
7357	YossefMohamed/covid-app-api An Api for testing covid using cough sound	13	Experimental	covid-19-prediction-ml	9	TypeScript
7358	akhileshmanitiwari06/InterviewMentor-AI InterviewMentor AI is an intelligent mock interview assistant designed for...	13	Experimental	ai-interview-simulators	—	Jupyter Notebook
7359	nashalexander/PersonaSpeak Simple but comprehensive TTS GUI tool for use with modern models	13	Experimental	lightweight-tts-libraries	—	Python
7360	abhiFSD/VoiceForge 🎙️ Real-time AI voice assistant — Speak → Whisper STT → Gemini Flash → Edge...	13	Experimental	voice-agent-applications	—	Python
7361	sridattb96/MeetingStory A project I built while doing research for a professor in the Visual &...	13	Experimental	meeting-transcription-summarizers	14	Python
7362	shujaatsunasra/ai-based-expensetracker luminous_flow leverages a multi-layered AI pipeline to deliver personalized,...	13	Experimental	educational-voice-apps	—	Dart
7363	dae9999nam/Memory-Garden This repository is to provide service, Memory-Garden, that create narratives...	13	Experimental	image-to-speech-synthesis	—	Python
7364	ca0wx/Gemini-Talker-Chat 🎙️ Gemini Talker Chat: Ollama ve Edge-TTS tabanlı, gerçek zamanlı sesli...	13	Experimental	multimodal-medical-assistants	—	Python
7365	remsky/prebuilt_tts_wheels Prebult wheels for dependencies of TTS service; Kokoro-FastAPI	13	Experimental	kokoro-tts-ecosystem	3	Dockerfile
7366	max-lt/voxtral-cpp Local implementation for voxtral	13	Experimental	lightweight-tts-runtimes	2	C++
7367	pukaa900/reagana Ko taqaku konqamatuqa mo nqaaqaku meqa.	13	Experimental	lightweight-tts-runtimes	—	JavaScript
7368	RamirJunior/idox-ia-project Projeto MVP com processamento de áudio com IA local	13	Experimental	ai-powered-saas-startups	—	Java
7369	duanxianpi/AI-Voice-Diary Using voice to keep a journal.	13	Experimental	voice-chatbot-applications	9	Python
7370	carlfm01/my-speech-datasets My public domain speech index	13	Experimental	speech-corpora-datasets	13	—
7371	lianghsun/cosyvoice3-api FastAPI wrapper for Fun-CosyVoice3-0.5B: zero-shot voice cloning TTS with...	13	Experimental	coqui-tts-applications	—	Python
7372	nipponjo/tts-german-pytorch 🎙️ German TTS (FastPitch) with Thorsten voice / emotional	13	Experimental	zero-shot-voice-synthesis	9	Python
7373	muurakami/momokiki Open source language learning app — Duolingo alternative with offline...	13	Experimental	android-voice-assistants	—	Dart
7374	Mormolykos/bedvibe-datasets Multilingual emotional speech datasets for TTS training	13	Experimental	speech-corpora-datasets	—	—
7375	kjanjua26/HearPapers HearPapers allows you to listen to PDFs (by converting them to audiobooks,...	13	Experimental	pdf-to-audio-conversion	9	Python
7376	amay09x/TheNewsCoo TheNewsCoo is a desktop AI application that helps users quickly understand...	13	Experimental	news-audio-bulletins	—	Python
7377	BenjaminDanker/Audio-Cleaner-Web AI-powered video audio noise reduction in the cloud using DeepFilterNet3 and...	13	Experimental	audio-noise-reduction	—	JavaScript
7378	LauraKokkarinen/AzureAI.TextToSpeech A console application for converting long-form plain-text files into speech...	13	Experimental	dotnet-tts-libraries	—	C#
7379	Thisen-Ekanayake/sinhala-vision-assist Vision–language assistive pipeline that answers Sinhala voice questions...	13	Experimental	assistive-vision-ai	—	Python
7380	RutronikSystemSolutions/RDK3_BLE_EnOcean Project used to illustrate how to use a RDK3 to interact with EnOcean BLE...	13	Experimental	embedded-tts-systems	—	Assembly
7381	Rumeysakeskin/ASR-Quantization Post-training quantization on Nvidia Nemo ASR model	13	Experimental	automatic-speech-recognition	9	Jupyter Notebook
7382	danielrosehill/ASR-And-STT-AI-Notebook Propmts and outputs (and some notes) on STT + ASR + fine-tuning. LLM: Claude	13	Experimental	speech-ai-coursework	2	Python
7383	NAJL123/voice-ai-assistant Local Voice AI Assistant — faster-whisper STT + Ollama LLM + pyttsx3 TTS	13	Experimental	local-voice-assistants	—	Python
7384	Priyanshu-Yadav19/Call-Voice-Agent Real-time AI Voice Agent using Streaming STT, LLM-based conversation...	13	Experimental	voice-agent-applications	—	Python
7385	laafeiak/ai_text_reader text	13	Experimental	web-speech-api-tts	—	JavaScript
7386	namphung134/ASR-Vietnamese Fine-tuning the openai/whisper-small model on the 250h dataset for...	13	Experimental	whisper-fine-tuning	3	Jupyter Notebook
7387	Giuseppe-Della-Corte/IESTAC A corpus that can be used to train English-to-Italian End-to-End...	13	Experimental	speech-corpora-datasets	11	—
7388	N1kOk/WhispeRu Голос — в текст. Приватно. Локально. Моментально.	13	Experimental	voice-dictation-typing	—	—
7389	allvoicelab/allvoicelab AI-powered audio creation platform offering TTS, Voice Cloning, Voice...	13	Experimental	news-audio-bulletins	6	—
7390	metacore-stack/AuraVoice Production-grade on-device AI meeting assistant featuring real-time...	13	Experimental	text-to-speech-tts	—	Python
7391	jaychampaneri14/ai-voice-cloning Text-to-speech with multiple voice styles using gTTS and pyttsx3	13	Experimental	voice-cloning-tools	—	Python
7392	SMIL-SPCRAS/DAVIS Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle...	13	Experimental	automatic-speech-recognition	9	JavaScript
7393	JonPark0/web_audio_splitter AI-powered audio source separation using Meta Demucs - Split songs into...	13	Experimental	audio-source-separation	—	JavaScript
7394	kocharvishal/Fast-Speech-Transcription-Grammar-Scoring-Engine Built a transcription system using OpenAI’s Whisper and Fine-tuned...	13	Experimental	audio-transcription-tools	—	Jupyter Notebook
7395	lymcho/story-to-video Create a fully narrated YouTube audiobook channel in one command. AI...	13	Experimental	ai-video-generation	—	Python
7396	AleefBilal/tts_srt_gen A runpod serverless docker that generates TTS using chatterbox-tts along with .srt	13	Experimental	self-hosted-tts-servers	—	Python
7397	plandanogtav1-cmd/Conversational-For-Librechat 🎙 Headless real-time voice pipeline for LibreChat — LiveKit WebRTC +...	13	Experimental	ai-avatar-platforms	2	TypeScript
7398	iamvon/AudioRead Turn PDFs into audio with chunked LLMs and OpenAI TTS	13	Experimental	ebook-to-audiobook-conversion	—	Python
7399	adityakamat24/RTGX-Real-Time-Glossary-eXplainer RTGX is an AI-powered real-time glossary explainer that adds contextual...	13	Experimental	live-meeting-translation	—	JavaScript
7400	palaashatri/jvosk Audio transcription using Vosk. Built with Swing.	13	Experimental	vosk-asr-implementations	—	Java

« Prev 1 2 3 … 72 73 74 75 76 … 84 85 86 Next »