All Voice AI Tools
8,525 tools ranked by quality score · Page 74 of 86
| # | Tool | Score | Tier |
|---|---|---|---|
| 7301 |
Vlad1343/Sign-Wave
Real-time Ukrainian Sign Language translator using computer vision and... |
|
Experimental |
| 7302 |
Sumit0ubey/TorvixAI
TorchAI is an Android app that combines AI chat and voice assistance with... |
|
Experimental |
| 7303 |
funkyfranky/TTS-Radio
Create voice overs with radio effects for DCS |
|
Experimental |
| 7304 |
cser245086272/ComfyUI-FL-Qwen3TTS
🎤 Create realistic text-to-speech outputs with advanced voice cloning and... |
|
Experimental |
| 7305 |
fclaeys/nix-nerd-dictation
🎤 Nix flake for offline French speech-to-text with nerd-dictation.... |
|
Experimental |
| 7306 |
harlanx/voice_recorder_recognizer
An audio recorder and speech to text with commands recognition created using... |
|
Experimental |
| 7307 |
eddiedunn/transcribe
[DEPRECATED — superseded by diarized_transcriber] Audio-to-text... |
|
Experimental |
| 7308 |
ItxMatti/tts
🗣️ Deploy high-quality text-to-speech services with Gemini, OpenAI, and... |
|
Experimental |
| 7309 |
traceypooh/audio2text
creates text from audio of A/V input file, using docker, sphinx. extracts... |
|
Experimental |
| 7310 |
hannabdul/etf4asr
Official repo for the paper "An Effective Training Framework for... |
|
Experimental |
| 7311 |
AnshGaikwad/Personal-Voice-Assistant
Personal Voice Assistant: Easy to change the code and making it suitable for... |
|
Experimental |
| 7312 |
di37/speech-to-text-fine-tuning-on-unseen-language
This projects aims to show how whisper model can be fine-tuned on language... |
|
Experimental |
| 7313 |
Diluksha-Upeka/Voxis
Voxis is an intelligent voice assistant powered by Groq's AI models,... |
|
Experimental |
| 7314 |
MichaelMBrown/VoiceLab
Local Apple Silicon voice studio for Qwen3-TTS with a FastAPI backend and... |
|
Experimental |
| 7315 |
TJ-Neary/TommyTalker-Pro
Privacy-first voice-to-text for macOS — local STT via mlx-whisper with... |
|
Experimental |
| 7316 |
Karan36k/text2speech
A Basic But Useful Online Text to Speech Converter with a male voice... |
|
Experimental |
| 7317 |
Srinath-N-R/IPA-Wav2Vec2-Phoneme-Recognition
End-to-end IPA-based phoneme recognition pipeline using Wav2Vec2, featuring... |
|
Experimental |
| 7318 |
IshaanLabs/Text-to-Speech-TTS
Open Source Text-to-Speech (TTS) repository |
|
Experimental |
| 7319 |
NimbleAINinja/swift-scribe-rs
Fast, on-device speech-to-text transcription for macOS using Apple's Speech framework |
|
Experimental |
| 7320 |
Gokila-S/smart-translate
Smart Translator is a modern MERN stack application that allows users to... |
|
Experimental |
| 7321 |
Rayyan9477/speech-app
AI Language Processor is a powerful application that leverages... |
|
Experimental |
| 7322 |
rk-vashista/TTS-Story_Generator
A versatile app that converts images into short stories and lifelike audio... |
|
Experimental |
| 7323 |
hongkongkiwi/scoop-elevenlabs-cli
Official Scoop bucket for installing elevenlabs-cli on Windows. |
|
Experimental |
| 7324 |
oddvoices/oddvoices
An indie singing synthesizer |
|
Experimental |
| 7325 |
bivex/whisper-large-v3-turbo
Whisper Large V3 Turbo - fast speech-to-text model implementation with... |
|
Experimental |
| 7326 |
labestia2/Qwen3-Audiobook-Converter
🎧 Convert various document formats into high-quality audiobooks with Qwen3... |
|
Experimental |
| 7327 |
upskaling/voice-keyboard
an interface for nerd-dictation in gtk |
|
Experimental |
| 7328 |
Her-mia/Imgspeaker
An Android app written in Kotlin that performs OCR on Simplified Chinese... |
|
Experimental |
| 7329 |
maycondata/apontamento-op-por-voz
Apontamento de produção por voz (Whisper STT + gTTS) com confirmação e... |
|
Experimental |
| 7330 |
akhilachiju/AI-Audio-Transcriber
Audio transcription app using Whisper AI for accurate speech-to-text... |
|
Experimental |
| 7331 |
metacore-stack/Voice-to-Insights
Enterprise AI platform that transforms audio meetings into structured... |
|
Experimental |
| 7332 |
anhuynh219/vietnamese_SVS
Demo page for ViSVS: ON AUTOMATIC VIETNAMESE SINGING VOICE SYNTHESIS |
|
Experimental |
| 7333 |
DemoL2004/Serverless-Content-Generation-Distribution-Pipeline
Cloud-native media automation system integrating Reddit, ElevenLabs TTS,... |
|
Experimental |
| 7334 |
Himanshi-2519/Speech-To-Text-API
Capturing the Rhythm of your words. Real-time AI transcription with a... |
|
Experimental |
| 7335 |
walid-hamdi/fluener_ai-service
FastAPI AI microservice for language learning - Provides speech-to-text... |
|
Experimental |
| 7336 |
RedDotz20/speech-to-text-recognition
🎤 Effortlessly integrate speech recognition capabilities into your React... |
|
Experimental |
| 7337 |
mocarlaura-source/parakeet
🐦 Customize Fedora Silverblue with niri DE tailored for FriendlyElec NanopPC... |
|
Experimental |
| 7338 |
stefanpietrusky/QUEST
Repository for the QUEST App prototype. |
|
Experimental |
| 7339 |
joachimhodana/rtTranslator
Simple overlay for Windows, that listens for background sound and translates... |
|
Experimental |
| 7340 |
THE-DEEPDAS/RealTime-Voice-Assistant
Voice-activated assistant using Groq API, Streamlit UI, speech recognition, and TTS |
|
Experimental |
| 7341 |
SuperKabman/audioNote
AI enabled notes taking app |
|
Experimental |
| 7342 |
elloza/slides2video-pinokio-script
Pinokio script for installing the app slides2video |
|
Experimental |
| 7343 |
morelen17/tts-papers
List of papers about TTS / Список статей о TTS |
|
Experimental |
| 7344 |
saroshfarhan/story-teller
Story-Teller |
|
Experimental |
| 7345 |
x-phone/demos
Working examples and tutorials for the x-phone ecosystem — xphone-go,... |
|
Experimental |
| 7346 |
unicodeveloper/voicery
Play with voices. Speak any language. Clone your vibe. |
|
Experimental |
| 7347 |
sj2tpgk/voiceroid-docker
Voiceroid+ in docker on X64/Arm linux + web interface (mirrored from... |
|
Experimental |
| 7348 |
AbhiramMandala/virtual_assistant
Voice-controlled virtual assistant built with Python using speech... |
|
Experimental |
| 7349 |
onwurahben/meeting-assistant
Transform raw meeting audio into speaker-aware transcripts, summaries, and... |
|
Experimental |
| 7350 |
NafisRayan/AI-Voice-Assistant-ST
AI voice assistant made with Streamlit python and powered by Gemini, Mistral... |
|
Experimental |
| 7351 |
madebyaris/dsw-voice
Real-time voice noise reduction app for macOS with virtual microphone support |
|
Experimental |
| 7352 |
manhph2211/ViTTS
In this repo, I developed a step-by-step pipeline for a standard... |
|
Experimental |
| 7353 |
kiraping1337/ChatTwitchTTS
Twitch TTS бот с клонированием голоса через XTTS v2. Озвучивание сообщений... |
|
Experimental |
| 7354 |
mccvliqht/signifeye-capstone
a capstone project about real-time sign language translator using camera |
|
Experimental |
| 7355 |
karim23657/ParsiGoo
ParsiGoo is a Persian multispeaker dataset for text-to-speech purposes. It... |
|
Experimental |
| 7356 |
heroic-differentialdiagnosis696/MeetingMindAI
Capture, transcribe, and summarize meetings effortlessly with MeetingMindAI,... |
|
Experimental |
| 7357 |
YossefMohamed/covid-app-api
An Api for testing covid using cough sound |
|
Experimental |
| 7358 |
akhileshmanitiwari06/InterviewMentor-AI
InterviewMentor AI is an intelligent mock interview assistant designed for... |
|
Experimental |
| 7359 |
nashalexander/PersonaSpeak
Simple but comprehensive TTS GUI tool for use with modern models |
|
Experimental |
| 7360 |
abhiFSD/VoiceForge
🎙️ Real-time AI voice assistant — Speak → Whisper STT → Gemini Flash → Edge... |
|
Experimental |
| 7361 |
sridattb96/MeetingStory
A project I built while doing research for a professor in the Visual &... |
|
Experimental |
| 7362 |
shujaatsunasra/ai-based-expensetracker
luminous_flow leverages a multi-layered AI pipeline to deliver personalized,... |
|
Experimental |
| 7363 |
dae9999nam/Memory-Garden
This repository is to provide service, Memory-Garden, that create narratives... |
|
Experimental |
| 7364 |
ca0wx/Gemini-Talker-Chat
🎙️ Gemini Talker Chat: Ollama ve Edge-TTS tabanlı, gerçek zamanlı sesli... |
|
Experimental |
| 7365 |
remsky/prebuilt_tts_wheels
Prebult wheels for dependencies of TTS service; Kokoro-FastAPI |
|
Experimental |
| 7366 |
max-lt/voxtral-cpp
Local implementation for voxtral |
|
Experimental |
| 7367 |
pukaa900/reagana
Ko taqaku konqamatuqa mo nqaaqaku meqa. |
|
Experimental |
| 7368 |
RamirJunior/idox-ia-project
Projeto MVP com processamento de áudio com IA local |
|
Experimental |
| 7369 |
duanxianpi/AI-Voice-Diary
Using voice to keep a journal. |
|
Experimental |
| 7370 |
carlfm01/my-speech-datasets
My public domain speech index |
|
Experimental |
| 7371 |
lianghsun/cosyvoice3-api
FastAPI wrapper for Fun-CosyVoice3-0.5B: zero-shot voice cloning TTS with... |
|
Experimental |
| 7372 |
nipponjo/tts-german-pytorch
🎙️ German TTS (FastPitch) with Thorsten voice / emotional |
|
Experimental |
| 7373 |
muurakami/momokiki
Open source language learning app — Duolingo alternative with offline... |
|
Experimental |
| 7374 |
Mormolykos/bedvibe-datasets
Multilingual emotional speech datasets for TTS training |
|
Experimental |
| 7375 |
kjanjua26/HearPapers
HearPapers allows you to listen to PDFs (by converting them to audiobooks,... |
|
Experimental |
| 7376 |
amay09x/TheNewsCoo
TheNewsCoo is a desktop AI application that helps users quickly understand... |
|
Experimental |
| 7377 |
BenjaminDanker/Audio-Cleaner-Web
AI-powered video audio noise reduction in the cloud using DeepFilterNet3 and... |
|
Experimental |
| 7378 |
LauraKokkarinen/AzureAI.TextToSpeech
A console application for converting long-form plain-text files into speech... |
|
Experimental |
| 7379 |
Thisen-Ekanayake/sinhala-vision-assist
Vision–language assistive pipeline that answers Sinhala voice questions... |
|
Experimental |
| 7380 |
RutronikSystemSolutions/RDK3_BLE_EnOcean
Project used to illustrate how to use a RDK3 to interact with EnOcean BLE... |
|
Experimental |
| 7381 |
Rumeysakeskin/ASR-Quantization
Post-training quantization on Nvidia Nemo ASR model |
|
Experimental |
| 7382 |
danielrosehill/ASR-And-STT-AI-Notebook
Propmts and outputs (and some notes) on STT + ASR + fine-tuning. LLM: Claude |
|
Experimental |
| 7383 |
NAJL123/voice-ai-assistant
Local Voice AI Assistant — faster-whisper STT + Ollama LLM + pyttsx3 TTS |
|
Experimental |
| 7384 |
Priyanshu-Yadav19/Call-Voice-Agent
Real-time AI Voice Agent using Streaming STT, LLM-based conversation... |
|
Experimental |
| 7385 |
laafeiak/ai_text_reader
text |
|
Experimental |
| 7386 |
namphung134/ASR-Vietnamese
Fine-tuning the openai/whisper-small model on the 250h dataset for... |
|
Experimental |
| 7387 |
Giuseppe-Della-Corte/IESTAC
A corpus that can be used to train English-to-Italian End-to-End... |
|
Experimental |
| 7388 |
N1kOk/WhispeRu
Голос — в текст. Приватно. Локально. Моментально. |
|
Experimental |
| 7389 |
allvoicelab/allvoicelab
AI-powered audio creation platform offering TTS, Voice Cloning, Voice... |
|
Experimental |
| 7390 |
metacore-stack/AuraVoice
Production-grade on-device AI meeting assistant featuring real-time... |
|
Experimental |
| 7391 |
jaychampaneri14/ai-voice-cloning
Text-to-speech with multiple voice styles using gTTS and pyttsx3 |
|
Experimental |
| 7392 |
SMIL-SPCRAS/DAVIS
Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle... |
|
Experimental |
| 7393 |
JonPark0/web_audio_splitter
AI-powered audio source separation using Meta Demucs - Split songs into... |
|
Experimental |
| 7394 |
kocharvishal/Fast-Speech-Transcription-Grammar-Scoring-Engine
Built a transcription system using OpenAI’s Whisper and Fine-tuned... |
|
Experimental |
| 7395 |
lymcho/story-to-video
Create a fully narrated YouTube audiobook channel in one command. AI... |
|
Experimental |
| 7396 |
AleefBilal/tts_srt_gen
A runpod serverless docker that generates TTS using chatterbox-tts along with .srt |
|
Experimental |
| 7397 |
plandanogtav1-cmd/Conversational-For-Librechat
🎙 Headless real-time voice pipeline for LibreChat — LiveKit WebRTC +... |
|
Experimental |
| 7398 |
iamvon/AudioRead
Turn PDFs into audio with chunked LLMs and OpenAI TTS |
|
Experimental |
| 7399 |
adityakamat24/RTGX-Real-Time-Glossary-eXplainer
RTGX is an AI-powered real-time glossary explainer that adds contextual... |
|
Experimental |
| 7400 |
palaashatri/jvosk
Audio transcription using Vosk. Built with Swing. |
|
Experimental |