All Voice AI Tools
6,981 tools ranked by quality score · Page 51 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 5001 |
SabaSyed/SpeechAvatarBot
An interactive voice-based chatbot with a visual avatar that runs locally... |
|
Experimental |
| 5002 |
Arushi-Srivastava-16/SpatialAudio
SpatialAudio detects key objects using YOLOv8, identifies their location in... |
|
Experimental |
| 5003 |
vishishttiwari/Android_Application_for_understanding_ASL_using_gesture_recognition
An Android Application that uses gesture recognition to understand alphabets... |
|
Experimental |
| 5004 |
EasyAI-France/Audiobook-Simplifier
Audiobook Simplifier is a tool that creates audiobooks from text documents... |
|
Experimental |
| 5005 |
rossriserose/Real-time-Voice-cloning
Clone a voice to generate arbitrary speech in real-time |
|
Experimental |
| 5006 |
DuyguA/TSD2025-Mind-the-Gap
Innovative ASR model to keep named entities intact, offered as a conference paper. |
|
Experimental |
| 5007 |
Kadir-Atmaca/Asistan-STT-Vosk
Bu depo stt yani speech to text Türkçesiyle sesi yazıya çevirme Türkçe şekilde |
|
Experimental |
| 5008 |
halisuyanik/speech-recognition-note-app-vue.js-regex
Note application that converts voice command to text and performs voice... |
|
Experimental |
| 5009 |
swarnayuroy/Web-Automation-using-speech-recognition
Generate results on web browser i.e. automated after user speaks out the... |
|
Experimental |
| 5010 |
Heatwave114/wazobia-open-speech-mobile
This is an open-source mobile application that augments the wazobia... |
|
Experimental |
| 5011 |
JmKanmo/VoiceRecognitionMemoApp
Speech recognition and memo application |
|
Experimental |
| 5012 |
ItsJamin/another-tts
A program to easily create datasets for training own tts models. |
|
Experimental |
| 5013 |
chirag127/ComicSpeak-AI-Web-Comic-Dubber-Browser-Extension
Transforms web comics into audio with AI-powered OCR and TTS |
|
Experimental |
| 5014 |
NOime22/Web-listen
🎧 AI语音朗读助手 - Chrome浏览器扩展,支持划词朗读和截图OCR朗读 |
|
Experimental |
| 5015 |
neshani/Kitten-Offline-TTS
Kitten Offline Mobile TTS Webapp |
|
Experimental |
| 5016 |
zeeshan020dev/Jarvis-AI-For-Windows-2026
A Python-based voice-controlled AI assistant for Windows using Google Gemini... |
|
Experimental |
| 5017 |
CuteOwOwO/Gale
「讓長輩的每一次伸展,都充滿趣味與溫暖的陪伴。」 |
|
Experimental |
| 5018 |
babadue/seamless-m4t-v2-large-demo
Demonstration features of seamless-m4t-v2-large model |
|
Experimental |
| 5019 |
shujareshi/LearnedCamera
an android application based on machine learning for object recognition... |
|
Experimental |
| 5020 |
chirag127/SpeechFlow-AI-Powered-Text-to-Speech-Browser-Extension
AI-powered text-to-speech browser extension. Transforms web content into... |
|
Experimental |
| 5021 |
zz85/silly-ai
my collection of fully local AI experiments: including a voice-first AI... |
|
Experimental |
| 5022 |
oarthurfc/AI-outgoing-call
An intelligent voice agent that automatically calls leads, promoting... |
|
Experimental |
| 5023 |
dusionlike/unplugin-string-to-audio
在打包过程中自动将字符串转换为语音文件并添加到最终的打包文件里面, 支持Vite and Webpack |
|
Experimental |
| 5024 |
thiswillbeyourgithub/Spotify_tts
Reads title of spotify songs aloud using AI |
|
Experimental |
| 5025 |
nsourlos/voice_cloning_tools
Various tools to clone a voice |
|
Experimental |
| 5026 |
spokestack/spokestack-tray-android
A UI component that makes it easy to add voice interaction to your app. |
|
Experimental |
| 5027 |
EceSenaEtoglu/News-Podcast-Generator
Get breaking news and top headlines in an audited format with this cool bot!... |
|
Experimental |
| 5028 |
host452b/casts_down
Cross-platform CLI to download & transcribe podcasts locally — Apple... |
|
Experimental |
| 5029 |
noor-afshan/video-transcriber
🎥 Transcribe videos quickly with GPU support, offering speaker... |
|
Experimental |
| 5030 |
NJUxlj/hotel-voice-agent-manual
一个RAG语音对话助手,用于上海的旅游信息查询。用户语音输入用ASR转文本,再用智谱api搜知识库+RAG生成回复,最后用TTS转语音输出。 |
|
Experimental |
| 5031 |
emercado72/tts-streamer
Real-time Text-to-Speech streaming with PDF reader, powered by Kokoro-82M |
|
Experimental |
| 5032 |
ruslanmv/VRSecretary
VRSecretary is a production-ready reference implementation for building... |
|
Experimental |
| 5033 |
burritosoftware/mira
A modular text-to-speech Discord bot for Bay Area public transit systems. |
|
Experimental |
| 5034 |
Jobijoba2000/add_dub
Automated video voice-over tool for Windows. Converts subtitles to speech... |
|
Experimental |
| 5035 |
ccj242/Audible-Deaf-Communications
A non-profit app designed to make help the deaf communicate in person and... |
|
Experimental |
| 5036 |
neosun100/fish-speech
🐟 Advanced multilingual Text-to-Speech system with speaker management,... |
|
Experimental |
| 5037 |
voothi/20250421115831-anki-gtts-player
A powerful Anki audio add-on with a 3-tier playback system: prioritizes your... |
|
Experimental |
| 5038 |
ayzem88/text-to-speech-converter
أداة لتحويل النصوص العربية إلى ملفات صوتية باستخدام OpenAI TTS / Tool for... |
|
Experimental |
| 5039 |
mbrotos/SoundSeg
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation |
|
Experimental |
| 5040 |
Entity047/Voice_AI_Creator
Python TTS and voice cloning framework for educational AI/ML demonstrations. |
|
Experimental |
| 5041 |
neosun100/orpheus-tts-docker
Production-ready Docker deployment for Orpheus TTS with GPU management,... |
|
Experimental |
| 5042 |
sap1119/voice_agent_0.02
An open‑source voice AI platform for building real‑time, scalable, and... |
|
Experimental |
| 5043 |
Hauntlight/video_translator
🎥 Translate and dub video audio into another language using AI. Built with... |
|
Experimental |
| 5044 |
MatiousCorp/claude-tts
Text-to-speech plugin for Claude Code — multi-provider support (ElevenLabs,... |
|
Experimental |
| 5045 |
falniak95/TurkishSpeechRecognition
Tamamen Türkçe Konuşma Algılama Sistemi. Google Cloud Platform API desteği... |
|
Experimental |
| 5046 |
Mrzhangxiaoduo/react-native-speech-recognizer
react-native-speech-recognizer |
|
Experimental |
| 5047 |
gregunger-microsoft/Jarvis
AI-powered Microsoft Teams meeting assistant with voice interaction,... |
|
Experimental |
| 5048 |
awesome-german/speaking
Resources and methods to improve spoken German, pronunciation, and real-life... |
|
Experimental |
| 5049 |
Tombarr/TranscriberApp
Local-first macOS Tahoe Transcription App & CLI Tool |
|
Experimental |
| 5050 |
chicogong/ffvoice-engine
🎙️ 高性能 C++ 语音引擎 - 实时音频处理 + AI 语音识别 + 边录边转写 | High-performance C++ voice... |
|
Experimental |
| 5051 |
bonniepeng2002/Apollo
Apollo: your intuitive, virtual nurse. |
|
Experimental |
| 5052 |
sagarpednekar/live-transcript-app
Live Transcription Tool - Real-time speech-to-text transcription |
|
Experimental |
| 5053 |
sse-digital-man/TTS-Core
数字人项目-TTS部分 |
|
Experimental |
| 5054 |
mict-zhaw/chall_e2e_stt
End-to-end ASR experiments for language learning, focusing on... |
|
Experimental |
| 5055 |
Zhima-Mochi/whisper-v3-server
A robust backend server for audio processing, delivering high-accuracy... |
|
Experimental |
| 5056 |
bjornbytes/lua-deepspeech
Lua Library for Speech Recognition |
|
Experimental |
| 5057 |
zhaoyi2/Classical-Speech-Algorithms
Classical speech recognition and speaker recognition algorithms |
|
Experimental |
| 5058 |
Slothologist/AudioSegmenter
Segmentation of audio for a speech pipeline |
|
Experimental |
| 5059 |
GIO443/speech-to-owl
Voice-driven ontology builder. Say “command …” then a sentence (e.g., “the... |
|
Experimental |
| 5060 |
RykerWilder/jarvis
Just A Rather Very Intelligent System |
|
Experimental |
| 5061 |
petitwhito/Speech_to_text_project
Complete Speech-to-Text pipeline: from-scratch architectures (MLP, CNN, RNN,... |
|
Experimental |
| 5062 |
Rohit909-creator/EfficientWordNet_Upgrade
EfficientWordNet enhances wakeword detection with noise-robust similarity... |
|
Experimental |
| 5063 |
dananjaya2002/realtime-voice-assistant
AI-powered desktop voice assistant using OpenAI Whisper and Silero VAD |
|
Experimental |
| 5064 |
MML-Group/code4AVE-Speech
Source Code for AVE Speech Dataset |
|
Experimental |
| 5065 |
webKing021/VoiceFlow-An-Automatic-NLP-Transcriber
VoiceFlow is a Windows push-to-talk voice-to-text application that... |
|
Experimental |
| 5066 |
Nazmul0005/Personal_Voice_Assistant_Mili
Mili is a smart voice assistant built with Python and Google Gemini AI. It... |
|
Experimental |
| 5067 |
cvcwebsolutions/vibe-local
Local voice-to-text with AI-powered text cleanup. Privacy-focused... |
|
Experimental |
| 5068 |
keymastervn/htksupport
Minimal HTK for supporting HTK in Vietnamese. |
|
Experimental |
| 5069 |
lhg96/stt-demo-korean
Korean Speech-to-Text app with Whisper & Vosk | 한국어 음성인식 데모 애플리케이션 |
|
Experimental |
| 5070 |
ayzem88/audio-to-text-converter
أداة متقدمة لتحويل الملفات الصوتية إلى نصوص باستخدام OpenAI Whisper /... |
|
Experimental |
| 5071 |
SunPCSolutions/DiarASR
Enterprise-Grade Secure ASR Diarization Pipeline - HIPAA-compliant speech... |
|
Experimental |
| 5072 |
PrthD/AI-powered-Voice-assisted-Object-Locator
🔍 Real-time object detection with voice command integration using YOLOv5... |
|
Experimental |
| 5073 |
gouhaha/Whisper-App
Windows Whisper transcription app (PyInstaller + ffmpeg) |
|
Experimental |
| 5074 |
vicentezaror/js-web-t2v
Web text to voice utility functions that allows to customize the behavior,... |
|
Experimental |
| 5075 |
alozowski/textplease
Upload an audio/video file, configure settings, and receive a text transcript |
|
Experimental |
| 5076 |
bguerraDev/LoudlyTTS
Native Android app to read your notifications aloud over Bluetooth.... |
|
Experimental |
| 5077 |
lkwbr/structured-prediction
Machine learning algorithms for structured inputs and outputs, such as on... |
|
Experimental |
| 5078 |
sonhm3029/Realtime-Vietnamese-ASR-React-Native-and-Whisper
This project implement end to end realtime vietnamese speech recognition... |
|
Experimental |
| 5079 |
my-north-ai/semantic_audio_filtering
Synthetic data augmentation technique via LLM for Automatic Speech... |
|
Experimental |
| 5080 |
toledomauricio/fastapi-whisper-ollama
FastAPI + Whisper + Ollama: Audio transcription and LLM processing API.... |
|
Experimental |
| 5081 |
wangjialiang678/speaklow-macvoiceinput
SpeakLow — a lightweight macOS menu bar app for voice-to-text input. Press a... |
|
Experimental |
| 5082 |
bobbymay/Dictation-for-macOS
Speech Recognition for macOS that allows you to define words, phrases, or... |
|
Experimental |
| 5083 |
KarinBrisker/Video-Subtitler
Automatically Generating Multilingual Subtitles Using OpenAI's Whisper and... |
|
Experimental |
| 5084 |
labrijisaad/Youtube-video-transcriptor
In this notebook, I implemented a script to transcribe YouTube videos (and... |
|
Experimental |
| 5085 |
luizomf/sussu
CLI educacional para transcrição com OpenAI Whisper |
|
Experimental |
| 5086 |
ElmiraGhorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4)... |
|
Experimental |
| 5087 |
Caliope-SpeechProcessingLab/SpeechTester
Speech Tester is a set of Python scripts conceived as an extension to HTK... |
|
Experimental |
| 5088 |
OpenVoiceOS/ovos-tts-plugin-SAM
S.A.M - Software Automatic Mouth |
|
Experimental |
| 5089 |
code-spirit-369/text-to-speech-yt
This AI TTS web application allows you to convert any text into realistic,... |
|
Experimental |
| 5090 |
talhabinjaved/voice-ai-agents-openai-telnyx
A FastAPI starter that turns a Telnyx phone number into a realtime,... |
|
Experimental |
| 5091 |
vipyne/american-dream-phone
An AI voice agent to help you call your political representatives. |
|
Experimental |
| 5092 |
wis/speak
a browser extension designed for minimal clicks or presses to start reading... |
|
Experimental |
| 5093 |
Jmi2020/HowdyVox
A privacy focused offline STT TTS interface for your favorite LLM |
|
Experimental |
| 5094 |
Vitgracer/Offline-Voice-LLM-Assistant
Running small but capable language models entirely offline |
|
Experimental |
| 5095 |
Daeels/Smart-E-commerce-Microservices-App
This project is an E-commerce App using the microservices architecture. |
|
Experimental |
| 5096 |
language-org/voice-activ-detect-deepnet
ASR: Light deep net for real-time voice activity detection |
|
Experimental |
| 5097 |
cagataygedik/TTS
Internship Text-to-Speech research project. |
|
Experimental |
| 5098 |
daniel-szulc/Speech_Recognition
🎙 Automatic Keyword Speech Recognition for Polish and English in Tensorflow 🧠 |
|
Experimental |
| 5099 |
shashankchandak/AutoSMSReader
An android application that allows users to read all incoming messages loudly |
|
Experimental |
| 5100 |
pig-mesh/volcengine-tts-spring-boot-starter
火山引擎语音合成(TTS)服务集成 |
|
Experimental |