All Voice AI Tools

6,981 tools ranked by quality score · Page 51 of 70

Showing 5001–5100 of 6,981
# Tool Score Tier
5001 SabaSyed/SpeechAvatarBot

An interactive voice-based chatbot with a visual avatar that runs locally...

16
Experimental
5002 Arushi-Srivastava-16/SpatialAudio

SpatialAudio detects key objects using YOLOv8, identifies their location in...

16
Experimental
5003 vishishttiwari/Android_Application_for_understanding_ASL_using_gesture_recognition

An Android Application that uses gesture recognition to understand alphabets...

16
Experimental
5004 EasyAI-France/Audiobook-Simplifier

Audiobook Simplifier is a tool that creates audiobooks from text documents...

16
Experimental
5005 rossriserose/Real-time-Voice-cloning

Clone a voice to generate arbitrary speech in real-time

16
Experimental
5006 DuyguA/TSD2025-Mind-the-Gap

Innovative ASR model to keep named entities intact, offered as a conference paper.

16
Experimental
5007 Kadir-Atmaca/Asistan-STT-Vosk

Bu depo stt yani speech to text Türkçesiyle sesi yazıya çevirme Türkçe şekilde

16
Experimental
5008 halisuyanik/speech-recognition-note-app-vue.js-regex

Note application that converts voice command to text and performs voice...

16
Experimental
5009 swarnayuroy/Web-Automation-using-speech-recognition

Generate results on web browser i.e. automated after user speaks out the...

16
Experimental
5010 Heatwave114/wazobia-open-speech-mobile

This is an open-source mobile application that augments the wazobia...

16
Experimental
5011 JmKanmo/VoiceRecognitionMemoApp

Speech recognition and memo application

16
Experimental
5012 ItsJamin/another-tts

A program to easily create datasets for training own tts models.

16
Experimental
5013 chirag127/ComicSpeak-AI-Web-Comic-Dubber-Browser-Extension

Transforms web comics into audio with AI-powered OCR and TTS

16
Experimental
5014 NOime22/Web-listen

🎧 AI语音朗读助手 - Chrome浏览器扩展,支持划词朗读和截图OCR朗读

16
Experimental
5015 neshani/Kitten-Offline-TTS

Kitten Offline Mobile TTS Webapp

16
Experimental
5016 zeeshan020dev/Jarvis-AI-For-Windows-2026

A Python-based voice-controlled AI assistant for Windows using Google Gemini...

16
Experimental
5017 CuteOwOwO/Gale

「讓長輩的每一次伸展,都充滿趣味與溫暖的陪伴。」

16
Experimental
5018 babadue/seamless-m4t-v2-large-demo

Demonstration features of seamless-m4t-v2-large model

16
Experimental
5019 shujareshi/LearnedCamera

an android application based on machine learning for object recognition...

16
Experimental
5020 chirag127/SpeechFlow-AI-Powered-Text-to-Speech-Browser-Extension

AI-powered text-to-speech browser extension. Transforms web content into...

16
Experimental
5021 zz85/silly-ai

my collection of fully local AI experiments: including a voice-first AI...

16
Experimental
5022 oarthurfc/AI-outgoing-call

An intelligent voice agent that automatically calls leads, promoting...

16
Experimental
5023 dusionlike/unplugin-string-to-audio

在打包过程中自动将字符串转换为语音文件并添加到最终的打包文件里面, 支持Vite and Webpack

16
Experimental
5024 thiswillbeyourgithub/Spotify_tts

Reads title of spotify songs aloud using AI

16
Experimental
5025 nsourlos/voice_cloning_tools

Various tools to clone a voice

16
Experimental
5026 spokestack/spokestack-tray-android

A UI component that makes it easy to add voice interaction to your app.

16
Experimental
5027 EceSenaEtoglu/News-Podcast-Generator

Get breaking news and top headlines in an audited format with this cool bot!...

16
Experimental
5028 host452b/casts_down

Cross-platform CLI to download & transcribe podcasts locally — Apple...

16
Experimental
5029 noor-afshan/video-transcriber

🎥 Transcribe videos quickly with GPU support, offering speaker...

16
Experimental
5030 NJUxlj/hotel-voice-agent-manual

一个RAG语音对话助手,用于上海的旅游信息查询。用户语音输入用ASR转文本,再用智谱api搜知识库+RAG生成回复,最后用TTS转语音输出。

16
Experimental
5031 emercado72/tts-streamer

Real-time Text-to-Speech streaming with PDF reader, powered by Kokoro-82M

16
Experimental
5032 ruslanmv/VRSecretary

VRSecretary is a production-ready reference implementation for building...

16
Experimental
5033 burritosoftware/mira

A modular text-to-speech Discord bot for Bay Area public transit systems.

16
Experimental
5034 Jobijoba2000/add_dub

Automated video voice-over tool for Windows. Converts subtitles to speech...

16
Experimental
5035 ccj242/Audible-Deaf-Communications

A non-profit app designed to make help the deaf communicate in person and...

16
Experimental
5036 neosun100/fish-speech

🐟 Advanced multilingual Text-to-Speech system with speaker management,...

16
Experimental
5037 voothi/20250421115831-anki-gtts-player

A powerful Anki audio add-on with a 3-tier playback system: prioritizes your...

16
Experimental
5038 ayzem88/text-to-speech-converter

أداة لتحويل النصوص العربية إلى ملفات صوتية باستخدام OpenAI TTS / Tool for...

16
Experimental
5039 mbrotos/SoundSeg

Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation

16
Experimental
5040 Entity047/Voice_AI_Creator

Python TTS and voice cloning framework for educational AI/ML demonstrations.

16
Experimental
5041 neosun100/orpheus-tts-docker

Production-ready Docker deployment for Orpheus TTS with GPU management,...

16
Experimental
5042 sap1119/voice_agent_0.02

An open‑source voice AI platform for building real‑time, scalable, and...

16
Experimental
5043 Hauntlight/video_translator

🎥 Translate and dub video audio into another language using AI. Built with...

16
Experimental
5044 MatiousCorp/claude-tts

Text-to-speech plugin for Claude Code — multi-provider support (ElevenLabs,...

16
Experimental
5045 falniak95/TurkishSpeechRecognition

Tamamen Türkçe Konuşma Algılama Sistemi. Google Cloud Platform API desteği...

16
Experimental
5046 Mrzhangxiaoduo/react-native-speech-recognizer

react-native-speech-recognizer

16
Experimental
5047 gregunger-microsoft/Jarvis

AI-powered Microsoft Teams meeting assistant with voice interaction,...

16
Experimental
5048 awesome-german/speaking

Resources and methods to improve spoken German, pronunciation, and real-life...

16
Experimental
5049 Tombarr/TranscriberApp

Local-first macOS Tahoe Transcription App & CLI Tool

16
Experimental
5050 chicogong/ffvoice-engine

🎙️ 高性能 C++ 语音引擎 - 实时音频处理 + AI 语音识别 + 边录边转写 | High-performance C++ voice...

16
Experimental
5051 bonniepeng2002/Apollo

Apollo: your intuitive, virtual nurse.

16
Experimental
5052 sagarpednekar/live-transcript-app

Live Transcription Tool - Real-time speech-to-text transcription

16
Experimental
5053 sse-digital-man/TTS-Core

数字人项目-TTS部分

16
Experimental
5054 mict-zhaw/chall_e2e_stt

End-to-end ASR experiments for language learning, focusing on...

16
Experimental
5055 Zhima-Mochi/whisper-v3-server

A robust backend server for audio processing, delivering high-accuracy...

16
Experimental
5056 bjornbytes/lua-deepspeech

Lua Library for Speech Recognition

16
Experimental
5057 zhaoyi2/Classical-Speech-Algorithms

Classical speech recognition and speaker recognition algorithms

16
Experimental
5058 Slothologist/AudioSegmenter

Segmentation of audio for a speech pipeline

16
Experimental
5059 GIO443/speech-to-owl

Voice-driven ontology builder. Say “command …” then a sentence (e.g., “the...

16
Experimental
5060 RykerWilder/jarvis

Just A Rather Very Intelligent System

16
Experimental
5061 petitwhito/Speech_to_text_project

Complete Speech-to-Text pipeline: from-scratch architectures (MLP, CNN, RNN,...

16
Experimental
5062 Rohit909-creator/EfficientWordNet_Upgrade

EfficientWordNet enhances wakeword detection with noise-robust similarity...

16
Experimental
5063 dananjaya2002/realtime-voice-assistant

AI-powered desktop voice assistant using OpenAI Whisper and Silero VAD

16
Experimental
5064 MML-Group/code4AVE-Speech

Source Code for AVE Speech Dataset

16
Experimental
5065 webKing021/VoiceFlow-An-Automatic-NLP-Transcriber

VoiceFlow is a Windows push-to-talk voice-to-text application that...

16
Experimental
5066 Nazmul0005/Personal_Voice_Assistant_Mili

Mili is a smart voice assistant built with Python and Google Gemini AI. It...

16
Experimental
5067 cvcwebsolutions/vibe-local

Local voice-to-text with AI-powered text cleanup. Privacy-focused...

16
Experimental
5068 keymastervn/htksupport

Minimal HTK for supporting HTK in Vietnamese.

16
Experimental
5069 lhg96/stt-demo-korean

Korean Speech-to-Text app with Whisper & Vosk | 한국어 음성인식 데모 애플리케이션

16
Experimental
5070 ayzem88/audio-to-text-converter

أداة متقدمة لتحويل الملفات الصوتية إلى نصوص باستخدام OpenAI Whisper /...

16
Experimental
5071 SunPCSolutions/DiarASR

Enterprise-Grade Secure ASR Diarization Pipeline - HIPAA-compliant speech...

16
Experimental
5072 PrthD/AI-powered-Voice-assisted-Object-Locator

🔍 Real-time object detection with voice command integration using YOLOv5...

16
Experimental
5073 gouhaha/Whisper-App

Windows Whisper transcription app (PyInstaller + ffmpeg)

16
Experimental
5074 vicentezaror/js-web-t2v

Web text to voice utility functions that allows to customize the behavior,...

16
Experimental
5075 alozowski/textplease

Upload an audio/video file, configure settings, and receive a text transcript

16
Experimental
5076 bguerraDev/LoudlyTTS

Native Android app to read your notifications aloud over Bluetooth....

16
Experimental
5077 lkwbr/structured-prediction

Machine learning algorithms for structured inputs and outputs, such as on...

16
Experimental
5078 sonhm3029/Realtime-Vietnamese-ASR-React-Native-and-Whisper

This project implement end to end realtime vietnamese speech recognition...

16
Experimental
5079 my-north-ai/semantic_audio_filtering

Synthetic data augmentation technique via LLM for Automatic Speech...

16
Experimental
5080 toledomauricio/fastapi-whisper-ollama

FastAPI + Whisper + Ollama: Audio transcription and LLM processing API....

16
Experimental
5081 wangjialiang678/speaklow-macvoiceinput

SpeakLow — a lightweight macOS menu bar app for voice-to-text input. Press a...

16
Experimental
5082 bobbymay/Dictation-for-macOS

Speech Recognition for macOS that allows you to define words, phrases, or...

16
Experimental
5083 KarinBrisker/Video-Subtitler

Automatically Generating Multilingual Subtitles Using OpenAI's Whisper and...

16
Experimental
5084 labrijisaad/Youtube-video-transcriptor

In this notebook, I implemented a script to transcribe YouTube videos (and...

16
Experimental
5085 luizomf/sussu

CLI educacional para transcrição com OpenAI Whisper

16
Experimental
5086 ElmiraGhorbani/gpt-speaker-diarization

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4)...

16
Experimental
5087 Caliope-SpeechProcessingLab/SpeechTester

Speech Tester is a set of Python scripts conceived as an extension to HTK...

16
Experimental
5088 OpenVoiceOS/ovos-tts-plugin-SAM

S.A.M - Software Automatic Mouth

16
Experimental
5089 code-spirit-369/text-to-speech-yt

This AI TTS web application allows you to convert any text into realistic,...

16
Experimental
5090 talhabinjaved/voice-ai-agents-openai-telnyx

A FastAPI starter that turns a Telnyx phone number into a realtime,...

16
Experimental
5091 vipyne/american-dream-phone

An AI voice agent to help you call your political representatives.

16
Experimental
5092 wis/speak

a browser extension designed for minimal clicks or presses to start reading...

16
Experimental
5093 Jmi2020/HowdyVox

A privacy focused offline STT TTS interface for your favorite LLM

16
Experimental
5094 Vitgracer/Offline-Voice-LLM-Assistant

Running small but capable language models entirely offline

16
Experimental
5095 Daeels/Smart-E-commerce-Microservices-App

This project is an E-commerce App using the microservices architecture.

16
Experimental
5096 language-org/voice-activ-detect-deepnet

ASR: Light deep net for real-time voice activity detection

16
Experimental
5097 cagataygedik/TTS

Internship Text-to-Speech research project.

16
Experimental
5098 daniel-szulc/Speech_Recognition

🎙 Automatic Keyword Speech Recognition for Polish and English in Tensorflow 🧠

16
Experimental
5099 shashankchandak/AutoSMSReader

An android application that allows users to read all incoming messages loudly

16
Experimental
5100 pig-mesh/volcengine-tts-spring-boot-starter

火山引擎语音合成(TTS)服务集成

16
Experimental
« Prev 1 2 3 49 50 51 52 53 68 69 70 Next »