All Voice AI Tools
6,981 tools ranked by quality score · Page 35 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 3401 |
6Morpheus6/IndicF5
High-Quality Text-to-Speech for Indian Languages |
|
Experimental |
| 3402 |
CodingWithEnjoy/Speech-To-Text-Python
متن به صدا | Text To Speech 😊🤩 |
|
Experimental |
| 3403 |
scraptechguy/SpeechCheck
Speech recognition and subsequent speech evaluation, all driven by Microsoft Azure |
|
Experimental |
| 3404 |
VIKASRAPARTHI/Jarvis-Voice-Assistant
Jarvis is a powerful desktop voice assistant designed to enhance... |
|
Experimental |
| 3405 |
gwihlidal/speechtest-rs
Google Cloud text-to-speech prototype |
|
Experimental |
| 3406 |
violet125qq/my-live-caption-with-translation-for-macos
A Python script that captures microphone and/or system audio in real time,... |
|
Experimental |
| 3407 |
MatteoM95/Smart-Home-Vigilance-System
An indoor video surveillance system capable of recognizing the presence of a... |
|
Experimental |
| 3408 |
z1311/Face-Recognition-with-Voice-Output
Real Time Face Recognition with Voice Output System. |
|
Experimental |
| 3409 |
abhineetraj1/python-voice-command
This is voice command A.I. which give you output according to your predefined codes. |
|
Experimental |
| 3410 |
DmitryCherneckiy/speech-to-text
Telegram bot. Turns a voice message into a text message. |
|
Experimental |
| 3411 |
victormgross/RealVideo
📹 Create engaging video calls with RealVideo, a WebSocket-based system that... |
|
Experimental |
| 3412 |
Agash/TTSTextNormalization
Modern .NET10 / C#14 library to normalize text (emojis, currency, numbers,... |
|
Experimental |
| 3413 |
speechpro/speechpro-cloud-tts-examples
В данном репозитории представлены примеры использования синтеза речи с... |
|
Experimental |
| 3414 |
deepgram-starters/csharp-transcription
Get started using Deepgram's Transcription with this C# demo app |
|
Experimental |
| 3415 |
benrucker/JermaBot
A wacky, sound-oriented Discord bot |
|
Experimental |
| 3416 |
utsavpshah/SpeakingHands
This is an extension to LeapTrainer.js repository. With this project, we... |
|
Experimental |
| 3417 |
herrkaefer/SwiftEdgeTTS
Microsoft's Edge TTS in pure Swift |
|
Experimental |
| 3418 |
jharrilim/RasaDocker
Docker image with Rasa + Anaconda + Tensorflow + portaudio + PyAudio +... |
|
Experimental |
| 3419 |
ascender1729/AudioDictate
An efficient desktop application for transcribing audio files into text... |
|
Experimental |
| 3420 |
siddhantmishra1305/Anuvaad
An iOS translator that supports more that 40 languages. User can add notes... |
|
Experimental |
| 3421 |
haliphax/tts
Twitch text to speech overlay for OBS (using lobe-tts) |
|
Experimental |
| 3422 |
Lordmau5/firebot-script-elevenlabs-tts
A custom Firebot script that adds support for ElevenLabs TTS |
|
Experimental |
| 3423 |
allpaqa-jgk/twitch_text_to_speech_bot
Text to Speech bot using Twitch IRC for mac and (linux and windows |
|
Experimental |
| 3424 |
facejungle/fj_chat_to_speech
FJ Chat to Speech. Text To Speech: YouTube, Twitch |
|
Experimental |
| 3425 |
PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI
Demonstration for the Qwen/Qwen3-TTS-12Hz models using Daggr for modular UI... |
|
Experimental |
| 3426 |
miikkij/Speechos
Local-first speech AI benchmarking — compare STT, TTS, emotion & diarization... |
|
Experimental |
| 3427 |
rdyson/morsel
Forward links, get a daily podcast digest. Scripts that turns article URLs... |
|
Experimental |
| 3428 |
bibinkunjumon2020/Azure-Avatar-AI
The text to speech avatar system is a text to speech feature with vision... |
|
Experimental |
| 3429 |
b4rtaz/voice-assistant-net-server
Voice Assistant Server for VSCode |
|
Experimental |
| 3430 |
stefanwebb/unity-voice-agents
A Unity package for building open-source AI voice agents that run fully... |
|
Experimental |
| 3431 |
jonelo/unlock-win-tts-voices
Unlocks the Microsoft Windows TTS voices for use with other x64 applications... |
|
Experimental |
| 3432 |
QuyAnh2005/vits-japanese
Text to Speech for Japanese |
|
Experimental |
| 3433 |
agusibrahim/tiktok-tts-api
A Text-to-Speech API using TikTok’s private API to convert text into audio,... |
|
Experimental |
| 3434 |
nguyennpa412/simple-multimodal-ai
Simple Gradio application integrated with Hugging Face Multimodals to... |
|
Experimental |
| 3435 |
th33k/Luigi
LUIGI is an interactive pet robot designed for fun, companionship, and... |
|
Experimental |
| 3436 |
SaranshKejriwal/Harold_Finch
Face recognition via voice Commands (OpenCV Python + SpeechRecognition 3.1.3) |
|
Experimental |
| 3437 |
CT83/Hellin-Worki
A video conferencing platform which seamlessly dials your coworkers when you... |
|
Experimental |
| 3438 |
straff2002/OpenGlasses
Use Meta Rayban glasses with alternative providers |
|
Experimental |
| 3439 |
DevashishPrasad/Virtual-AI-assistant
This repository contains my Bachelor's degree final year project. It is a... |
|
Experimental |
| 3440 |
devfinwiz/Python-Voice-Assistant-Virtual-Slave
This voice assistant is buit in VS Code. It has an ability to understand... |
|
Experimental |
| 3441 |
fwcd/okpi
Virtual assistant with offline voice recognition for Raspberry Pi |
|
Experimental |
| 3442 |
2017fandrei/ForcedAlignment
Graphical utility for forced alignment using aeneas, an interactive audio player |
|
Experimental |
| 3443 |
FlyingPolarBear/CityKBQA
Xiaode: a Knowledge Based Question Answering System with Speech IO |
|
Experimental |
| 3444 |
etasdemir/OpticMap
On-device optical character recognition Android application. |
|
Experimental |
| 3445 |
sdsb8432/TextToSpeech-Android
Text to Speech for Android Application with Google API |
|
Experimental |
| 3446 |
sanjifr3/Narrator
An image and video description generator using an CNN-RNN based architecture. |
|
Experimental |
| 3447 |
rodrigosuelli/ditey-web
🎙 Leitor de textos online desenvolvido com React e Web Speech API. Tcc (ETEC) |
|
Experimental |
| 3448 |
zvz23/vProfanity
A software solution that automates the detection and censorship of profanity... |
|
Experimental |
| 3449 |
Otosaku/OtosakuStreamingASR-iOS
OtosakuStreamingASR-iOS is a real-time speech recognition engine for iOS,... |
|
Experimental |
| 3450 |
M-Mowina/TalentTalk---AI-powered-interview-system
AI-powered technical interview system with dynamic resume analysis, voice... |
|
Experimental |
| 3451 |
derek-byte/multilingual-voice-assistant-llm
cohere labs - aya expedition 2025: integrating speech & audio into aya... |
|
Experimental |
| 3452 |
jark006/SummerTTS_VS
SummerTTS... |
|
Experimental |
| 3453 |
hinantin/QuechuaTTS
Hinantin - Text-to-Speech System for Quechua |
|
Experimental |
| 3454 |
vra/supertonic-mnn
A command-line interface for running Supertonic TTS models using MNN. |
|
Experimental |
| 3455 |
stefantaubert/zho-tts
Web app, command-line interface and Python library for synthesizing Chinese... |
|
Experimental |
| 3456 |
6Morpheus6/IndexTTS2
[NVIDIA, MAC, ROCM] Emotionally Expressive and Duration-Controlled... |
|
Experimental |
| 3457 |
sezer-muhammed/EBookReaderFullStack
A local-first EPUB reader with high-fidelity neural text-to-speech,... |
|
Experimental |
| 3458 |
HadrienGardeur/read-aloud-best-practices
Documenting best practices for implementing a read aloud feature in reading apps |
|
Experimental |
| 3459 |
ChanMo/espeak-ng-tts
Espeak-ng TTS 是 Chrome 浏览器的 TTS 插件,使用 本地espeak-ng 作为 TTS 引擎。 |
|
Experimental |
| 3460 |
SzLeaves/asr-webapp
ASR Web APP 中文语音识别实验室APP,使用Django构建,包含中文语音转文字与中文语音聊天机器人模块 |
|
Experimental |
| 3461 |
nexusjuan12/AetherChat
AetherChat local RVC chat interface for Koboldcpp and OpenAI style API |
|
Experimental |
| 3462 |
SeanPLeary/dc_tts-transfer-learning
Transfer learning exploration of dc_tts text-to-speech model |
|
Experimental |
| 3463 |
jonasmore/Cloudflare-Workers-AI-Home-Assistant-Integration
Cloudflare Workers AI integration for Home Assistant - TTS, STT, and... |
|
Experimental |
| 3464 |
Yunichi/livekit-voice-ai-agent-setup
The "livekit-voice-ai-agent-setup" repository provides a comprehensive guide... |
|
Experimental |
| 3465 |
LEMAS-Project/LEMAS-Project
LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with... |
|
Experimental |
| 3466 |
pilarOG/unit_selection_tts
Toy example on how to build a unit selection TTS in Spanish |
|
Experimental |
| 3467 |
crispinprojects/klatt-synthesizer
Klatt speech synthesizer |
|
Experimental |
| 3468 |
amirivojdan/neyshekar
A Large-Scale Open Persian Speech Dataset |
|
Experimental |
| 3469 |
MahtaFetrat/GPTInformal-Persian-Speech-Dataset
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs... |
|
Experimental |
| 3470 |
paulomarcos/pyelant
PyElant is a simple python tool for performing translations and storing it... |
|
Experimental |
| 3471 |
MrSean2d2/mouthwords
A python script to put words in other peoples mouths. |
|
Experimental |
| 3472 |
akabe/obs-transcript
Real-time subtitle generation by speech recognition for OBS Studio |
|
Experimental |
| 3473 |
xuan3986/UDDETTS
The first LLM that unifies discrete and dimensional emotions for... |
|
Experimental |
| 3474 |
racai-ai/TEPROLIN
This is the TEPROLIN Romanian text processing platform, developed in the... |
|
Experimental |
| 3475 |
6ixGODD/audex
Smart Medical Recording & Transcription System with voice recognition and... |
|
Experimental |
| 3476 |
axzml/VoxLinkAI_Client
Native macOS voice input assistant. Hold a hotkey, speak, and let AI... |
|
Experimental |
| 3477 |
HiMeditator/wfts-chinese-tool
使用中文游玩《群星低语》游戏。Playing the game "Whisper from the Stars" in Chinese. |
|
Experimental |
| 3478 |
ZackAkil/global-video-dubbing
Using Googel Cloud Video Intelligence API with Cloud Translation API and... |
|
Experimental |
| 3479 |
KyeongJooni/ai-dubbing-studio
AI-powered dubbing web service - Upload audio/video, get dubbed in any... |
|
Experimental |
| 3480 |
ThomasRigoni7/Audio-emotion-recognition-RAVDESS
Implementation of various models to address the speech emotion recognition... |
|
Experimental |
| 3481 |
LG-1/audio2text
Ease of use for Speech to Text |
|
Experimental |
| 3482 |
Sushkyn/simple-music-player
playing music in shell for linux. |
|
Experimental |
| 3483 |
Saganaki22/kokoro-web
Kokoro TTS Web |
|
Experimental |
| 3484 |
boltomli/speech-api
Demo to show how to use Azure Speech Services API in app |
|
Experimental |
| 3485 |
arjunmahishi/Speech-with-JavaScript
Code sample for speech recognition and syntheses with simple javascript |
|
Experimental |
| 3486 |
jfassis20/aish
🤖 Simplify command execution with AISH, an AI-powered shell assistant that... |
|
Experimental |
| 3487 |
Philipp2211/Udacity-Natural-Language-Processing-Nanodegree
This repository contains all my solutions to the tutorials/projects of the... |
|
Experimental |
| 3488 |
simran2104/Machine-Learning-Projects
It contains different projects made using different algorithms in Machine Learning |
|
Experimental |
| 3489 |
Sim-hu/voicebot-rust
Discordで使用可能な読み上げbot。rust言語で書かれていて、とにかく軽量(なはず) |
|
Experimental |
| 3490 |
yanorei32/aitalked-server
Simple GynoidTalk / VOICEROID Web Server based on aitalked library |
|
Experimental |
| 3491 |
isaacgounton/awesome-tts
A unified Text-to-Speech gateway combining multiple TTS providers (Kokoro... |
|
Experimental |
| 3492 |
VattamBhavaniPrasad5i5/Voice-Cloning-Project
String as a input and extract the youtube video from keyword and extract... |
|
Experimental |
| 3493 |
Nicolas-Prevot/TTS_playground
Unified toolkit for testing and comparing multiple state-of-the-art... |
|
Experimental |
| 3494 |
neosapience/typecast-js
The official Node.js SDK for the Typecast API. |
|
Experimental |
| 3495 |
ShunsukeHayashi/byteplus-voice-ai
BytePlus音声対話AIアプリケーション - ASR, TTS, Voice Cloning統合(WebSocket対応、日本語対応✅) |
|
Experimental |
| 3496 |
Oqaasileriffik/martha
Martha TTS (Greenlandic text-to-speech) documentation, containers, and helpers |
|
Experimental |
| 3497 |
natelindev/voice-agent
Low-latency real-time terminal voice assistant with VAD, ASR, LLM, and TTS |
|
Experimental |
| 3498 |
FUYOH666/VoiceToText
Cross-platform Voice-to-Text application with support for macOS, Linux, and... |
|
Experimental |
| 3499 |
gtiwari333/speech-recognition-java-hidden-markov-model-vq-mfcc
Automatically exported from... |
|
Experimental |
| 3500 |
aishoot/DTWSpeech
A simple application of DTW Algorithm in isolate word speech recognition. |
|
Experimental |