All Voice AI Tools
6,981 tools ranked by quality score · Page 43 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 4201 |
josephrocca/lyra-v2-soundstream-web
Lyra V2 (SoundStream) running in the browser |
|
Experimental |
| 4202 |
PierreChouteau/umss_icassp
ICASSP 2024 paper - A Fully Differentiable Model for Unsupervised Singing... |
|
Experimental |
| 4203 |
chandachewe10/whisper-open-ai
Transcribe and Translate Audio to Text using whisper open-ai |
|
Experimental |
| 4204 |
paddy41601/faster-whisper-cli
A command-line interface wrapper for Faster Whisper |
|
Experimental |
| 4205 |
D34DC3N73R/ha-chatterbox-tts
Home Assistant TTS integration for Chatterbox-TTS-Server |
|
Experimental |
| 4206 |
aklos/gpt3-personal-assistant
Interact with GPT-3 through speech |
|
Experimental |
| 4207 |
masasibata/t-one-rest-api
Production-ready REST API for Russian speech recognition using T-one model.... |
|
Experimental |
| 4208 |
ali-ibnouf/SmartTalker
Digital Human AI Agent Platform — Real-time talking avatar with Arabic-first support |
|
Experimental |
| 4209 |
famda/semantics
Semantics CLI - Unified interface for media intelligence |
|
Experimental |
| 4210 |
francescomalatesta/php-google-tts-example
A basic example script to use Google Cloud Text-to-Speech APIs |
|
Experimental |
| 4211 |
NDharshan/iNeuron-Blind-Navigation
This project attempts to create a system which would bring in added ease to... |
|
Experimental |
| 4212 |
rupac4530-creator/ai-desktop-assistant
Voice-controlled AI desktop assistant | 100% local & private | Whisper +... |
|
Experimental |
| 4213 |
anoyetta/CeVIOAIProxy
CeVIO AI に棒読みちゃんと同等のTCPソケットインターフェースを生やすアプリケーションです。CastCraft... |
|
Experimental |
| 4214 |
godmode2k/whisper.cpp.android
whisper.cpp.android with CLBlast(OpenCL), Translation (Google ML-Kit) and TTS |
|
Experimental |
| 4215 |
KevinSJ/rss-to-speech
Use Google Text-To-Speech to read long articles from rss feed |
|
Experimental |
| 4216 |
tjwodud04/Master-Course-Project
Master course team project code files (석사과정 참여과제 코드 파일) |
|
Experimental |
| 4217 |
Tanaka-zi/VoiceR
VoiceR is a Linux voice control app that lets you control games using speech... |
|
Experimental |
| 4218 |
alorbach/open-video-transcribe
Open Video Transcribe - Open-source video transcription tool that emphasizes... |
|
Experimental |
| 4219 |
shreyamalogi/Text-To-Speech
"Transform Your Words into Sonic Spells with Shreya's Text-to-Speech... |
|
Experimental |
| 4220 |
Fortyseven/Vibrance
Local voice-to-text transcription tool 🗣️📢🖮 |
|
Experimental |
| 4221 |
bishal7679/ASL-Transformer
A user-friendly application for converting either audio or text into sign... |
|
Experimental |
| 4222 |
michael-borck/video-lens
Analyzes presentation videos using speech transcription, computer vision,... |
|
Experimental |
| 4223 |
Kaljurand/EKISpeak
Implementation of Android's TextToSpeechService that provides Estonian text-to-speech |
|
Experimental |
| 4224 |
Neka-Ev/Live2D-AI-Vivian
基于 PyQt5 与 Live2D 的桌面 AI 伴侣“薇薇安”。集成了 LLM 对话、本地/云端语音识别 (ASR) 与高表现力语音合成... |
|
Experimental |
| 4225 |
folubebe/gemini_realtime_speech_to_text
Real-time speech translation using Google Gemini API for free |
|
Experimental |
| 4226 |
rockywuest/kawaii-bath-assistant
🛁 Cute AI-powered bathroom assistant for M5Stack Core 2 — kawaii face,... |
|
Experimental |
| 4227 |
tihu-nlp/tihu-native
Persian text-to-speech on web and mobile using expo react-native |
|
Experimental |
| 4228 |
yashasviyadav30/Omnibox
📦 AI-powered CLI utility with voice support - One Tool, Infinite Possibilities |
|
Experimental |
| 4229 |
shinshekai/VoxForge-Pro
VoxForge Pro is a premium, offline audiobook generator powered by Kokoro-82M... |
|
Experimental |
| 4230 |
deepgram-devs/talk-time-analytics
Sample app for generating and displaying speaker talk-time using the... |
|
Experimental |
| 4231 |
lukaszliniewicz/Subdub
A command line Python app offering a video-to-dubbed-video workflow with... |
|
Experimental |
| 4232 |
fuota-io/The-Things-Network-NodeJS-SDK
The user-friendly Node.js SDK to boost connectivity and data management... |
|
Experimental |
| 4233 |
CPCCoder/SoundMatchAnalyser
SoundMatchAnalyser (SMA) is a powerful tool designed to analyze and compare... |
|
Experimental |
| 4234 |
artryazanov/gemini-speech-to-speech-translator
Transform your audio content into any language with high accuracy and... |
|
Experimental |
| 4235 |
LINSUISHENG034/Qwen3-ASR-Desktop
Modern PyQt6 desktop GUI for Qwen3-ASR with batch transcription support |
|
Experimental |
| 4236 |
joe62/TalkingClipboard
文本朗读工具,文本转MP3 |
|
Experimental |
| 4237 |
Xinghui-Wu/KENKU
KENKU: Towards Efficient and Stealthy Black-box Adversarial Attacks against... |
|
Experimental |
| 4238 |
sil-ai/tts-singlish
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm. |
|
Experimental |
| 4239 |
i-celeste-aurora/katip
A SFSpeechRecognizer-based voice recordings transcriber for macOS |
|
Experimental |
| 4240 |
Acumane/lectern
Listen to PDFs with natural TTS and read-along text prompts |
|
Experimental |
| 4241 |
msalhab96/Conformer
An implementation for "Conformer: Convolution-augmented Transformer for... |
|
Experimental |
| 4242 |
husseinnsourr/NeuralChatter
A Next-Generation Neural TTS Engine. High-quality, human-like voice... |
|
Experimental |
| 4243 |
scruss/micropython-SYN6988
MicroPython library for the VoiceTX SYN6988 text to speech module |
|
Experimental |
| 4244 |
p1an-lin-jung/WavThruVec_pytorch
An implementation of Charactr, Inc's "WavThruVec: Latent speech... |
|
Experimental |
| 4245 |
SVM0N/ttsweb
Convert PDFs/EPUBs to audiobooks with synchronized text highlighting using... |
|
Experimental |
| 4246 |
kyegomez/SoundStream
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End... |
|
Experimental |
| 4247 |
Kit4Some/Voice_opencode
The open source vibe_voice coding agent. |
|
Experimental |
| 4248 |
hi-paris/CosyVoice2-EU
Europeanized CosyVoice2 for French & German |
|
Experimental |
| 4249 |
danamini/aichat
Speech-to-Speech conversational AI using Azure OpenAI Service and Azure... |
|
Experimental |
| 4250 |
HelgeSverre/glados
A web interface for GLaDOS text-to-speech with AI conversation capabilities |
|
Experimental |
| 4251 |
tangming579/text-to-speech
文字转语音Demo,分别使用百度云、科大讯飞、有道云实现 |
|
Experimental |
| 4252 |
mahirgul/GoogleTTS.Net
.Net dll that uses Google's translate text to speech service. |
|
Experimental |
| 4253 |
Ajay-user/Streamlit-ElevenLabs-Text2Speech
Text to Speech by ElevenLabs |
|
Experimental |
| 4254 |
QuantiusBenignus/voluble
Let your GNOME desktop speak to you. Reads your desktop notifications or... |
|
Experimental |
| 4255 |
chirag127/ContextChat-AI-Webpage-Conversational-Browser-Extension
An AI-powered browser extension to chat directly with any webpage's content.... |
|
Experimental |
| 4256 |
iconclub/zalo-tts
Zalo Text-To-Speech for python |
|
Experimental |
| 4257 |
matlab-deep-learning/Use-a-Python-Speech-Command-Recognition-System-to-MATLAB
Use a Python speech command recognition system in MATLAB |
|
Experimental |
| 4258 |
hecx333/edge-tts-go
一个用于 Microsoft Edge 在线文本转语音服务的 Go 语言库。 本项目允许您免费使用 Microsoft Edge 的高质量神经 TTS 语音。 |
|
Experimental |
| 4259 |
siva-sub/pocket-tts-openapi-gpu
GPU-enhanced Pocket TTS with Remotion + TikTok captions |
|
Experimental |
| 4260 |
AmirAbaskohi/Automatic-Speech-recognition-for-Speech-Assessment-of-Persian-Preschool-Children
Preschool evaluation is crucial because it gives teachers and parents... |
|
Experimental |
| 4261 |
dog0sd/sven
elevenlabs powered TTS utility |
|
Experimental |
| 4262 |
rodrigoguedes09/multimodal-medical-assistant
End-to-end intelligent automation system for medical clinics, combining REST... |
|
Experimental |
| 4263 |
vishwakneelamegam/deepspeech-android
i have build speech recognition app using mozilla deepspeech |
|
Experimental |
| 4264 |
hyunjoonbok/natural-language-processing
Ready-to-use Implementation of Natural Language Processing models in... |
|
Experimental |
| 4265 |
Kirili4ik/QuartzNet-ASR-pytorch
Automatic Speech Recognition (ASR) model QuartzNet trained on English... |
|
Experimental |
| 4266 |
SPACESODA/read-txt
Read TXT is a lightweight text-to-speech reader with auto language detection... |
|
Experimental |
| 4267 |
virajbhutada/speech-emotion-recognition
This repository houses a robust speech emotion recognition system, featuring... |
|
Experimental |
| 4268 |
ImPavloh/WhiTTsper-The-Lora
Demo combining Whisper for speech recognition and Google TTS for speech... |
|
Experimental |
| 4269 |
oleglegun/polly-ru-ssml
Enhance AWS Polly TTS pronunciation for english words within russian text |
|
Experimental |
| 4270 |
andybi7676/reborn-uasr
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training... |
|
Experimental |
| 4271 |
h-iori/AI-Desktop-Assistant-Python-OpenAI
This project is a work-in-progress AI desktop assistant powered by OpenAI... |
|
Experimental |
| 4272 |
AlpinDale/kizuna
Fast TTS Library for Kokoro |
|
Experimental |
| 4273 |
ilyamiro/stewart
Personal voice assistant |
|
Experimental |
| 4274 |
jsbxyyx/tts_java
微软文本转语音工具 |
|
Experimental |
| 4275 |
nickpending/lspeak
Speaks terminal output with semantic caching and serial playback |
|
Experimental |
| 4276 |
ArenAcikgoz/Whisper-Alignment
Forced alignment decoder for Whisper. |
|
Experimental |
| 4277 |
awasthiabhijeet/Error-Driven-ASR-Personalization
Code for "Error-driven Fixed-Budget ASR Personalization for Accented... |
|
Experimental |
| 4278 |
divshekhar/Jarvis
A Voice Assistant - Jarvis |
|
Experimental |
| 4279 |
AliceAuto/obsidian-auto-word-audio
一个为 Obsidian 单词笔记自动添加音频发音的插件 |
|
Experimental |
| 4280 |
mcp-tool-shop-org/audiobooker
AI Audiobook Generator - Convert EPUB/TXT books into professionally narrated... |
|
Experimental |
| 4281 |
nerdpudding/nerdpudding
The proof is in the pudding. Real-time AI video commentary with... |
|
Experimental |
| 4282 |
joszuijderwijk/BarryBox
BarryBox is an MQTT controlled TTS Speaker. You can hook it up to the... |
|
Experimental |
| 4283 |
devp19/MyBuddy
Generative AI Therapist built using Google-Cloud's Speech-To-Text... |
|
Experimental |
| 4284 |
harishkotra/Voice-to-Text-Ionic
Ionic Framework example app for both iOS and Android to convert voice to... |
|
Experimental |
| 4285 |
steveseguin/tts.rocks
Cutting-edge Text-to-Speech in the browser - for free |
|
Experimental |
| 4286 |
symblai/real-time-speech-recognition-with-websockets
Use Symbl.ai's Streaming API to create real-time speech recognition with... |
|
Experimental |
| 4287 |
fxnoob/speech-recognition-toolkit
Voice control for chrome browser |
|
Experimental |
| 4288 |
Pogayo/african-voices-web
Website that hosts the African Voices projects. Users can download datasets... |
|
Experimental |
| 4289 |
shreyamalogi/ZAC-the-AI-Assistant
ZAC: Your robotic virtual assistant - Enhancing human-machine interaction... |
|
Experimental |
| 4290 |
qcri/Arabic_speech_code_switching
The first Dialectal Arabic Code Switching - DACS corpus from broadcast... |
|
Experimental |
| 4291 |
igor-lirussi/Dialogue-Pepper-Robot
it provides Pepper Robot conversation abilities to handle a free open-domain... |
|
Experimental |
| 4292 |
asafu-art/deepspeech-kabyle
Automatic Speech Recognition (ASR) - Kabyle |
|
Experimental |
| 4293 |
nmanikiran/browser-apis
There are a large number of Web / Browser APIs available. This repo... |
|
Experimental |
| 4294 |
Hlid-Systems/vanaheim-audio-generator
🔊 Professional Audio Simulation Microservice (Hlid Systems). Orchestrates... |
|
Experimental |
| 4295 |
aidayang/index-tts-OneClick
index-tts2声音克隆软件免安装一键启动整合包 |
|
Experimental |
| 4296 |
erasedwalt/CTC-ASR
An implementation of Jasper, QuartzNet, Citrinet and pipeline for training... |
|
Experimental |
| 4297 |
f76tbntbww-crypto/VoiceForge
One-click local AI voice assistant powered by ASR+LLM+TTS, 100% coded by... |
|
Experimental |
| 4298 |
madcato/bl-speech-recognizer
Some implemented use cases for SFSpeechRecognizer |
|
Experimental |
| 4299 |
flumi3/speech-to-text
Transcribe audio files with Azure Cognitive Services |
|
Experimental |
| 4300 |
exemplaryai/ai-engine
Easy to use Multi-Provider ASR/Speech To Text and NLP engine |
|
Experimental |