All Voice AI Tools
6,981 tools ranked by quality score · Page 33 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 3201 |
CodeSarthak/autopilot-shorts
Fully automated YouTube Shorts pipeline with a self-improving feedback loop.... |
|
Experimental |
| 3202 |
zhurlik/smart-home
A multi-project that contains UDP server, MQTT broker and a few sub-projects... |
|
Experimental |
| 3203 |
khalooei/Voxtral-AI-Demo-Local-Interface
Voxtral is a state-of-the-art model developed to handle both speech... |
|
Experimental |
| 3204 |
kouyt5/lightning-asr
基于pytorch-lighting框架搭建的端到端语音识别模型,目前还在实验中,性能在不断优化 |
|
Experimental |
| 3205 |
wildminder/awesome-ai-voice
List of open-source TTS, voice cloning, and music generation models |
|
Experimental |
| 3206 |
polterguy/magic-menu
An alternative input module for Phosphorus Five, allowing you to use natural... |
|
Experimental |
| 3207 |
AIOCW/EasyAss
Python3智能语音助手 |
|
Experimental |
| 3208 |
otaviocc/Stenographer
A macOS Tahoe app for transcribing audio/video files using Apple's on-device... |
|
Experimental |
| 3209 |
shricodev/google-sheet-super-agent
An AI Agent to work with Google Sheet using Natural Language |
|
Experimental |
| 3210 |
1epalpyrgou/smartbell-server
Ένα έξυπνο κουδούνι για το σχολείο μας - 1ο Επαγγελματικό Λύκειο Πύργου |
|
Experimental |
| 3211 |
wukan1986/KWebSpeaker
保持原排版可选段的网页朗读神器 |
|
Experimental |
| 3212 |
Sajith171111/whisper
🗣️ Transcribe your voice to text easily on macOS. Just hold **Fn**, speak,... |
|
Experimental |
| 3213 |
dive2Pro/AI-Waifu-Vtuber
AI Vtuber for Streaming on Youtube/Twitch |
|
Experimental |
| 3214 |
tonywu71/distilling-and-forgetting-in-large-pre-trained-models
Code for my dissertation on "Distilling and Forgetting in Large Pre-Trained... |
|
Experimental |
| 3215 |
nguyenanhtuan203207-arch/AI-Waifu-Vtuber
AI Vtuber for Streaming on Youtube/Twitch |
|
Experimental |
| 3216 |
egorsmkv/ukrainian-onnx-model
An ONNX model for speech recognition of the Ukrainian language |
|
Experimental |
| 3217 |
MaikeMota/comando-voz
Utilizando HTML5 SpeechRecognizer para Reconhecimento de Comandos. |
|
Experimental |
| 3218 |
Ben-jilo/awesome-faceless
🚀 Discover AI tools and resources to create faceless content for YouTube,... |
|
Experimental |
| 3219 |
alex-yelisieiev/advanced-speech-transcription
A minimalistic yet powerful voice transcribing app that features precise and... |
|
Experimental |
| 3220 |
jetblinx/sonus
Speech recognition and synthesis app |
|
Experimental |
| 3221 |
dhdaines/soundswallower-demo
Simple demo of client-side speech recognition |
|
Experimental |
| 3222 |
IDEA-Emdoor-Lab/UniTTS
A TTS Trained on Universal Audio. |
|
Experimental |
| 3223 |
pragmatrix/context-switch
Audio Streaming for FreeSWITCH with backends powered by Azure, OpenAI, and Aristech |
|
Experimental |
| 3224 |
aeleraqi/Text-to-Speech-gTTS---English-text
Easy-to-use Python library for converting English text into natural sounding... |
|
Experimental |
| 3225 |
Baisampayan1324/AI-MOM
Al MOM is an Al-powered meeting intelligence platform that delivers... |
|
Experimental |
| 3226 |
mathieutrudeau/Fast-TTS
API that uses Tortoise and RVC to speed up text-to-speech generation. |
|
Experimental |
| 3227 |
anonfaded/robospeaker101
Python tool for text-to-speech conversion with voice selection, usage... |
|
Experimental |
| 3228 |
guibranco/talabat-hackathon-2022
🏃 💡 Talabat Hackathon 2022 API project |
|
Experimental |
| 3229 |
PalabraAI/palabra-ai-java
Java SDK for Palabra AI's real-time speech-to-speech translation API. Break... |
|
Experimental |
| 3230 |
tsengia/SphinxTrainHelper
A Bash script designed to make training sphinx4 and pocketsphinx acoustic... |
|
Experimental |
| 3231 |
Lion-Wu/Voice-Chat
An app that allows you to have voice conversations using the API of AI... |
|
Experimental |
| 3232 |
Zuhef/Text-to-Speech
USING HTML , CSS AND JAVASCRIPT I HAVE BUILD A SIMPLE TEXT TO SPEECH CONVERTER. |
|
Experimental |
| 3233 |
zzpuser/SnapDict
macOS AI 翻译词典,基于 DeepSeek 提供智能翻译、词根助记、拼写纠正和语音朗读 | AI-powered dictionary app... |
|
Experimental |
| 3234 |
tetratensor/Next.js-Cloudflare-Voice-AI
🎤 A real-time voice AI assistant built with Next.js and deployed on... |
|
Experimental |
| 3235 |
erogol/ddc-samples
🐸💬 Coqui TTS Double Decoder Consistency samples |
|
Experimental |
| 3236 |
deapi-ai/claude-code-skills
YouTube/audio transcription, image, video generation, AI voice (TTS) & OCR... |
|
Experimental |
| 3237 |
speechnotes/speechnotes-speech-recognizer
The speech recognition engine behind Speechnotes, based on the Webspeech-API |
|
Experimental |
| 3238 |
hutchpd/AI-Medical-Scribe
Local-first AI medical scribe running entirely in the browser using Chrome... |
|
Experimental |
| 3239 |
GitPolyakoff/voice-assistant
Voice Assistant — приложение на C# для управления компьютером голосом.... |
|
Experimental |
| 3240 |
danklabs/tts_dataset_maker
A gui to help make a text to speech dataset. |
|
Experimental |
| 3241 |
eddmann/VoiceScribe
Privacy-first macOS transcription app with global hotkey recording. 100%... |
|
Experimental |
| 3242 |
irismaker/pdf-to-audio
A flexible Python tool for converting PDF documents to audio using various... |
|
Experimental |
| 3243 |
ABashir88/enterprise-voice-ai-architectures
Reference architectures, cost models, and sales-engineering playbooks for... |
|
Experimental |
| 3244 |
milosgajdos/go-playht
PlayHT API client Go module |
|
Experimental |
| 3245 |
Jaffe2718/qwen3asr4j
Java binding for Qwen3 ASR |
|
Experimental |
| 3246 |
falabrasil/cmusphinx-br
Scripts e recursos para ASR em Português Brasileiro |
|
Experimental |
| 3247 |
kimtth/agentic-connected-vehicle-platform
🚗🤖 Agentic Connected Vehicle Platform — Agent orchestration 🤝, Model Context... |
|
Experimental |
| 3248 |
hpbyte/myanmar-tts
Myanmar Text-to-Speech with End-to-End Speech Synthesis |
|
Experimental |
| 3249 |
treychen-369/WallWhisper
🏠 Turn any IP camera into a smart English tutor for your family. AI-powered,... |
|
Experimental |
| 3250 |
seven-io/swift-client
Official Swift API Client for the seven.io SMS Gateway |
|
Experimental |
| 3251 |
tim-dickey/voice-clone
Consumer-grade voice cloning app using Coqui TTS - clone your voice from... |
|
Experimental |
| 3252 |
YOUSSEF-BT/Ai-Summarizer
AI-powered summarizer for articles, PDFs, and Word documents with... |
|
Experimental |
| 3253 |
arcb01/g-narrator
A screen reading accessibility tool |
|
Experimental |
| 3254 |
BYO-UPM/Neurovoz_Dababase
Neurovoz corpus of parkinosnian speech |
|
Experimental |
| 3255 |
thankcheeses/NHID-Clinical
Non-Human Identity Disclosure Standard for Healthcare Voice Workflows |
|
Experimental |
| 3256 |
kaveenkumar/Speech_Recognition_and_Emotion_Detection_in_English_and_German
Python project for Speech-to-Text and Sentiment Analysis. Supports English... |
|
Experimental |
| 3257 |
heptacode/interactivekiosk
다양한 사용자를 위한 키오스크 개선 프로젝트 ✨ |
|
Experimental |
| 3258 |
Dante9581/laravel-elevenlabs
🎤 Integrate ElevenLabs Text-to-Speech and Speech-to-Text APIs seamlessly... |
|
Experimental |
| 3259 |
oeschsec/Sidekick---voice-controlled-keyboard-and-mouse
Voice controlled keyboard and mouse that is lightweight (minimal... |
|
Experimental |
| 3260 |
mrizwan47/vox
Python Text to Voice Package |
|
Experimental |
| 3261 |
HarikalarKutusu/3d-voice-chess
A voice driven 3D chess game for learning Voice AI |
|
Experimental |
| 3262 |
ZaneH/heybilly
🗣️ It's like Alexa, but for your computer. Highly modular, real-time voice... |
|
Experimental |
| 3263 |
Mildemelwe/Japanese-Tacotron-2-notebook
Training notebook for Japanese TTS model with Tacotron 2 |
|
Experimental |
| 3264 |
jakecyr/llm-voice
Library to reduce latency in voice generations from LLM chat completion streams. |
|
Experimental |
| 3265 |
srinivaspedapati/Voice-Assistant-using-Speech-Recognition
Desktop based Personal Voice-Assistant Pi |
|
Experimental |
| 3266 |
repodiac/espeak-ng_german_loan_words
Brief tutorial with code where you can automatically create a dictionary... |
|
Experimental |
| 3267 |
Abhradipta/OCR-With-Read-Out-Loud-Using-Python
An Optical Character Recognition (OCR) System designed using Python to read... |
|
Experimental |
| 3268 |
bagustris/speech-recognition-course
Material for learning speech recognition, based on Microsoft teaching material on EdX |
|
Experimental |
| 3269 |
OpenVoiceOS/ovos-tts-plugin-pico
pico-tts-plugin |
|
Experimental |
| 3270 |
spokestack/android-skeleton
A functionless Android app that demonstrates a basic integration with the... |
|
Experimental |
| 3271 |
Dcros/NodeJs-AI-Live-Face-Recognition-Voice-Controlled
Its a Voice Controlled AI (Natural Language Processing) with some live face... |
|
Experimental |
| 3272 |
GSA/coe-hud-acq-advanced-analytics
A repository for information related to the Data Analytics team's Advanced... |
|
Experimental |
| 3273 |
AlasdairKing/Calendar-VB6
Simple, accessible Calendar for screenreader and blind users. |
|
Experimental |
| 3274 |
EliFuzz/parakeet-mlx
Parakeet MLX is a next-generation automatic speech recognition (ASR) engine... |
|
Experimental |
| 3275 |
belambert/asr-scripts
Lots of miscellaneous scripts to work with Sphinx ASR files and other... |
|
Experimental |
| 3276 |
funnyzak/aliyun-nls
阿里云智能语音处理 Node 模块。 |
|
Experimental |
| 3277 |
Alexmhack/Django-PDF-Audio-Reader
Uploading PDF files on Webpage, converting text present in PDF to speech and... |
|
Experimental |
| 3278 |
matin91/Parrot
Parrot is a Talking Alarm App which allows the user to set up to 5 Alarm or... |
|
Experimental |
| 3279 |
BenjaminPoncet/bobby-snips-tts
bobby-snips-tts is an implementation of snips-tts written in Node.js with... |
|
Experimental |
| 3280 |
limbang/text-to-speech
基于 Azure 文本转语音 |
|
Experimental |
| 3281 |
shyhirt/AutoDub
Automatic video translator and dubber using Whisper, XTTS v2 for voice... |
|
Experimental |
| 3282 |
IranTechNest/PersianSpeechRecognition
Persian Speech Recognition |
|
Experimental |
| 3283 |
ilbash/made_mail.ru
Code and theory from Big Data Academy |
|
Experimental |
| 3284 |
swiss-ai-center/text-to-speech-service
Queries an API based on Edge-TTS and returns an audio file based on... |
|
Experimental |
| 3285 |
Dawizzer/ComfyUI-Qwen3TTS-Emotional
Voice cloning with 80+ emotions and multi-emotion mixing for ComfyUI |
|
Experimental |
| 3286 |
GSA/coe-hud-acq-data-visualization
A repository for information related to the Data Analytics team's Data... |
|
Experimental |
| 3287 |
Barbany/Multi-speaker-Neural-Vocoder
Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial... |
|
Experimental |
| 3288 |
Ex094/VoiceCom
A Simple Voice Command Application powered by Java and Sphinx4 Speech... |
|
Experimental |
| 3289 |
palahsu/Greeting-PC
Greeting PC, made with simple Visual Basic Script. Run file it will executes... |
|
Experimental |
| 3290 |
bhadrik/Voice-Coding
Voice Coding is all about writing code by voice commands. |
|
Experimental |
| 3291 |
warisqr007/vq-bnf
Vector Quantizing speech representations |
|
Experimental |
| 3292 |
Epistates/rosellas
Automatic speech recognition (ASR) for Apple Silicon |
|
Experimental |
| 3293 |
localzet/tts
Веб-сервис для озвучки текстов с использованием Microsoft Edge TTS |
|
Experimental |
| 3294 |
aeleraqi/gTTS---Arabic-text-to-multiple-languages
Converting Arabic text to speech in various languages with the versatile... |
|
Experimental |
| 3295 |
voothi/20240411110510-autohotkey
This repository is a collection of personal AutoHotkey v2 scripts designed... |
|
Experimental |
| 3296 |
Redwiat/Language-Translator
Language Translator App - Translate text into multiple languages. Built with... |
|
Experimental |
| 3297 |
yeyupiaoling/VITS-PaddlePaddle
本项目是基于PaddlePaddle的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复... |
|
Experimental |
| 3298 |
cjh0613/vosk-android-demo-chinese
中文 vosk-android-demo |
|
Experimental |
| 3299 |
taoing/tts-server
微软晓晓 语音合成接口 |
|
Experimental |
| 3300 |
viig99/esolafast
Fast C++ implementation of ESOLA using KFRLib, can be used for online... |
|
Experimental |