All Voice AI Tools
8,165 tools ranked by quality score · Page 73 of 82
| # | Tool | Score | Tier |
|---|---|---|---|
| 7201 |
Arushi-Srivastava-16/SpatialAudio
SpatialAudio detects key objects using YOLOv8, identifies their location in... |
|
Experimental |
| 7202 |
NusrathFarheen/talkbuddy
An AI-powered voice chatbot built with JavaScript and Node.js to help you... |
|
Experimental |
| 7203 |
partha-09/VenomX-Windows-Voice-Assistant
VenomX is an advanced voice assistant for Windows, utilizing Python and AI... |
|
Experimental |
| 7204 |
djdhairya/J.A.R
JAR represents a paradigm shift in desktop interaction, optimizing task... |
|
Experimental |
| 7205 |
jayeshbhandarkar/GlobalSpeak
GlobalSpeak is a Python Flask web application designed to overcome language... |
|
Experimental |
| 7206 |
BatuhanYilmaz26/Youtube-Transcriber
Input a YouTube video link and get a transcription as a .txt, .vtt or .srt file. |
|
Experimental |
| 7207 |
zeeshanmahar007/Sign-Talk---Bridge-the-Gap-of-Communication
SignTalk is an android based application in which Hearing, or speech... |
|
Experimental |
| 7208 |
thuantn210823/ASR
This repo utilizes the popular and highly effective Conformer Encoder from... |
|
Experimental |
| 7209 |
louis030195/book2audiobook
text to speech public domain / free audio books |
|
Experimental |
| 7210 |
Alkohole/machine-reading-text
A small extension that adds the ability to voice text from YaBrowser to... |
|
Experimental |
| 7211 |
phil1px/voice-cloner
Flask/FastAPI + Gradio app for voice cloning with Resemble AI — upload,... |
|
Experimental |
| 7212 |
Rulikkk/digit-tutor
Digit Tutor is a simple online game for kids in Svelte, which uses speech... |
|
Experimental |
| 7213 |
Harsha0431/News-Scraper-Summarizer-Text-to-Speech
This project scrapes news articles, summarizes content using BERT or Gemini... |
|
Experimental |
| 7214 |
AbdulGani11/Vocably
Text-to-speech web application built with React, FastAPI, JWT... |
|
Experimental |
| 7215 |
VinnyVanGogh/cli-whisperer
🎤 Professional Voice-to-Text TUI Application - OpenAI Whisper + GPT with... |
|
Experimental |
| 7216 |
leeorhelps/SpeechBird
Speech Bird is a speech recognition system which makes complete hands-free... |
|
Experimental |
| 7217 |
dhcgn/ai-sample-scripts
Simple shell scripts for AI tasks (image description, transcription, TTS,... |
|
Experimental |
| 7218 |
DJJ547/CMPE273-Book-Reader-Django
An AI-powered book reader app with search, library management, and... |
|
Experimental |
| 7219 |
HuyDang05/Ringurooma
AI-powered Japanese speaking practice platform using n8n, and Azure AI for... |
|
Experimental |
| 7220 |
muhammadazhariqbal/schedule-ai-backend
serverless AI-powered service that helps convert audio to text and extract... |
|
Experimental |
| 7221 |
Mamrez/speech-recognition
Analogue speech recognition based on physical computing |
|
Experimental |
| 7222 |
manasgandhi99/Lappy-Voice-Assistant
A Laptop Voice assistant built using python that can perform multiple... |
|
Experimental |
| 7223 |
Cyan903/zundamon-yomitan
Fallback audio source for Yomitan which uses ずんだもん TTS. |
|
Experimental |
| 7224 |
verbio-technologies/cpp-verbio-speech-center
C++ integration with the Verbio Speech Center Cloud. https://speechcenter.verbio.com/ |
|
Experimental |
| 7225 |
bryanrandell/ChatGPT_speech_to_speech
OpenAI and Google Cloud for a speech answer to speech respond from OpenAI ChatGPT |
|
Experimental |
| 7226 |
myselfaryan/uchchaaran
A sophisticated text-to-speech (TTS) system specifically designed for... |
|
Experimental |
| 7227 |
shubhomoydas/ai_raspberrypi
Voice instructions with AI for controlling LEGO motor connected to Raspberry Pi 5 |
|
Experimental |
| 7228 |
sungjae-cho/ICASSP2020_STDemo
Show and Tell demonstration homepage |
|
Experimental |
| 7229 |
seven-io/StackStorm
Send SMS and make text-to-speech calls via StackStorm |
|
Experimental |
| 7230 |
MohamedNabill7/Alex-Virtual-Assistant-in-Smart-Home
Machine Learning Behind AI Smart Assistant in Smart Home |
|
Experimental |
| 7231 |
sj2tpgk/voiceroid-docker
Voiceroid+ in docker on X64/Arm linux + web interface (mirrored from... |
|
Experimental |
| 7232 |
RichFesler/node-red-tts-flask
Fast local text-to-speech system for Node-RED using Flask, Coqui TTS, and FFmpeg |
|
Experimental |
| 7233 |
driftingruby/395-transcribing-with-artificial-intelligence
In this episode, we look at creating an audio transcription service which... |
|
Experimental |
| 7234 |
Rtiwary-1/Voice-Based-Music-Playlist-Generator
This project was done as coursework for the subject of Database Management System |
|
Experimental |
| 7235 |
iamvon/AudioRead
Turn PDFs into audio with chunked LLMs and OpenAI TTS |
|
Experimental |
| 7236 |
aghezzafmohamed/Chatbot-with-Python-and-Deep-Learning
ChatBot that will help students in university. In order to reduce the... |
|
Experimental |
| 7237 |
taresh18/livekit-orpheus
LiveKit TTS plugin with Orpheus streaming support |
|
Experimental |
| 7238 |
test-dan-run/squim-report
Using TorchAudio-SQUIM to create dataset quality reports |
|
Experimental |
| 7239 |
trautodiag/Free-Local-AI-Voice-Cloning-Unlimited
Stop paying monthly fees. High-fidelity voice cloning on your own GPU. No... |
|
Experimental |
| 7240 |
polojudayamani-crypto/Shashitha-voice-assistant
Python voice assistant project |
|
Experimental |
| 7241 |
wasabina67/openai-tts-example
Openai tts example |
|
Experimental |
| 7242 |
fabianzimber/atelier-of-synthetic-voice
A professional, iOS-inspired studio for high-fidelity voice cloning and... |
|
Experimental |
| 7243 |
mrdiamonddirt/python-llm-interpreter
A python script that listens for a request plans a response tells you the... |
|
Experimental |
| 7244 |
Sunkware/notthatstuff
Large-language-model + Text-to-speech + Voice-cloning doppelgangers, trained... |
|
Experimental |
| 7245 |
alifarrokh/asr-from-scratch
ASR models implemented from scratch in PyTorch |
|
Experimental |
| 7246 |
Nexdata-AI/347-Hours-Italian-Speech-Data-Collected-by-Mobile-Phone
Italian Speech Dataset |
|
Experimental |
| 7247 |
rwmicro/voice-backend
Voice backend that provides acces to Kokoro, Chatterbox and F5-TTS. |
|
Experimental |
| 7248 |
AleefBilal/tts_srt_gen
A runpod serverless docker that generates TTS using chatterbox-tts along with .srt |
|
Experimental |
| 7249 |
JacobCoffee/dpo-reader
For those too lazy to read a DPO thread that is far too long. Option of good... |
|
Experimental |
| 7250 |
KunalSingh5431/smartPDF
AI-powered PDF summarizer with text-to-speech built using MERN stack. |
|
Experimental |
| 7251 |
r-shafi/bangla-speech-to-text
Automatic speech recognition for the Bangla language, one of the world's... |
|
Experimental |
| 7252 |
agdm/chatterbox-api
Fast API in front of Chatterbox |
|
Experimental |
| 7253 |
0x3EF8/Unified-API-Server
A modular, auto-loading REST API server built with FastAPI. Drop a service... |
|
Experimental |
| 7254 |
roperi/Podcast2Wordpress
Podcast2Wordpress automates podcast-to-blog post conversion on WordPress. It... |
|
Experimental |
| 7255 |
thomasthaddeus/TTSSolution
TTS Application written in C# |
|
Experimental |
| 7256 |
ittia-research/speak
Education oriented TTS inference server |
|
Experimental |
| 7257 |
codysnider/kokoro
Dockerized Kokoro TTS |
|
Experimental |
| 7258 |
jpdiazpardo/gutural_nlp
Gutural and scream automatic speech recognition (ASR) system using a... |
|
Experimental |
| 7259 |
GermanCentralLibraryForTheBlind/TTSOnDemand
Text to speech technology to speech-enable web sites |
|
Experimental |
| 7260 |
cookerwatcher/ChopItUp
Python scripts to perform speech recognition on video files, then chop them... |
|
Experimental |
| 7261 |
laafeiak/ai_text_reader
text |
|
Experimental |
| 7262 |
ayushirastogi15/Flask-Application-Development
This repository tells you how to develop a flask application for the speech... |
|
Experimental |
| 7263 |
jefflai108/scale
Some of my public work at https://hltcoe.jhu.edu/research/scale/scale-2017/ |
|
Experimental |
| 7264 |
ReadieFur/AWS-Polly-for-SpeechChat
Reads out twitch, youtube and mixer chat from Speechchat using AWS Polly. |
|
Experimental |
| 7265 |
coco-whisper/Voice-Conversation-Audio-Generation-Platform-TTS-
A self-hosted platform for text-to-speech, voice conversion, and AI audio... |
|
Experimental |
| 7266 |
bliptron/Google-TTS-Server
A FastAPI server for Google Gemini Text-to-Speech with modern web interface.... |
|
Experimental |
| 7267 |
Tailmc/Syaberunoda
VoiceVoxを使ったシンプルな読み上げボット |
|
Experimental |
| 7268 |
t1seo/karina-voice-notification
Clone any voice from YouTube to create custom Claude Code notification... |
|
Experimental |
| 7269 |
rounayak/Virtual-assistant
Python based virtual assistant that can understand speech,respond via speech... |
|
Experimental |
| 7270 |
dwain-barnes/vui-fastapi-server
A OpenAI-compatible Text-to-Speech API server powered by VUI - a small... |
|
Experimental |
| 7271 |
roopesharch/EchoSonic
Built and deployed a full-stack AI text-to-speech platform using FastAPI and... |
|
Experimental |
| 7272 |
ohboundless/HeyWindows
A basic voice command interface for Windows. |
|
Experimental |
| 7273 |
MrBlueBird2/jarvis-in-python
An amazing AI which will talk with you and, wikipedia, questions. |
|
Experimental |
| 7274 |
SyedSohail786/SaaS-Website
This project supports Text to image and Text to speech functionality which... |
|
Experimental |
| 7275 |
saroshfarhan/story-teller
Story-Teller |
|
Experimental |
| 7276 |
apribeiro/TextToSpeechApp
A simple C# console application that converts user input text to speech. |
|
Experimental |
| 7277 |
jokio/sdk
SDK for building decentralised localfirst web apps. Provides tts ai model... |
|
Experimental |
| 7278 |
E-Asrar-Haghighi/farsi-tts-generator-with-music
Convert Farsi text to speech using OpenAI TTS, with optional background... |
|
Experimental |
| 7279 |
Sarasadeghii/Sharif-Wav2vec2
This repo shows how to finetune the wav2vec2.0 model along with its prerequisites. |
|
Experimental |
| 7280 |
alokbhateshwar/virtual-assistant
"Python-based virtual assistant with voice recognition and text-to-speech... |
|
Experimental |
| 7281 |
chasmack/translate
Translation and Text-to-Speech for Anki Card Decks |
|
Experimental |
| 7282 |
Matthias84/speech2josm
JOSM presets via voice control |
|
Experimental |
| 7283 |
G3VV/Twine
🌿 A tool to automatically generate Reddit TTS, Comment Screenshots and JSON Data |
|
Experimental |
| 7284 |
nisheethjaiswal/Speech-to-Text
Speech to text implementation using transformers in PyTorch. |
|
Experimental |
| 7285 |
IJCS/Trainer-app
A lightweight and highly flexible tool designed to assist coaches.... |
|
Experimental |
| 7286 |
nfreear/simple-speak
Power-tool wrapper around the browser Web Speech API — |
|
Experimental |
| 7287 |
shestaya-liniya/accentless
Shape your accent with AI |
|
Experimental |
| 7288 |
nafiuny/voice_conversion_dataset
top dataset for voice conversion models |
|
Experimental |
| 7289 |
caraleeqiu/mememeow
Practice English speaking with a carrot cat! Read along with YouTube/TikTok... |
|
Experimental |
| 7290 |
joeybronner/meeting-live-translation
🎤Live translation for your meetings using HTML5 Speech Recognition API |
|
Experimental |
| 7291 |
alecproj/microphone-module
Smart Home Microphone Module |
|
Experimental |
| 7292 |
aloukikjoshi/FinSpeak
🎙️ Voice-powered mutual fund assistant — Ask about NAV & returns in English,... |
|
Experimental |
| 7293 |
mateogon/Cadence
Cadence: immersive reading pipeline from EPUB to audiobook with synchronized... |
|
Experimental |
| 7294 |
speakingofdata/LJ2_Corpus
Single speaker, 26,200 transcribed audio recordings, 48 hours total |
|
Experimental |
| 7295 |
singleshade8/japanese-subtitle-generator
GPU-accelerated Japanese → English subtitle generator using faster-whisper... |
|
Experimental |
| 7296 |
bivex/voice_to_text
A Python application for real-time Russian voice-to-text transcription and... |
|
Experimental |
| 7297 |
D-Keqi/Implementation-for-ASR-by-API-of-Baidu
This is an open source code that you can use to connect to Baidu's API to... |
|
Experimental |
| 7298 |
balas-world/kitten-tts-web-demo
Kitten TTS Web Demo showcases the Kitten TTS Nano in your browser—a... |
|
Experimental |
| 7299 |
birros/dictations
Experimental progressive web application for dictations |
|
Experimental |
| 7300 |
ArshCypherZ/text-to-speech
Text to Speech API using kokoro. |
|
Experimental |