All Voice AI Tools
6,981 tools ranked by quality score · Page 32 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 3101 |
mrmanna/Nvidia_Nemo_FastPitch_TTS_Example
How to Build a High-Quality Text-to-Speech (TTS) System Locally with Nvidia... |
|
Experimental |
| 3102 |
Sumit81107/echo-tts
🔊 Create lifelike speech from text using a multi-speaker model, enhancing... |
|
Experimental |
| 3103 |
rock3125/tts
Simple text to speech server in docker using coqui-ai/TTS |
|
Experimental |
| 3104 |
The-Data-Dilemma/Medibeng-Orpheus-3b-0.1-ft-Fine-Tuning
Medibeng-Orpheus-3b-0.1-ft- A TTS model for bilingual Bengali-English... |
|
Experimental |
| 3105 |
mym-br/gama_tts
Experimental articulatory speech synthesizer derived from Gnuspeech |
|
Experimental |
| 3106 |
railmapgen/rma
Generate the rail announcement from your rmg project! |
|
Experimental |
| 3107 |
MahtaFetrat/VirgoolInformal-Speech-Dataset
A dataset of informal Persian audio and text chunks, along with a fully open... |
|
Experimental |
| 3108 |
AssemblyAI-Community/assemblyai-and-python-in-5-minutes
Repo for hosting tutorial code associated with the "AssemblyAI and Python in... |
|
Experimental |
| 3109 |
Leapward-Koex/Namida-OCR
A purely browser based OCR tool designed recognizing, copying, and... |
|
Experimental |
| 3110 |
dpressel/reserve
FastAPI + WebSockets + SSE service to interface with Triton/Riva ASR |
|
Experimental |
| 3111 |
helemanc/ambient-intelligence
Application for Disruptive Situations Detection in public transports through... |
|
Experimental |
| 3112 |
botbahlul/VOSK-Powered-LIVE-SUBTITLE-V2
ANDROID APP that can RECOGNIZE LIVE AUDIO/VIDEO STREAMING (using free VOSK... |
|
Experimental |
| 3113 |
aloproducao/Live-captions-for-broadcast
The Real-Time Speech Recognition System is an innovative tool designed to... |
|
Experimental |
| 3114 |
Baibhav-nag/SER-using-MLP-and-CNN
Speech emotion recognition using MLP and CNN on four benchmark datasets... |
|
Experimental |
| 3115 |
koesan/Evoars
A multi-model AI platform for comics, manga, and videos. It colorizes... |
|
Experimental |
| 3116 |
BobRandomNumber/ComfyUI-KyutaiTTS
A non real-time ComfyUI implementation of Kyutai TTS |
|
Experimental |
| 3117 |
itning/hass-aliyun_bailian_tts
Home Assistant integrates Alibaba Cloud's BaiLian Platform TTS |
|
Experimental |
| 3118 |
AfkaraLP/qwen3-tts-webui
A Simple webui and api for cloning voices with Qwen3-TTS |
|
Experimental |
| 3119 |
tjysdsg/capt-public
Public version of my Computer-Aided Pronunciation Training (CAPT) system (server) |
|
Experimental |
| 3120 |
slayerrr12/WaveSlayer
ai chatbot that uses speech to operate and respond |
|
Experimental |
| 3121 |
gfrangiamone/audiobook-maker
Web tool to convert epub or txt book in audiobook via edge_tts lib |
|
Experimental |
| 3122 |
ckaznable/yt-cli-live
Youtube Text Live Streaming in CLI |
|
Experimental |
| 3123 |
zugaldia/speedofsound
Voice typing for the Linux desktop. |
|
Experimental |
| 3124 |
NN-Project-2/Emotion-TTS-Emebddings
This project explores zero-shot emotional speech synthesis using EMOD, a... |
|
Experimental |
| 3125 |
srvk/srvk-eesen-offline-transcriber
Top level code to transcribe English audio/video files into text/subtitles |
|
Experimental |
| 3126 |
arthurxlw/cytonNss
Cyton Online Neural Sentence Segmentation for Simultaneous Interpretation |
|
Experimental |
| 3127 |
netbuffer/android-technology-test
android technology test use java language,DI,Handler,Hilt,Scheduler,TTS,Log... |
|
Experimental |
| 3128 |
yuvraj108c/ComfyUI-PiperTTS
ComfyUI Piper TTS Custom Node |
|
Experimental |
| 3129 |
DmytroNorth/Automated_Subtitles_Generation-Regex_Java
An automated workflow that generates timestamped subtitles from a video file... |
|
Experimental |
| 3130 |
sanyasamineva0x/govorun-app
Говорун — офлайн голосовой ввод на русском для macOS (GigaAM-v3 + Silero VAD) |
|
Experimental |
| 3131 |
ibelgin/Text-To-Speech-App
This App is Made Using React Native. |
|
Experimental |
| 3132 |
QuantiusBenignus/NoteWhispers
Voice memos recorded from the microphone, transcribed offline to text and... |
|
Experimental |
| 3133 |
FragJage/PicoVoiceCpp
PicoVoiceCpp is a simple TTS (text to speech) class base on picovoice (svox). |
|
Experimental |
| 3134 |
nodef/extra-tts
Generate speech audio from super long text through machine. |
|
Experimental |
| 3135 |
CingZeoi/YDVoiceTTS
Chinese TTS from Yongde screen reader |
|
Experimental |
| 3136 |
jarmitage/tts-cli
Simple CLI app for TTS |
|
Experimental |
| 3137 |
dwain-barnes/chatterbox-streaming-api-docker
Chatterbox with OpenAI-compatible endpoints, streaming support, multiple... |
|
Experimental |
| 3138 |
pnkvalavala/multivoice
Multivoice: Enhance your foreign-language movie and TV show experience with... |
|
Experimental |
| 3139 |
jmdlab/vesper
Therapeutic audio pipeline. Faith meets science. Free, static, open source. |
|
Experimental |
| 3140 |
nilakshdas/ADAGIO
Adversarial Defense for Audio in a Gadget with Interactive Operations |
|
Experimental |
| 3141 |
ramsrk7/AIVoxPlay
AI-powered real-time voice interaction framework for building conversational... |
|
Experimental |
| 3142 |
shreyasnisal/VoiceQuiz-v2
Verstion 2 of the quiz-app, this is the repository for the voice-based quiz.... |
|
Experimental |
| 3143 |
ry-sun/bob-plugin-openai-tts
OpenAI TTS for Bob Plugin is a tts plugin for bob, a brilliant translation... |
|
Experimental |
| 3144 |
louiscoetzee/mlx-tts-studio
Native macOS text-to-speech app powered by Qwen3-TTS and Apple Silicon... |
|
Experimental |
| 3145 |
KVarnitZ/Total-Tank-Simulator-UA
Українізатор TTS, який повноцінно додає мову як окрему (перекладено з... |
|
Experimental |
| 3146 |
abumubaarak/Wellbeing-Doctor
Doctor management app |
|
Experimental |
| 3147 |
rohanmistry231/Voice-Assistant
A Python-based voice assistant that processes voice commands to perform... |
|
Experimental |
| 3148 |
zhongyuchen/speech-classification
CNN and VGG speech classification with interactive website for testing |
|
Experimental |
| 3149 |
circle-hotaru/talk-boost-ai
A web application that utilizes AI to help you improve your English speaking... |
|
Experimental |
| 3150 |
elie-atia/talk-to-chat-gpt
Enable to talk to ChatGPTusing voice-to-text (record and recognize the... |
|
Experimental |
| 3151 |
sidphbot/visual-to-audio-aid-for-visually-impaired
A system to process visual input on timed frames to produce sensible audio... |
|
Experimental |
| 3152 |
luciferchase/chase_hospitals
This is a GUI based Python connectivity project on Hospital Management. The... |
|
Experimental |
| 3153 |
Sidra-009/AI-Interview-Coach
AuraCoach is an AI-powered interview coach that generates personalized... |
|
Experimental |
| 3154 |
taeefnajib/Vocazee
A voice cloning and text-to-speech application that can generate speech in any voice. |
|
Experimental |
| 3155 |
SzLeaves/asr-model-ctc
ASR deep learning models (use BiGRU & WaveNet & CTC), use Tensorflow2... |
|
Experimental |
| 3156 |
adi611/The-CheatGPT
Python application that uses GPT-3 language model and Pinecone vector... |
|
Experimental |
| 3157 |
german-asr/kaldi-german
Scripts for training Kaldi for German speech recognition (ASR). |
|
Experimental |
| 3158 |
Iroha-P/MiniBox
Character voice chatbot with GPT-SoVITS TTS + LLM role-playing, supports Web... |
|
Experimental |
| 3159 |
Saga9103/t2yLLM
A voice assistant with local LLM as a backend |
|
Experimental |
| 3160 |
flexhub77/piper-tts-call
🎙️ Generate high-quality audio from text in real-time with Piperin, the... |
|
Experimental |
| 3161 |
developer-mezbah/Mock-Test-UI
Practice IELTS, TOEFL, & PTE speaking online. This web app offers full test... |
|
Experimental |
| 3162 |
binglel/asr_baidu_web_server
asr web server based on flask |
|
Experimental |
| 3163 |
myths-labs/prometheus-avatar
Open-source SDK for driving Live2D & 3D avatars with LLM output. Give your AI a face. |
|
Experimental |
| 3164 |
dangkhoadl/WER-in-cpp
Calculates the word error rate between the reference and hypothesis in ASR,... |
|
Experimental |
| 3165 |
buddheshwarnath/blurtpy
Offline, cross-platform Python text-to-speech and sound notifications.... |
|
Experimental |
| 3166 |
mcw519/Brownie
Post processing for speech recognition |
|
Experimental |
| 3167 |
vinbhaskara/Digit-Speech-Recognition
Using MFCC features on Speech Signals to classify Digits after matching... |
|
Experimental |
| 3168 |
hi5/nvda-autohotkey
NVDA and AutoHotkey - Text to Speech (TTS) and Braille from AHK scripts |
|
Experimental |
| 3169 |
nisiddharth/TextToSpeech
A Simple Java based Text to Speech converter made using NetBeans 8.2 |
|
Experimental |
| 3170 |
answersolutionsapps/runandread-ios
Ultimate Text-to-Speech and Audiobook Player for iOS, macOS |
|
Experimental |
| 3171 |
naturalDesign/fusion-remote
Chatbot for Autodesk Fusion 360 with speech recognition |
|
Experimental |
| 3172 |
TCL606/Speech-Number-Recognition
基于数字信号处理的语音数字识别器 |
|
Experimental |
| 3173 |
juan-csv/Alfred-assistant
Assistant in which you can program any type of action in the python... |
|
Experimental |
| 3174 |
marttirandma/tipi
Tipi Web v2 |
|
Experimental |
| 3175 |
backpropper/DNN-Activation-Brain
Code repository for Dissecting the DNN Brain for a Better Insight (ICASSP 2016) |
|
Experimental |
| 3176 |
artemnikitin/tts-test-app
Android app for testing Text-to-speech stuff |
|
Experimental |
| 3177 |
mmphego/medium-to-speech
Medium posts as Markdown to Speech. |
|
Experimental |
| 3178 |
egorsmkv/radtts-uk
🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model |
|
Experimental |
| 3179 |
Inviro/Illud
Illud is a smart text analyzer written in pure Java that displays different... |
|
Experimental |
| 3180 |
sidagarwal04/SpeechRecognition-Sphinx-GCP
Speech Recognition on edge using CMU Sphinx and on cloud using Google Cloud... |
|
Experimental |
| 3181 |
auroraapi/aurora-python
Aurora SDK for Python |
|
Experimental |
| 3182 |
ganlvtech/bing-stt
Rust implementation of bing "Search using voice" button speech recognition... |
|
Experimental |
| 3183 |
denizariyan/Real-Time-Auto-Transcriber
Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to... |
|
Experimental |
| 3184 |
stefantaubert/english-text-normalization
Command-line interface (CLI) and library to normalize English texts. |
|
Experimental |
| 3185 |
tigjaw/remyme
ReMyMe - a basic "Read My Messages" Android application (old) |
|
Experimental |
| 3186 |
alfianlosari/flutter_chatbot_inventory
Chatbot Flutter App used to track inventory of product and description using... |
|
Experimental |
| 3187 |
Raj2503/Python-Text-To-Speech-Hindi
Python Hindi Concatenative Based TTS using Phoneme Database |
|
Experimental |
| 3188 |
amelielavender/voicely
a discord bot that transmit text-to-speech messages to voice channels directly |
|
Experimental |
| 3189 |
jhdeov/armenian-intonation
Repository of question-answer dialogues of Armenian, for an intonation study. |
|
Experimental |
| 3190 |
alihassanml/Voice-Controlled-Agentic-AI-Bot
A real-time voice assistant powered by Ollama, Piper TTS, and... |
|
Experimental |
| 3191 |
Philipinho/ThreadVoice
Source code for https://twitter.com/threadvoice |
|
Experimental |
| 3192 |
RavnOP/Vehicle-Speed-and-Type-Detection-Using-YoloV8
This system uses artificial intelligence to detect vehicles in video... |
|
Experimental |
| 3193 |
Arbazkhan4712/Text-To-Speech
A program that can convert Text into Speech using python |
|
Experimental |
| 3194 |
PiasRoY/Bangla-Spoken-Number-Recognition
recognizing spoken Bangla numbers using MFCCs and CNN. |
|
Experimental |
| 3195 |
JaesungBae/Speech-Command-Recognition-with-Capsule-Network
Speech command recognition with capsule network & various NNs / KWS on... |
|
Experimental |
| 3196 |
mehdichaouch/nabstory
Let your Nabaztag 🐰 read you a story 📖 |
|
Experimental |
| 3197 |
franchesoni/s2t
:speaking_head: :keyboard: Speech-to-text on key for Linux |
|
Experimental |
| 3198 |
ggh-png/EMOTIBOT
emotion robot using gpt model3.5 EMOTIBOT |
|
Experimental |
| 3199 |
M-SRIKAR-VARDHAN/speech-to-speech-with-lipsync
End-to-end speech-to-speech translation pipeline with voice cloning (RVC)... |
|
Experimental |
| 3200 |
pigzach/MagicSpeechASR
magicspeech competition recipe |
|
Experimental |