All Voice AI Tools
6,981 tools ranked by quality score · Page 49 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 4801 |
InboraStudio/Google-Cloud-Speech-Recognition-Unity
Unity Speech Recognition with Google Cloud A cross-platform speech... |
|
Experimental |
| 4802 |
Kenjd/student-name-pronunciation-helper
A Shiny app to help teachers learn correct pronunciation of student names |
|
Experimental |
| 4803 |
SudharsanSaravanan/JARVIS
JARVIS (Just A Rather Very Intelligent System) is a voice-controlled,... |
|
Experimental |
| 4804 |
IceDynamix/iceTTS
Twitch Chat TTS with no strings attached |
|
Experimental |
| 4805 |
jinseok19/Intermediate_Level_Project_for_AI-X
🤖AI+X 선도 인재 양성 중급 프로젝트 with KT & 상명대학교🤖 |
|
Experimental |
| 4806 |
chrismarquezz/voice-chess
An interactive chess app that lets you play and control the game entirely... |
|
Experimental |
| 4807 |
syedjahangirpeeran/Optical-Character-Recognition-and-TTS
Written in MATLAB, the project aims to convert hand written or printed text... |
|
Experimental |
| 4808 |
dcervantes/VoiceFlashcards
VoiceFlashcards is an innovative web app that helps users practice language... |
|
Experimental |
| 4809 |
Uriiol1808/Harmon-AI
Music AI web-based application. |
|
Experimental |
| 4810 |
Salama1429/speech-to-speech-translation
cascaded speech-to-speech translation (STST), mapping from source speech in... |
|
Experimental |
| 4811 |
imvladikon/wav2vec2-hebrew
Speech Recognition for Hebrew (using wav2vec2 models) |
|
Experimental |
| 4812 |
PrarieComamile/speech-to-text
Convert your voice to text file with this program. |
|
Experimental |
| 4813 |
TirtaSandy/AI-Interview-Assist
📸 Enhance your job interviews with AI-powered screen analysis and privacy... |
|
Experimental |
| 4814 |
huzaifa-fullstack/eduvox-ai
EduVox AI is an AI-powered educational voice companion that delivers... |
|
Experimental |
| 4815 |
OpenLake/Speech-Analyser
An App to help you improve your English fluency 🎤 |
|
Experimental |
| 4816 |
vault-42/AIND_DNN_Speech_Recognizer
End-to-end speech to text recognition |
|
Experimental |
| 4817 |
romestylez/pocketChat
Dein Stream in der Tasche — Chat lesen, schreiben und moderieren, Events von... |
|
Experimental |
| 4818 |
williamclavier/Multimodal-Classroom-Video-Recorder
A smart multimodal classroom video recording system that automatically... |
|
Experimental |
| 4819 |
rahulsushilsharma/tts-pipelines
TTS pipelines for text to speech using Kitten TTS and Piper TTS |
|
Experimental |
| 4820 |
b0o/whispertool
🗣️ voice recording and transcription tool built on whisper.cpp |
|
Experimental |
| 4821 |
Usman-bin-Khalid/Jarvis-AI-Voice-and-Text-Assistant-Python-
Jarvis AI Voice & Text Assistant – A Python-based desktop AI assistant with... |
|
Experimental |
| 4822 |
joypix-ai/joypix
AI Talking Video Generator: Talking Photo (AI lip-sync) + AI Avatar... |
|
Experimental |
| 4823 |
rockerritesh/kitten-tts-android
KittenTTS - On-device text-to-speech Android app using ONNX Runtime and espeak-ng |
|
Experimental |
| 4824 |
ZarredFelicite/parakeet-transcriber
An audio transcription tool using NVIDIA Parakeet, available as a CLI or... |
|
Experimental |
| 4825 |
thewh1teagle/phonikud-assistant
Local AI assistant in Hebrew with Phonikud ✨ |
|
Experimental |
| 4826 |
hwpoison/vosk-voice-recognition-c
Offline voice recognition using pure C and vosk lib. (from file and from... |
|
Experimental |
| 4827 |
upstash/radio-hackernews
Audio Recap of Top Hackernews Stories |
|
Experimental |
| 4828 |
ss87021456/mfcc_ctc_speech
apply mfcc feature of waveform with the LSTM + CTC loss architecture |
|
Experimental |
| 4829 |
linseycurrie/NHS-Speech-Recognition-App
This was a group project created remotely over 7 days using Java, Spring,... |
|
Experimental |
| 4830 |
msalhab96/RNN-Transducer
PyTorch implementation of Sequence Transduction with Recurrent Neural... |
|
Experimental |
| 4831 |
LiveTrad/livetrad-io-engine
A complete solution for Live meeting translations Extension+DesktopApp+Server |
|
Experimental |
| 4832 |
smsraj2001/PYEDIT-PRO-THE-ULTIMATE-ADVANCED-TEXT-EDITOR
An Advanced text editor in python with enhanced and amazing features |
|
Experimental |
| 4833 |
tuanio/e2e-asr-toolkit
E2E Speech Recognition Toolkit with Hydra and Pytorch Lightning |
|
Experimental |
| 4834 |
gheyret/uyghur-asr-transformer
Speech Recognition for Uyghur using Speech transformer |
|
Experimental |
| 4835 |
mohammadhasananisi/Google-Speech-Recognition
Persian-Speech-Recognition |
|
Experimental |
| 4836 |
smswg/FreeSwitch-Mod_FunAsr
FreeSWITCH... |
|
Experimental |
| 4837 |
Nono-04/ChannelPoints-TTS
A simple TTS rewards script for Twitch channel points |
|
Experimental |
| 4838 |
x2agi/x2agi-speechkit
🎧 X2AGI speech services: ASR, diarization, AI reports (gRPC, REST clients) |
|
Experimental |
| 4839 |
OpenVoiceOS/ovos-tts-plugin-google-tx
google translate tts plugin for OVOS / mycroft |
|
Experimental |
| 4840 |
sahilmishra0012/prescription-generator
This project aims at generating the prescription dictated by the doctor in a... |
|
Experimental |
| 4841 |
AdilShamim8/BUET-CSE-Fest-2026
DL Sprint 4.0 | BUET CSE Fest 2026 — Bengali Long-Form Speech Recognition... |
|
Experimental |
| 4842 |
EN10/Speech-to-Text-WaveNet
Speech to Text |
|
Experimental |
| 4843 |
leo01102/lumen
Lumen – Asistente IA Empático y Multimodal (rostro y voz) en tiempo real.... |
|
Experimental |
| 4844 |
rahul6975/Helping-Voice
An Android application which completely works on voice input which helps... |
|
Experimental |
| 4845 |
Jay113910/Speech-to-Text-Vosk
A real time speech recognition program using microphone based on "Vosk" - an... |
|
Experimental |
| 4846 |
neosun100/higgs-audio-web
🎵 All-in-One Docker deployment for Higgs Audio v2 with Web UI, REST API &... |
|
Experimental |
| 4847 |
rudhreeshkumaar/Speech-to-Text
Speech recognition and text transcription from file or microphone |
|
Experimental |
| 4848 |
MAXBAF1/SpoonEat
A mobile application for maintaining a balance in nutrition, with the... |
|
Experimental |
| 4849 |
astrologos/libri-scraper
The Public Audiobook Scraper downloads full audiobook MP3's from... |
|
Experimental |
| 4850 |
ngetikin/selfbot-discord
Ini selfbot Discord fokus voice + utilitas ringan. Core sudah bootstrap,... |
|
Experimental |
| 4851 |
DrewThomasson/ebook2audiobookEspeak
Create audiobooks with espeak in a gradio interface gui easy |
|
Experimental |
| 4852 |
Herobrine25mcpe/text-to-speech_Tkinter
So this is a project in which I am working on a simple text to speech... |
|
Experimental |
| 4853 |
DDDeeeee/Teasr
Microphone-free speech recognition and text polishing for vibe coding. |
|
Experimental |
| 4854 |
10raw/Prescription-Generator
android app to generate Doctor's Prescriptions faster using Deep Learning |
|
Experimental |
| 4855 |
LuisMiSanVe/AiCursorHelper
AI Assistant that helps you move around your Desktop with voice command |
|
Experimental |
| 4856 |
YoRyan/obicaller
Talking caller ID for OBiTALK OBi200 and Raspberry Pi (or other Linux) |
|
Experimental |
| 4857 |
ducnt18121997/Viet-Transformer-TTS
This is PyTorch Implementation of A Non-Autoregressive Transformer with... |
|
Experimental |
| 4858 |
DaviBonetto/AETHER-L5-RealTime-Voice-Stream
Low-latency bi-directional voice interface with VAD, Whisper STT, and Neural TTS. |
|
Experimental |
| 4859 |
ilya16/isp-tts
A simple TTS model developed for the Speech Synthesis and Voice Cloning... |
|
Experimental |
| 4860 |
1999AZZAR/Telegram-Bot-Playground
This repository is a playground for experimenting with several simple... |
|
Experimental |
| 4861 |
RamR3R/InterviewAuto
This is openAi powered interview site where the user can join and take in... |
|
Experimental |
| 4862 |
techieinhouse/chatbot
python chatterbot using flask and speech recognition from html5 |
|
Experimental |
| 4863 |
dllcnx/tts
云之声离线集成的android项目 |
|
Experimental |
| 4864 |
Regaez/datastar-speech
A custom Datastar action plugin that leverages the Web Speech API in order... |
|
Experimental |
| 4865 |
RGonza1529/Nura
A Full-Stack React/Node.js AI-powered web application that provides... |
|
Experimental |
| 4866 |
ckull/SUKI
A Node.JS Discord bot |
|
Experimental |
| 4867 |
temp3rr0r/Longsword-Data-MQTT-Publisher
Working demo: https://www.youtube.com/watch?v=v7hvOyPQ0EM. The main IoT app.... |
|
Experimental |
| 4868 |
xi-Rick/captains-log
A voice transcription and logging web app built with TypeScript, Captain's... |
|
Experimental |
| 4869 |
rupin/WrittenAudio
Written Audio Uses Google Text to Speech engine and a configuration file to... |
|
Experimental |
| 4870 |
algorithmio/accent-conversion-ai
Real-time accent conversion during phone calls using Twilio, Deepgram, and... |
|
Experimental |
| 4871 |
etornam45/mmt-jepa
Using the JEPA architecture for multimodal language translation |
|
Experimental |
| 4872 |
dhia-gharsallaoui/go-elevenlabs
Production-ready Go client library for theElevenLabs Text-to-Speech API with... |
|
Experimental |
| 4873 |
Aprataksh/Python-Files
mic_py : Python 3 code for successful use of microphone on windows.... |
|
Experimental |
| 4874 |
Sneakyhydra/Sentinel
Voice Assistant using Whisper in python3 |
|
Experimental |
| 4875 |
ahmedoubadi/kokoro-tts
Open-source Kokoro-TTS API server (FastAPI) and web UI (React) for... |
|
Experimental |
| 4876 |
sky-flutter/Python-Jarvis
Voice-based assistant to make task automated |
|
Experimental |
| 4877 |
Shibli-Nomani/Open-Source-Models-with-Hugging-Face
Open Source Models With Hugging Face |
|
Experimental |
| 4878 |
openvoicepacks/openvoicepacks
Generate and customize complete voice packs for OpenTX and EdgeTX radios. |
|
Experimental |
| 4879 |
MotivationalSpeechSynthesis/motivational-speech-synthesis
Artistic research deconstructing the performative excess of motivational... |
|
Experimental |
| 4880 |
icosane/alstroemeria
Create and translate subtitles for any video, complete with voiceover capabilities. |
|
Experimental |
| 4881 |
aditeyabaral/natural-language-database-querying
A novel approach to data retrieval from tagged databases using only natural... |
|
Experimental |
| 4882 |
LiBinZyu/VAI
Implement highly precise natural language voice control in any Unity... |
|
Experimental |
| 4883 |
jaypinho/transcript-accuracy
A Streamlit app to evaluate the accuracy of automatic speech recognition... |
|
Experimental |
| 4884 |
kiy0ni/auto-video-editor
Un outil Python (Tkinter) qui génère automatiquement des highlights et des... |
|
Experimental |
| 4885 |
shaharpit809/Deep-Learning-Models
This repository consists of application of Deep Learning Models like DNN,... |
|
Experimental |
| 4886 |
v-aibha-v-jain/VA-task-executor
A desktop voice assistant sys, that can execute commands like open URLs, apps. |
|
Experimental |
| 4887 |
inforkgodara/python-speech-to-text
A few lines of code which convert speech to text. |
|
Experimental |
| 4888 |
brailcom/speechd-java
Java client library for Speech Dispatcher |
|
Experimental |
| 4889 |
anubhav-n-mishra/xtts-api
Production-ready Text-to-Speech API with XTTS-v2, voice cloning,... |
|
Experimental |
| 4890 |
Lostenergydrink/styletts2-dataset-toolkit
Complete Windows-optimized workflow for voice cloning with StyleTTS2.... |
|
Experimental |
| 4891 |
Akash-Apturkar/Sentiment-Analysis-of-speech-using-NLP-with-Android-Connect-feature-and-web-scraping
We aim to develop a ‘Smart Speech Ecosystem’ that takes audio input,... |
|
Experimental |
| 4892 |
ChrisRobinT/realtime-translation
Real-time WebRTC voice translation using Whisper STT, Azure Translate, and... |
|
Experimental |
| 4893 |
sujalrajpoot/openai-tts
A powerful and easy-to-use Python library for generating natural-sounding... |
|
Experimental |
| 4894 |
sedoglia/video-translator
Desktop application that automatically translates video audio to any... |
|
Experimental |
| 4895 |
woofie/woof
AR Unity virtual pet app that recognises voice commands, performs NLP on... |
|
Experimental |
| 4896 |
elvanselvano/streamlit-whisper
empowering the visually impaired with equal financial access through... |
|
Experimental |
| 4897 |
tsensei/WTF-Does-This-Company-Actually-Do
Paste any URL. AI scrapes the website, cross-references reviews & funding... |
|
Experimental |
| 4898 |
blastheart1/voice-ai-braincx
🎤 Real-time voice AI conversational agent with LiveKit, FastAPI & React.... |
|
Experimental |
| 4899 |
eauchs/speech-to-speech-pipeline
A real-time, interruptible (barge-in) conversational AI pipeline... |
|
Experimental |
| 4900 |
NitinN77/ASL-To-Speech-Rpi
A pi setup to recognize ASL signs using a pre-trained CNN model and speak it... |
|
Experimental |