All Voice AI Tools
6,981 tools ranked by quality score · Page 65 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 6401 |
praveen0777/Multimodal-AI-Assistant-with-Face-Recognition-Emotion-Analysis-GPT-Based-and-Object-Detection-YOLOv8
The project introduces Maya, Multimodal AI Assistant with Face Recognition,... |
|
Experimental |
| 6402 |
kamyusama/kamyu-case-study
Twitch→LLM→TTS→字幕の最小デモ(非商用のケーススタディ)/ Minimal Twitch→LLM→TTS→captions demo... |
|
Experimental |
| 6403 |
KoljaB/ai_cli_tools
AI at your fingertips: powerful CLI tools for speech, text, and language processing |
|
Experimental |
| 6404 |
echogarden-project/macos-native-tts
Node.js binding to the macOS native text-to-speech API (AVSpeechSynthesizer). |
|
Experimental |
| 6405 |
AcTePuKc/bg_g2p_builder
Tools to generate a standardized Bulgarian IPA lexicon for Text-to-Speech... |
|
Experimental |
| 6406 |
eddiedunn/transcribe
[DEPRECATED — superseded by diarized_transcriber] Audio-to-text... |
|
Experimental |
| 6407 |
rounayak/Virtual-assistant
Python based virtual assistant that can understand speech,respond via speech... |
|
Experimental |
| 6408 |
sheng1111/text2srt_tts
Convert text to speech and auto-generate SRT subtitles. A CLI tool for... |
|
Experimental |
| 6409 |
JessicaMulein/asterisk-custom-poly-sounds
Regenerated Asterisk Sounds using Amazon Poly library. Salli voice. |
|
Experimental |
| 6410 |
dfgHiatus/TTSLib
A modular text to speech framework for C# |
|
Experimental |
| 6411 |
A35G/gmCaptcha
A small and simple Proof-of-Concept of Captcha (graphical and mathematical)... |
|
Experimental |
| 6412 |
funkyfranky/TextToSpeechListener
UDP Client that listens for text messages and converts it to speech. |
|
Experimental |
| 6413 |
alexFrankfurt/srt-to-audio-windows-tts
Transform audio to subtitles to timed audio from foreign language to... |
|
Experimental |
| 6414 |
Sgvkamalakar/Translate_Speak
A Flask-based web application that lets you translate text between languages... |
|
Experimental |
| 6415 |
balzhinimaev/translator-app
Telegram Web App для перевода речи в реальном времени с поддержкой двух... |
|
Experimental |
| 6416 |
tagoWorks/pharmor
An easy to use tool to crop videos to 16:9, overlay images, and extract text... |
|
Experimental |
| 6417 |
njuh0/MacDeepTranscriber
Transcriber for MacOS |
|
Experimental |
| 6418 |
amitybell/pikatts
Pika TTS is a small, local text to speech voice synthesizer package based on... |
|
Experimental |
| 6419 |
ameyasutar1/ManagerAI
An AI-driven Talking Manager: a voice-interactive chatbot designed to... |
|
Experimental |
| 6420 |
DrAchernar/tensorflutter
Flutter tflite example |
|
Experimental |
| 6421 |
hesic73/SpeechSynthSubs
Convert text to synchronized speech and subtitles using Google's Text-to-Speech API. |
|
Experimental |
| 6422 |
pigmilcom/google-tts
Google Text-To-Speech |
|
Experimental |
| 6423 |
TJ-Neary/TommyTalker-Pro
Privacy-first voice-to-text for macOS — local STT via mlx-whisper with... |
|
Experimental |
| 6424 |
walid-hamdi/fluener_ai-service
FastAPI AI microservice for language learning - Provides speech-to-text... |
|
Experimental |
| 6425 |
maycondata/apontamento-op-por-voz
Apontamento de produção por voz (Whisper STT + gTTS) com confirmação e... |
|
Experimental |
| 6426 |
BurtTheCoder/aida
AI Digital Assistant |
|
Experimental |
| 6427 |
thiagogre/mimicking
English Pronunciation Improvement App |
|
Experimental |
| 6428 |
bealopc/AudioNotebookLM
Herramientas educativas en Python para generar diálogos a dos voces con... |
|
Experimental |
| 6429 |
Harshit2012/VocalTexter
VocalTexter is a simple and user-friendly web application that enables users... |
|
Experimental |
| 6430 |
SH0-ahacker/bot_SCRO_1011
Woah!, this is very cool! |
|
Experimental |
| 6431 |
FanaticExplorer/SayClip
[Coming soon] A Python app that converts spoken words into text, making it... |
|
Experimental |
| 6432 |
DottedAnt-Dooz/voi-to-voi
Small Framework for linking different AI models to create a Voice-To-Voice interface. |
|
Experimental |
| 6433 |
ahammedrohit/Speech-Recognition-using-wav2vec2-with-minimum-GPU
Python Colab for speech recognition with wav2vec2. Since wav2vec2 requires... |
|
Experimental |
| 6434 |
den0011/SpeechRecognition
Проект представляет собой десктопное приложение для распознавания речи с... |
|
Experimental |
| 6435 |
polojudayamani-crypto/Shashitha-voice-assistant
Python voice assistant project |
|
Experimental |
| 6436 |
nithin-k-shine/text-to-speech-using-Arduino
A health monitoring project to convert the temperature, pulse rate and blood... |
|
Experimental |
| 6437 |
Bhuvaneshw/VoiceDictionary
This is an offline voice based dictionary app for Android. It accepts voice... |
|
Experimental |
| 6438 |
kshibarn/SpeechNotes-Voice-to-Text-Tool
https://speechnotes-voice-to-text.herokuapp.com/ |
|
Experimental |
| 6439 |
chicogong/translate_eval
Translation evaluation tool - LLM-based multilingual translation platform... |
|
Experimental |
| 6440 |
Priyanshu-Yadav19/Call-Voice-Agent
Real-time AI Voice Agent using Streaming STT, LLM-based conversation... |
|
Experimental |
| 6441 |
bivex/whisper-large-v3-turbo
Whisper Large V3 Turbo - fast speech-to-text model implementation with... |
|
Experimental |
| 6442 |
TiagoRueda/TTS_ESP32_IDF
Text-to-Speech (TTS) com ESP32 usando FreeRTOS e Flite |
|
Experimental |
| 6443 |
NullEnt1ty/GoCloudTTS
Translate text to speech using Google Cloud on the command line |
|
Experimental |
| 6444 |
lawitschka/ex_aws_polly
ExAws module for AWS Polly Text-to-Speech |
|
Experimental |
| 6445 |
revolunet/whatever-tts
return MP3 audio as a stream from given text |
|
Experimental |
| 6446 |
aks-arise1600/Text2Speech
Text2Speech is an Qt based UI application that converts written text into... |
|
Experimental |
| 6447 |
kdahlhaus/pyflite
A quick, poorly made, python 3 wrapper for the CMU Flite speech synthesis engine |
|
Experimental |
| 6448 |
IJCS/Trainer-app
A lightweight and highly flexible tool designed to assist coaches.... |
|
Experimental |
| 6449 |
shan4usmani/Emotion_Analyzer
This Project is used to detect emotions using python and Deep learning. It... |
|
Experimental |
| 6450 |
AlexLamson/rpi-greeter
Make your Raspberry Pi speak a greeting when you arrive |
|
Experimental |
| 6451 |
Razorwings18/Medieval-Dynasty-Speak-Up
Medieval Dynasty Speak Up gives voices to NPCs in the Medieval Dynasty video... |
|
Experimental |
| 6452 |
DogeCN/Vogen
A simple AI-powered text-to-speech tool with interactive shell |
|
Experimental |
| 6453 |
sruckh/echoTTS-app
OpenAI TTS WebUI client |
|
Experimental |
| 6454 |
JohnSpahr/MyTalkingRon
Talking Tom webapp made with vanilla JavaScript. Uses speech recognition and... |
|
Experimental |
| 6455 |
skyradez/Speech-Recognition-using-Convolutional-Neural-Network
Tutorial on Speech Recognition using Convolutional Neural Network |
|
Experimental |
| 6456 |
miozilla/aifc01d1
aifc01d1 :sheep::writing_hand::speaking_head: : Text from Image to Speech... |
|
Experimental |
| 6457 |
Timmald/Unnatural_Language_Processing
Text to speech app using a phoneme prediction model |
|
Experimental |
| 6458 |
naemazam/text-to-speak
Natural Reader is a professional text to speech program that converts any... |
|
Experimental |
| 6459 |
Yash-Shinde-24/KokoroTTS-API
KokoroTTS-API is a RESTful API that extends the Kokoro-82M TTS model,... |
|
Experimental |
| 6460 |
Pixel4bit/pxd-tts
A simple yet powerful web application for converting text into... |
|
Experimental |
| 6461 |
MichaelCurrin/kokoro-speech-demo
Explore the Kokoro Python library for text to speech output |
|
Experimental |
| 6462 |
acharan-tech-200037/az_bank_livekit_agent
This application is an AI-powered real-time voice agent platform built... |
|
Experimental |
| 6463 |
deanhouseholder/text-to-speech
This script will allow you to synthesize text into an audio file using the... |
|
Experimental |
| 6464 |
karya-inc/AudioGen
Automated multilingual TTS pipeline: CSV in, ElevenLabs audio out, tracked... |
|
Experimental |
| 6465 |
next-build/sarvam-ai-laravel
Laravel package for Sarvam AI integration with speech-to-text,... |
|
Experimental |
| 6466 |
mateogon/Cadence
Cadence: immersive reading pipeline from EPUB to audiobook with synchronized... |
|
Experimental |
| 6467 |
ArshCypherZ/text-to-speech
Text to Speech API using kokoro. |
|
Experimental |
| 6468 |
maverickg59/ottotone
Mac TTS app running on Faster Whisper |
|
Experimental |
| 6469 |
alecproj/microphone-module
Smart Home Microphone Module |
|
Experimental |
| 6470 |
l3lackcurtains/Index_TTS_Backend
🗂️ Index-TTS A cutting-edge Text-to-Speech (TTS) project powered by Python... |
|
Experimental |
| 6471 |
balas-world/kitten-tts-web-demo
Kitten TTS Web Demo showcases the Kitten TTS Nano in your browser—a... |
|
Experimental |
| 6472 |
hongkongkiwi/scoop-elevenlabs-cli
Official Scoop bucket for installing elevenlabs-cli on Windows. |
|
Experimental |
| 6473 |
speakingofdata/LJ2_Corpus
Single speaker, 26,200 transcribed audio recordings, 48 hours total |
|
Experimental |
| 6474 |
Tiger-Tom/GandalfHome
A smart home assistant written in Python. Includes speech recognition,... |
|
Experimental |
| 6475 |
abdullahashfaqvirk/Speech-Translation-Agent
The Speech Translation Agent is a real time application with a Streamlit... |
|
Experimental |
| 6476 |
Aayush9027/COSMOS_VIRTUAL_ASSISTANT
It is an open source accessibility tool created for better usability and... |
|
Experimental |
| 6477 |
SumanKumar5/revrag-voice-agent
Real-time AI voice agent using LiveKit and Deepgram with interruption... |
|
Experimental |
| 6478 |
marlenezw/speech-to-text
Turn any video or audio recording into a written transcript using python |
|
Experimental |
| 6479 |
caraleeqiu/mememeow
Practice English speaking with a carrot cat! Read along with YouTube/TikTok... |
|
Experimental |
| 6480 |
Abood-devo/EV3-Automation
Lego ev3 brick automation |
|
Experimental |
| 6481 |
leo-aa88/joker-bot
An AI written in Python which tells random jokes |
|
Experimental |
| 6482 |
NomanAhmed234/image2speech
Streamlit app that extracts text from images, converts it to speech, and... |
|
Experimental |
| 6483 |
beratuna/speech-to-text
Minimal, production-ready Streamlit app for Turkish + English speech... |
|
Experimental |
| 6484 |
PrajyotDaphal/WolframAlphal
You can search any question about... |
|
Experimental |
| 6485 |
roopesharch/EchoSonic
Built and deployed a full-stack AI text-to-speech platform using FastAPI and... |
|
Experimental |
| 6486 |
shestaya-liniya/accentless
Shape your accent with AI |
|
Experimental |
| 6487 |
Varsha02nats/RealTime-Voice-Agent-Structured-ASR
Production-ready real-time AI voice agent for structured data extraction... |
|
Experimental |
| 6488 |
dwain-barnes/vui-fastapi-server
A OpenAI-compatible Text-to-Speech API server powered by VUI - a small... |
|
Experimental |
| 6489 |
videosdk-community/videosdk-elevenlabs-ai-game-agent
This project integrates VideoSDK, ElevenLabs, and Deepgram APIs to create an... |
|
Experimental |
| 6490 |
bliptron/Google-TTS-Server
A FastAPI server for Google Gemini Text-to-Speech with modern web interface.... |
|
Experimental |
| 6491 |
zeri27/AI-LLP
AI Language Learning Application for Conversational Agents Course TU Delft |
|
Experimental |
| 6492 |
ty70/voicevox-text-to-speech
A Python-based text-to-speech pipeline using the VOICEVOX engine. Includes... |
|
Experimental |
| 6493 |
coco-whisper/Voice-Conversation-Audio-Generation-Platform-TTS-
A self-hosted platform for text-to-speech, voice conversion, and AI audio... |
|
Experimental |
| 6494 |
soumyaranjansia/nltk_speechassistant
Speech Assistant Using Natural Language Tool Kit |
|
Experimental |
| 6495 |
christalphilip/voice-replication
Clone a voice from a single audio clip using Coqui TTS XTTS v2. |
|
Experimental |
| 6496 |
MohdAqdasAsim/Vocentra
Vocentra is your AI-powered language-learning sidekick, built to turn... |
|
Experimental |
| 6497 |
Vatis-Tech/asr-client-js-react-example
How to use Vatis Tech with React. |
|
Experimental |
| 6498 |
astigPree/GuestOCAI
RoomFinder is an offline voice-enabled desktop application that helps users... |
|
Experimental |
| 6499 |
shravya1125/AI-Pirate-Assistant
Voice-first AI assistant with real-time speech input, LLM conversation, and... |
|
Experimental |
| 6500 |
ittia-research/speak
Education oriented TTS inference server |
|
Experimental |