All Voice AI Tools
6,981 tools ranked by quality score · Page 61 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 6001 |
msalhab96/Listen-Attend-and-Spell
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper |
|
Experimental |
| 6002 |
tuanio/conformer-rnnt
Conformer RNN-Transducer |
|
Experimental |
| 6003 |
zyascend/End-to-End-Speech-Recognition-Learning
ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别 |
|
Experimental |
| 6004 |
upskyy/RNN-Transducer
PyTorch Implementation of RNN-Transducer |
|
Experimental |
| 6005 |
khaykingleb/automatic-speech-recognition
QuartzNet and DeepSpeech implementation for ASR |
|
Experimental |
| 6006 |
avrtt/MoE-speech-recognition
Mixture of experts architecture for speech-to-text and language... |
|
Experimental |
| 6007 |
yandex-cloud-examples/yc-speechkit-async-recognizer
SpeechKit Asynchronous Batch Recognizer. |
|
Experimental |
| 6008 |
markus-m-u-e-l-l-e-r/CTC.ISL
ISL Speech Recognition Toolkit for training neural networks with the CTC... |
|
Experimental |
| 6009 |
SrujanHR/Happy-AI-Voice-Assistant
Happy is a Python-based personal voice assistant for Windows. It responds to... |
|
Experimental |
| 6010 |
yehuohan/ln-asr
Automatic Speech Recognition |
|
Experimental |
| 6011 |
Omitg24/IIS-ASR
Repositorio para Administración de Sistemas y Redes (ASR), asignatura del... |
|
Experimental |
| 6012 |
subuhana2303/VaaniRakshak_Offline-Emergency-Voice-Assistant
VaaniRakshak is an offline voice assistant built for disaster scenarios,... |
|
Experimental |
| 6013 |
sofiahernandes/speech-sci-calculator
A smart scientific calculator app with speech recognition, built in Python... |
|
Experimental |
| 6014 |
AathifZahir/WhisprSplit
A powerful, local speech-to-text transcription system that combines OpenAI's... |
|
Experimental |
| 6015 |
DanteVela/Python-Voice-Assistant
A repository of a speech-driven virtual assistant powered by Speech... |
|
Experimental |
| 6016 |
Brooklyn-Dev/Ultron-AI
Voice-controlled AI gaming assistant for Marvel Rivals. |
|
Experimental |
| 6017 |
Manan-49/SRT-GENERATOR
Offline desktop application for generating accurate subtitles (SRT) from... |
|
Experimental |
| 6018 |
asiff00/TTS-Training-Blueprint
Intuitive understanding of Autoregressive TTS Models |
|
Experimental |
| 6019 |
brandonviaje/echo
voice assistant discord bot |
|
Experimental |
| 6020 |
Clats97/ClatScribe
ClatScribe is a speech-to-text tool that captures real-time audio,... |
|
Experimental |
| 6021 |
zayedalbloushi/AI-Transcription
Stream audio from the browser, transcribe it in real time, and get live... |
|
Experimental |
| 6022 |
msadeqsirjani/SubtitleGenerator
🎬 AI-powered subtitle generator using OpenAI Whisper. Multi-language... |
|
Experimental |
| 6023 |
tuannho0802/PDFvert-TextToSpeech
A web-based application for seamless PDF/DOCX conversion and natural... |
|
Experimental |
| 6024 |
MrFlapstaart/GameOCRTTS
Speak out text balloons in games without voice acting to use OCR on the... |
|
Experimental |
| 6025 |
taeefnajib/Aximos
Aximos is an innovative AI-powered tool that transforms your content into... |
|
Experimental |
| 6026 |
noAbbreviation/approxima
A command line program to loudly tell time (in chunks of 5 minutes). |
|
Experimental |
| 6027 |
LiZeC123/legado-tts-tencent
Tencent TTS for Legado Reader 基于腾讯语音合成API的Legado(开源阅读)TTS服务. |
|
Experimental |
| 6028 |
Aavache/pdf2speech
Reading PDF files and converting them to audio tracks. |
|
Experimental |
| 6029 |
10809104/taigi-speech-to-text
台語語音轉文字訓練資料集,資料來源:教育部《臺灣閩南語常用詞辭典》。 |
|
Experimental |
| 6030 |
benda1989/qwen3-tts
qwen3-tts train multi-speaker emotion control |
|
Experimental |
| 6031 |
Prathuvj/spectrolingua
🎵 Audio Processing Studio - A comprehensive Django API with Streamlit... |
|
Experimental |
| 6032 |
alam025/AI-voice-assistant-with-RAG-powered-customer-support
Enterprise-grade AI voice assistant with RAG-powered customer support,... |
|
Experimental |
| 6033 |
PedritoGMG/GMG-FunMenu
Client-side commands for microphone interactions, sound effects, and more,... |
|
Experimental |
| 6034 |
shaikhsaif72/Jarvis-Voice-Assistant
A voice-activated virtual assistant using Python and OpenAI. |
|
Experimental |
| 6035 |
yigitaliayyildiz/SmartSEE
Android object detection app using YOLOv8 (TFLite) with Turkish TTS feedback. |
|
Experimental |
| 6036 |
AapseMatlb/Pickasso-Speech
Speech Interaction Subsystem for Pickasso Autonomous Robot Enables wake word... |
|
Experimental |
| 6037 |
Swathi-88/JARVIS-AI
A voice-controlled desktop AI assistant for Windows featuring OpenAI... |
|
Experimental |
| 6038 |
AbhaySingh71/Multimodal-Agentic-Assistant-Clara
Clara: An agentic multimodal AI assistant that can see through your webcam,... |
|
Experimental |
| 6039 |
isbendiyarovanezrin/SpeechDetection
Speech Detection 💬 |
|
Experimental |
| 6040 |
masonintokyo/voicevox-srt-to-speak
VOICEVOX Engine APIを使ってSubRipファイルから各セリフ時間内に収まるように音声合成します。 |
|
Experimental |
| 6041 |
Madh93/whisper
🎙️ My Whisper stuff |
|
Experimental |
| 6042 |
YoungloLee/tf2-speech-recognition-transformer
Tensorflow 2 Speech Recognition Code (Transformer) |
|
Experimental |
| 6043 |
jmrashed/ai-desktop-assistant
A Python-based AI desktop assistant designed to perform various tasks like... |
|
Experimental |
| 6044 |
dannis999/trained_SpeechRecognition
此项目用于备份一个完整的中文语音识别环境,包括环境配置和预训练模型,以方便直接使用 |
|
Experimental |
| 6045 |
Masihtabaei/reswhis
A lightweight, WebSocket-based server for real-time, remote audio... |
|
Experimental |
| 6046 |
MSAbhishek22/Veronica_Chatbot
🤖 AI Chatbot with Voice Interface - A Flask web app featuring Groq-powered... |
|
Experimental |
| 6047 |
Hhhpraise/auto-subtitler
a python based app that generates subtitles , and can also be translated ,... |
|
Experimental |
| 6048 |
kevin30205/Media-Transcribe
Media Transcribe: Seamlessly generate transcripts from your video and audio... |
|
Experimental |
| 6049 |
wazeerc/voxie
Voxie, Let Your Notes Speak |
|
Experimental |
| 6050 |
parula-app/assistant
Parula - Digital assistant - Running entirely on your own device |
|
Experimental |
| 6051 |
CSFelix/audio-to-text
🔊 Extract Text from Audios 🔊 |
|
Experimental |
| 6052 |
Renamekk/Voice-Assistant
A simple and customizable voice assistant written in Python. Supports adding... |
|
Experimental |
| 6053 |
Akshitha0118/Akshitha-Voice-AI-Voice-Powered-YouTube-Assistant
An AI-powered Voice Assistant built using Python and Streamlit that listens... |
|
Experimental |
| 6054 |
dudarev/speechdown
CLI tool to transcribe your spoken audio notes into timestamped,... |
|
Experimental |
| 6055 |
druellan/ED-AI-Companion
A Python script to monitor the Elite Dangerous journal files and provide... |
|
Experimental |
| 6056 |
GlobussBiogestion/text-to-signals-and-voice
This API works 100% in HTML with Javascipt so it is very light and easy to... |
|
Experimental |
| 6057 |
jetfontanilla/browser-text-to-speech
a demo of what a browser is currently capable of in text-to-speech |
|
Experimental |
| 6058 |
passion-27/openai-whisper-api
A sample speech transcription app implementing OpenAI Text to Speech API... |
|
Experimental |
| 6059 |
13shivam/yt-agent
Offline-friendly backend POC to transcribe YouTube videos and chat with... |
|
Experimental |
| 6060 |
Eng-M-Abdrabbou/Sonix
A high-speed speech processing engine that captures and converts spoken... |
|
Experimental |
| 6061 |
kbhujbal/J.A.R.V.I.S-AI-Assistant
🤖 Voice-controlled AI assistant with speech recognition, Wikipedia search,... |
|
Experimental |
| 6062 |
mavleo96/whisper-accent
Conditioning via Adaptive Layer Norm for accented speech recognition |
|
Experimental |
| 6063 |
5ekastanx/Text-To-Speech
This Django project allows converting text to audio files and saving... |
|
Experimental |
| 6064 |
Aavtic/ena
A video generation program using GIFS. |
|
Experimental |
| 6065 |
sruckh/VibeVoice-finetune-easy
Simplified scripts for fine-tuning VibeVoice speech synthesis models with... |
|
Experimental |
| 6066 |
gas/pronunza-tts-galego-onnx-colab
Caderno de Colab para síntese de voz (TTS) en galego usando o modelo ONNX de Celtia |
|
Experimental |
| 6067 |
nmanikiran/ionic-allinone
This is to give a demo of each feature that are there in ionic and ionic-native |
|
Experimental |
| 6068 |
tb0hdan/voiceplay
Client-side first music centered voice controlled player |
|
Experimental |
| 6069 |
zefie/multi-tts
Docker for multiple TTS Engines with a GRadio interface |
|
Experimental |
| 6070 |
Zuellni/XTTS-Server
XTTS Server for SillyTavern. |
|
Experimental |
| 6071 |
EllangoK/gpt-voice-companion
Small, simple chatbot using GPT and ElevenLabs TTS |
|
Experimental |
| 6072 |
vpakarinen2/text-voice-chatterbox
Text-to-speech and voice cloning using Chatterbox Turbo. |
|
Experimental |
| 6073 |
ExplainableML/ZerAuCap
[NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning... |
|
Experimental |
| 6074 |
jmaczan/asr-dysarthria
Research on Automatic Speech Recognition for dysarthric speech |
|
Experimental |
| 6075 |
Temerold/TobsTTS
Text to speech, Python 3.7. Swedish and English. bye |
|
Experimental |
| 6076 |
SSusantAchary/AI_Resources
Have read and collected few Interesting Papers , Projects |
|
Experimental |
| 6077 |
ryanp3343/LiveScreenTranslator
LiveScreenTranslator utilizes OCR and translation services to provide... |
|
Experimental |
| 6078 |
Vidyut/vidyut-tts
Streamlit frontend for Coqui-tts |
|
Experimental |
| 6079 |
twers1/telegram-bot-audio
Telegram bot text-to-speech and speech-to-text |
|
Experimental |
| 6080 |
khaykingleb/research-playground
Efficient ML/DL implementations across multiple domains with K3s multi-node... |
|
Experimental |
| 6081 |
michaelmior/ha-silero
Text-to-speech for Home Assistant using Silero |
|
Experimental |
| 6082 |
CaydendW/Cashew
A python based virtual assistant |
|
Experimental |
| 6083 |
kuanyshbakytuly/camera-text-speech
Blind Text-Assistance |
|
Experimental |
| 6084 |
kunal2812/Programmophone
It is a tool to program with speech and is intended to be used by sightless... |
|
Experimental |
| 6085 |
Joyeah/videomaker
批量图片生成视频 |
|
Experimental |
| 6086 |
lingualogic/speech-react
Speech-React SDK |
|
Experimental |
| 6087 |
TejasQ/react-praise
A React binding for Praise. |
|
Experimental |
| 6088 |
ponchotitlan/google_text-to-speech_prompt_maker
Utility for Google Text-To-Speech batch audio files generator. Ideal for... |
|
Experimental |
| 6089 |
willwade/TTS-Dataset
A workflow to create a dataset of all TTS voices/languages available on... |
|
Experimental |
| 6090 |
kaka-lin/rpi-voice-kit-app
Using app to control Voice Kit(smart speaker) |
|
Experimental |
| 6091 |
Rumeysakeskin/Speech-Datasets-for-ASR
Download speech datasets (English and non-English) for Automatic Speech Recognition |
|
Experimental |
| 6092 |
arjunbazinga/speak
Select any text and have it read out loud |
|
Experimental |
| 6093 |
OVOSHatchery/ovos-tts-plugin-responsivevoice
responsive voice TTS plugin for mycroft |
|
Experimental |
| 6094 |
koth/kokoro.cpp
kokoro tts in cpp |
|
Experimental |
| 6095 |
Erio-Harrison/kokorotts_service
A TTS service that deploys Kokoro model inference |
|
Experimental |
| 6096 |
robauto/bibli3.0
BiBli 3.0 for Raspberry Pi - Swarm Robotics and IoT Operating System - AI -... |
|
Experimental |
| 6097 |
Thukyd/OpenAI-Spechify-Your-Docs
OpenAI-Spechify-Your-Docs is a Python project that converts text from... |
|
Experimental |
| 6098 |
ReadieFur/Stream-Tools
A stream chat tool that features AWS text to speech, voice commands, chat... |
|
Experimental |
| 6099 |
zguesmi/image2speech
Ethereum ready Dapp to speak your images. |
|
Experimental |
| 6100 |
PeterTakahashi/openai-tts
OpenAI Text to Speech |
|
Experimental |