All Voice AI Tools
6,981 tools ranked by quality score · Page 34 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 3301 |
tongjinle123/speech-transformer-pytorch_lightning
ASR project with pytorch-lightning |
|
Experimental |
| 3302 |
leaxer-ai/leaxer-qwen3-tts
C++ implementation of Qwen3-TTS running on top of ONNX Runtime. |
|
Experimental |
| 3303 |
AppleHolic/FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2 |
|
Experimental |
| 3304 |
ictnlp/ComSpeech
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct... |
|
Experimental |
| 3305 |
synesthesiam/pt-synesthesiam
CMU Sphinx acoustic model for Portugese (pt-br) |
|
Experimental |
| 3306 |
angelinekeke/claude-awake-speak
让你的 Claude Code 会说话 — 自动语音朗读中文内容,8种微软官方音色可选,实时切换,免费无需API Key,跨平台支持 |
|
Experimental |
| 3307 |
hyperloop-modules/titanium-speech
Use the iOS 10 SFSpeechRecognizer API in JavaScript with Appcelerator Hyperloop. |
|
Experimental |
| 3308 |
umjammer/vavi-speech2
🗣 Java Text to Speech (JSAPI2) engines (google cloud, cocoa, open jtalk,... |
|
Experimental |
| 3309 |
Niger-Volta-LTI/urhobo-asr-spoken-digits
URH-DIGITS is a connected digits speech recognition task |
|
Experimental |
| 3310 |
jonsafari/buckeye_dict
Buckeye Pronunciation Dictionary |
|
Experimental |
| 3311 |
AI-TOOLKIT/VoiceBridgeProjects
Example projects for VoiceBridge - an AI-TOOLKIT Open Source C++ Speech... |
|
Experimental |
| 3312 |
BBC-Esq/Elegant-Audio-Transcriber
Extremely fast and accurate audio transcrbier surpassing Whisper. Optimized... |
|
Experimental |
| 3313 |
vectominist/rspin
Official inference code for NAACL 2024 paper "R-Spin: Efficient Speaker and... |
|
Experimental |
| 3314 |
Alan-6666/chinese_asr
a demo of chinese asr |
|
Experimental |
| 3315 |
speechly/api
Speechly public API definitions and generated code |
|
Experimental |
| 3316 |
dyankov91/a2pod
Convert articles into podcast-quality audio on Apple Silicon. Local TTS, LLM... |
|
Experimental |
| 3317 |
amolgorithm/speech-gpt
What if ChatGPT had its own voice? What if you could speak to it with your... |
|
Experimental |
| 3318 |
deepgram-starters/django-text-to-speech
Get started using Deepgram's Transcription with this Django demo app |
|
Experimental |
| 3319 |
ansh-info/SpeechSense
This powerful toolkit combines real-time speech recognition with NLP to... |
|
Experimental |
| 3320 |
Flux9665/ArticulatoryTextFrontend
This is a text-processing frontend that converts graphemes to phonemes and... |
|
Experimental |
| 3321 |
lifeiteng/Rabbit
Explore Text-To-Speech |
|
Experimental |
| 3322 |
neurlang/whipstr
Whipstr ASR/STT System |
|
Experimental |
| 3323 |
marvinborner/CTC-LSTM
Spoken word recognition using CTC LSTMs for SWR2 Tübingen |
|
Experimental |
| 3324 |
probablyagoodusername/vesper
Therapeutic audio pipeline. Faith meets science. Free, static, open source. |
|
Experimental |
| 3325 |
xDoritox/Voice-Clone-Studio
🔊 Clone and design voices easily with Voice Clone Studio, a web UI powered... |
|
Experimental |
| 3326 |
vishalnagda1/text-to-speech
Python program to convert text to speech. |
|
Experimental |
| 3327 |
cniweb/podcast_generator
Vollautomatisierter Podcast-Generator: Erstellt komplette Episoden (Audio &... |
|
Experimental |
| 3328 |
myned-ai/interactive-website-navigator
An interactive 3D Gaussian Splatting avatar that guides website visitors... |
|
Experimental |
| 3329 |
rust-han/han-speech
汉语发音系统 |
|
Experimental |
| 3330 |
smcantab/speak11
Select text, press ⌥⇧/, hear it read aloud. macOS text-to-speech powered by... |
|
Experimental |
| 3331 |
loglux/FlexAudioPrint
FlexAudioPrint is a Python-based app for transcribing audio to text using... |
|
Experimental |
| 3332 |
visu123s/MimicKit
🤖 Learn motion imitation with MimicKit, a framework offering advanced... |
|
Experimental |
| 3333 |
slanglabs-projects/asr-wer-bench
Workbench for benchmarking Word Error Rate (WER) of Automatic Speech... |
|
Experimental |
| 3334 |
baochuquan/ios-vad
iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN,... |
|
Experimental |
| 3335 |
mpoyraz/ngram-lm-wiki
Scripts to train a n-gram language models on Wikipedia articles |
|
Experimental |
| 3336 |
awetomate/text-to-speech-streamlit
Text-to-Speech solution using Google's Cloud TTS API and a Streamlit front end |
|
Experimental |
| 3337 |
idiap/TIDIGITSRecipe.jl
A Julia recipe for training an ASR system using the TIDIGITS database |
|
Experimental |
| 3338 |
codassassin/voice-assistant
This is a very simple CLI based voice assistant which does various work on... |
|
Experimental |
| 3339 |
linto-ai/linto-punctuation
LinTO Platform punctuation service. |
|
Experimental |
| 3340 |
Ryan5453/lyricscribe
Automated Lyric Transcription Research |
|
Experimental |
| 3341 |
good-boy01/Quki
A virtual assistant that helps with everyday tasks , Quki is still in the... |
|
Experimental |
| 3342 |
HawksLab/narratify
e-book to audiobook convertor |
|
Experimental |
| 3343 |
andrew-fennell/CogNative
Translated vocal synthesis - Clone a voice and output speech in another language |
|
Experimental |
| 3344 |
findtharun/Railway_bot
Interactive Railway Reservation - BuildIng a ChatBot for a railway... |
|
Experimental |
| 3345 |
guglielmocamporese/learning_invariances_in_speech_recognition
In this work I investigate the speech command task developing and analyzing... |
|
Experimental |
| 3346 |
SoCXin/ASR1606
L4 R2: ASR 624MHz Cortex-R5 Cat.1 SoC (ASR1606/ASR1602) |
|
Experimental |
| 3347 |
SoCXin/ASR1601
L4 R3: ASR Cortex-R5 LTE Cat.1 SoC (ASR1601/ASR1603/ASR3601) |
|
Experimental |
| 3348 |
benank/bento
Your AI cooking companion🍱 Utilizes OpenAI's ChatGPT & Whisper APIs +... |
|
Experimental |
| 3349 |
dacson/Demo-of-Text-to-Speech-based-on-Deep-Learning
text to speech for mandarin, |
|
Experimental |
| 3350 |
WindowsNT/SpeechRec
Continuous Dictation Speech Recognition and Speech Synthesis in Win32 |
|
Experimental |
| 3351 |
mklement0/voices
macOS CLI for changing the default TTS (text-to-speech) voice and printing... |
|
Experimental |
| 3352 |
motelian/NutriSmart
NutriSmart is an AI-based calorie and macro tracking app equipped with NLP... |
|
Experimental |
| 3353 |
burrmill/sph2pipe
sph2pipe v2.5. We do not maintain this, and/or accept pull requests; just... |
|
Experimental |
| 3354 |
MItCHeLPL/Discord-AISupBOT
Discord AI Chat Bot with GPT-3 |
|
Experimental |
| 3355 |
mc095/luma
Personal Voice Agentic AI powered by Agno |
|
Experimental |
| 3356 |
mayank-kumar-giri/Speech-Recognizer-cum-Voice-Typing-Editor
Speech Recognizer cum text editor that facilitates voice typing using Google... |
|
Experimental |
| 3357 |
baocin/hugging_face_example_STT_api
Demonstration of Hugging Face's (https://huggingface.co/) newly released... |
|
Experimental |
| 3358 |
Scisaga/qwen3-asr-openai
自托管 ASR 推理服务 |
|
Experimental |
| 3359 |
aallaguly01/Diplom
Multimodal Python framework for hand gesture and voice control with cursor... |
|
Experimental |
| 3360 |
jxlarrea/homeassistant-voice-recipes
GPU/CUDA-accelerated voice control stack for Home Assistant. Runs on x86/x64... |
|
Experimental |
| 3361 |
Jopex1/real-time-voice-translator
🌍 Capture speech, translate it instantly, and playback audio in a selected... |
|
Experimental |
| 3362 |
HyxiaoGe/ai-audio-assistant-ui
面向音视频内容理解的 AI 助手,支持上传与 YouTube 链接, ASR 转写、结构化摘要与实时进度。 |
|
Experimental |
| 3363 |
prakharjadaun/Voice-Assistant
Created a Voice Assistant with the help of pyttsx3 library. Also, I have... |
|
Experimental |
| 3364 |
aidayang/Faster-whisper-OneClick
Faster-whisper一键启动整合包带GUI界面 |
|
Experimental |
| 3365 |
persanix-llc/chatrpi-app
Chat for Raspberry Pi (Chatrpi) is a voice assistant for the Raspberry Pi... |
|
Experimental |
| 3366 |
mariangle/taskify
AI powered task manager app with speech recognition, twitter-like input... |
|
Experimental |
| 3367 |
goodmike31/pl-asr-speech-data-survey
Survey of available speech datasets for Polish ASR development |
|
Experimental |
| 3368 |
Michaelrace/awesome-voice-agents
🗣️ Explore a curated list of voice AI agents, frameworks, tools, and best... |
|
Experimental |
| 3369 |
Rishabh1925/voiceforge
AI-powered voice automation platform with text-to-speech and automated... |
|
Experimental |
| 3370 |
cycle-sync-ai/livekit-voice-ai-agent-setup
This is the guide to show the method to build your own AI-Powered voice... |
|
Experimental |
| 3371 |
ayutaz/openjtalk-native
Cross-platform OpenJTalk native shared library — Japanese text-to-phoneme C... |
|
Experimental |
| 3372 |
Userdev1213/h3xassist
🤖 Automate your online meetings with H3xAssist to record, transcribe, and... |
|
Experimental |
| 3373 |
vpdl-sys/vpdl-public
Proprietary AI Voice Script Writer for turning written text into natural,... |
|
Experimental |
| 3374 |
pkprajapati7402/Darvin-Chatbot
Darvin is a Python-based voice-activated chatbot that interacts with users... |
|
Experimental |
| 3375 |
SethiPawandeep/kaldi-for-dummies
This is the repository for my version of Kaldi for Dummies example. |
|
Experimental |
| 3376 |
yuhanwang14/ASR-Pipeline
Local GPU-accelerated speech transcription pipeline with speaker diarization... |
|
Experimental |
| 3377 |
tfm000/diana
Locally hosted Text-to-Speech Document Converter |
|
Experimental |
| 3378 |
charlescao460/SpeechRecognitionByGoogleCloud
A .NET program that captures local audio and recognizes speech |
|
Experimental |
| 3379 |
Sariel2018/audio-srt-aligner
Dual-mode subtitle tool: transcript-aware alignment and audio-only auto... |
|
Experimental |
| 3380 |
kofemann/streetguide
An Android app to discover where you drive |
|
Experimental |
| 3381 |
techiaith/seilwaith
Offer hwyluso creu Adnabod Lleferydd Cymraeg gyda HTK, IRSTLM, Julius a... |
|
Experimental |
| 3382 |
theawless/sr-lib
Automatic Speech Recognition library for my BTech Project. |
|
Experimental |
| 3383 |
coo-quack/talkative-lobster
Desktop voice conversation app — speak into your mic and an AI responds out loud |
|
Experimental |
| 3384 |
deepgram/deepgram-js-captions
This package is the JavaScript implementation of Deepgram's WebVTT and SRT... |
|
Experimental |
| 3385 |
HxnDev/Convert-Text-To-Speech
This project uses Google Text to Speech to convert the written text into any... |
|
Experimental |
| 3386 |
Phe0nix/Speech-Email-Sender
Send email with speech recognition means just start talking and send emails.... |
|
Experimental |
| 3387 |
tongplw/ASR-web-based-restaurant
🍔 Foody, a smart voice-assistant web-based restaurant using Kaldi, React, and WebRTC |
|
Experimental |
| 3388 |
ldl805/QuickSpeechPi
Very, very lightweight and simple text to speech (TTS) program that outputs... |
|
Experimental |
| 3389 |
NEURASCOPE/neurascreen
Automate product tour videos with JSON scenarios. Real browser recording, AI... |
|
Experimental |
| 3390 |
aks-devs/mod_whisper_asr
Freeswitch ASR module |
|
Experimental |
| 3391 |
Anthonyiswhy/blind_navigation_aid
Raspberry Pi + ESP32 system for blind assistance using LiDAR, OpenCV, YOLO,... |
|
Experimental |
| 3392 |
pystorage/pyspeechkit
Library for working with a range of technologies for speech recognition and... |
|
Experimental |
| 3393 |
Jithsaavvy/Deploying-an-end-to-end-keyword-spotting-model-into-cloud-server-by-integrating-CI-CD-pipeline
The project is a concoction of research (audio signal processing, keyword... |
|
Experimental |
| 3394 |
Bashvalencia724/xiaomusic
🎶 Stream music effortlessly with XiaoMusic, enhancing your Xiao AI speaker... |
|
Experimental |
| 3395 |
Kaljurand/net-speech-api
Java API for the online speech recognition services provided by phon.ioc.ee |
|
Experimental |
| 3396 |
ParthPipermintwala/Personal-Assistant
🎙 Voice-controlled AI desktop assistant built with Python. Supports voice... |
|
Experimental |
| 3397 |
nikhilkumarsingh/Wit-Speech-API-Wrapper
A python client for interacting with Wit Speech Recognition API |
|
Experimental |
| 3398 |
mahimairaja/vapiserve
A to Z Vapi's custom tools | All your voice agent needs |
|
Experimental |
| 3399 |
abhaymathur21/Aura
A Personal Voice Assistant that performs a multitude of tasks for you on... |
|
Experimental |
| 3400 |
alefiury/SE-R-2022-SER-Track
Code for the winning solution in the SE&R 2022 Challenge - SER track. |
|
Experimental |