All Voice AI Tools
6,981 tools ranked by quality score · Page 21 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 2001 |
MaxMax2016/Grad-TTS-Chinese
Huawei Grad-TTS for Chinese |
|
Emerging |
| 2002 |
tabahi/WebSpeechAnalyzer
JS speech analyzer for fast speech analysis and labeling |
|
Emerging |
| 2003 |
rapidaai/rapida-go
Open-source Golang SDK for Rapida to build real-time, observable Voice AI... |
|
Emerging |
| 2004 |
AcTePuKc/Kokoro-Local-Gui
Hyper-fast, local, high-quality TTS based on Kokoro-82M. PySide6 GUI included. |
|
Emerging |
| 2005 |
dsi-icl/do-voice-interaction
The goal of this project is to provide a voice assistant to the Data... |
|
Emerging |
| 2006 |
AASHISHAG/DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser. |
|
Emerging |
| 2007 |
ibm-self-serve-assets/Watson-Speech
This collection demonstrates how to help you to quickly embed Watson Speech... |
|
Emerging |
| 2008 |
ajaygujja/Kahani-Storytelling-App-For-Children-With-Hearing-Impairment
Storytelling App For Children With Hearing Impairment |
|
Emerging |
| 2009 |
thewh1teagle/vad-rs
Speech detection using silero vad in Rust |
|
Emerging |
| 2010 |
madzadev/voice-cue
📣 Find sentiments, tags, entities, and actions in your voice recordings instantly |
|
Emerging |
| 2011 |
yyaadet/autosrt_page
AutoSRT is an macOS app that automatically generates dual language subtitles... |
|
Emerging |
| 2012 |
rryam/SakuraKit
Swift SDK for Prototyping AI Speech Generation |
|
Emerging |
| 2013 |
twn39/EdgeTTS.DotNet
EdgeTTS.DotNet is a C# (.NET) library that allows you to use Microsoft... |
|
Emerging |
| 2014 |
muhammadGagah/native-speech-generation
Add-on NVDA untuk mengubah teks menjadi suara alami dengan Google Gemini AI. |
|
Emerging |
| 2015 |
small-cactus/Jarvis-ChatGPT-VoiceAssistant
Jarvis powered by GPT-3.5/GPT-4 |
|
Emerging |
| 2016 |
atakanakin/TutunSabri
He is not our hero. He is a silent guardian. A watchful protector. |
|
Emerging |
| 2017 |
ywatanabe1989/scitex-notification
Give your AI agents a voice — TTS, phone calls, SMS, email, webhooks. One... |
|
Emerging |
| 2018 |
eminemahjoub/pdf-voice-reader
"PDF Reader: A Python application for seamless PDF viewing with enhanced... |
|
Emerging |
| 2019 |
rt400/ReversoTTS-HA
ReversoTTS component for HomeAssistant |
|
Emerging |
| 2020 |
fquirin/speech-recognition-experiments
Experiments to test different speech recognition systems for SEPIA Framework |
|
Emerging |
| 2021 |
eellak/gsoc2019-sphinx
Creation of an online Greek mail dictation system, using Sphinx and... |
|
Emerging |
| 2022 |
aishoot/Multi-Hotword_Spotting
Won't it be cool to build a speech assistant like Alexa or Siri yourself... |
|
Emerging |
| 2023 |
gheyret/uyghur-asr-ctc
Speech Recognition for Uyghur using deep learning |
|
Emerging |
| 2024 |
FlorianEagox/WeeaBlind
A program to dub non-english media with modern AI speech synthesis,... |
|
Emerging |
| 2025 |
vdutts7/ai-rapper
Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise... |
|
Emerging |
| 2026 |
eellak/gsoc2021-audio-annotation-tool
Creation of a multi user audio first annotation tool - GSoC 2021 |
|
Emerging |
| 2027 |
Harshit-shrivastav/TikTok-TTS-Bot
A python TikTok Text to speech generator telegram bot. |
|
Emerging |
| 2028 |
vroomai/vst
🎹 Generate sounds from words. Directly in your DAW. |
|
Emerging |
| 2029 |
0xPD33/sonori
Sonori is a fully local STT app for Linux (Wayland). |
|
Emerging |
| 2030 |
Gust4voSales/Marvin-VirtualAssistent
A dinamic virtual assistent made with Python, you can easily add more voice... |
|
Emerging |
| 2031 |
shawnrushefsky/talky-talky
MCP server for Audio Generation and Analysis with a Variety of Open Models. |
|
Emerging |
| 2032 |
mramshaw/Speech-Recognition
Speech recognition with Python |
|
Emerging |
| 2033 |
xinjli/ucla-phonetic-corpus
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH... |
|
Emerging |
| 2034 |
akku2005/VocalInk
Next-gen open-source voice-to-blog platform with AI, TTS, gamification, and... |
|
Emerging |
| 2035 |
lesleyrs/clipboard-narrator
Turn any web page into an audiobook, works in the background on desktop! |
|
Emerging |
| 2036 |
lucasnewman/vocos-mlx
Implementation of 'Vocos: Closing the gap between time-domain and... |
|
Emerging |
| 2037 |
aks-devs/mod_openai_tts
Freeswitch Speech-To-Text module |
|
Emerging |
| 2038 |
speechly/ios-client
The iOS client library for Speechly API |
|
Emerging |
| 2039 |
wwdok/faster-whisper-webui-cn
Cloned from https://huggingface.co/spaces/aadnk/faster-whisper-webui, and... |
|
Emerging |
| 2040 |
jimbobbennett/SpeechToTextSamples
Sample code showing how to use the Azure Speech to Text service from Python 🗣 |
|
Emerging |
| 2041 |
DojoCodingLabs/remotion-superpowers
🎬 Claude Code plugin — full video production studio for Remotion. AI... |
|
Emerging |
| 2042 |
royshil/obs-squawk
Real-time Text-to-Speech AI Engine built-in OBS, integrative and intuitive |
|
Emerging |
| 2043 |
lucadellalib/audiocodecs
A collections of audio codecs with a standardized API |
|
Emerging |
| 2044 |
ShawnPi233/SynParaSpeech
Official Repository of Paper: "SynParaSpeech: Automated Synthesis of... |
|
Emerging |
| 2045 |
mravanelli/pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained... |
|
Emerging |
| 2046 |
pviotti/sayit
A text-to-speech command line tool backed by Azure Cognitive Services. |
|
Emerging |
| 2047 |
HeyHeyChicken/NOVA-Python
NOVA is a customizable voice assistant made with Python. |
|
Emerging |
| 2048 |
ORI-Muchim/One-Click-MB-iSTFT-VITS2
MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making... |
|
Emerging |
| 2049 |
prathamsolanki/gender-recognition-by-voice
Identify a voice as male or female. |
|
Emerging |
| 2050 |
yui-mhcp/text_to_speech
(Multi Speaker) Text-To-Speech (TTS) project |
|
Emerging |
| 2051 |
daisy/obi
Obi is an open source audio book production tool that produces digital... |
|
Emerging |
| 2052 |
r1di/neutts-fastapi
OpenAI-compatible Text-to-Speech API server powered by NeuTTS. Drop-in... |
|
Emerging |
| 2053 |
ga642381/Taiwanese-Whisper
fine-tune Whipser model for Taiwanese speech recognition |
|
Emerging |
| 2054 |
Citadawn/VoiceDAO
语道 (VoiceDAO) - 专注于文本转语音功能的 Android 应用 |
|
Emerging |
| 2055 |
taresh18/orpheus-streaming
Orpheus TTS Server with streaming support (TTFB ~160ms) |
|
Emerging |
| 2056 |
ye-kyaw-thu/myG2P
Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary... |
|
Emerging |
| 2057 |
thevickypedia/Jarvis_UI
Light weight UI to interact with Jarvis via API calls |
|
Emerging |
| 2058 |
saky-semicolon/Emotion-Aware-AI-Support-System
A smart AI-powered platform that detects emotions from student voice input,... |
|
Emerging |
| 2059 |
jianchang512/kokoro-uiapi
用于kokoro TTS的webui界面和兼容openai api |
|
Emerging |
| 2060 |
poretsky/ru_tts
Compact and portable Russian speech synthesizer |
|
Emerging |
| 2061 |
yanghaha0908/FastHuBERT
Official implementation for Fast-HuBERT: An Efficient Training Framework for... |
|
Emerging |
| 2062 |
arunk140/serve-piper-tts
Go Lang API Wrapper around Piper TTS - Supports TTS Inference and List of Voices |
|
Emerging |
| 2063 |
susilnem/American-sign-Language
A CNN based human computer interface for American Sign Language recognition... |
|
Emerging |
| 2064 |
esoyeon/KoreanTTS
Korean Text To Speech Project: Using Tacotron1, Tacotron2, Wavenet and Melgan |
|
Emerging |
| 2065 |
SCRN-VRC/Voice-Recognition-Shader
Audio detection with visemes in a fragment shader |
|
Emerging |
| 2066 |
rcdalj/speech2speech
Full speech-to-speech workflow (can be customized to user's requirements) |
|
Emerging |
| 2067 |
manascb1344/zonos-api
Production-ready FastAPI wrapper for Zonos TTS models with GPU acceleration,... |
|
Emerging |
| 2068 |
unza-speech-lab/zambezi-voice
Repository for multilingual speech data resources for native languages of Zambia. |
|
Emerging |
| 2069 |
biyoml/End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset. |
|
Emerging |
| 2070 |
jonaro00/wallace-minion
🔨🙂 Discord Bot for my private friend server |
|
Emerging |
| 2071 |
lcraver/ProxiTalk
This is the repo for ProxiTalk OS. ProxiTalk is a custom operating system... |
|
Emerging |
| 2072 |
30stomercury/Automatic-Speech-Recognition
End-to-End Speech Recognition Using Tensorflow |
|
Emerging |
| 2073 |
phineas-pta/fine-tune-whisper-vi
jupyter notebooks to fine tune whisper models on Vietnamese using Colab... |
|
Emerging |
| 2074 |
LedoKun/028-simple-queue-system
A real-time, responsive queue calling system designed for TV displays,... |
|
Emerging |
| 2075 |
ivanvovk/compressed-tacotron2-pytorch
Compressed version of Tacotron 2 using Tensor Train + Waveglow. |
|
Emerging |
| 2076 |
SiddhantSadangi/st_deepgram_playground
API playground for Deepgram built with Streamlit |
|
Emerging |
| 2077 |
DataXujing/ASR-paper
:fire: ASR教程: https://dataxujing.github.io/ASR-paper/ |
|
Emerging |
| 2078 |
vani-voice/vani
Open protocol & middleware for Indian language voice agents — STT→LLM→TTS in... |
|
Emerging |
| 2079 |
Ephrem-ETH/E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM |
|
Emerging |
| 2080 |
Aditya-ds-1806/dictpress-tts
TTS plugin for dictpress |
|
Emerging |
| 2081 |
sberdevices/smartspeech
SmartSpeech — это сервис для синтеза и распознавания речи |
|
Emerging |
| 2082 |
daymade/chattts-seed-example
这是一个 ChatTTS 音频仓库,包含用不同 seed 生成的不同音色,你可以方便地挑选你喜欢的 seed。 |
|
Emerging |
| 2083 |
stefantaubert/mean-opinion-score
Python library for calculating the mean opinion score and 95% confidence... |
|
Emerging |
| 2084 |
thewh1teagle/israwave
Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet |
|
Emerging |
| 2085 |
funway/audible-epub3-maker
Generate audiobooks from plain EPUB files in EPUB 3 Media Overlays format... |
|
Emerging |
| 2086 |
Deimos-M/DL-Virtual-Assistant
It is a virtual assistant for visually impaired which include models like... |
|
Emerging |
| 2087 |
Yangyangii/TPGST-Tacotron
Google's TPGST reimplementation. |
|
Emerging |
| 2088 |
taikun114/VOICEVOX-TTS-for-Home-Assistant
Custom integration for Japanese TTS using VOICEVOX in Home Assistant. |
|
Emerging |
| 2089 |
mike-nott/smart-announcements
Intelligent context-aware voice announcements for Home Assistant.... |
|
Emerging |
| 2090 |
AkshathRaghav/tinyspeech
Code release for "TinySpeech: Attention Condensers for Deep Speech... |
|
Emerging |
| 2091 |
OpenTSLab/BELLE
Official implementation of BELLE "Bayesian Speech Synthesizers Can Learn... |
|
Emerging |
| 2092 |
souvikg544/TTS_Data_Maker
Text to speech is an emerging zone of AI. This repository helps to create a... |
|
Emerging |
| 2093 |
ih3xcode/h3xassist
Meeting assistant that records, transcribes, and summarizes online meetings... |
|
Emerging |
| 2094 |
brewusinc/Edge-TTS
Edge-TTS is a Swift implementation of Microsoft Edge's Text-to-Speech (TTS)... |
|
Emerging |
| 2095 |
samuelbradshaw/text-to-timestamps
Python and command-line utility for aligning audio to a transcript. |
|
Emerging |
| 2096 |
georgesterpu/Taris
Transformer-based online speech recognition system with TensorFlow 2 |
|
Emerging |
| 2097 |
wahyd4/say-it
TTS in command line -- Pronounce the Chinese and English words you typed in. |
|
Emerging |
| 2098 |
art1415926535/yandex_speech
Generation of speech using Yandex SpeechKit. |
|
Emerging |
| 2099 |
oleges1/quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261] |
|
Emerging |
| 2100 |
mazzasaverio/youtube-auto-dub
Automated voice dubbing for YouTube videos using Docker, OpenVoice, and... |
|
Emerging |