All Voice AI Tools
6,981 tools ranked by quality score · Page 23 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 2201 |
MbBrainz/ttslab
TTSLab is THE place to easily test ANY text to text to speech model on your... |
|
Emerging |
| 2202 |
kapi2800/qwen3-tts-mac
Optimized implementation of Qwen3-TTS for Apple Silicon (M1-M4) |
|
Emerging |
| 2203 |
sayak-brm/espeakng-python
An eSpeak NG TTS binding for Python3. |
|
Emerging |
| 2204 |
GloomyGrave/Sinsy-NG
(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis... |
|
Emerging |
| 2205 |
OpenVoiceOS/ovos-tts-plugin-beepspeak
experiment adding new r2d2 tts engine for mycroft |
|
Emerging |
| 2206 |
HelloChatterbox/py_responsivevoice
unoficial python api for responsive voice |
|
Emerging |
| 2207 |
gokhaneraslan/tts-dataset-generator
With this tool you can create custom TTS dataset from video or audio. |
|
Emerging |
| 2208 |
diggerdu/pytorch_audio
audio processing module for pytorch:stft, istft |
|
Emerging |
| 2209 |
andi611/CS-Tacotron-Pytorch
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis... |
|
Emerging |
| 2210 |
hkdb/offline-tts
A Chrome extension that reads web pages and PDFs aloud using Supertonic's... |
|
Emerging |
| 2211 |
USSLab/DolphinAttack
Inaudible Voice Commands |
|
Emerging |
| 2212 |
Proteusiq/saa
Making Time Speak! 🎙️ |
|
Emerging |
| 2213 |
go-restream/supertts
🎧 Supertonic TTS ONNX Inference Openai Speech REST API |
|
Emerging |
| 2214 |
Sciss/SpeechRecognitionHMM
Exported from... |
|
Emerging |
| 2215 |
aidayang/LatentSync-OneClick
免费视频对口型软件LatentSync一键启动整合包 |
|
Emerging |
| 2216 |
AI-TOOLKIT/VoiceBridge
VoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit |
|
Emerging |
| 2217 |
npuichigo/ttsflow
tensorflow speech synthesis c++ inference for voicenet |
|
Emerging |
| 2218 |
hkilang/TTS
香港圍頭話及客家話文字轉語音朗讀器 |
|
Emerging |
| 2219 |
UFOAlastor/AI-Waifu-Project-LaIN
一个拥有长期记忆, 表情动作, 语音对话/打断/声纹识别, FunctionCall, 多模型支持的AI Waifu客户端. |
|
Emerging |
| 2220 |
Issac-Moses/Beacon
Beacon – A lightweight voice-controlled AI assistant using Whisper.cpp. ... |
|
Emerging |
| 2221 |
wspr-ncsu/robocall-audio-dataset
A dataset of real-world robocall audio recordings |
|
Emerging |
| 2222 |
SEPIA-Framework/sepia-web-audio
Create modular, cross-browser, web audio pipelines to record and process... |
|
Emerging |
| 2223 |
skit-ai/speech-to-intent-dataset
Dataset Release for Intent Classification from Speech |
|
Emerging |
| 2224 |
siddhant-vij/Health-Fitness-Tracker
Health & fitness app with natural language processing, custom... |
|
Emerging |
| 2225 |
scarletcho/prep4kaldi
Data preparation code for building Kaldi ASR system |
|
Emerging |
| 2226 |
krestaino/prankstr
📞 Prank your friends with text-to-speech phone calls powered by Twilio and... |
|
Emerging |
| 2227 |
amirharati/kaldi-alligner
scripts to align a given wave to its transcription using trained models by Kaldi |
|
Emerging |
| 2228 |
hanxiao/mls
MLX Local Serving (MLS) - Unified ASR, TTS, and Translation on Apple Silicon |
|
Emerging |
| 2229 |
khanld/Wav2vec2-Pretraining
Wav2vec 2.0 Self-Supervised Pretraining |
|
Emerging |
| 2230 |
IPS-LMU/transcription-portal
A portal that offers a transcription chain for multi upload and processing... |
|
Emerging |
| 2231 |
deepgram-devs/deepgram-demos-rust
Useful demo applications for Deepgram Voice AI APIs, using the Rust language! 🦀 |
|
Emerging |
| 2232 |
jopedroliveira/speech_recog_uc
Speech processing ROS-package. Performs speech recognition and estimates the... |
|
Emerging |
| 2233 |
ASR-project/Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM.... |
|
Emerging |
| 2234 |
karrarkazuya/ArabicTTS
ArabicTTS (TextToSpeech) Android library with a sample |
|
Emerging |
| 2235 |
boudhayan-dev/Blind-Reader-project
A low cost reading device for blind people. |
|
Emerging |
| 2236 |
mozilla/deepspeech-playbook
A crash course for training speech recognition models using DeepSpeech. |
|
Emerging |
| 2237 |
SEPIA-Framework/sepia-docs
Documentation and Wiki for SEPIA. Please post your questions and bug-reports... |
|
Emerging |
| 2238 |
xcmyz/FastSpeech2
The Implementation of FastSpeech2 Based on Pytorch. |
|
Emerging |
| 2239 |
overcrash66/Audio-File-Translator---S2ST
Audio file translator is a multilingual speech to speech and speech to text... |
|
Emerging |
| 2240 |
ayshrv/memento-app
Android App which serves as an AI assistant for human memory |
|
Emerging |
| 2241 |
papercast-dev/papercast
A Python pipeline tool and plugin ecosystem for processing technical... |
|
Emerging |
| 2242 |
shreyanspagariya/sankshep
Video Summarization - Summarized a video lecture and converted it to a... |
|
Emerging |
| 2243 |
ondrejklejch/learning_to_adapt
Coordinate-wise meta-learner for speaker adaptation of ASR models. |
|
Emerging |
| 2244 |
The-Data-Dilemma/ParquetToHuggingFace
ParquetToHuggingFace processes raw audio data, converts it into Parquet... |
|
Emerging |
| 2245 |
suzuran0y/Live2D-LLM-Chat
Live2D + ASR + LLM + TTS → Real-time communication + Offline... |
|
Emerging |
| 2246 |
zalo/OpenAI-Voice
A simple proof of concept for voice-to-voice interaction. |
|
Emerging |
| 2247 |
ericc-ch/edge-tts
Use Microsoft Edge's online text-to-speech service from JS code directly! |
|
Emerging |
| 2248 |
laszukdawid/cracker
Usable GUI for text-to-speech services |
|
Emerging |
| 2249 |
AshutoshDongare/convo
Open source voice bot for Humanoid Robots and virtual digital humans |
|
Emerging |
| 2250 |
X-LANCE/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient... |
|
Emerging |
| 2251 |
MichalKacprzak99/jarvis
Jarvis is a personal voice assistant inspired by the Marvel movie series |
|
Emerging |
| 2252 |
jenswittmann/CurlyFramework
Tiny Framework for accessibility and sustainability, not only for MODX or Kirby CMS. |
|
Emerging |
| 2253 |
opsdroid/opsdroid-audio
🗣 A companion application for opsdroid which adds hotwords, speech... |
|
Emerging |
| 2254 |
HasnainDarkNet/DarKVoice
DarKVoice is an open-source voice assistant and audio processing tool built... |
|
Emerging |
| 2255 |
upskyy/ContextNet
PyTorch implementation of "ContextNet: Improving Convolutional Neural... |
|
Emerging |
| 2256 |
hug33k/PyTalk-R2D2
Python script for R2D2 text-to-speech |
|
Emerging |
| 2257 |
zmeet-ai/tts-demo
支持各种感情的男女声音,支持实时和离线文本合成tts语音;支持单模特声音变声,语音速率调整,语音音量大小调整;支持自定义语音模型。 |
|
Emerging |
| 2258 |
in03/squawk
Automatic subtitles for DaVinci Resolve with OpenAI Whisper |
|
Emerging |
| 2259 |
Ronik22/Voice-Controlled-Email
A python-based voice-controlled email application for visually impaired persons. |
|
Emerging |
| 2260 |
filimo/ReaderTranslator
PDF/WebPages Reader with embedded Google Translate and voice engine on... |
|
Emerging |
| 2261 |
ognistik/alfred-superwhisper
Use Alfred to Control Superwhisper - AI Powered Voice to Text |
|
Emerging |
| 2262 |
JSON2Video/json2video-php-sdk
Video automation with PHP: add watermarks, resize videos, create slideshows,... |
|
Emerging |
| 2263 |
telecombcn-dl/2018-dlsl
UPC Deep Learning for Speech and Language 2018 |
|
Emerging |
| 2264 |
azraelkuan/FFTNet
FFTNet: a Real-Time Speaker-Dependent Neural Vocoder |
|
Emerging |
| 2265 |
ckaytev/tgisper
Telegram bot with ASR |
|
Emerging |
| 2266 |
vorojar/VoiceSnap
Open-source offline voice dictation — a free alternative to Typeless. 100%... |
|
Emerging |
| 2267 |
ZeroMirai/Waifu_AI_Vtuber
Waifu_AI_Vtuber is a AI virtual YouTuber chatbot powered by OpenAI GPT-3.5,... |
|
Emerging |
| 2268 |
hanifabd/voice-activity-detection-vad-realtime
Real-time Voice Activity Detection (VAD) with some example use case like... |
|
Emerging |
| 2269 |
hutchresearch/latex2speech
TeX2Speech is an application that turns LaTeX documents into spoken audio. |
|
Emerging |
| 2270 |
PowerBeef/QwenVoice
Native macOS app for Qwen3-TTS with custom voices, voice design, and voice... |
|
Emerging |
| 2271 |
suzumushi0/SoundObject_binary
SoundObject binary distribution. |
|
Emerging |
| 2272 |
HCI-LAB-UGSPEECHDATA/speech_data_ghana_ug
The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani,... |
|
Emerging |
| 2273 |
kcitlyn/PolyScribe_Desktop
Fully-offline transcription and translator w/ speech-to-text and... |
|
Emerging |
| 2274 |
i4Ds/whisper-prep
Data preparation utility for the finetuning of OpenAI's Whisper model. |
|
Emerging |
| 2275 |
indri-voice/audiotoken
Audio tokenization, in the fastest way possible! |
|
Emerging |
| 2276 |
BraceYourselfGames/UE-BYGTextToSpeech
A plugin that uses the Windows Speech API to speak text in Unreal Engine 4. |
|
Emerging |
| 2277 |
bensonruan/Speech-Command
Speech Command Recognizer using tensorflowjs |
|
Emerging |
| 2278 |
theaifutureguy/Vocal-Agent
A sophisticated real-time voice assistant that seamlessly integrates speech... |
|
Emerging |
| 2279 |
led-mirage/VoivoClip
VOICEVOXでクリップボードに貼り付けられたテキストを読み上げるアプリです。 |
|
Emerging |
| 2280 |
masonthemaker/saidwell
Open Source Voice AI Dashboard |
|
Emerging |
| 2281 |
lmangani/docker-rtpengine-speech
OpenSIPS + RTPEngine Recording + Speech Recognition in HEP |
|
Emerging |
| 2282 |
hebbihebb/MBook
EPUB to M4B using Maya1 |
|
Emerging |
| 2283 |
gkrsv/split_audio
A rough and ready Python utility which splits audio files based on silence... |
|
Emerging |
| 2284 |
oren-cohen/whatsmybitrate
Whatsmybitrate analyzes audio files for quality metrics such as bit rate,... |
|
Emerging |
| 2285 |
hollygrimm/voice-dataset-creation
Tools to create your own voice dataset for TTS training |
|
Emerging |
| 2286 |
aabdurakhmanov/uzbekcha-gapir
Matnni O'zbek tilida talafuz qiluvchi desktop dastur | Text to speech... |
|
Emerging |
| 2287 |
RapDoodle/Web-Real-Time-Speech-Recognition-with-Azure
An example project that provides a web interface to real-time speech-to-text... |
|
Emerging |
| 2288 |
calinalexandru/pericles
A browser extension offering intuitive text-to-speech functionality, making... |
|
Emerging |
| 2289 |
surajondev/text-to-speech
Conver text into speech |
|
Emerging |
| 2290 |
vectominist/End-to-end-ASR-Pytorch-DLHLP
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation... |
|
Emerging |
| 2291 |
gokulkarthik/text2speech
Towards Building Text-To-Speech Systems for the Next Billion Users -... |
|
Emerging |
| 2292 |
weespin/RequestifyTF2
Client side commands for mic spamming and more! |
|
Emerging |
| 2293 |
SUNGBEOMCHOI/Korean-Streaming-ASR
Korean Streaming ASR(with Denoiser and Conformer CTC) |
|
Emerging |
| 2294 |
Rongjiehuang/Multiband-WaveRNN
An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio... |
|
Emerging |
| 2295 |
jesseward/azuretexttospeech
A Go library for Azure's Cognitive Services text-to-speech API. |
|
Emerging |
| 2296 |
Vazgen005/discord-virtual-micro
Says everything you type in discord for you using ai (Silero Models) |
|
Emerging |
| 2297 |
betaoverflow/donna
Transform your smart devices to intelligent communicators. |
|
Emerging |
| 2298 |
CMsmartvoice/Unet-TTS
One-shot TTS with Improved Unseen Speaker and Style Transfer |
|
Emerging |
| 2299 |
mishrababhishek/chatbot
AI Chatbot answers students' queries about their college program using... |
|
Emerging |
| 2300 |
gokhaneraslan/XTTS_V2-finetuning
Training XTTS V2 and PEFT LORA Text-to-Speech (TTS) |
|
Emerging |