All Voice AI Tools
6,983 tools ranked by quality score · Page 3 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 201 |
rzru/nightingale
Machine learning powered Karaoke app (with scores!) |
|
Established |
| 202 |
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project. |
|
Established |
| 203 |
asterics/Asterics-AAC
Free, easy-to-use AAC app with offline support, flexible input options,... |
|
Established |
| 204 |
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition. |
|
Established |
| 205 |
supertone-inc/supertonic-py
Lightning-Fast, On-Device TTS — running natively via ONNX. |
|
Established |
| 206 |
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface... |
|
Established |
| 207 |
Vonage/vonage-ruby-sdk
Vonage REST API client for Ruby. API support for SMS, Voice, Text-to-Speech,... |
|
Established |
| 208 |
Saurav-Paul/AI-virtual-assistant-python
Command line virtual assistant for competitive programming |
|
Established |
| 209 |
pilot51/voicenotify
Android app that speaks notifications |
|
Established |
| 210 |
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model |
|
Established |
| 211 |
Enemyx-net/VibeVoice-ComfyUI
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech... |
|
Established |
| 212 |
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,... |
|
Established |
| 213 |
p0n1/epub_to_audiobook
EPUB to audiobook converter, optimized for Audiobookshelf, WebUI included |
|
Established |
| 214 |
OpenVoiceOS/ovos-tts-plugin-espeakNG
espeakNG plugin |
|
Established |
| 215 |
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented... |
|
Established |
| 216 |
evancohen/sonus
:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword... |
|
Established |
| 217 |
alphacep/vosk-unity-asr
Automatic Speech Recognition in Unity using Vosk library |
|
Established |
| 218 |
mybigday/whisper.rn
React Native binding of whisper.cpp. |
|
Established |
| 219 |
Femoon/tts-azure-web
TTS Azure Web 是一个 Azure 文本转语音(TTS)网页应用,可以在本地或者云端使用你的 Azure Key 一键部署。TTS... |
|
Established |
| 220 |
arcosoph/nanowakeword
A lightweight, open-source, and intelligent wake word detection engine.... |
|
Established |
| 221 |
HeyWillow/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive... |
|
Established |
| 222 |
SahilAggarwal2004/react-text-to-speech
An easy-to-use React.js library that leverages the Web Speech API to convert... |
|
Established |
| 223 |
mdiller/MangoByte
A discord bot that provides the ability to play dota hero response clips, do... |
|
Established |
| 224 |
antirek/voicer
AGI-server voice recognizer for #Asterisk |
|
Established |
| 225 |
TrevorS/voxtral-mini-realtime-rs
Streaming speech recognition running natively and in the browser. A pure... |
|
Established |
| 226 |
richardr1126/openreader
An open-source read-along document reader server with high-quality TTS... |
|
Established |
| 227 |
RageAgainstThePixel/ElevenLabs-DotNet
A Non-Official ElevenLabs RESTful API Client for dotnet |
|
Established |
| 228 |
BoltzmannEntropy/MimikaStudio
MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic... |
|
Established |
| 229 |
thevickypedia/Jarvis
Fully Functional Voice Based Natural Language UI |
|
Established |
| 230 |
janvarev/Irene-Voice-Assistant
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы... |
|
Established |
| 231 |
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For... |
|
Established |
| 232 |
canopyai/Orpheus-TTS
Towards Human-Sounding Speech |
|
Established |
| 233 |
yeyupiaoling/YeAudio
Python的音频工具 |
|
Established |
| 234 |
davidacm/NVDA-IBMTTS-Driver
This project is aimed at developing and maintaining the NVDA IBMTTS driver.... |
|
Established |
| 235 |
vivekuppal/transcribe
Transcribe is a real time transcription, conversation, Language learning... |
|
Established |
| 236 |
fishaudio/fish-audio-python
The official Python library for the Fish Audio API. |
|
Established |
| 237 |
marytts/marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system... |
|
Established |
| 238 |
dictation-toolbox/dragonfly
Speech recognition framework allowing powerful Python-based scripting and... |
|
Established |
| 239 |
ttop32/MouseTooltipTranslator
Mouseover Translate Any Language At Once - Chrome Extension: PDF Translator,... |
|
Established |
| 240 |
mlalma/kokoro-ios
Kokoro TTS for iOS and macOSX |
|
Established |
| 241 |
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language |
|
Established |
| 242 |
gooofy/py-kaldi-asr
Some simple wrappers around kaldi-asr intended to make using kaldi's... |
|
Established |
| 243 |
keithito/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with... |
|
Established |
| 244 |
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX |
|
Established |
| 245 |
stefantaubert/pinyin-to-ipa
Command-line interface and Python library to transcribe pinyin to IPA. The... |
|
Established |
| 246 |
jonatasgrosman/huggingsound
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools |
|
Established |
| 247 |
xiangyuecn/Recorder
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid... |
|
Established |
| 248 |
DevEmperor/Dictate
A powerful Whisper AI keyboard for reliable speech transcription |
|
Established |
| 249 |
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages! |
|
Established |
| 250 |
moonstar-x/discord-tts-bot
A Text-to-Speech bot for Discord. |
|
Established |
| 251 |
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment |
|
Established |
| 252 |
deepgram/deepgram-rust-sdk
Community Rust SDK for Deepgram. |
|
Established |
| 253 |
Blaizzy/mlx-audio-swift
A modular Swift SDK for audio processing with MLX on Apple Silicon |
|
Established |
| 254 |
YuanGongND/whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT:... |
|
Established |
| 255 |
capacitor-community/text-to-speech
⚡️ Capacitor plugin for synthesizing speech from text. |
|
Established |
| 256 |
sfortis/openai_tts
Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine... |
|
Established |
| 257 |
dectalk/dectalk
Modern builds for the 90s/00s DECtalk text-to-speech application. |
|
Established |
| 258 |
robdmac/talkito
TalkiTo lets developers interact with AI systems through speech across... |
|
Established |
| 259 |
ai-ng/swift
Fast voice assistant powered by Groq, Cartesia, and Vercel. |
|
Established |
| 260 |
readium/speech
💬 A TypeScript library for implementing read aloud on the Web |
|
Established |
| 261 |
kadirnar/VoiceHub
VoiceHub: A Unified Inference Interface for TTS Models |
|
Established |
| 262 |
FirezTheGreat/1SHOT
All my works - https://github.com/FirezTheGreat (latest music commands/djs... |
|
Established |
| 263 |
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for... |
|
Established |
| 264 |
MasuRii/opencode-smart-voice-notify
🔊 Smart voice notification plugin for OpenCode with multiple TTS engines... |
|
Established |
| 265 |
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion |
|
Established |
| 266 |
shivammehta25/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS) |
|
Established |
| 267 |
Gr122lyBr/voicetag
Speaker identification powered by pyannote and resemblyzer |
|
Established |
| 268 |
Picovoice/speech-to-text-benchmark
speech to text benchmark framework |
|
Established |
| 269 |
hkchengrex/MMAudio
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality... |
|
Established |
| 270 |
snakers4/silero-stress
Silero Stress — pre-trained enterprise-grade automated stress and homograph... |
|
Established |
| 271 |
i3thuan5/tai5-uan5_gian5-gi2_kang1-ku7
臺灣言語工具 |
|
Established |
| 272 |
WhisperSpeech/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper. |
|
Established |
| 273 |
petercunha/tts
:pencil: :sound: A simple text-to-speech tool. Converts your text to speech... |
|
Established |
| 274 |
zzw922cn/Automatic_Speech_Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow |
|
Established |
| 275 |
R3gm/SoniTranslate
Synchronized Translation for Videos. Video dubbing |
|
Established |
| 276 |
vox-serve/vox-serve
A Streaming-Native Serving Engine for TTS/STS Models |
|
Established |
| 277 |
pykaldi/pykaldi
A Python wrapper for Kaldi |
|
Established |
| 278 |
alphacep/vosk-android-demo
Offline speech recognition for Android with Vosk library. |
|
Established |
| 279 |
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model... |
|
Established |
| 280 |
midas-research/audino
Open source audio annotation tool for humans |
|
Established |
| 281 |
yeyupiaoling/PaddlePaddle-DeepSpeech
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。 |
|
Established |
| 282 |
funnyzak/tts-now
跨平台基于云平台(阿里云、讯飞等)语音合成 API 的文字转语音助手。支持单文本快速合成和批量合成。支持windows、macOS、Linux。 |
|
Established |
| 283 |
linto-ai/linto-stt
An automatic speech recognition API |
|
Established |
| 284 |
Aivis-Project/AivisSpeech-Engine
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine |
|
Established |
| 285 |
nari-labs/dia
A TTS model capable of generating ultra-realistic dialogue in one pass. |
|
Established |
| 286 |
mgonzs13/whisper_ros
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2 |
|
Established |
| 287 |
mathigatti/midi2voice
Singing synthesis from MIDI file |
|
Established |
| 288 |
soniqo/speech-swift
AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and... |
|
Established |
| 289 |
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level... |
|
Established |
| 290 |
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model. |
|
Emerging |
| 291 |
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without... |
|
Emerging |
| 292 |
analyticsinmotion/werx
🐍📦 Easy-to-use Python package for lightning-fast Word Error Rate (WER) analysis |
|
Emerging |
| 293 |
High-Logic/Genie-TTS
GPT-SoVITS ONNX Inference Engine & Model Converter |
|
Emerging |
| 294 |
lobehub/lobe-tts
🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser |
|
Emerging |
| 295 |
NeonGeckoCom/neon-tts-plugin-coqui
Coqui AI TTS plugin |
|
Emerging |
| 296 |
jeroenterheerdt/pycsspeechtts
Python (py) library to use Microsofts Cognitive Services Speech (csspeech)... |
|
Emerging |
| 297 |
ThioJoe/Auto-Synced-Translated-Dubs
Automatically translates the text of a video based on a subtitle file, and... |
|
Emerging |
| 298 |
sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition... |
|
Emerging |
| 299 |
rwth-i6/rasr
The RWTH ASR Toolkit. |
|
Emerging |
| 300 |
Stypox/dicio-android
Dicio assistant app for Android |
|
Emerging |