All Voice AI Tools
6,983 tools ranked by quality score · Page 2 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 101 |
kurianbenoy/whisper_normalizer
A python package for whisper normalizer |
|
Established |
| 102 |
ieasybooks/tafrigh
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai. |
|
Established |
| 103 |
nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files |
|
Established |
| 104 |
PyThaiNLP/pythaiasr
Python Thai Automatic Speech Recognition |
|
Established |
| 105 |
Picovoice/rhino
On-device Speech-to-Intent engine powered by deep learning |
|
Established |
| 106 |
cboard-org/cboard
Augmentative and Alternative Communication (AAC) system with text-to-speech... |
|
Established |
| 107 |
Vonage/vonage-php-sdk-core
Vonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech,... |
|
Established |
| 108 |
ManimCommunity/manim-voiceover
Manim plugin for all things voiceover |
|
Established |
| 109 |
roryeckel/wyoming_openai
OpenAI-Compatible Proxy Middleware for the Wyoming Protocol |
|
Established |
| 110 |
PyThaiNLP/PyThaiTTS
Open Source Thai Text-to-speech library in Python |
|
Established |
| 111 |
flashlight/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit |
|
Established |
| 112 |
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine |
|
Established |
| 113 |
OpenMOSS/MOSS-TTS
MOSS‑TTS Family is an open‑source speech and sound generation model family... |
|
Established |
| 114 |
lugia19/elevenlabslib
Full python wrapper for the elevenlabs API. |
|
Established |
| 115 |
amicalhq/amical
🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no... |
|
Established |
| 116 |
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing |
|
Established |
| 117 |
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping. |
|
Established |
| 118 |
tabahi/bournemouth-forced-aligner
Extract phoneme-level timestamps from speeh audio. |
|
Established |
| 119 |
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning |
|
Established |
| 120 |
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages |
|
Established |
| 121 |
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time |
|
Established |
| 122 |
MainRo/deepspeech-server
A testing server for a speech to text service based on coqui.ai |
|
Established |
| 123 |
OpenVoiceOS/ovos-tts-server
simple flask server to host OpenVoiceOS tts plugins as a service |
|
Established |
| 124 |
aichaos/rivescript-python
A RiveScript interpreter for Python. RiveScript is a scripting language for... |
|
Established |
| 125 |
software-mansion/react-native-executorch
Declarative way to run AI models in React Native on device, powered by ExecuTorch. |
|
Established |
| 126 |
chinokikiss/GSV-TTS-Lite
GSV-TTS-Lite A high-performance inference engine specifically designed for... |
|
Established |
| 127 |
vilassn/whisper_android
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android |
|
Established |
| 128 |
charleprr/redditube
A video generator from Reddit posts and comments |
|
Established |
| 129 |
altunenes/parakeet-rs
very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA... |
|
Established |
| 130 |
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit |
|
Established |
| 131 |
GitYCC/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022) |
|
Established |
| 132 |
MycroftAI/mycroft-precise
A lightweight, simple-to-use, RNN wake word listener |
|
Established |
| 133 |
Wikidepia/g2p-id
Indonesian Grapheme-to-Phoneme (IPA notation) |
|
Established |
| 134 |
n1teshy/yapper-tts
offline text to speech and free SOTA LLM APIs to let your programs speak to you |
|
Established |
| 135 |
AbdullahHendy/live-translation
Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM... |
|
Established |
| 136 |
Vonage/vonage-node-sdk
Vonage API client for Node.js. API support for SMS, Voice, Text-to-Speech,... |
|
Established |
| 137 |
phuc-nt/my-translator
Real-time speech translation — macOS & Windows, free TTS, no server, your... |
|
Established |
| 138 |
haoheliu/voicefixer
General Speech Restoration |
|
Established |
| 139 |
Spr-Aachen/Easy-Voice-Toolkit
A user-friendly audio toolkit for voice recognition, voice transcription,... |
|
Established |
| 140 |
jianchang512/stt
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式 |
|
Established |
| 141 |
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning) |
|
Established |
| 142 |
wq2012/SimpleDER
A lightweight library to compute Diarization Error Rate (DER). |
|
Established |
| 143 |
astorfi/speechpy
:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition:... |
|
Established |
| 144 |
sccn/eegprep
EEGPrep is an automated preprocessing tool for human EEG data built on a... |
|
Established |
| 145 |
revdotcom/revai-node-sdk
Node.js SDK for the Rev AI API |
|
Established |
| 146 |
justinsalamon/scaper
A library for soundscape synthesis and augmentation |
|
Established |
| 147 |
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper |
|
Established |
| 148 |
XDcobra/react-native-sherpa-onnx
React Native TurboModule for Sherpa-ONNX offline on-device Speech Processing... |
|
Established |
| 149 |
yandexdataschool/speech_course
YSDA course in Speech Processing. |
|
Established |
| 150 |
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API |
|
Established |
| 151 |
jamsch/expo-speech-recognition
Speech Recognition for React Native Expo projects |
|
Established |
| 152 |
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching |
|
Established |
| 153 |
krillinai/KrillinAI
Video translation and dubbing tool powered by LLMs. The video translator... |
|
Established |
| 154 |
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX |
|
Established |
| 155 |
echogarden-project/echogarden
Cross-platform speech toolset, used from the command-line or as a Node.js... |
|
Established |
| 156 |
linto-ai/WebVoiceSDK
Buildings block for voice-enabled applications in the browser |
|
Established |
| 157 |
deepgram/deepgram-js-sdk
Official JavaScript SDK for Deepgram. |
|
Established |
| 158 |
kstonekuan/tambourine-voice
Your personal voice interface for any app. Speak naturally and your words... |
|
Established |
| 159 |
ken107/read-aloud
An awesome browser extension that reads aloud webpage content with one click |
|
Established |
| 160 |
remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX... |
|
Established |
| 161 |
EddyVerbruggen/nativescript-speech-recognition
:speech_balloon: Speech to text, using the awesome engines readily available... |
|
Established |
| 162 |
itsmevictor/clean-transcribe
A simple CLI to transcribe Youtube videos or local audio/video files and... |
|
Established |
| 163 |
zuoban/tts
tts 服务 |
|
Established |
| 164 |
githubharald/CTCWordBeamSearch
Connectionist Temporal Classification (CTC) decoder with dictionary and... |
|
Established |
| 165 |
NVIDIA-AI-Blueprints/pdf-to-podcast
Transform PDFs into AI podcasts for engaging on-the-go audio content. |
|
Established |
| 166 |
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition |
|
Established |
| 167 |
OpenMOSS/MOSS-TTSD
MOSS-TTSD is a spoken dialogue generation model designed for expressive... |
|
Established |
| 168 |
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to... |
|
Established |
| 169 |
met4citizen/HeadTTS
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for... |
|
Established |
| 170 |
Azure-Samples/Cognitive-Speech-TTS
Microsoft Text-to-Speech API sample code in several languages, part of... |
|
Established |
| 171 |
LokerL/tts-vue
🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。 |
|
Established |
| 172 |
kalliope-project/kalliope
Kalliope is a framework that will help you to create your own personal assistant. |
|
Established |
| 173 |
sandrohanea/whisper.net
Whisper.net. Speech to text made simple using Whisper Models |
|
Established |
| 174 |
VolcanicArts/VRCOSC
A modular node-programming language, program creator, animation system,... |
|
Established |
| 175 |
travisvn/edge-tts-universal
Use Microsoft Edge's online text-to-speech service in Node.js, browsers, or... |
|
Established |
| 176 |
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path,... |
|
Established |
| 177 |
aahl/zai-tts
🗣️ ZAI/GLM TTS to OpenAI Speech API, 免费的语音合成API,支持克隆音色,基于智谱TTS |
|
Established |
| 178 |
peteonrails/voxtype
Voice-to-text with push-to-talk for Wayland compositors |
|
Established |
| 179 |
dlutton/flutter_tts
Flutter Text to Speech package |
|
Established |
| 180 |
gunthercox/chatterbot-voice
A example of verbal communication using ChatterBot |
|
Established |
| 181 |
pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX |
|
Established |
| 182 |
yuga-hashimoto/openclaw-assistant
OpenClaw voice assistant app for Android - Wake word activation & system... |
|
Established |
| 183 |
dputhier/pygtftk
A python package and a set of shell commands to handle GTF files |
|
Established |
| 184 |
Oaklight/asr2clip
handy cli tool to convert your speech to clipboard text |
|
Established |
| 185 |
royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI |
|
Established |
| 186 |
BryceWG/BiBi-Keyboard
说点啥(BiBi Keyboard):一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR... |
|
Established |
| 187 |
stemrollerapp/stemroller
Isolate vocals, drums, bass, and other instrumental stems from any song |
|
Established |
| 188 |
kishanrajput23/Jarvis-Desktop-Voice-Assistant
A python based desktop voice assistant capable of executing system-level... |
|
Established |
| 189 |
deepgram/deepgram-python-sdk
Official Python SDK for Deepgram. |
|
Established |
| 190 |
stimm-ai/stimm
The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI... |
|
Established |
| 191 |
zai-org/GLM-ASR
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters |
|
Established |
| 192 |
JamesBrill/react-speech-recognition
💬Speech recognition for your React app |
|
Established |
| 193 |
wannaphong/ttsmms
TTS with The Massively Multilingual Speech (MMS) project |
|
Established |
| 194 |
ynop/audiomate
Python library for handling audio datasets. |
|
Established |
| 195 |
sdkcarlos/artyom.js
A voice control - voice commands - speech recognition and speech synthesis... |
|
Established |
| 196 |
Aivis-Project/aivmlib
Aivis Voice Model File (.aivm/.aivmx) Utility Library |
|
Established |
| 197 |
hugobloem/wyoming-microsoft-tts
Wyoming protocol server for Microsoft Azure text-to-speech |
|
Established |
| 198 |
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统 |
|
Established |
| 199 |
namastexlabs/murmurai
🎙️ Drop-in replacement for paid transcription APIs. Self-hosted,... |
|
Established |
| 200 |
mkiol/dsnote
Speech Note Linux app. Note taking, reading and translating with offline... |
|
Established |