All Voice AI Tools
6,981 tools ranked by quality score · Page 13 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 1201 |
alias454/YATSEE
YATSEE - Yet Another Tool for Speech Extraction & Enrichment |
|
Emerging |
| 1202 |
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and... |
|
Emerging |
| 1203 |
CoffeeVampir3/audiocraft-webui
Quick webui for audiocraft |
|
Emerging |
| 1204 |
HenestrosaDev/audiotext
A desktop application that transcribes audio from files, microphone input or... |
|
Emerging |
| 1205 |
kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation |
|
Emerging |
| 1206 |
TeamAudio/reaspeech
Speech recognition for REAPER |
|
Emerging |
| 1207 |
italankin/samplevoicebot
TTS Telegram bot |
|
Emerging |
| 1208 |
smaranjitghose/AIAudioTranscriber
A minimalistic web app to generate transciption for audio built using Python |
|
Emerging |
| 1209 |
arpy8/ESP32_Voice_Assistant
This project combines embedded system and AI inference to create an... |
|
Emerging |
| 1210 |
MHaggis/ASRGEN
ASR Configurator, Essentials and Atomic Testing |
|
Emerging |
| 1211 |
finos/greenkey-asrtoolkit
A collection of useful tools for handling speech recognition data |
|
Emerging |
| 1212 |
serpapps/ai-voice-cloner
AI Voice Cloning Desktop Application that runs locally on your computer and... |
|
Emerging |
| 1213 |
JosefAlbers/e2tts-mlx
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX |
|
Emerging |
| 1214 |
benmaster82/writher
Voice-powered productivity for Windows |
|
Emerging |
| 1215 |
rishikksh20/gmvae_tacotron
Gaussian Mixture VAE Tacotron |
|
Emerging |
| 1216 |
r0227n/flutter_whisper_kit
🎤 A Flutter plugin for running WhisperKit speech-to-text models on-device,... |
|
Emerging |
| 1217 |
jbelford/Eolian
Eolian is a Discord music bot which provide a very powerful API for queuing... |
|
Emerging |
| 1218 |
AIFSH/ComfyUI-FishSpeech
a custom comfyui node for fish-speech |
|
Emerging |
| 1219 |
sotelo/parrot
RNN-based generative models for speech. |
|
Emerging |
| 1220 |
rainygirl/rspeaker
말귀를 알아듣고 뉴스도 요약해 읽어줍니다 |
|
Emerging |
| 1221 |
petewarden/spchcat
Speech recognition tool to convert audio to text transcripts, for Linux and... |
|
Emerging |
| 1222 |
Kaljurand/Arvutaja
An Android app for voice actions in Estonian and English |
|
Emerging |
| 1223 |
spokestack/spokestack-ios
Spokestack: give your iOS app a voice interface! |
|
Emerging |
| 1224 |
gdoudeng/react-native-baidu-asr
The react-native Baidu voice library provides voice recognition, voice... |
|
Emerging |
| 1225 |
MerlinCN/kinoko7danmaku
调用TTS来播报哔哩哔哩直播中的弹幕、礼物、舰长等 |
|
Emerging |
| 1226 |
mobassir94/comprehensive-bangla-tts
Aiming to achieve ultimate Multilingual TTS pipeline with main focus on... |
|
Emerging |
| 1227 |
botbahlul/VOSK-Powered-Live-Subtitle-V3
ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using free... |
|
Emerging |
| 1228 |
declare-lab/speech-adapters
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient... |
|
Emerging |
| 1229 |
mapluisch/OpenAI-Realtime-API-for-Unity
Implementation of OpenAI's Realtime API in Unity. Easily integrate... |
|
Emerging |
| 1230 |
solyarisoftware/CoquiSTTJs
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server. |
|
Emerging |
| 1231 |
cherts/mspeech
Program for speech recognition using the Google Speech API, voice commands,... |
|
Emerging |
| 1232 |
Andrewcpu/elevenlabs-api
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs... |
|
Emerging |
| 1233 |
Frikallo/parakeet.cpp
Ultra fast and portable Parakeet implementation for on-device inference in... |
|
Emerging |
| 1234 |
Saganaki22/ComfyUI-Step_Audio_EditX_TTS
ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice... |
|
Emerging |
| 1235 |
Kini218/speech-to-text
Speech to text script on python |
|
Emerging |
| 1236 |
yeahhe365/PageTalk
一个简洁且优秀的描述是:这是一款在任何网页上实现无缝语音转文字的 Chrome 扩展,使用先进的 ASR API。 |
|
Emerging |
| 1237 |
Edw590/VISOR---Android-Version-Assistant
V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory! |
|
Emerging |
| 1238 |
AryanVBW/AiVoiceClonerPRO
Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into... |
|
Emerging |
| 1239 |
DeeepMaker/subtitle-to-audio
A python script to generate .wav audio files for .srt subtitle files |
|
Emerging |
| 1240 |
yl4579/HiFTNet
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter... |
|
Emerging |
| 1241 |
itsRares/react-native-deepgram
Brings Deepgram's capabilities to React Native applications, with a focus on... |
|
Emerging |
| 1242 |
huckiyang/Voice2Series-Reprogramming
ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time... |
|
Emerging |
| 1243 |
gladiaio/normalization
A lightweight library for normalizing speech transcripts before computing WER |
|
Emerging |
| 1244 |
Spac5y/Vocal-Agent
A cutting-edge Cascading voice assistant combining real-time speech... |
|
Emerging |
| 1245 |
nodef/extra-amazontts
Generate speech audio from super long text through machine (via "Amazon... |
|
Emerging |
| 1246 |
advanced-media-inc/amivoice-api-client-library
AmiVoice API Client Library and the sample programs |
|
Emerging |
| 1247 |
OvidijusParsiunas/speech-to-element
A simple way to add speech to text functionality to your website :microphone: |
|
Emerging |
| 1248 |
cboard-org/ccboard
Cordova wrapper for the Cboard application |
|
Emerging |
| 1249 |
ae9is/subtitle-chan
Live speech transcription and translation in your browser |
|
Emerging |
| 1250 |
botbahlul/pyvosklivesubtitle
PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23... |
|
Emerging |
| 1251 |
holm-aune-bachelor2018/ctc
Speech recognition with CTC in Keras with Tensorflow backend |
|
Emerging |
| 1252 |
nl8590687/ASRT_SDK_Python3
ASRT语音识别系统的Python版SDK |
|
Emerging |
| 1253 |
kapi2800/qwen3-tts-apple-silicon
Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning,... |
|
Emerging |
| 1254 |
TUD-STKS/VocalTractLabBackend-dev
The VocalTractLab backend sources and C/C++ API |
|
Emerging |
| 1255 |
jing332/tts-server-go
微软TTS服务转发,以便在阅读APP中通过网络导入方式收听微软TTS / Edge大声朗读 |
|
Emerging |
| 1256 |
mattmireles/kokoro-coreml
PyTorch → CoreML conversion pipeline for Kokoro TTS. Unlocks fast on-device... |
|
Emerging |
| 1257 |
Renovamen/Speech-and-Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech... |
|
Emerging |
| 1258 |
alexiokay/AriLink
Modern ARI-STASI server, built on Asterisk ARI with real-time speech-to-text... |
|
Emerging |
| 1259 |
kristofferv98/VoiceProcessingToolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for... |
|
Emerging |
| 1260 |
n0name45/node-red-contrib-yandex-station-management
Модуль node-red-contrib-yandex-station-management для управления умными... |
|
Emerging |
| 1261 |
FedericaPaoli1/stm32-speech-recognition-and-traduction
stm32-speech-recognition-and-traduction is a project developed for the... |
|
Emerging |
| 1262 |
kaituoxu/Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End... |
|
Emerging |
| 1263 |
GlobalTechInfo/gspeak
Google Text to Speech for Node.js — modern, typed, zero deprecated dependencies. |
|
Emerging |
| 1264 |
bnsantoso/sub-to-audio
Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS... |
|
Emerging |
| 1265 |
satyam9090/Automatic-Indian-Sign-Language-Translator-ISL
I created an application which takes in live speech or audio recording as... |
|
Emerging |
| 1266 |
awslabs/speech-representations
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020) |
|
Emerging |
| 1267 |
fcjr/ltts
Quick CLI for local text-to-speech using Qwen3-TTS or Kokoro TTS. |
|
Emerging |
| 1268 |
holgern/kokorog2p
A unified multi-language G2P (Grapheme-to-Phoneme) library for Kokoro TTS. |
|
Emerging |
| 1269 |
kosich/rxjs-tts
RxJS wrapper for Text-to-Speech Web API |
|
Emerging |
| 1270 |
zh217/torch-asg
Auto Segmentation Criterion (ASG) implemented in pytorch |
|
Emerging |
| 1271 |
Nighthawk42/mOrpheus
Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant. |
|
Emerging |
| 1272 |
ritazh/EchoML
🔉 A web app to play, visualize, and annotate your audio files for machine learning |
|
Emerging |
| 1273 |
deepgram-starters/flask-text-to-speech
Get started using Deepgram's Text-to-Speech with this Flask demo app |
|
Emerging |
| 1274 |
DangerDaza/Dooms-Enhancement-Suite
An immersive RPG enhancement extension for SillyTavern — character tracking,... |
|
Emerging |
| 1275 |
lokkelvin2/tacotron2-tts-GUI
Text To Speech (TTS) GUI wrapper for NVIDIA Tacotron 2+Waveglow. For custom... |
|
Emerging |
| 1276 |
shahules786/mayavoz
Pytorch based speech enhancement toolkit. |
|
Emerging |
| 1277 |
tianbot/rosecho
Tianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用 |
|
Emerging |
| 1278 |
weimeng23/speech-recognition-learning-resources
:white_check_mark: A list of speech recognition learning resources including... |
|
Emerging |
| 1279 |
nikhilunni/demucs-rs
Rust powered waveform source separation |
|
Emerging |
| 1280 |
deepgram-starters/flask-voice-agent
Flask WebSocket proxy for Deepgram's Voice Agent API |
|
Emerging |
| 1281 |
smx-smx/KodiSharp
Use Kodi python APIs in C#, and write rich addons using the .NET framework/Mono |
|
Emerging |
| 1282 |
PhilippeRo/IBus-Speech-To-Text
A speech to text IBus engine using VOSK |
|
Emerging |
| 1283 |
sl5net/SL5-aura-service
Your offline, privacy-first voice assistant framework. Transform speech into... |
|
Emerging |
| 1284 |
maum-ai/wavegrad2
Unofficial Pytorch Implementation of WaveGrad2 |
|
Emerging |
| 1285 |
oscie57/tiktok-voice
Simple Python script to interact with the TikTok TTS API |
|
Emerging |
| 1286 |
EndlessReform/fish-speech.rs
A Fish Speech implementation in Rust, with Candle.rs |
|
Emerging |
| 1287 |
qforge-dev/qspeak
qSpeak is a powerful voice transcription and AI assistant tool that helps... |
|
Emerging |
| 1288 |
robmsmt/ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR... |
|
Emerging |
| 1289 |
loretoparisi/wave2vec-recognize-docker
Wave2vec 2.0 Recognize pipeline |
|
Emerging |
| 1290 |
FaceOnLive/Spleeter-Android-iOS
On-device, Offline Spleeter Solution For Mobile |
|
Emerging |
| 1291 |
Igorcbraz/Calculadora
📐 Calculadora simples e intuitiva com suporte a comandos de voz e temas... |
|
Emerging |
| 1292 |
definitio/ha-rhvoice
Home Assistant integration for RHVoice - a local text-to-speech engine. |
|
Emerging |
| 1293 |
DrewThomasson/ebook2audiobookpiper-tts
Converts ebooks into audiobooks with piper-tts |
|
Emerging |
| 1294 |
tugstugi/mongolian-speech-recognition
Mongolian speech recognition with PyTorch |
|
Emerging |
| 1295 |
carleeno/elevenlabs_tts
Custom TTS Integration using ElevenLabs API |
|
Emerging |
| 1296 |
sunshine0523/MNNServer
A third-party MNN server supporting external calls, embedding model, TTS... |
|
Emerging |
| 1297 |
keonlee9420/Robust_Fine_Grained_Prosody_Control
PyTorch Implementation of Robust and fine-grained prosody control of... |
|
Emerging |
| 1298 |
QiBowen2008/SuperTextToolBox
一个免费的文字处理工具箱 |
|
Emerging |
| 1299 |
coqui-ai/STT-models
Open models for Coqui STT |
|
Emerging |
| 1300 |
jing332/tts-server-android
这是一个Android系统TTS应用,内置微软演示接口,可自定义HTTP请求,可导入其他本地TTS引擎,以及根据中文双引号的简单旁白/对话识别朗读... |
|
Emerging |