All Voice AI Tools
6,981 tools ranked by quality score · Page 46 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 4501 |
hyqzz/ICodeStar-text2speech-mp3
Simple Python tool to convert text to speech (TTS) and save as MP3 files.... |
|
Experimental |
| 4502 |
LetterLiGo/Inaudible-Adversarial-Perturbation-Vrifle
[NDSS'24] Inaudible Adversarial Perturbation: Manipulating the Recognition... |
|
Experimental |
| 4503 |
kauer3/Slang-Text-to-Speech
💻🔊 A chrome extension that converts text on the web to speech. This was my... |
|
Experimental |
| 4504 |
bryanstevensacosta/tts-studio
Personal voice cloning CLI tool using XTTS-v2 |
|
Experimental |
| 4505 |
yotsuda/Speech
PowerShell modules for text-to-speech (TTS) and speech-to-text (STT) across... |
|
Experimental |
| 4506 |
mcp-tool-shop-org/soundboard-maui
Cross-platform .NET MAUI desktop client for the Sound Board voice engine. |
|
Experimental |
| 4507 |
chychen/srt_to_tts
use pysrt to parse the time in .srt file, and then call google cloud... |
|
Experimental |
| 4508 |
godspirit00/ListeningTestAudioMaker
一个可以帮助您快速制作外语考试中听力部分的音频的工具。 / A tool that helps you quickly generate... |
|
Experimental |
| 4509 |
monocasual/vocoder
Probably one of the best text-to-speech online apps in the world (if your... |
|
Experimental |
| 4510 |
rishikksh20/voxtral-codec-pytoch
Voxtral Codec : Combining Semantic VQ and Acoustic FSQ for Ultra-Low Bitrate... |
|
Experimental |
| 4511 |
partigabor/read-aloud
A rudimentary text-to-speech engine for reading PDF files aloud in English |
|
Experimental |
| 4512 |
axynos/STARK
CSGO Audio File playback and Text-to-Speech |
|
Experimental |
| 4513 |
ChrisBrooksbank/Vox
Open-source screen reader for Windows 11 — built in C#/.NET 9 with UI... |
|
Experimental |
| 4514 |
mcp-tool-shop-org/avatar-face-mvp
Real-time VRM avatar lipsync MVP — Godot 4 + FFT visemes + OpenSeeFace |
|
Experimental |
| 4515 |
RonanDavalan/PiperRead
Privacy-First Neural Text-to-Speech for Linux (Wayland & X11). |
|
Experimental |
| 4516 |
haya256/random-read-in-computer-voice-interval-cli
テキストファイルからランダムに1行を選び、一定間隔でmacOSの音声で読み上げる学習用CLIツール |
|
Experimental |
| 4517 |
hiansit/ankiflow
ブラウザで動く汎用暗記カードアプリ「AnkiFlow」。自動読み上げ(TTS)機能を搭載し、画面を見ない「聞き流し学習」にも対応しています。 |
|
Experimental |
| 4518 |
neon-aiart/spitch-omakase-connect
Setup VOICEVOX & RVC on Google Colab. / GoogleColabでVOICEVOXとRVCの環境構築 |
|
Experimental |
| 4519 |
voothi/20231001193911-tts
A small collection of helper scripts for working with Google Text-to-Speech... |
|
Experimental |
| 4520 |
AmourWaltz/BayesLMs
Project of IEEE/ACM TASLP “Bayesian Neural Network Language Modeling for... |
|
Experimental |
| 4521 |
cmaroti/speech_recognition
Convolutional Neural Network for Speech Recognition, implemented in Ms. Pacman game |
|
Experimental |
| 4522 |
ssharanyab/persona-tts
PersonaTTS is a personalized neural text-to-speech system that learns a... |
|
Experimental |
| 4523 |
npuichigo/grpc_gateway_demo
Audio streaming transfer demo with google.api.HttpBody and grpc gateway for... |
|
Experimental |
| 4524 |
dylanbretzjr/anki-kokoro-tts
Generates audio for Anki flashcards using the Kokoro TTS engine with direct... |
|
Experimental |
| 4525 |
Pchambet/tp-hmm-markov
Markov Chains and Hidden Markov Models: weather modeling with discrete... |
|
Experimental |
| 4526 |
Muthu-Mkode/audify
An asynchronous Python desktop application that extracts text from PDFs and... |
|
Experimental |
| 4527 |
0x0501/Apora
Anki plugin for using Apora platform. |
|
Experimental |
| 4528 |
marcusau2/VOX-1-Audiobook-Maker
VOX-1 Audiobook Maker is a local, GPU-accelerated studio for creating... |
|
Experimental |
| 4529 |
DevBytAmir/vocaudio
CLI tool to generate spoken vocabulary study audio from a JSON deck.... |
|
Experimental |
| 4530 |
rudil24/pdf-audio-reader
Javascript leveraging browser-native Web Speech API to convert any PDF to... |
|
Experimental |
| 4531 |
JaesungHuh/look-listen-recognise
Dataset page for Look, Listen and Recognise : character-aware audio-visual... |
|
Experimental |
| 4532 |
vgarciasc/tts2pnp
Python tool that transforms Tabletop Simulator internal pictures into... |
|
Experimental |
| 4533 |
sseanik/google-home-task-relay
Voice-guided task routines for Google Home using Google Assistant |
|
Experimental |
| 4534 |
saxil/mareen
Mareen - A privacy-focused voice assistant with 3D orb UI, powered by Ollama... |
|
Experimental |
| 4535 |
NihaalNO/Voxis
A private, offline-capable Voice AI Assistant that runs locally on your... |
|
Experimental |
| 4536 |
enrelu/AITranslator
Gemini-powered Chrome extension for smart translations. Features... |
|
Experimental |
| 4537 |
lxaw/GoogleSubtitleGenerator
Using GoogleTranslate to generate automatic subtitles of videos. |
|
Experimental |
| 4538 |
MushroomFleet/QwenTTS-UI
Qwen 3 TTS web UI | https://www.scuffedepoch.com | https://www.oragenai.com... |
|
Experimental |
| 4539 |
EN10/SimpleSpeech
Simple Audio Recognition |
|
Experimental |
| 4540 |
sknadig/ASR_2018_T01
Example repository for 2018 DS/NC 821 / Automatic Speech Recognition projects |
|
Experimental |
| 4541 |
kdelmotte/Mumble
A simple, fast and free speech to text app running on OpenAI's Whisper Large v3 |
|
Experimental |
| 4542 |
parvatijay2901/Hindi-ASR-and-TTS
EC499: Major Project |
|
Experimental |
| 4543 |
Thijsn04/MediClear-AI
An intelligent medical translator powered by Google Gemini 2.5. Simplifies... |
|
Experimental |
| 4544 |
princesingh-ai-dev/JARVIS-Voice-Assistant
🤖 AI-powered voice assistant with Whisper STT, Groq LLM, real-time TTS,... |
|
Experimental |
| 4545 |
jcsilva/asr-benchmark
Benchmark of industrial Speech Recognition systems for Brazilian Portuguese |
|
Experimental |
| 4546 |
pyromage/lazy-podinator-public
Create your own AI generated daily summary podcasts from news feeds |
|
Experimental |
| 4547 |
powerpig99/readaloud
Local-first text-to-speech reader powered by Qwen3-TTS. 9 voices, 10... |
|
Experimental |
| 4548 |
vpakarinen2/omnilocal
Local voice-enabled assistant. |
|
Experimental |
| 4549 |
KaMeLoTmArMoT/Qwen_TTS_Api
FastAPI wrapper for Qwen3-TTS CustomVoice: generate chapter WAV from... |
|
Experimental |
| 4550 |
Sundy1219/RNNLM
Using RNNLM rescoring a sentence in Chinese ASR system |
|
Experimental |
| 4551 |
diogobr90/AI-Narrator
A global, lightweight Text-to-Speech engine using the Kokoro model with... |
|
Experimental |
| 4552 |
lhfer/video-dub-studio
Convert YouTube/local videos into multilingual dubbed audio with Qwen ASR +... |
|
Experimental |
| 4553 |
al3xsus/AI-powered-waste-sorting-station
This is a concept for an AI-powered waste sorting station, that helps people... |
|
Experimental |
| 4554 |
FardinHash/TTS-Node
The TTS Engine is a sophisticated web-based platform designed to transform... |
|
Experimental |
| 4555 |
egorsmkv/ukrainian-tts-datasets
🇺🇦 Open Source Ukrainian Text-to-Speech datasets |
|
Experimental |
| 4556 |
moto-pu/claude-code-voicevox-notify
Claude Code hooks for VOICEVOX voice notifications on task completion and... |
|
Experimental |
| 4557 |
lvecsey/pushup1000
Perform timed pushups as a fitness routine, with text to speech. |
|
Experimental |
| 4558 |
Mx0M/speech-to-text-rust
A high-performance speech-to-text CLI tool written in Rust, powered by... |
|
Experimental |
| 4559 |
mcp-tool-shop-org/soundboard-plugin
Give Claude Code a voice. TTS plugin with emotion-aware speech,... |
|
Experimental |
| 4560 |
hecko-yes/tts-dataset-prompts
Finally, some decent sample sentences |
|
Experimental |
| 4561 |
darwinva97/yarvis-android
Asistente de voz para Android con reconocimiento de voz continuo,... |
|
Experimental |
| 4562 |
azandabot/asizwe-ai
Real-time AI-powered translation for vernacular, slang, and regional... |
|
Experimental |
| 4563 |
NawrizTurjo/Agri-Smart-BD
Empowering Bangladesh farmers with AI-driven price forecasts, market... |
|
Experimental |
| 4564 |
laustke/jimlet_classic
Offline text-to-speech GUI converter with drag-and-drop support,... |
|
Experimental |
| 4565 |
giriaryan694-a11y/Paste2Listen
A simple, privacy-friendly tool to convert text into speech. Instead of... |
|
Experimental |
| 4566 |
DarthJahus/azure-simple-tts
Simple Text-to-Speech web interface. |
|
Experimental |
| 4567 |
icosane/hyacinthia
Simple graphical front‑end for F5‑TTS |
|
Experimental |
| 4568 |
JuanJRA20/Conversor-Texto-a-Voz
🎙️ Sistema inteligente de conversión de texto a audio con detección... |
|
Experimental |
| 4569 |
skye-cyber/ttskit3
A lightweight text to speeach toolkit |
|
Experimental |
| 4570 |
ANVEAI/voice-ai-resources
A curated collection of voice AI tools, libraries, datasets, and learning resources |
|
Experimental |
| 4571 |
ANVEAI/open-source-voice-ai
Open source voice AI tools, models, and libraries for speech recognition and... |
|
Experimental |
| 4572 |
hongkongkiwi/action-elevenlabs-cli
GitHub Action for ElevenLabs CLI: TTS, STT, voice, knowledge, and usage operations. |
|
Experimental |
| 4573 |
deepgram-starters/cpp-text-to-speech
Get started using Deepgram's Text-to-Speech with this C++ demo app |
|
Experimental |
| 4574 |
zippyclawdbot-lab/zippy-voice
🎤 Voice-to-voice PWA for Clawdbot — talk to your AI assistant hands-free,... |
|
Experimental |
| 4575 |
R1ckShi/SeACo-Paraformer
[ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer. |
|
Experimental |
| 4576 |
neosapience/typecast-skills
The official Typecast Claude Skils. |
|
Experimental |
| 4577 |
Echoshard/AudiobookStudio
Desktop app for PocketTTS with voice cloning audiobook creation,... |
|
Experimental |
| 4578 |
aashish-joshi/tts-bulk
Tool for generating TTS files in bulk. |
|
Experimental |
| 4579 |
shrey802/PyTTSeval
Evaluation tool for TTS systems |
|
Experimental |
| 4580 |
StuMason/claude-tts
Text-to-speech for AI coding assistants. Give your AI a voice with emotional... |
|
Experimental |
| 4581 |
deepgram-starters/rust-text-to-speech
Get started using Deepgram's Text-to-Speech with this Rust demo app |
|
Experimental |
| 4582 |
001kenji/Text_To_Speech_AI
A modern web application that converts text to speech using advanced TTS... |
|
Experimental |
| 4583 |
patelritiq/CodeClause-Internship-Projects
A comprehensive collection of 4 Python applications developed during a... |
|
Experimental |
| 4584 |
YChenL/UniVR
An official implement of "UniVR: A Unified Framework for Pitch-Shifted Voice... |
|
Experimental |
| 4585 |
hubetcardenasi/SpeechApp
Convertir tu celular en una aplicación de voz. |
|
Experimental |
| 4586 |
sujitpanda/Google-Cloud-Speech-API
Google Cloud Speech API Android Project Demo |
|
Experimental |
| 4587 |
icantc0de1/Qwen3-TTS-FastAPI
An OpenAI-compatible Text-to-Speech (TTS) API server for the Qwen3-TTS model series. |
|
Experimental |
| 4588 |
martinp95/meeting-transcriber
AI-powered meeting transcription tool that converts audio and video files... |
|
Experimental |
| 4589 |
nl8590687/asrt-sdk-go
ASRT Speech Recognition SDK for Golang. 用于ASRT语音识别系统的Golang SDK |
|
Experimental |
| 4590 |
pyzskw/meeting-teleprompter
线上会议提词器 - 语音识别自动跟读、防截屏、专注模式、离线模型 | Meeting Teleprompter with offline ASR |
|
Experimental |
| 4591 |
Zhennor/Multimodal-Video-Retrieval-Engine-with-Vision-and-Text
A video search engine combining OCR, ASR, CLIP, Image Captioning, Object &... |
|
Experimental |
| 4592 |
yaya-sy/speechscorer
unsupervised spoken utterances scoring |
|
Experimental |
| 4593 |
NickBouwhuis/QwenTTS
AI-powered text-to-speech for macOS with voice design and voice cloning.... |
|
Experimental |
| 4594 |
rahulm-28/celebrity-voice-panel-qwen3-tts
AI voice cloning panel that generates multi-speaker discussions between... |
|
Experimental |
| 4595 |
ttsaigit/tts-ios
TTS.ai iOS app — 18 AI text-to-speech models, voice cloning, speech-to-text |
|
Experimental |
| 4596 |
Ploscha/Awesome-Audio-Generation
Awesome-Audio-Generation is a collection of resources for Text-to-Audio... |
|
Experimental |
| 4597 |
danielrosehill/Speech-To-Text-System-Prompt-Library
An updated skeleton library of system prompts for using LLMs to refine STT output |
|
Experimental |
| 4598 |
SysAdminDoc/Qwen3-TTS-Studio
Install and create TTS with AI voice generator powered by Alibaba's Qwen3-TTS. |
|
Experimental |
| 4599 |
surpoloyang/Audio-Chatbot
Intelligent Voice Interaction System Project |
|
Experimental |
| 4600 |
MrKruemel/VoicePaste
Voice-controlled transcription, AI summarization, and paste — triggered by... |
|
Experimental |