All Voice AI Tools
6,981 tools ranked by quality score · Page 16 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 1501 |
holgern/pykokoro
A Python library for Kokoro TTS (Text-to-Speech) using ONNX runtime. |
|
Emerging |
| 1502 |
wannaphong/KhanomTan-TTS-v1.0
KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that... |
|
Emerging |
| 1503 |
deepgram-starters/django-transcription
Get started using Deepgram's Transcription with this Django demo app |
|
Emerging |
| 1504 |
p-groarke/wsay
Windows "say" |
|
Emerging |
| 1505 |
ibotplus/kbase-media
视频、音频、图片内容识别、语音转写、语音合成 / easy convert video audio image to text, and revert... |
|
Emerging |
| 1506 |
tochilkinva/tg_bot_stt_tts
Telegram bot with voice message recognition and generation. Speech to Text... |
|
Emerging |
| 1507 |
wdbm/deep_throat
speech synthesis program |
|
Emerging |
| 1508 |
wxkingstar/TransEcho
macOS 实时同声传译 - 捕获系统音频,实时翻译字幕 + 语音同传 | Real-time simultaneous interpretation for macOS |
|
Emerging |
| 1509 |
inevolin/DiscordSpeechBot
A speech-to-text bot for discord with music commands and more using NodeJS.... |
|
Emerging |
| 1510 |
CarrotYuan/openclaw-voice-control
A macOS local voice-control companion for OpenClaw with Siri-like wakeword... |
|
Emerging |
| 1511 |
aeleraqi/Text-to-Speech-gTTS---Arabic-text
Google Text-to-Speech API to convert text input into audio files |
|
Emerging |
| 1512 |
34j/mecab-text-cleaner
Simple Python package (CLI/Python API) for getting japanese readings... |
|
Emerging |
| 1513 |
sciforce/phones-las
Articulatory features estimation using Listen Attend and Spell architecture. |
|
Emerging |
| 1514 |
manhph2211/ViSR
This repo builds an end-to-end deep learning application that supports... |
|
Emerging |
| 1515 |
Troyanovsky/awesome-TTS-Colab
Collection of awesome TTS and voice cloning models to run with Google Colab |
|
Emerging |
| 1516 |
Kyubyong/specAugment
Tensor2tensor experiment with SpecAugment |
|
Emerging |
| 1517 |
shijincai/VibeVoice
Archive of the official Microsoft VibeVoice repository (7B & 1.5B). Backup... |
|
Emerging |
| 1518 |
BlinkTagInc/gtfs-tts
Review GTFS stop pronunciations to determine which stops need a tts_stop_name value. |
|
Emerging |
| 1519 |
Dostoyewski/django_voice_bot
Package for django onpage support bot with speech recognition and voice commands |
|
Emerging |
| 1520 |
falabrasil/kaldi-br
☕🇧🇷 Scripts para o Kaldi em Português Brasileiro |
|
Emerging |
| 1521 |
ng-web-apis/speech
A library for using Web Speech API with Angular |
|
Emerging |
| 1522 |
IceFog72/pocket-tts-openapi
Fast, local, OpenAI-compatible TTS server with voice cloning support powered... |
|
Emerging |
| 1523 |
linagora-labs/ssak
SSAK contains helpers and tools to process data and train/infer ASR models. |
|
Emerging |
| 1524 |
naeruru/mimiuchi
a free, customizable, osc capable speech-to-text interface for relaying text... |
|
Emerging |
| 1525 |
sskorol/vosk-api-gpu
Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC |
|
Emerging |
| 1526 |
sexfrance/RecaptchaV2-Solver
A Python-based solution for solving Google's reCAPTCHA v2 challenges... |
|
Emerging |
| 1527 |
DrDroidLab/voicesummary
Open Source AI Database for Voice Agent Transcripts | Call Analysis &... |
|
Emerging |
| 1528 |
leduckhai/wav2graph
wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech |
|
Emerging |
| 1529 |
QuantiusBenignus/BlahST
Input text from speech in any Linux window, the lean, fast and accurate way,... |
|
Emerging |
| 1530 |
noco-ai/spellbook-docker
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many... |
|
Emerging |
| 1531 |
husniadil/cc-hooks
Audio feedback plugin for Claude Code with TTS announcements, sound effects,... |
|
Emerging |
| 1532 |
HawkAaron/E2E-ASR
PyTorch Implementations for End-to-End Automatic Speech Recognition |
|
Emerging |
| 1533 |
rxlabz/sytody
a Flutter "speech to todo" app example |
|
Emerging |
| 1534 |
keenresearch/KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING:... |
|
Emerging |
| 1535 |
HawkAaron/RNN-Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction... |
|
Emerging |
| 1536 |
kroko-ai/kroko-onnx
Kroko ASR - Speech-to-text |
|
Emerging |
| 1537 |
CMsmartvoice/One-Shot-Voice-Cloning
:relaxed: One Shot Voice Cloning base on Unet-TTS |
|
Emerging |
| 1538 |
ybouhjira/claude-code-tts
🔊 Text-to-Speech MCP plugin for Claude Code - hear audio feedback while... |
|
Emerging |
| 1539 |
rishikksh20/UnivNet-pytorch
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators... |
|
Emerging |
| 1540 |
binzhouchn/masr
中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。 |
|
Emerging |
| 1541 |
persiandataset/PersianSpeech
Persian ASR dataset |
|
Emerging |
| 1542 |
stevenhillis/awesome-asr-contextualization
A curated list of awesome papers on contextualizing E2E ASR outputs |
|
Emerging |
| 1543 |
seven-io/home-assistant
HACS supporting Home Assistant integration for seven |
|
Emerging |
| 1544 |
thinh-vu/ur_audio_sub
Generate text captions for audio files & youtube video using OpenAI Whisper... |
|
Emerging |
| 1545 |
talin190/Qwen3-TTS-Daggr-UI
🎤 Create dynamic voice experiences with Qwen3-TTS-Daggr-UI, a Gradio app for... |
|
Emerging |
| 1546 |
fqueis/pollinationsai
🔥 TypeScript SDK wrapper for Pollinations AI services |
|
Emerging |
| 1547 |
Bunlong/react-webspeech
The official WebSpeech for React. |
|
Emerging |
| 1548 |
habla-liaa/ser-with-w2v2
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from... |
|
Emerging |
| 1549 |
hypeapps/black-mirror
A voice controlled smart mirror powered by Raspberry Pi3 and AndroidThings. |
|
Emerging |
| 1550 |
LetsPlayNow/Speech_AI
Speech to speech bot built with Python |
|
Emerging |
| 1551 |
aks-devs/mod_openai_asr
Freeswitch Speech-To-Text module |
|
Emerging |
| 1552 |
j3soon/speech-to-windows-input
Perform speech-to-text (STT/ASR) with Azure speech service and simulate... |
|
Emerging |
| 1553 |
audioku/cross-accent-maml-asr
Meta-learning model agnostic (MAML) implementation for cross-accented ASR |
|
Emerging |
| 1554 |
botbahlul/vosk_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using... |
|
Emerging |
| 1555 |
tmanderson/ivona-node
Ivona Cloud (via Amazon services) client library for Node |
|
Emerging |
| 1556 |
yzfly/awesome-voice-agents
A curated list of voice AI agent frameworks, tools, resources, and best practices |
|
Emerging |
| 1557 |
hcy71o/MB-iSTFT-VITS-with-AutoVocoder
Incorporating AutoVocoder to MB-iSTFT-VITS |
|
Emerging |
| 1558 |
jaywcjlove/TextSoundSaver
Using the TextSoundSaver application, you can convert text into realistic... |
|
Emerging |
| 1559 |
shi-gg/Auditional-Text
The source code of the Auditional Text discord Boat |
|
Emerging |
| 1560 |
hcoles/voices
Fast, in-process text to speech for Java |
|
Emerging |
| 1561 |
mrf345/flask_gtts
A Flask extension to add gTTS Google text to speech |
|
Emerging |
| 1562 |
jianchang512/chatterbox-api
一个基于 Chatterbox-TTS的文字转语音(TTS)服务。提供与 OpenAI TTS 兼容的 API 接口并支持声音克隆,附带简洁的 Web 用户界面。 |
|
Emerging |
| 1563 |
MartinMashalov/VoiceCloning
Generative voice cloning model using TTS synthesis with state-of-the-art... |
|
Emerging |
| 1564 |
johnGettings/LIHQ
Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter) |
|
Emerging |
| 1565 |
ismailperim/reportcast
Transform reports into podcasts with AI - Nobody reads your reports. But... |
|
Emerging |
| 1566 |
blip-radar/vatsim-parser
Parser for a variety of VATSIM-related file formats |
|
Emerging |
| 1567 |
VoXera/VoXera
An Open-Source Persian Language Techs Toolkit with Python |
|
Emerging |
| 1568 |
outspeed-ai/voice-devtools
Developer tools to debug and build realtime voice agents. Supports multiple models. |
|
Emerging |
| 1569 |
wangz-code/legado-edge-tts
edge大声朗读微软TTS服务, 在阅读legado中配置语音引擎方式收听微软TTS / Edge大声朗读, 如果没有 vps 部署可以看看阅读内置... |
|
Emerging |
| 1570 |
ORI-Muchim/PolyLangVITS
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH) |
|
Emerging |
| 1571 |
gianpaj/sexyvoice
Voice Cloning, Voice Call and Text to Speech platform. Perfect for content... |
|
Emerging |
| 1572 |
rishikksh20/iSTFT-Avocodo-pytorch
Ultrafast GAN based Vocoder for Text to Speech |
|
Emerging |
| 1573 |
kaloprojects/KALO-ESP32-Voice-Chat-AI-Friends
ESP32-based voice device for chatting with multiple custom AI bots.... |
|
Emerging |
| 1574 |
JstnMcBrd/dectalk-tts
API wrapper for the Dectalk TTS system |
|
Emerging |
| 1575 |
thewh1teagle/piper-onnx
Use piper TTS with onnxruntime |
|
Emerging |
| 1576 |
verbio-technologies/python-verbio-speech-center
Python integration with the Verbio Speech Center Cloud.... |
|
Emerging |
| 1577 |
HordRicJr/HordVoice
HordVoice - AI-powered voice assistant built with Flutter and Azure AI... |
|
Emerging |
| 1578 |
SpenserCai/cosyvoice3.rs
Python bindings for CosyVoice3 TTS using Candle. Has the characteristics of... |
|
Emerging |
| 1579 |
Sundy1219/eesen-for-thchs30
ASR for Chinese Mandarin |
|
Emerging |
| 1580 |
hipnologo/EchoForge_Studio
Multi-LLM writing and voice production workspace built with Streamlit. |
|
Emerging |
| 1581 |
khakers/go-subgen
Automatically generate subtitles for your media using whisper.cpp via... |
|
Emerging |
| 1582 |
alamparelli/mcp-claude-say
Voice interaction for Claude Code - Talk to Claude and hear responses using... |
|
Emerging |
| 1583 |
bookbot-kids/speech-recognizer-bahasa-indonesian
A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer... |
|
Emerging |
| 1584 |
hhguo/SoCodec
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications |
|
Emerging |
| 1585 |
mattzzz/rick-voice
Give any bot the voice of Rick Sanchez |
|
Emerging |
| 1586 |
AlexxIT/FasterWhisper
Faster Whisper for Home Assistant - custom integration with a local... |
|
Emerging |
| 1587 |
sljavi/handsfree-for-web-zoom-module
Zoom module implementation for Handsfree for web |
|
Emerging |
| 1588 |
zassou65535/VITS
VITSによるテキスト読み上げ器&ボイスチェンジャー |
|
Emerging |
| 1589 |
NotAbhinavGamerz/emotion-aware-automatic-speech-recognition
🎤 Enhance speech recognition by detecting emotions in spoken language,... |
|
Emerging |
| 1590 |
zabir-nabil/bangla-tts
Bangla text to speech, Multilingual (Bangla, English) real-time speech... |
|
Emerging |
| 1591 |
mravanelli/pySpeechRev
This python code performs an efficient speech reverberation starting from a... |
|
Emerging |
| 1592 |
tuanh123789/AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for... |
|
Emerging |
| 1593 |
OpenASR/idiolect
🎙️ Handsfree Audio Development Interface |
|
Emerging |
| 1594 |
anyvoiceai/Barkify
Barkify: an unoffical training implementation of Bark TTS by suno-ai |
|
Emerging |
| 1595 |
e-c-k-e-r/vall-e
An unofficial PyTorch implementation of VALL-E |
|
Emerging |
| 1596 |
XilinJia/Podcini
Open source podcast instrument for Android supporting contents from YouTube... |
|
Emerging |
| 1597 |
soniqo/speech-android
On-device speech SDK for Android — ASR, TTS, VAD, and noise cancellation... |
|
Emerging |
| 1598 |
erogol/FFTNet
FFTNet vocoder implementation |
|
Emerging |
| 1599 |
GinoShun/Accent-Activation-Steering
Official code for "Activation Steering for Accent Adaptation in Speech... |
|
Emerging |
| 1600 |
zolomohan/speech-recognition-in-javascript
Final Code for Speech Recognition in JavaScript tutorial. |
|
Emerging |