Trending Voice AI Tools
Tools with the biggest quality score improvements over the last 8 days.
| # | Tool | Change | Score | Tier |
|---|---|---|---|---|
| 1 |
holgern/kokorog2p
A unified multi-language G2P (Grapheme-to-Phoneme) library for Kokoro TTS. |
+18 | 40 | Emerging |
| 2 |
holgern/pykokoro
A Python library for Kokoro TTS (Text-to-Speech) using ONNX runtime. |
+17 | 38 | Emerging |
| 3 |
GlobalTechInfo/gspeak
Google Text to Speech for Node.js — modern, typed, zero deprecated dependencies. |
+17 | 40 | Emerging |
| 4 |
atharva-again/indic-asr-onnx
Helper package for using quantized versions of the Indic ASR Model by AI4Bharat. |
+16 | 33 | Emerging |
| 5 |
codyw912/open-asr-server
OpenAI-compatible ASR server with pluggable local backends (Parakeet,... |
+16 | 40 | Emerging |
| 6 |
Gautham495/react-native-speech-recognition-kit
React Native Turbo Module to access Speech Recognition in Android & iOS |
+15 | 49 | Emerging |
| 7 |
PraaneshSelvaraj/speech_engine
Speech Engine is a Python package that provides a simple interface for... |
+15 | 45 | Emerging |
| 8 |
robmsmt/CommonCorrections
Easily fix common corrections in speech! |
+15 | 27 | Experimental |
| 9 |
rapidaai/rapida-python
Open-source Python SDK for real-time Voice AI, voice agents, streaming... |
+15 | 30 | Emerging |
| 10 |
OpenVoiceOS/ovos-tts-plugin-espeakNG
espeakNG plugin |
+15 | 52 | Established |
| 11 |
pystorage/pyspeechkit
Library for working with a range of technologies for speech recognition and... |
+14 | 24 | Experimental |
| 12 |
David-Antolick/REX_voice_assistant
Lightweight offline voice assistant for hands-free music control (YouTube... |
+14 | 26 | Experimental |
| 13 |
nikkoxgonzales/streaming-tts
A streamlined, Kokoro-based text-to-speech library with streaming support. |
+14 | 22 | Experimental |
| 14 |
stefantaubert/pronunciation-dictionary-utils
Utils to modify pronunciation dictionaries. |
+14 | 36 | Emerging |
| 15 |
neosapience/n8n-nodes-typecast
Integrate Typecast AI TTS into your n8n workflows with this community node. |
+14 | 34 | Emerging |
| 16 |
oovz/expo-edge-speech
Microsoft Edge text-to-speech for Expo and React Native |
+14 | 26 | Experimental |
| 17 |
twangodev/speak-mintlify
Automatically generate voice narration for your Mintlify documentation. |
+14 | 38 | Emerging |
| 18 |
JstnMcBrd/dectalk-tts
API wrapper for the Dectalk TTS system |
+14 | 37 | Emerging |
| 19 |
holgern/ttsforge
Convert EPUB files to audiobooks using Kokoro ONNX TTS |
+14 | 34 | Emerging |
| 20 |
sebastienrousseau/akande
An innovative, open-source voice assistant powered by OpenAI's GPT-3,... |
+13 | 35 | Emerging |
| 21 |
funnyzak/aliyun-nls
阿里云智能语音处理 Node 模块。 |
+13 | 24 | Experimental |
| 22 |
LG-1/audio2text
Ease of use for Speech to Text |
+13 | 23 | Experimental |
| 23 |
nfreear/simple-speak
Power-tool wrapper around the browser Web Speech API — |
+13 | 15 | Experimental |
| 24 |
nodef/extra-tts
Generate speech audio from super long text through machine. |
+13 | 25 | Experimental |
| 25 |
KillovSky/gTTS
Repositório do módulo de geração de texto para fala Google, gTTS. |
+12 | 22 | Experimental |
| 26 |
thaispalmer/talkify-tts-api
Library to generate TTS directly from Talkify.net APIs |
+12 | 23 | Experimental |
| 27 |
alttch/ttsbroker
Simple TTS (Text-To-Speech) broker for Python |
+12 | 22 | Experimental |
| 28 |
jhermann/kopfkino
Syntactic sugar sprinkled on top of MoviePy and AI components to allow... |
+12 | 34 | Emerging |
| 29 |
HachiroSan/google-pronouncer
🔊 Download pronunciation audio files from Google's dictionary service.... |
+12 | 36 | Emerging |
| 30 |
lmk123/cvox
Get spoken alerts when Claude Code needs permission or finishes a task — so... |
+12 | 25 | Experimental |
| 31 |
OnesAndZer0s/node-dectalk
Node.js module that provides bindings for the DecTalk Text-To-Speech library |
+12 | 15 | Experimental |
| 32 |
saurabhdaware/bol
Slightly more consistent Text-to-speech for Web and a wrapper around speechSynthesis |
+12 | 36 | Emerging |
| 33 |
buddheshwarnath/blurtpy
Offline, cross-platform Python text-to-speech and sound notifications.... |
+12 | 24 | Experimental |
| 34 |
vani-voice/vani
Open protocol & middleware for Indian language voice agents — STT→LLM→TTS in... |
+12 | 32 | Emerging |
| 35 |
kaiaai/kaia.js
Kaia.ai platform's JS client library |
+12 | 34 | Emerging |
| 36 |
sljavi/handsfree-for-web-control-speech-recognition-module
Handsfree for Web module useful to ask for start or stop listening for voice commands |
+12 | 35 | Emerging |
| 37 |
vkosuri/dialogflow-lite
[Maintainer Required] A light-weight python library REST agent for Dialogflow |
+12 | 36 | Emerging |
| 38 |
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs,... |
+12 | 90 | Verified |
| 39 |
far-analytics/dialog
A modular framework for building VoIP-Agent applications. |
+11 | 31 | Emerging |
| 40 |
erich2s/native-speak
A simple text-to-speech library using system native tts engines for Node.js |
+11 | 14 | Experimental |
| 41 |
OpenVoiceOS/ovos-tts-plugin-cotovia
galician tts plugin for OVOS |
+11 | 45 | Emerging |
| 42 |
BattlefieldDuck/HTML-Speaker
🔈 A custom html element makes Text-To-Speech function easier to use on your... |
+11 | 22 | Experimental |
| 43 |
maxpatiiuk/text-hoarder
A browser extension for Google Chrome. Provides reader view, saving articles... |
+11 | 35 | Emerging |
| 44 |
Gaurav890/vocal-stack
vocal-stack is a high-performance utility library for developers building... |
+11 | 32 | Emerging |
| 45 |
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance. |
+11 | 69 | Established |
| 46 |
filippo-fonseca/durat
💬 A JS/TS framework for opening the possibilities for what you can do with text. |
+11 | 21 | Experimental |
| 47 |
Sec-ant/etts
edge-tts in Bun. |
+11 | 15 | Experimental |
| 48 |
oleglegun/polly-ru-ssml
Enhance AWS Polly TTS pronunciation for english words within russian text |
+10 | 20 | Experimental |
| 49 |
Vicopem01/srttossml
Using AWS Polly requires SSML files for a better optimised text to speech... |
+10 | 25 | Experimental |
| 50 |
18566246732/tts-player
a cross-platform tts(text to speak) player |
+10 | 12 | Experimental |
| 51 |
Picovoice/porcupine
On-device wake word detection powered by deep learning |
+10 | 70 | Verified |
| 52 |
AFine970/ttspeech
A Promise tts api, it depend on browser api window.speechSynthesis |
+10 | 12 | Experimental |
| 53 |
flogy/gatsby-transformer-polly
Generate AWS Polly speech output data from SSML files! |
+10 | 22 | Experimental |
| 54 |
8G6/rtts
rtts is an open source JavaScript package for text to speech conversion |
+10 | 14 | Experimental |
| 55 |
istupakov/onnx-asr
A lightweight Python package for Automatic Speech Recognition using ONNX models |
+10 | 66 | Established |
| 56 |
marianapatcosta/talk-to-me
Package that allows the user to talk/text to a customizable avatar. Uses... |
+10 | 14 | Experimental |
| 57 |
osteele/speech-provider
A unified TypeScript interface for browser speech synthesis and Eleven Labs... |
+10 | 26 | Experimental |
| 58 |
HerambVD/spoken2written
A source of python package which converts language styles in speech to its... |
+9 | 26 | Experimental |
| 59 |
jorcelinojunior/whisper-vtt2srt
A robust WebVTT to SRT converter optimized for AI transcriptions (Whisper,... |
+9 | 30 | Emerging |
| 60 |
ywatanabe1989/scitex-notification
Give your AI agents a voice — TTS, phone calls, SMS, email, webhooks. One... |
+9 | 33 | Emerging |
| 61 |
headlessripper/NectarSTT
NectarSTT (Nectar Speech To Text) is a Python-based speech recognition... |
+9 | 25 | Experimental |
| 62 |
revolunet/whatever-tts
return MP3 audio as a stream from given text |
+9 | 11 | Experimental |
| 63 |
kosich/rxjs-stt
RxJS wrapper for speech recognition Web API |
+9 | 21 | Experimental |
| 64 |
kurianbenoy/whisper_normalizer
A python package for whisper normalizer |
+8 | 60 | Established |
| 65 |
TrevorS/voxtral-mini-realtime-rs
Streaming speech recognition running natively and in the browser. A pure... |
+7 | 52 | Established |
| 66 |
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning) |
+7 | 56 | Established |
| 67 |
livekit/livekit
End-to-end realtime stack for connecting humans and AI |
+7 | 69 | Established |
| 68 |
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition. |
+7 | 53 | Established |
| 69 |
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project. |
+7 | 53 | Established |
| 70 |
rhasspy/piper
A fast, local neural text to speech system |
+7 | 45 | Emerging |
| 71 |
krillinai/KrillinAI
Video translation and dubbing tool powered by LLMs. The video translator... |
+7 | 55 | Established |
| 72 |
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation.... |
+7 | 47 | Emerging |
| 73 |
jianchang512/clone-voice
A sound cloning tool with a web interface, using your voice or any sound to... |
+7 | 46 | Emerging |
| 74 |
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统 |
+7 | 53 | Established |
| 75 |
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface... |
+7 | 53 | Established |
| 76 |
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support... |
+7 | 48 | Emerging |
| 77 |
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,... |
+7 | 52 | Established |
| 78 |
LokerL/tts-vue
🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。 |
+7 | 54 | Established |
| 79 |
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper |
+7 | 56 | Established |
| 80 |
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art... |
+7 | 66 | Established |
| 81 |
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E |
+7 | 48 | Emerging |
| 82 |
Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to... |
+7 | 42 | Emerging |
| 83 |
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI |
+7 | 45 | Emerging |
| 84 |
readbeyond/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize... |
+7 | 63 | Established |
| 85 |
rhasspy/rhasspy
Offline private voice assistant for many human languages |
+7 | 45 | Emerging |
| 86 |
6drf21e/ChatTTS_colab
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。 |
+7 | 39 | Emerging |
| 87 |
jdepoix/youtube-transcript-api
This is a python API which allows you to get the transcript/subtitles for a... |
+7 | 86 | Verified |
| 88 |
readest/readest
Readest is a modern, feature-rich ebook reader designed for avid readers... |
+7 | 69 | Established |
| 89 |
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper. |
+7 | 68 | Established |
| 90 |
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit |
+7 | 57 | Established |
| 91 |
WhisperSpeech/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper. |
+7 | 50 | Established |
| 92 |
jing332/tts-server-android
这是一个Android系统TTS应用,内置微软演示接口,可自定义HTTP请求,可导入其他本地TTS引擎,以及根据中文双引号的简单旁白/对话识别朗读... |
+7 | 40 | Emerging |
| 93 |
CheshireCC/faster-whisper-GUI
faster_whisper GUI with PySide6 |
+7 | 44 | Emerging |
| 94 |
marytts/marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system... |
+7 | 51 | Established |
| 95 |
tensorflow/lingvo
Lingvo |
+7 | 62 | Established |
| 96 |
openctp/openctp
openctp提供CTP股票期权、中泰证券XTP、华鑫证券奇点TORA、东方证券OST、东方财富证券EMT、盈透证券TWS、易盛TAP、量投QDP等各通道... |
+7 | 61 | Established |
| 97 |
snakers4/silero-models
Silero Models: pre-trained text-to-speech models made embarrassingly simple |
+7 | 64 | Established |
| 98 |
cmusphinx/pocketsphinx
A small speech recognizer |
+7 | 84 | Verified |
| 99 |
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in... |
+7 | 62 | Established |
| 100 |
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System |
+7 | 63 | Established |