Trending Voice AI Tools

Tools with the biggest quality score improvements over the last 8 days.

# Tool Change Score Tier
1 holgern/kokorog2p

A unified multi-language G2P (Grapheme-to-Phoneme) library for Kokoro TTS.

+18 40 Emerging
2 holgern/pykokoro

A Python library for Kokoro TTS (Text-to-Speech) using ONNX runtime.

+17 38 Emerging
3 GlobalTechInfo/gspeak

Google Text to Speech for Node.js — modern, typed, zero deprecated dependencies.

+17 40 Emerging
4 atharva-again/indic-asr-onnx

Helper package for using quantized versions of the Indic ASR Model by AI4Bharat.

+16 33 Emerging
5 codyw912/open-asr-server

OpenAI-compatible ASR server with pluggable local backends (Parakeet,...

+16 40 Emerging
6 Gautham495/react-native-speech-recognition-kit

React Native Turbo Module to access Speech Recognition in Android & iOS

+15 49 Emerging
7 PraaneshSelvaraj/speech_engine

Speech Engine is a Python package that provides a simple interface for...

+15 45 Emerging
8 robmsmt/CommonCorrections

Easily fix common corrections in speech!

+15 27 Experimental
9 rapidaai/rapida-python

Open-source Python SDK for real-time Voice AI, voice agents, streaming...

+15 30 Emerging
10 OpenVoiceOS/ovos-tts-plugin-espeakNG

espeakNG plugin

+15 52 Established
11 pystorage/pyspeechkit

Library for working with a range of technologies for speech recognition and...

+14 24 Experimental
12 David-Antolick/REX_voice_assistant

Lightweight offline voice assistant for hands-free music control (YouTube...

+14 26 Experimental
13 nikkoxgonzales/streaming-tts

A streamlined, Kokoro-based text-to-speech library with streaming support.

+14 22 Experimental
14 stefantaubert/pronunciation-dictionary-utils

Utils to modify pronunciation dictionaries.

+14 36 Emerging
15 neosapience/n8n-nodes-typecast

Integrate Typecast AI TTS into your n8n workflows with this community node.

+14 34 Emerging
16 oovz/expo-edge-speech

Microsoft Edge text-to-speech for Expo and React Native

+14 26 Experimental
17 twangodev/speak-mintlify

Automatically generate voice narration for your Mintlify documentation.

+14 38 Emerging
18 JstnMcBrd/dectalk-tts

API wrapper for the Dectalk TTS system

+14 37 Emerging
19 holgern/ttsforge

Convert EPUB files to audiobooks using Kokoro ONNX TTS

+14 34 Emerging
20 sebastienrousseau/akande

An innovative, open-source voice assistant powered by OpenAI's GPT-3,...

+13 35 Emerging
21 funnyzak/aliyun-nls

阿里云智能语音处理 Node 模块。

+13 24 Experimental
22 LG-1/audio2text

Ease of use for Speech to Text

+13 23 Experimental
23 nfreear/simple-speak

Power-tool wrapper around the browser Web Speech API —

+13 15 Experimental
24 nodef/extra-tts

Generate speech audio from super long text through machine.

+13 25 Experimental
25 KillovSky/gTTS

Repositório do módulo de geração de texto para fala Google, gTTS.

+12 22 Experimental
26 thaispalmer/talkify-tts-api

Library to generate TTS directly from Talkify.net APIs

+12 23 Experimental
27 alttch/ttsbroker

Simple TTS (Text-To-Speech) broker for Python

+12 22 Experimental
28 jhermann/kopfkino

Syntactic sugar sprinkled on top of MoviePy and AI components to allow...

+12 34 Emerging
29 HachiroSan/google-pronouncer

🔊 Download pronunciation audio files from Google's dictionary service....

+12 36 Emerging
30 lmk123/cvox

Get spoken alerts when Claude Code needs permission or finishes a task — so...

+12 25 Experimental
31 OnesAndZer0s/node-dectalk

Node.js module that provides bindings for the DecTalk Text-To-Speech library

+12 15 Experimental
32 saurabhdaware/bol

Slightly more consistent Text-to-speech for Web and a wrapper around speechSynthesis

+12 36 Emerging
33 buddheshwarnath/blurtpy

Offline, cross-platform Python text-to-speech and sound notifications....

+12 24 Experimental
34 vani-voice/vani

Open protocol & middleware for Indian language voice agents — STT→LLM→TTS in...

+12 32 Emerging
35 kaiaai/kaia.js

Kaia.ai platform's JS client library

+12 34 Emerging
36 sljavi/handsfree-for-web-control-speech-recognition-module

Handsfree for Web module useful to ask for start or stop listening for voice commands

+12 35 Emerging
37 vkosuri/dialogflow-lite

[Maintainer Required] A light-weight python library REST agent for Dialogflow

+12 36 Emerging
38 Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs,...

+12 90 Verified
39 far-analytics/dialog

A modular framework for building VoIP-Agent applications.

+11 31 Emerging
40 erich2s/native-speak

A simple text-to-speech library using system native tts engines for Node.js

+11 14 Experimental
41 OpenVoiceOS/ovos-tts-plugin-cotovia

galician tts plugin for OVOS

+11 45 Emerging
42 BattlefieldDuck/HTML-Speaker

🔈 A custom html element makes Text-To-Speech function easier to use on your...

+11 22 Experimental
43 maxpatiiuk/text-hoarder

A browser extension for Google Chrome. Provides reader view, saving articles...

+11 35 Emerging
44 Gaurav890/vocal-stack

vocal-stack is a high-performance utility library for developers building...

+11 32 Emerging
45 IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

+11 69 Established
46 filippo-fonseca/durat

💬 A JS/TS framework for opening the possibilities for what you can do with text.

+11 21 Experimental
47 Sec-ant/etts

edge-tts in Bun.

+11 15 Experimental
48 oleglegun/polly-ru-ssml

Enhance AWS Polly TTS pronunciation for english words within russian text

+10 20 Experimental
49 Vicopem01/srttossml

Using AWS Polly requires SSML files for a better optimised text to speech...

+10 25 Experimental
50 18566246732/tts-player

a cross-platform tts(text to speak) player

+10 12 Experimental
51 Picovoice/porcupine

On-device wake word detection powered by deep learning

+10 70 Verified
52 AFine970/ttspeech

A Promise tts api, it depend on browser api window.speechSynthesis

+10 12 Experimental
53 flogy/gatsby-transformer-polly

Generate AWS Polly speech output data from SSML files!

+10 22 Experimental
54 8G6/rtts

rtts is an open source JavaScript package for text to speech conversion

+10 14 Experimental
55 istupakov/onnx-asr

A lightweight Python package for Automatic Speech Recognition using ONNX models

+10 66 Established
56 marianapatcosta/talk-to-me

Package that allows the user to talk/text to a customizable avatar. Uses...

+10 14 Experimental
57 osteele/speech-provider

A unified TypeScript interface for browser speech synthesis and Eleven Labs...

+10 26 Experimental
58 HerambVD/spoken2written

A source of python package which converts language styles in speech to its...

+9 26 Experimental
59 jorcelinojunior/whisper-vtt2srt

A robust WebVTT to SRT converter optimized for AI transcriptions (Whisper,...

+9 30 Emerging
60 ywatanabe1989/scitex-notification

Give your AI agents a voice — TTS, phone calls, SMS, email, webhooks. One...

+9 33 Emerging
61 headlessripper/NectarSTT

NectarSTT (Nectar Speech To Text) is a Python-based speech recognition...

+9 25 Experimental
62 revolunet/whatever-tts

return MP3 audio as a stream from given text

+9 11 Experimental
63 kosich/rxjs-stt

RxJS wrapper for speech recognition Web API

+9 21 Experimental
64 kurianbenoy/whisper_normalizer

A python package for whisper normalizer

+8 60 Established
65 TrevorS/voxtral-mini-realtime-rs

Streaming speech recognition running natively and in the browser. A pure...

+7 52 Established
66 RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

+7 56 Established
67 livekit/livekit

End-to-end realtime stack for connecting humans and AI

+7 69 Established
68 pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

+7 53 Established
69 kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

+7 53 Established
70 rhasspy/piper

A fast, local neural text to speech system

+7 45 Emerging
71 krillinai/KrillinAI

Video translation and dubbing tool powered by LLMs. The video translator...

+7 55 Established
72 open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation....

+7 47 Emerging
73 jianchang512/clone-voice

A sound cloning tool with a web interface, using your voice or any sound to...

+7 46 Emerging
74 nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

+7 53 Established
75 jianchang512/ChatTTS-ui

一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface...

+7 53 Established
76 myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support...

+7 48 Emerging
77 abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,...

+7 52 Established
78 LokerL/tts-vue

🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。

+7 54 Established
79 MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

+7 56 Established
80 TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art...

+7 66 Established
81 enhuiz/vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

+7 48 Emerging
82 Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to...

+7 42 Emerging
83 Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

+7 45 Emerging
84 readbeyond/aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize...

+7 63 Established
85 rhasspy/rhasspy

Offline private voice assistant for many human languages

+7 45 Emerging
86 6drf21e/ChatTTS_colab

🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。

+7 39 Emerging
87 jdepoix/youtube-transcript-api

This is a python API which allows you to get the transcript/subtitles for a...

+7 86 Verified
88 readest/readest

Readest is a modern, feature-rich ebook reader designed for avid readers...

+7 69 Established
89 collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

+7 68 Established
90 wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

+7 57 Established
91 WhisperSpeech/WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

+7 50 Established
92 jing332/tts-server-android

这是一个Android系统TTS应用,内置微软演示接口,可自定义HTTP请求,可导入其他本地TTS引擎,以及根据中文双引号的简单旁白/对话识别朗读...

+7 40 Emerging
93 CheshireCC/faster-whisper-GUI

faster_whisper GUI with PySide6

+7 44 Emerging
94 marytts/marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system...

+7 51 Established
95 tensorflow/lingvo

Lingvo

+7 62 Established
96 openctp/openctp

openctp提供CTP股票期权、中泰证券XTP、华鑫证券奇点TORA、东方证券OST、东方财富证券EMT、盈透证券TWS、易盛TAP、量投QDP等各通道...

+7 61 Established
97 snakers4/silero-models

Silero Models: pre-trained text-to-speech models made embarrassingly simple

+7 64 Established
98 cmusphinx/pocketsphinx

A small speech recognizer

+7 84 Verified
99 TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in...

+7 62 Established
100 index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

+7 63 Established