Trending Voice AI Tools

Tools with the biggest quality score improvements over the last 8 days.

#	Tool	Change	Score	Tier	Category	Stars
1	holgern/kokorog2p A unified multi-language G2P (Grapheme-to-Phoneme) library for Kokoro TTS.	+18	40	Emerging	grapheme-to-phoneme-conversion	3
2	holgern/pykokoro A Python library for Kokoro TTS (Text-to-Speech) using ONNX runtime.	+17	38	Emerging	kokoro-tts-ecosystem	2
3	GlobalTechInfo/gspeak Google Text to Speech for Node.js — modern, typed, zero deprecated dependencies.	+17	40	Emerging	google-tts-libraries	1
4	atharva-again/indic-asr-onnx Helper package for using quantized versions of the Indic ASR Model by AI4Bharat.	+16	33	Emerging	automatic-speech-recognition	2
5	codyw912/open-asr-server OpenAI-compatible ASR server with pluggable local backends (Parakeet,...	+16	40	Emerging	parakeet-asr-implementations	2
6	Gautham495/react-native-speech-recognition-kit React Native Turbo Module to access Speech Recognition in Android & iOS	+15	49	Emerging	react-native-voice-libraries	3
7	PraaneshSelvaraj/speech_engine Speech Engine is a Python package that provides a simple interface for...	+15	45	Emerging	lightweight-tts-libraries	3
8	robmsmt/CommonCorrections Easily fix common corrections in speech!	+15	27	Experimental	automatic-speech-recognition	3
9	rapidaai/rapida-python Open-source Python SDK for real-time Voice AI, voice agents, streaming...	+15	30	Emerging	voice-ai-sdks	1
10	OpenVoiceOS/ovos-tts-plugin-espeakNG espeakNG plugin	+15	52	Established	espeak-ng-ecosystem	2
11	pystorage/pyspeechkit Library for working with a range of technologies for speech recognition and...	+14	24	Experimental	yandex-speechkit-tools	1
12	David-Antolick/REX_voice_assistant Lightweight offline voice assistant for hands-free music control (YouTube...	+14	26	Experimental	local-voice-assistants	1
13	nikkoxgonzales/streaming-tts A streamlined, Kokoro-based text-to-speech library with streaming support.	+14	22	Experimental	kokoro-tts-ecosystem	1
14	stefantaubert/pronunciation-dictionary-utils Utils to modify pronunciation dictionaries.	+14	36	Emerging	tts-dataset-creation	1
15	neosapience/n8n-nodes-typecast Integrate Typecast AI TTS into your n8n workflows with this community node.	+14	34	Emerging	google-tts-libraries	1
16	oovz/expo-edge-speech Microsoft Edge text-to-speech for Expo and React Native	+14	26	Experimental	edge-tts-implementations	1
17	twangodev/speak-mintlify Automatically generate voice narration for your Mintlify documentation.	+14	38	Emerging	web-speech-api-tts	2
18	JstnMcBrd/dectalk-tts API wrapper for the Dectalk TTS system	+14	37	Emerging	dotnet-tts-libraries	1
19	holgern/ttsforge Convert EPUB files to audiobooks using Kokoro ONNX TTS	+14	34	Emerging	ebook-to-audiobook-conversion	1
20	sebastienrousseau/akande An innovative, open-source voice assistant powered by OpenAI's GPT-3,...	+13	35	Emerging	voice-chatgpt-interfaces	3
21	funnyzak/aliyun-nls 阿里云智能语音处理 Node 模块。	+13	24	Experimental	google-tts-libraries	2
22	LG-1/audio2text Ease of use for Speech to Text	+13	23	Experimental	speech-to-text-converters	1
23	nfreear/simple-speak Power-tool wrapper around the browser Web Speech API —	+13	15	Experimental	web-speech-api-tts	1
24	nodef/extra-tts Generate speech audio from super long text through machine.	+13	25	Experimental	google-tts-libraries	1
25	KillovSky/gTTS Repositório do módulo de geração de texto para fala Google, gTTS.	+12	22	Experimental	google-tts-libraries	1
26	thaispalmer/talkify-tts-api Library to generate TTS directly from Talkify.net APIs	+12	23	Experimental	google-tts-libraries	2
27	alttch/ttsbroker Simple TTS (Text-To-Speech) broker for Python	+12	22	Experimental	lightweight-tts-libraries	1
28	jhermann/kopfkino Syntactic sugar sprinkled on top of MoviePy and AI components to allow...	+12	34	Emerging	ai-video-generation	1
29	HachiroSan/google-pronouncer 🔊 Download pronunciation audio files from Google's dictionary service....	+12	36	Emerging	lightweight-tts-libraries	3
30	lmk123/cvox Get spoken alerts when Claude Code needs permission or finishes a task — so...	+12	25	Experimental	voice-enabled-coding-assistants	2
31	OnesAndZer0s/node-dectalk Node.js module that provides bindings for the DecTalk Text-To-Speech library	+12	15	Experimental	dotnet-tts-libraries	2
32	saurabhdaware/bol Slightly more consistent Text-to-speech for Web and a wrapper around speechSynthesis	+12	36	Emerging	web-speech-api-tts	3
33	buddheshwarnath/blurtpy Offline, cross-platform Python text-to-speech and sound notifications....	+12	24	Experimental	lightweight-tts-libraries	1
34	vani-voice/vani Open protocol & middleware for Indian language voice agents — STT→LLM→TTS in...	+12	32	Emerging	voice-agent-applications	1
35	kaiaai/kaia.js Kaia.ai platform's JS client library	+12	34	Emerging	google-tts-libraries	1
36	sljavi/handsfree-for-web-control-speech-recognition-module Handsfree for Web module useful to ask for start or stop listening for voice commands	+12	35	Emerging	web-speech-api-libraries	2
37	vkosuri/dialogflow-lite [Maintainer Required] A light-weight python library REST agent for Dialogflow	+12	36	Emerging	voice-command-assistants	2
38	Uberi/speech_recognition Speech recognition module for Python, supporting several engines and APIs,...	+12	90	Verified	automatic-speech-recognition	8,959
39	far-analytics/dialog A modular framework for building VoIP-Agent applications.	+11	31	Emerging	voice-command-assistants	1
40	erich2s/native-speak A simple text-to-speech library using system native tts engines for Node.js	+11	14	Experimental	google-tts-libraries	2
41	OpenVoiceOS/ovos-tts-plugin-cotovia galician tts plugin for OVOS	+11	45	Emerging	espeak-ng-ecosystem	3
42	BattlefieldDuck/HTML-Speaker 🔈 A custom html element makes Text-To-Speech function easier to use on your...	+11	22	Experimental	web-speech-api-tts	2
43	maxpatiiuk/text-hoarder A browser extension for Google Chrome. Provides reader view, saving articles...	+11	35	Emerging	browser-tts-extensions	2
44	Gaurav890/vocal-stack vocal-stack is a high-performance utility library for developers building...	+11	32	Emerging	ai-tutoring-platforms	2
45	IAHispano/Applio A simple, high-quality voice conversion tool focused on ease of use and performance.	+11	69	Established	voice-cloning-tools	3,070
46	filippo-fonseca/durat 💬 A JS/TS framework for opening the possibilities for what you can do with text.	+11	21	Experimental	web-speech-api-tts	1
47	Sec-ant/etts edge-tts in Bun.	+11	15	Experimental	edge-tts-implementations	1
48	oleglegun/polly-ru-ssml Enhance AWS Polly TTS pronunciation for english words within russian text	+10	20	Experimental	aws-polly-tts	1
49	Vicopem01/srttossml Using AWS Polly requires SSML files for a better optimised text to speech...	+10	25	Experimental	aws-polly-tts	2
50	18566246732/tts-player a cross-platform tts(text to speak) player	+10	12	Experimental	web-speech-api-tts	1
51	Picovoice/porcupine On-device wake word detection powered by deep learning	+10	70	Verified	wake-word-detection	4,743
52	AFine970/ttspeech A Promise tts api, it depend on browser api window.speechSynthesis	+10	12	Experimental	web-speech-api-tts	1
53	flogy/gatsby-transformer-polly Generate AWS Polly speech output data from SSML files!	+10	22	Experimental	aws-polly-tts	3
54	8G6/rtts rtts is an open source JavaScript package for text to speech conversion	+10	14	Experimental	web-speech-api-tts	3
55	istupakov/onnx-asr A lightweight Python package for Automatic Speech Recognition using ONNX models	+10	66	Established	automatic-speech-recognition	281
56	marianapatcosta/talk-to-me Package that allows the user to talk/text to a customizable avatar. Uses...	+10	14	Experimental	vue-speech-recognition	3
57	osteele/speech-provider A unified TypeScript interface for browser speech synthesis and Eleven Labs...	+10	26	Experimental	web-speech-api-tts	1
58	HerambVD/spoken2written A source of python package which converts language styles in speech to its...	+9	26	Experimental	speech-recognition-apis	2
59	jorcelinojunior/whisper-vtt2srt A robust WebVTT to SRT converter optimized for AI transcriptions (Whisper,...	+9	30	Emerging	whisper-subtitle-generation	2
60	ywatanabe1989/scitex-notification Give your AI agents a voice — TTS, phone calls, SMS, email, webhooks. One...	+9	33	Emerging	voice-enabled-coding-assistants	2
61	headlessripper/NectarSTT NectarSTT (Nectar Speech To Text) is a Python-based speech recognition...	+9	25	Experimental	lightweight-tts-libraries	1
62	revolunet/whatever-tts return MP3 audio as a stream from given text	+9	11	Experimental	google-tts-libraries	1
63	kosich/rxjs-stt RxJS wrapper for speech recognition Web API	+9	21	Experimental	web-speech-api-libraries	3
64	kurianbenoy/whisper_normalizer A python package for whisper normalizer	+8	60	Established	speech-to-text-converters	76
65	TrevorS/voxtral-mini-realtime-rs Streaming speech recognition running natively and in the browser. A pure...	+7	52	Established	rust-speech-recognition	710
66	RVC-Boss/GPT-SoVITS 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)	+7	56	Established	vits-tts-implementations	55,896
67	livekit/livekit End-to-end realtime stack for connecting humans and AI	+7	69	Established	ai-avatar-platforms	17,671
68	pot-app/pot-desktop 🌈一个跨平台的划词翻译和OCR软件 \| A cross-platform software for text translation and recognition.	+7	53	Established	ios-speech-frameworks	17,383
69	kaldi-asr/kaldi kaldi-asr/kaldi is the official location of the Kaldi project.	+7	53	Established	kaldi-asr-ecosystem	15,346
70	rhasspy/piper A fast, local neural text to speech system	+7	45	Emerging	piper-tts-ecosystem	10,694
71	krillinai/KrillinAI Video translation and dubbing tool powered by LLMs. The video translator...	+7	55	Established	video-dubbing-tools	9,724
72	open-mmlab/Amphion Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation....	+7	47	Emerging	streamlit-tts-apps	9,712
73	jianchang512/clone-voice A sound cloning tool with a web interface, using your voice or any sound to...	+7	46	Emerging	voice-cloning-tools	8,922
74	nl8590687/ASRT_SpeechRecognition A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统	+7	53	Established	ctc-asr-implementations	8,359
75	jianchang512/ChatTTS-ui 一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface...	+7	53	Established	self-hosted-tts-servers	7,521
76	myshell-ai/MeloTTS High-quality multi-lingual text-to-speech library by MyShell.ai. Support...	+7	48	Emerging	lightweight-tts-runtimes	7,267
77	abus-aikorea/voice-pro Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,...	+7	52	Established	gradio-tts-webuis	6,366
78	LokerL/tts-vue 🎤 微软语音合成工具，使用 Electron + Vue + ElementPlus + Vite 构建。	+7	54	Established	google-tts-libraries	6,099
79	MahmoudAshraf97/whisper-diarization Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper	+7	56	Established	whisper-diarization	5,437
80	TensorSpeech/TensorFlowTTS :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art...	+7	66	Established	fastspeech-tts-models	3,995
81	enhuiz/vall-e An unofficial PyTorch implementation of the audio LM VALL-E	+7	48	Emerging	tacotron-tts-models	2,992
82	Purfview/whisper-standalone-win Whisper & Faster-Whisper standalone executables for those who don't want to...	+7	42	Emerging	speech-to-text-converters	2,921
83	Camb-ai/MARS5-TTS MARS5 speech model (TTS) from CAMB.AI	+7	45	Emerging	voice-cloning-tools	2,814
84	readbeyond/aeneas aeneas is a Python/C library and a set of tools to automagically synchronize...	+7	63	Established	asr-evaluation-metrics	2,811
85	rhasspy/rhasspy Offline private voice assistant for many human languages	+7	45	Emerging	general-purpose-voice-assistants	2,725
86	6drf21e/ChatTTS_colab 🚀 一键部署（含离线整合包）！基于 ChatTTS ，支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用，无需复杂安装。	+7	39	Emerging	self-hosted-tts-servers	2,578
87	jdepoix/youtube-transcript-api This is a python API which allows you to get the transcript/subtitles for a...	+7	86	Verified	video-transcription-extraction	7,078
88	readest/readest Readest is a modern, feature-rich ebook reader designed for avid readers...	+7	69	Established	ai-powered-ereaders	18,791
89	collabora/WhisperLive A nearly-live implementation of OpenAI's Whisper.	+7	68	Established	speech-to-text-converters	3,894
90	wenet-e2e/wenet Production First and Production Ready End-to-End Speech Recognition Toolkit	+7	57	Established	end-to-end-asr-frameworks	5,056
91	WhisperSpeech/WhisperSpeech An Open Source text-to-speech system built by inverting Whisper.	+7	50	Established	speech-to-text-converters	4,575
92	jing332/tts-server-android 这是一个Android系统TTS应用，内置微软演示接口，可自定义HTTP请求，可导入其他本地TTS引擎，以及根据中文双引号的简单旁白/对话识别朗读...	+7	40	Emerging	java-tts-libraries	4,315
93	CheshireCC/faster-whisper-GUI faster_whisper GUI with PySide6	+7	44	Emerging	speech-to-text-converters	2,911
94	marytts/marytts MARY TTS -- an open-source, multilingual text-to-speech synthesis system...	+7	51	Established	java-tts-libraries	2,573
95	tensorflow/lingvo Lingvo	+7	62	Established	automatic-speech-recognition	2,857
96	openctp/openctp openctp提供CTP股票期权、中泰证券XTP、华鑫证券奇点TORA、东方证券OST、东方财富证券EMT、盈透证券TWS、易盛TAP、量投QDP等各通道...	+7	61	Established	system-tts-wrappers	2,715
97	snakers4/silero-models Silero Models: pre-trained text-to-speech models made embarrassingly simple	+7	64	Established	gradio-tts-webuis	5,822
98	cmusphinx/pocketsphinx A small speech recognizer	+7	84	Verified	automatic-speech-recognition	4,278
99	TensorSpeech/TensorFlowASR :zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in...	+7	62	Established	end-to-end-asr-frameworks	1,005
100	index-tts/index-tts An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System	+7	63	Established	zero-shot-voice-synthesis	19,454