All Voice AI Tools

6,983 tools ranked by quality score · Page 3 of 70

Showing 201–300 of 6,983
# Tool Score Tier
201 rzru/nightingale

Machine learning powered Karaoke app (with scores!)

53
Established
202 kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

53
Established
203 asterics/Asterics-AAC

Free, easy-to-use AAC app with offline support, flexible input options,...

53
Established
204 pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

53
Established
205 supertone-inc/supertonic-py

Lightning-Fast, On-Device TTS — running natively via ONNX.

53
Established
206 jianchang512/ChatTTS-ui

一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface...

53
Established
207 Vonage/vonage-ruby-sdk

Vonage REST API client for Ruby. API support for SMS, Voice, Text-to-Speech,...

53
Established
208 Saurav-Paul/AI-virtual-assistant-python

Command line virtual assistant for competitive programming

53
Established
209 pilot51/voicenotify

Android app that speaks notifications

52
Established
210 FunAudioLLM/SenseVoice

Multilingual Voice Understanding Model

52
Established
211 Enemyx-net/VibeVoice-ComfyUI

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech...

52
Established
212 abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,...

52
Established
213 p0n1/epub_to_audiobook

EPUB to audiobook converter, optimized for Audiobookshelf, WebUI included

52
Established
214 OpenVoiceOS/ovos-tts-plugin-espeakNG

espeakNG plugin

52
Established
215 sooftware/conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented...

52
Established
216 evancohen/sonus

:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword...

52
Established
217 alphacep/vosk-unity-asr

Automatic Speech Recognition in Unity using Vosk library

52
Established
218 mybigday/whisper.rn

React Native binding of whisper.cpp.

52
Established
219 Femoon/tts-azure-web

TTS Azure Web 是一个 Azure 文本转语音(TTS)网页应用,可以在本地或者云端使用你的 Azure Key 一键部署。TTS...

52
Established
220 arcosoph/nanowakeword

A lightweight, open-source, and intelligent wake word detection engine....

52
Established
221 HeyWillow/willow

Open source, local, and self-hosted Amazon Echo/Google Home competitive...

52
Established
222 SahilAggarwal2004/react-text-to-speech

An easy-to-use React.js library that leverages the Web Speech API to convert...

52
Established
223 mdiller/MangoByte

A discord bot that provides the ability to play dota hero response clips, do...

52
Established
224 antirek/voicer

AGI-server voice recognizer for #Asterisk

52
Established
225 TrevorS/voxtral-mini-realtime-rs

Streaming speech recognition running natively and in the browser. A pure...

52
Established
226 richardr1126/openreader

An open-source read-along document reader server with high-quality TTS...

51
Established
227 RageAgainstThePixel/ElevenLabs-DotNet

A Non-Official ElevenLabs RESTful API Client for dotnet

51
Established
228 BoltzmannEntropy/MimikaStudio

MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic...

51
Established
229 thevickypedia/Jarvis

Fully Functional Voice Based Natural Language UI

51
Established
230 janvarev/Irene-Voice-Assistant

Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы...

51
Established
231 bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For...

51
Established
232 canopyai/Orpheus-TTS

Towards Human-Sounding Speech

51
Established
233 yeyupiaoling/YeAudio

Python的音频工具

51
Established
234 davidacm/NVDA-IBMTTS-Driver

This project is aimed at developing and maintaining the NVDA IBMTTS driver....

51
Established
235 vivekuppal/transcribe

Transcribe is a real time transcription, conversation, Language learning...

51
Established
236 fishaudio/fish-audio-python

The official Python library for the Fish Audio API.

51
Established
237 marytts/marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system...

51
Established
238 dictation-toolbox/dragonfly

Speech recognition framework allowing powerful Python-based scripting and...

51
Established
239 ttop32/MouseTooltipTranslator

Mouseover Translate Any Language At Once - Chrome Extension: PDF Translator,...

51
Established
240 mlalma/kokoro-ios

Kokoro TTS for iOS and macOSX

51
Established
241 EveryVoiceTTS/EveryVoice

The EveryVoice TTS Toolkit - Text To Speech for your language

51
Established
242 gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's...

51
Established
243 keithito/tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with...

51
Established
244 lucasnewman/nanospeech

A simple, hackable text-to-speech system in PyTorch and MLX

51
Established
245 stefantaubert/pinyin-to-ipa

Command-line interface and Python library to transcribe pinyin to IPA. The...

51
Established
246 jonatasgrosman/huggingsound

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

51
Established
247 xiangyuecn/Recorder

html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid...

51
Established
248 DevEmperor/Dictate

A powerful Whisper AI keyboard for reliable speech transcription

51
Established
249 DigitalPhonetics/IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

51
Established
250 moonstar-x/discord-tts-bot

A Text-to-Speech bot for Discord.

51
Established
251 gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

51
Established
252 deepgram/deepgram-rust-sdk

Community Rust SDK for Deepgram.

51
Established
253 Blaizzy/mlx-audio-swift

A modular Swift SDK for audio processing with MLX on Apple Silicon

50
Established
254 YuanGongND/whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT:...

50
Established
255 capacitor-community/text-to-speech

⚡️ Capacitor plugin for synthesizing speech from text.

50
Established
256 sfortis/openai_tts

Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine...

50
Established
257 dectalk/dectalk

Modern builds for the 90s/00s DECtalk text-to-speech application.

50
Established
258 robdmac/talkito

TalkiTo lets developers interact with AI systems through speech across...

50
Established
259 ai-ng/swift

Fast voice assistant powered by Groq, Cartesia, and Vercel.

50
Established
260 readium/speech

💬 A TypeScript library for implementing read aloud on the Web

50
Established
261 kadirnar/VoiceHub

VoiceHub: A Unified Inference Interface for TTS Models

50
Established
262 FirezTheGreat/1SHOT

All my works - https://github.com/FirezTheGreat (latest music commands/djs...

50
Established
263 jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for...

50
Established
264 MasuRii/opencode-smart-voice-notify

🔊 Smart voice notification plugin for OpenCode with multiple TTS engines...

50
Established
265 svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

50
Established
266 shivammehta25/Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

50
Established
267 Gr122lyBr/voicetag

Speaker identification powered by pyannote and resemblyzer

50
Established
268 Picovoice/speech-to-text-benchmark

speech to text benchmark framework

50
Established
269 hkchengrex/MMAudio

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality...

50
Established
270 snakers4/silero-stress

Silero Stress — pre-trained enterprise-grade automated stress and homograph...

50
Established
271 i3thuan5/tai5-uan5_gian5-gi2_kang1-ku7

臺灣言語工具

50
Established
272 WhisperSpeech/WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

50
Established
273 petercunha/tts

:pencil: :sound: A simple text-to-speech tool. Converts your text to speech...

50
Established
274 zzw922cn/Automatic_Speech_Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

50
Established
275 R3gm/SoniTranslate

Synchronized Translation for Videos. Video dubbing

50
Established
276 vox-serve/vox-serve

A Streaming-Native Serving Engine for TTS/STS Models

50
Established
277 pykaldi/pykaldi

A Python wrapper for Kaldi

50
Established
278 alphacep/vosk-android-demo

Offline speech recognition for Android with Vosk library.

50
Established
279 stepfun-ai/Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model...

50
Established
280 midas-research/audino

Open source audio annotation tool for humans

50
Established
281 yeyupiaoling/PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

50
Established
282 funnyzak/tts-now

跨平台基于云平台(阿里云、讯飞等)语音合成 API 的文字转语音助手。支持单文本快速合成和批量合成。支持windows、macOS、Linux。

50
Established
283 linto-ai/linto-stt

An automatic speech recognition API

50
Established
284 Aivis-Project/AivisSpeech-Engine

AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine

50
Established
285 nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

50
Established
286 mgonzs13/whisper_ros

Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2

50
Established
287 mathigatti/midi2voice

Singing synthesis from MIDI file

50
Established
288 soniqo/speech-swift

AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and...

50
Established
289 jim60105/docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level...

50
Established
290 myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

49
Emerging
291 yeyupiaoling/Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without...

49
Emerging
292 analyticsinmotion/werx

🐍📦 Easy-to-use Python package for lightning-fast Word Error Rate (WER) analysis

49
Emerging
293 High-Logic/Genie-TTS

GPT-SoVITS ONNX Inference Engine & Model Converter

49
Emerging
294 lobehub/lobe-tts

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

49
Emerging
295 NeonGeckoCom/neon-tts-plugin-coqui

Coqui AI TTS plugin

49
Emerging
296 jeroenterheerdt/pycsspeechtts

Python (py) library to use Microsofts Cognitive Services Speech (csspeech)...

49
Emerging
297 ThioJoe/Auto-Synced-Translated-Dubs

Automatically translates the text of a video based on a subtitle file, and...

49
Emerging
298 sindresorhus/awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition...

49
Emerging
299 rwth-i6/rasr

The RWTH ASR Toolkit.

49
Emerging
300 Stypox/dicio-android

Dicio assistant app for Android

49
Emerging
« Prev 1 2 3 4 5 68 69 70 Next »