All Voice AI Tools

6,983 tools ranked by quality score · Page 2 of 70

Showing 101–200 of 6,983
# Tool Score Tier
101 kurianbenoy/whisper_normalizer

A python package for whisper normalizer

60
Established
102 ieasybooks/tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

60
Established
103 nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

60
Established
104 PyThaiNLP/pythaiasr

Python Thai Automatic Speech Recognition

60
Established
105 Picovoice/rhino

On-device Speech-to-Intent engine powered by deep learning

60
Established
106 cboard-org/cboard

Augmentative and Alternative Communication (AAC) system with text-to-speech...

60
Established
107 Vonage/vonage-php-sdk-core

Vonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech,...

60
Established
108 ManimCommunity/manim-voiceover

Manim plugin for all things voiceover

60
Established
109 roryeckel/wyoming_openai

OpenAI-Compatible Proxy Middleware for the Wyoming Protocol

60
Established
110 PyThaiNLP/PyThaiTTS

Open Source Thai Text-to-speech library in Python

60
Established
111 flashlight/wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

59
Established
112 netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

59
Established
113 OpenMOSS/MOSS-TTS

MOSS‑TTS Family is an open‑source speech and sound generation model family...

59
Established
114 lugia19/elevenlabslib

Full python wrapper for the elevenlabs API.

59
Established
115 amicalhq/amical

🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no...

59
Established
116 Kieirra/murmure

Fully local, private and cross platform Speech-to-Text with LLM Post-processing

59
Established
117 r9y9/nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

59
Established
118 tabahi/bournemouth-forced-aligner

Extract phoneme-level timestamps from speeh audio.

59
Established
119 Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

58
Established
120 xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

58
Established
121 babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

58
Established
122 MainRo/deepspeech-server

A testing server for a speech to text service based on coqui.ai

58
Established
123 OpenVoiceOS/ovos-tts-server

simple flask server to host OpenVoiceOS tts plugins as a service

58
Established
124 aichaos/rivescript-python

A RiveScript interpreter for Python. RiveScript is a scripting language for...

58
Established
125 software-mansion/react-native-executorch

Declarative way to run AI models in React Native on device, powered by ExecuTorch.

58
Established
126 chinokikiss/GSV-TTS-Lite

GSV-TTS-Lite A high-performance inference engine specifically designed for...

57
Established
127 vilassn/whisper_android

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

57
Established
128 charleprr/redditube

A video generator from Reddit posts and comments

57
Established
129 altunenes/parakeet-rs

very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA...

57
Established
130 wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

57
Established
131 GitYCC/g2pW

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

57
Established
132 MycroftAI/mycroft-precise

A lightweight, simple-to-use, RNN wake word listener

57
Established
133 Wikidepia/g2p-id

Indonesian Grapheme-to-Phoneme (IPA notation)

57
Established
134 n1teshy/yapper-tts

offline text to speech and free SOTA LLM APIs to let your programs speak to you

57
Established
135 AbdullahHendy/live-translation

Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM...

57
Established
136 Vonage/vonage-node-sdk

Vonage API client for Node.js. API support for SMS, Voice, Text-to-Speech,...

57
Established
137 phuc-nt/my-translator

Real-time speech translation — macOS & Windows, free TTS, no server, your...

57
Established
138 haoheliu/voicefixer

General Speech Restoration

57
Established
139 Spr-Aachen/Easy-Voice-Toolkit

A user-friendly audio toolkit for voice recognition, voice transcription,...

56
Established
140 jianchang512/stt

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

56
Established
141 RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

56
Established
142 wq2012/SimpleDER

A lightweight library to compute Diarization Error Rate (DER).

56
Established
143 astorfi/speechpy

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition:...

56
Established
144 sccn/eegprep

EEGPrep is an automated preprocessing tool for human EEG data built on a...

56
Established
145 revdotcom/revai-node-sdk

Node.js SDK for the Rev AI API

56
Established
146 justinsalamon/scaper

A library for soundscape synthesis and augmentation

56
Established
147 MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

56
Established
148 XDcobra/react-native-sherpa-onnx

React Native TurboModule for Sherpa-ONNX offline on-device Speech Processing...

56
Established
149 yandexdataschool/speech_course

YSDA course in Speech Processing.

56
Established
150 ahmetoner/whisper-asr-webservice

OpenAI Whisper ASR Webservice API

56
Established
151 jamsch/expo-speech-recognition

Speech Recognition for React Native Expo projects

55
Established
152 shivammehta25/Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

55
Established
153 krillinai/KrillinAI

Video translation and dubbing tool powered by LLMs. The video translator...

55
Established
154 lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

55
Established
155 echogarden-project/echogarden

Cross-platform speech toolset, used from the command-line or as a Node.js...

55
Established
156 linto-ai/WebVoiceSDK

Buildings block for voice-enabled applications in the browser

55
Established
157 deepgram/deepgram-js-sdk

Official JavaScript SDK for Deepgram.

55
Established
158 kstonekuan/tambourine-voice

Your personal voice interface for any app. Speak naturally and your words...

55
Established
159 ken107/read-aloud

An awesome browser extension that reads aloud webpage content with one click

55
Established
160 remsky/Kokoro-FastAPI

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX...

55
Established
161 EddyVerbruggen/nativescript-speech-recognition

:speech_balloon: Speech to text, using the awesome engines readily available...

55
Established
162 itsmevictor/clean-transcribe

A simple CLI to transcribe Youtube videos or local audio/video files and...

54
Established
163 zuoban/tts

tts 服务

54
Established
164 githubharald/CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and...

54
Established
165 NVIDIA-AI-Blueprints/pdf-to-podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

54
Established
166 dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

54
Established
167 OpenMOSS/MOSS-TTSD

MOSS-TTSD is a spoken dialogue generation model designed for expressive...

54
Established
168 Softcatala/open-dubbing

Open dubbing is an AI dubbing system which uses machine learning models to...

54
Established
169 met4citizen/HeadTTS

HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for...

54
Established
170 Azure-Samples/Cognitive-Speech-TTS

Microsoft Text-to-Speech API sample code in several languages, part of...

54
Established
171 LokerL/tts-vue

🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。

54
Established
172 kalliope-project/kalliope

Kalliope is a framework that will help you to create your own personal assistant.

54
Established
173 sandrohanea/whisper.net

Whisper.net. Speech to text made simple using Whisper Models

54
Established
174 VolcanicArts/VRCOSC

A modular node-programming language, program creator, animation system,...

54
Established
175 travisvn/edge-tts-universal

Use Microsoft Edge's online text-to-speech service in Node.js, browsers, or...

54
Established
176 githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path,...

54
Established
177 aahl/zai-tts

🗣️ ZAI/GLM TTS to OpenAI Speech API, 免费的语音合成API,支持克隆音色,基于智谱TTS

54
Established
178 peteonrails/voxtype

Voice-to-text with push-to-talk for Wayland compositors

54
Established
179 dlutton/flutter_tts

Flutter Text to Speech package

54
Established
180 gunthercox/chatterbot-voice

A example of verbal communication using ChatterBot

54
Established
181 pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

54
Established
182 yuga-hashimoto/openclaw-assistant

OpenClaw voice assistant app for Android - Wake word activation & system...

54
Established
183 dputhier/pygtftk

A python package and a set of shell commands to handle GTF files

54
Established
184 Oaklight/asr2clip

handy cli tool to convert your speech to clipboard text

54
Established
185 royshil/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

54
Established
186 BryceWG/BiBi-Keyboard

说点啥(BiBi Keyboard):一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR...

54
Established
187 stemrollerapp/stemroller

Isolate vocals, drums, bass, and other instrumental stems from any song

54
Established
188 kishanrajput23/Jarvis-Desktop-Voice-Assistant

A python based desktop voice assistant capable of executing system-level...

54
Established
189 deepgram/deepgram-python-sdk

Official Python SDK for Deepgram.

53
Established
190 stimm-ai/stimm

The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI...

53
Established
191 zai-org/GLM-ASR

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

53
Established
192 JamesBrill/react-speech-recognition

💬Speech recognition for your React app

53
Established
193 wannaphong/ttsmms

TTS with The Massively Multilingual Speech (MMS) project

53
Established
194 ynop/audiomate

Python library for handling audio datasets.

53
Established
195 sdkcarlos/artyom.js

A voice control - voice commands - speech recognition and speech synthesis...

53
Established
196 Aivis-Project/aivmlib

Aivis Voice Model File (.aivm/.aivmx) Utility Library

53
Established
197 hugobloem/wyoming-microsoft-tts

Wyoming protocol server for Microsoft Azure text-to-speech

53
Established
198 nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

53
Established
199 namastexlabs/murmurai

🎙️ Drop-in replacement for paid transcription APIs. Self-hosted,...

53
Established
200 mkiol/dsnote

Speech Note Linux app. Note taking, reading and translating with offline...

53
Established