All Voice AI Tools

6,981 tools ranked by quality score · Page 13 of 70

Showing 1201–1300 of 6,981
# Tool Score Tier
1201 alias454/YATSEE

YATSEE - Yet Another Tool for Speech Extraction & Enrichment

41
Emerging
1202 modelscope/FunCodec

FunCodec is a research-oriented toolkit for audio quantization and...

41
Emerging
1203 CoffeeVampir3/audiocraft-webui

Quick webui for audiocraft

41
Emerging
1204 HenestrosaDev/audiotext

A desktop application that transcribes audio from files, microphone input or...

41
Emerging
1205 kahne/SpeechTransProgress

Tracking the progress in end-to-end speech translation

41
Emerging
1206 TeamAudio/reaspeech

Speech recognition for REAPER

41
Emerging
1207 italankin/samplevoicebot

TTS Telegram bot

41
Emerging
1208 smaranjitghose/AIAudioTranscriber

A minimalistic web app to generate transciption for audio built using Python

41
Emerging
1209 arpy8/ESP32_Voice_Assistant

This project combines embedded system and AI inference to create an...

41
Emerging
1210 MHaggis/ASRGEN

ASR Configurator, Essentials and Atomic Testing

41
Emerging
1211 finos/greenkey-asrtoolkit

A collection of useful tools for handling speech recognition data

41
Emerging
1212 serpapps/ai-voice-cloner

AI Voice Cloning Desktop Application that runs locally on your computer and...

41
Emerging
1213 JosefAlbers/e2tts-mlx

Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX

41
Emerging
1214 benmaster82/writher

Voice-powered productivity for Windows

41
Emerging
1215 rishikksh20/gmvae_tacotron

Gaussian Mixture VAE Tacotron

41
Emerging
1216 r0227n/flutter_whisper_kit

🎤 A Flutter plugin for running WhisperKit speech-to-text models on-device,...

41
Emerging
1217 jbelford/Eolian

Eolian is a Discord music bot which provide a very powerful API for queuing...

41
Emerging
1218 AIFSH/ComfyUI-FishSpeech

a custom comfyui node for fish-speech

41
Emerging
1219 sotelo/parrot

RNN-based generative models for speech.

41
Emerging
1220 rainygirl/rspeaker

말귀를 알아듣고 뉴스도 요약해 읽어줍니다

41
Emerging
1221 petewarden/spchcat

Speech recognition tool to convert audio to text transcripts, for Linux and...

41
Emerging
1222 Kaljurand/Arvutaja

An Android app for voice actions in Estonian and English

41
Emerging
1223 spokestack/spokestack-ios

Spokestack: give your iOS app a voice interface!

41
Emerging
1224 gdoudeng/react-native-baidu-asr

The react-native Baidu voice library provides voice recognition, voice...

41
Emerging
1225 MerlinCN/kinoko7danmaku

调用TTS来播报哔哩哔哩直播中的弹幕、礼物、舰长等

41
Emerging
1226 mobassir94/comprehensive-bangla-tts

Aiming to achieve ultimate Multilingual TTS pipeline with main focus on...

41
Emerging
1227 botbahlul/VOSK-Powered-Live-Subtitle-V3

ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using free...

41
Emerging
1228 declare-lab/speech-adapters

Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient...

41
Emerging
1229 mapluisch/OpenAI-Realtime-API-for-Unity

Implementation of OpenAI's Realtime API in Unity. Easily integrate...

41
Emerging
1230 solyarisoftware/CoquiSTTJs

Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.

41
Emerging
1231 cherts/mspeech

Program for speech recognition using the Google Speech API, voice commands,...

41
Emerging
1232 Andrewcpu/elevenlabs-api

🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs...

41
Emerging
1233 Frikallo/parakeet.cpp

Ultra fast and portable Parakeet implementation for on-device inference in...

41
Emerging
1234 Saganaki22/ComfyUI-Step_Audio_EditX_TTS

ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice...

41
Emerging
1235 Kini218/speech-to-text

Speech to text script on python

41
Emerging
1236 yeahhe365/PageTalk

一个简洁且优秀的描述是:这是一款在任何网页上实现无缝语音转文字的 Chrome 扩展,使用先进的 ASR API。

41
Emerging
1237 Edw590/VISOR---Android-Version-Assistant

V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!

41
Emerging
1238 AryanVBW/AiVoiceClonerPRO

Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into...

41
Emerging
1239 DeeepMaker/subtitle-to-audio

A python script to generate .wav audio files for .srt subtitle files

41
Emerging
1240 yl4579/HiFTNet

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter...

41
Emerging
1241 itsRares/react-native-deepgram

Brings Deepgram's capabilities to React Native applications, with a focus on...

41
Emerging
1242 huckiyang/Voice2Series-Reprogramming

ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time...

41
Emerging
1243 gladiaio/normalization

A lightweight library for normalizing speech transcripts before computing WER

41
Emerging
1244 Spac5y/Vocal-Agent

A cutting-edge Cascading voice assistant combining real-time speech...

41
Emerging
1245 nodef/extra-amazontts

Generate speech audio from super long text through machine (via "Amazon...

41
Emerging
1246 advanced-media-inc/amivoice-api-client-library

AmiVoice API Client Library and the sample programs

41
Emerging
1247 OvidijusParsiunas/speech-to-element

A simple way to add speech to text functionality to your website :microphone:

41
Emerging
1248 cboard-org/ccboard

Cordova wrapper for the Cboard application

41
Emerging
1249 ae9is/subtitle-chan

Live speech transcription and translation in your browser

41
Emerging
1250 botbahlul/pyvosklivesubtitle

PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23...

41
Emerging
1251 holm-aune-bachelor2018/ctc

Speech recognition with CTC in Keras with Tensorflow backend

41
Emerging
1252 nl8590687/ASRT_SDK_Python3

ASRT语音识别系统的Python版SDK

41
Emerging
1253 kapi2800/qwen3-tts-apple-silicon

Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning,...

41
Emerging
1254 TUD-STKS/VocalTractLabBackend-dev

The VocalTractLab backend sources and C/C++ API

41
Emerging
1255 jing332/tts-server-go

微软TTS服务转发,以便在阅读APP中通过网络导入方式收听微软TTS / Edge大声朗读

41
Emerging
1256 mattmireles/kokoro-coreml

PyTorch → CoreML conversion pipeline for Kokoro TTS. Unlocks fast on-device...

41
Emerging
1257 Renovamen/Speech-and-Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech...

41
Emerging
1258 alexiokay/AriLink

Modern ARI-STASI server, built on Asterisk ARI with real-time speech-to-text...

41
Emerging
1259 kristofferv98/VoiceProcessingToolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for...

40
Emerging
1260 n0name45/node-red-contrib-yandex-station-management

Модуль node-red-contrib-yandex-station-management для управления умными...

40
Emerging
1261 FedericaPaoli1/stm32-speech-recognition-and-traduction

stm32-speech-recognition-and-traduction is a project developed for the...

40
Emerging
1262 kaituoxu/Listen-Attend-Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End...

40
Emerging
1263 GlobalTechInfo/gspeak

Google Text to Speech for Node.js — modern, typed, zero deprecated dependencies.

40
Emerging
1264 bnsantoso/sub-to-audio

Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS...

40
Emerging
1265 satyam9090/Automatic-Indian-Sign-Language-Translator-ISL

I created an application which takes in live speech or audio recording as...

40
Emerging
1266 awslabs/speech-representations

Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)

40
Emerging
1267 fcjr/ltts

Quick CLI for local text-to-speech using Qwen3-TTS or Kokoro TTS.

40
Emerging
1268 holgern/kokorog2p

A unified multi-language G2P (Grapheme-to-Phoneme) library for Kokoro TTS.

40
Emerging
1269 kosich/rxjs-tts

RxJS wrapper for Text-to-Speech Web API

40
Emerging
1270 zh217/torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

40
Emerging
1271 Nighthawk42/mOrpheus

Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.

40
Emerging
1272 ritazh/EchoML

🔉 A web app to play, visualize, and annotate your audio files for machine learning

40
Emerging
1273 deepgram-starters/flask-text-to-speech

Get started using Deepgram's Text-to-Speech with this Flask demo app

40
Emerging
1274 DangerDaza/Dooms-Enhancement-Suite

An immersive RPG enhancement extension for SillyTavern — character tracking,...

40
Emerging
1275 lokkelvin2/tacotron2-tts-GUI

Text To Speech (TTS) GUI wrapper for NVIDIA Tacotron 2+Waveglow. For custom...

40
Emerging
1276 shahules786/mayavoz

Pytorch based speech enhancement toolkit.

40
Emerging
1277 tianbot/rosecho

Tianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用

40
Emerging
1278 weimeng23/speech-recognition-learning-resources

:white_check_mark: A list of speech recognition learning resources including...

40
Emerging
1279 nikhilunni/demucs-rs

Rust powered waveform source separation

40
Emerging
1280 deepgram-starters/flask-voice-agent

Flask WebSocket proxy for Deepgram's Voice Agent API

40
Emerging
1281 smx-smx/KodiSharp

Use Kodi python APIs in C#, and write rich addons using the .NET framework/Mono

40
Emerging
1282 PhilippeRo/IBus-Speech-To-Text

A speech to text IBus engine using VOSK

40
Emerging
1283 sl5net/SL5-aura-service

Your offline, privacy-first voice assistant framework. Transform speech into...

40
Emerging
1284 maum-ai/wavegrad2

Unofficial Pytorch Implementation of WaveGrad2

40
Emerging
1285 oscie57/tiktok-voice

Simple Python script to interact with the TikTok TTS API

40
Emerging
1286 EndlessReform/fish-speech.rs

A Fish Speech implementation in Rust, with Candle.rs

40
Emerging
1287 qforge-dev/qspeak

qSpeak is a powerful voice transcription and AI assistant tool that helps...

40
Emerging
1288 robmsmt/ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR...

40
Emerging
1289 loretoparisi/wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

40
Emerging
1290 FaceOnLive/Spleeter-Android-iOS

On-device, Offline Spleeter Solution For Mobile

40
Emerging
1291 Igorcbraz/Calculadora

📐 Calculadora simples e intuitiva com suporte a comandos de voz e temas...

40
Emerging
1292 definitio/ha-rhvoice

Home Assistant integration for RHVoice - a local text-to-speech engine.

40
Emerging
1293 DrewThomasson/ebook2audiobookpiper-tts

Converts ebooks into audiobooks with piper-tts

40
Emerging
1294 tugstugi/mongolian-speech-recognition

Mongolian speech recognition with PyTorch

40
Emerging
1295 carleeno/elevenlabs_tts

Custom TTS Integration using ElevenLabs API

40
Emerging
1296 sunshine0523/MNNServer

A third-party MNN server supporting external calls, embedding model, TTS...

40
Emerging
1297 keonlee9420/Robust_Fine_Grained_Prosody_Control

PyTorch Implementation of Robust and fine-grained prosody control of...

40
Emerging
1298 QiBowen2008/SuperTextToolBox

一个免费的文字处理工具箱

40
Emerging
1299 coqui-ai/STT-models

Open models for Coqui STT

40
Emerging
1300 jing332/tts-server-android

这是一个Android系统TTS应用,内置微软演示接口,可自定义HTTP请求,可导入其他本地TTS引擎,以及根据中文双引号的简单旁白/对话识别朗读...

40
Emerging
« Prev 1 2 3 11 12 13 14 15 68 69 70 Next »