All Voice AI Tools

6,981 tools ranked by quality score · Page 22 of 70

Showing 2101–2200 of 6,981
# Tool Score Tier
2101 ekleziast/kiwi-voice

Voice interface for OpenClaw with speaker recognition, voice-gated security,...

32
Emerging
2102 seanhweb/Twitch-Text-to-Speech

Text to speech tool for twitch

32
Emerging
2103 umair13adil/background_stt

A flutter plugin to run always-on speech to text service in the background.

32
Emerging
2104 harisbinzia/PronouncUR

PronouncUR: An Urdu Pronunciation Lexicon Generator

32
Emerging
2105 Kyubyong/speaker_adapted_tts

Making a TTS model with 1 minute of speech samples within 10 minutes

32
Emerging
2106 aldysetiaa/voice_indonesian

Contoh kode program teks ke suara offline, text to speech offline, biasanya...

32
Emerging
2107 nishanth-kj/VoxLabs

Text to Speech

32
Emerging
2108 SyntropyLabs/react-web-speech

Add voice input to React apps in minutes. useSpeechInput handles mic...

32
Emerging
2109 Wikidepia/indonesian-tts

Indonesian TTS (text-to-speech) using Coqui TTS

32
Emerging
2110 hathibelagal-dev/str2speech

An easy-to-use library and command-line tool for TTS

32
Emerging
2111 manishdhakal/ASR-Nepali-using-CNN-BiLSTM-ResNet

Automatic speech recognition for the Nepali language using CNN,...

32
Emerging
2112 consulfedor/VoiceGrab

🎙️ Voice-to-Text Bridge for AI & Any Application. Record voice → Get text →...

32
Emerging
2113 ScottishFold007/TTSAudioNormalizer

TTSAudioNormalizer is a specialized tool for TTS data production,...

32
Emerging
2114 fewieden/MMM-TTS

Text-To-Speech Module for MagicMirror²

32
Emerging
2115 tariqjamel/Flutter-Chat-Bot

A Flutter-based AI chatbot that allows interaction through text, voice, and...

32
Emerging
2116 Gaurav890/vocal-stack

vocal-stack is a high-performance utility library for developers building...

32
Emerging
2117 NICEElevateAI/ElevateAIPythonSDK

ElevateAI - Speech-to-text API Python SDK

32
Emerging
2118 guan-yuan/Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion

A paper and project list about the cutting edge Speech Synthesis,...

32
Emerging
2119 zzw922cn/LPC_for_TTS

Linear Prediction Coefficients estimation from mel-spectrogram implemented...

32
Emerging
2120 bkbilly/asterisk-assistant

📞 AGI interface with python for speech recognition

32
Emerging
2121 TeaPoly/CTC-OptimizedLoss

Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge...

32
Emerging
2122 ancs21/awesome-openai-whisper

A curated list of awesome OpenAI's Whisper

32
Emerging
2123 atrzaska/VoiceStressAnalysis

VoiceStressAnalysis - Detects stress in your voice

32
Emerging
2124 USStateDept/State-TalentMAP-API

Source Code - https://github.com/USStateDept/State-TalentMAP

32
Emerging
2125 hmartelb/speech-denoising

Speech Denoising project for the Deep Learning course at Tsinghua...

32
Emerging
2126 ProperCode/Work-by-Speech

Windows app which allows efficient work on a computer by speech alone.

32
Emerging
2127 aks-devs/mod_google_tts

Freeswitch Text-To-Speech module

32
Emerging
2128 soldier444xd/KittenTTS

KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with...

32
Emerging
2129 srinivr/kaldi-long-audio-alignment

Long audio alignment using Kaldi

32
Emerging
2130 AntoBrandi/Robotics-and-ROS-Learn-by-Doing-Manipulators

3D Printed robot arm powered by ROS and Arduino and controlled via MoveIt!...

32
Emerging
2131 smtiitm/Fastspeech2_MFA

Indic TTS for Indian Languages: This is a project on developing...

32
Emerging
2132 asrajeh/arabic-tts

Arabic TTS ( الناطق العربي )

32
Emerging
2133 1038lab/ComfyUI-FireRedTTS

A ComfyUI integration for FireRedTTS‑2, a real-time multi-speaker TTS system...

32
Emerging
2134 tnicola/vue-voice

Speech to text and text to speech Vue library

32
Emerging
2135 allseeteam/ai-secretary

Smart assistant in Telegram bot format for transcribing online meetings

32
Emerging
2136 ShawnPi233/HQ-SVC

Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice...

32
Emerging
2137 ACT900/faster-whisper-railway

Deploy Faster Whisper on Railway — Speech-to-Text & Text-to-Speech API with 52 voices

32
Emerging
2138 GmEsoft/SP0256_CTS256A-AL2

G.I./Microchip SP0256 Speech Processor and CTS256A-AL2 Text-To-Speech...

32
Emerging
2139 aperepel/claude-mlx-tts

Voice-cloned smart attention TTS notifications for Claude Code. AI...

32
Emerging
2140 llm-believer/slide-to-video

A tool that converts a slide deck into a video, complete with your voice...

32
Emerging
2141 GreenSheep01201/claw-voice-chat

Push-to-talk voice chat interface for OpenClaw channels

32
Emerging
2142 wamich/personal-vocabulary

「个人词库」是一款浏览器插件。 用于英文阅读时,不断记住生词,构建个人词库。

32
Emerging
2143 zhaopuyang/golang-tts

Microsoft TTS (Text-To-Speech) for golang

32
Emerging
2144 bundlab/voice-stream

🎙️ Lightweight offline Python TTS engine. Thread-safe, CLI-ready, and...

32
Emerging
2145 nuance-communications/mix-demo-client-azstaticwebapps

Nuance Mix Demo Client for use with Azure Static Web Apps

32
Emerging
2146 Anwarvic/RasaChatbot-with-ASR-and-TTS

This repository contains an attempt to incorporate Rasa Chatbot with...

32
Emerging
2147 bhashini-ai/bhashini-api-examples

Sample programs for calling Bhashini.ai REST/WebSocket APIs - TTS, STT/ASR,...

32
Emerging
2148 gauthelo/kallaama-speech-dataset

A transcribed speech dataset in Wolof, Pulaar and Sereer, to support...

32
Emerging
2149 klima-sec/piper-tts-call

Python wrapper for Piper TTS with real-time CLI/GUI, global hotkeys, and...

32
Emerging
2150 YChenL/DS-TDNN

Official implement of "Dual-stream Time-Delay Neural Network with Dynamic...

32
Emerging
2151 nodef/wikipedia-tts

Crawl Wikipedia pages and upload TTS to Youtube.

32
Emerging
2152 egorsmkv/speech-recognition-uk

🇺🇦 Speech Recognition & Synthesis for Ukrainian

32
Emerging
2153 moutaouakkil/tts-text-to-speech

Text-to-Speech (TTS) enables developers to synthesize natural-sounding...

32
Emerging
2154 Anwarvic/Arabic-Speech-Recognition

This repository contains my attempt to use two famous speech recognition...

32
Emerging
2155 M0Rf30/shisper

A quick & dirty script to generate and view subtitles and transcriptions for...

32
Emerging
2156 loretoparisi/htk

HTK Toolkit with Linux 64 bit and Docker support

32
Emerging
2157 hariketsheth/Article_Repository_Management_System

In this Tech Savvy era, with lot of advancements in the field of AI, ML, IoT...

32
Emerging
2158 EtienneAb3d/WhisperHallu

Experimental code: sound file preprocessing to optimize Whisper...

32
Emerging
2159 dennisbergevin/cypress-voice-plugin

Cypress plugin to announce spec result and time in Cypress Test Runner

32
Emerging
2160 paladini/voice-separator-demucs

A simple and efficient self-hosted application to separate vocals from music...

32
Emerging
2161 sipeter/CloneTTS

A lightweight, offline Android Text-to-Speech (TTS) engine enabling seamless...

32
Emerging
2162 nickpending/clarvis

Jarvis-style voice notifications for Claude Code that transforms AI...

32
Emerging
2163 18F/tts-buy-bug-bounty

Solicitation and acquisition documents created for the TTS Bug Bounty...

32
Emerging
2164 lifeiteng/TTS-TextAnalyzer

TTS Text Analyzer

32
Emerging
2165 just-ai/aimybox-ios-sdk

Voice assistant SDK for iOS devices written in Swift

32
Emerging
2166 collectivat/cmusphinx-models

Acoustic and language models for minorised languages.

32
Emerging
2167 mostafaelaraby/Tensorflow-Keyword-Spotting

Keyword spotting using various architecture like convolutional vggnet , 1D...

32
Emerging
2168 privapps/TTS-Mandarin

text to speech in mandarin

32
Emerging
2169 uiuc-sst/asr24

24-hour Automatic Speech Recognition

32
Emerging
2170 Harium/espeak-java

espeak java wrapper

32
Emerging
2171 Wurielle/izabela-desktop

A proof of concept text-to-speech application allowing global typing. Can be...

32
Emerging
2172 appurist/say2file

This utility uses either ElevenLabs or IBM's Watson AI text-to-speech API to...

32
Emerging
2173 IBM/mic-sts-nlu-weather-tone-analyzer

# WARNING: This repository is no longer maintained :warning: > This...

31
Emerging
2174 mdingena/att-voodoo

A community-made magic mod for A Township Tale, a VR MMORPG game.

31
Emerging
2175 deepgram-devs/dg-translation-chrome-ext

A TypeScript chrome extension that uses Deepgram to provide live...

31
Emerging
2176 Better-Player/espeakng-sys

Rust bindings to eSpeak NG

31
Emerging
2177 d-j-e/SNPPar

Parallel/Homoplasic SNP Finder

31
Emerging
2178 m15-ai/Faster-Local-Voice-AI

A real-time, fully local voice AI system optimized for low-resource devices...

31
Emerging
2179 ttop32/wav2vec2-live-japanese-translator

real time japanese speech recognition translator using wav2vec2

31
Emerging
2180 rishikksh20/Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based...

31
Emerging
2181 LM-Kit/LynxTranscribe

LynxTranscribe is a comprehensive, professional-grade audio transcription...

31
Emerging
2182 henry-richard7/Natural-Text-to-Speech

This python program uses https://naturaltts.com API to convert given text to...

31
Emerging
2183 AndroidCodility/SpeechToText

Android application to text through which you can provide speech input to...

31
Emerging
2184 stensmir/mimir

Offline voice-to-text for macOS. No cloud, no tracking.

31
Emerging
2185 botbahlul/whisper_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using...

31
Emerging
2186 amadeomano/persian-tts

🔊 A simple human-based text-to-speach synthesiser and ReactNative app for...

31
Emerging
2187 kangyiwen/TTSlist

10000 chatTTS voices !chatTTS...

31
Emerging
2188 asiff00/Training-TTS

Train and finutune text-to-speech models for Bengali and many other languages!

31
Emerging
2189 lars76/fastspeech2-clean

Clean and modernized implementation of FastSpeech2/LightSpeech using IPA

31
Emerging
2190 huakunyang/SummerAsr

SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese...

31
Emerging
2191 AsaoluElijah/say-it

A mobile web application that helps you convert spoken words to...

31
Emerging
2192 far-analytics/dialog

A modular framework for building VoIP-Agent applications.

31
Emerging
2193 umutciftci/mp3totext

Convert audio file to text

31
Emerging
2194 Deepak5j/PyTranscriber

Speech to Text

31
Emerging
2195 wattyven/Live-Stream-TL

A real-time translation application that uses Vosk and the OpenAI API, with...

31
Emerging
2196 nssharmaofficial/reddit-hole

Automated reddit scraper and video creator

31
Emerging
2197 tristan-mcinnis/Simultaneous-Interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous...

31
Emerging
2198 IBM/text-to-speech-code-pattern

WARNING: This repository is no longer maintained

31
Emerging
2199 ottoweiss/pdf-to-audiobook

Uses OpenAI API to clean pdf then converts it to professional grade...

31
Emerging
2200 sglkc/tts-api

Free, minimal, unlimited*, CORS-friendly Google Translate Text to Speech API...

31
Emerging
« Prev 1 2 3 20 21 22 23 24 68 69 70 Next »