All Voice AI Tools

6,981 tools ranked by quality score · Page 43 of 70

Showing 4201–4300 of 6,981
# Tool Score Tier
4201 josephrocca/lyra-v2-soundstream-web

Lyra V2 (SoundStream) running in the browser

20
Experimental
4202 PierreChouteau/umss_icassp

ICASSP 2024 paper - A Fully Differentiable Model for Unsupervised Singing...

20
Experimental
4203 chandachewe10/whisper-open-ai

Transcribe and Translate Audio to Text using whisper open-ai

20
Experimental
4204 paddy41601/faster-whisper-cli

A command-line interface wrapper for Faster Whisper

20
Experimental
4205 D34DC3N73R/ha-chatterbox-tts

Home Assistant TTS integration for Chatterbox-TTS-Server

20
Experimental
4206 aklos/gpt3-personal-assistant

Interact with GPT-3 through speech

20
Experimental
4207 masasibata/t-one-rest-api

Production-ready REST API for Russian speech recognition using T-one model....

20
Experimental
4208 ali-ibnouf/SmartTalker

Digital Human AI Agent Platform — Real-time talking avatar with Arabic-first support

20
Experimental
4209 famda/semantics

Semantics CLI - Unified interface for media intelligence

20
Experimental
4210 francescomalatesta/php-google-tts-example

A basic example script to use Google Cloud Text-to-Speech APIs

20
Experimental
4211 NDharshan/iNeuron-Blind-Navigation

This project attempts to create a system which would bring in added ease to...

20
Experimental
4212 rupac4530-creator/ai-desktop-assistant

Voice-controlled AI desktop assistant | 100% local & private | Whisper +...

20
Experimental
4213 anoyetta/CeVIOAIProxy

CeVIO AI に棒読みちゃんと同等のTCPソケットインターフェースを生やすアプリケーションです。CastCraft...

20
Experimental
4214 godmode2k/whisper.cpp.android

whisper.cpp.android with CLBlast(OpenCL), Translation (Google ML-Kit) and TTS

20
Experimental
4215 KevinSJ/rss-to-speech

Use Google Text-To-Speech to read long articles from rss feed

20
Experimental
4216 tjwodud04/Master-Course-Project

Master course team project code files (석사과정 참여과제 코드 파일)

20
Experimental
4217 Tanaka-zi/VoiceR

VoiceR is a Linux voice control app that lets you control games using speech...

20
Experimental
4218 alorbach/open-video-transcribe

Open Video Transcribe - Open-source video transcription tool that emphasizes...

20
Experimental
4219 shreyamalogi/Text-To-Speech

"Transform Your Words into Sonic Spells with Shreya's Text-to-Speech...

20
Experimental
4220 Fortyseven/Vibrance

Local voice-to-text transcription tool 🗣️📢🖮

20
Experimental
4221 bishal7679/ASL-Transformer

A user-friendly application for converting either audio or text into sign...

20
Experimental
4222 michael-borck/video-lens

Analyzes presentation videos using speech transcription, computer vision,...

20
Experimental
4223 Kaljurand/EKISpeak

Implementation of Android's TextToSpeechService that provides Estonian text-to-speech

20
Experimental
4224 Neka-Ev/Live2D-AI-Vivian

基于 PyQt5 与 Live2D 的桌面 AI 伴侣“薇薇安”。集成了 LLM 对话、本地/云端语音识别 (ASR) 与高表现力语音合成...

20
Experimental
4225 folubebe/gemini_realtime_speech_to_text

Real-time speech translation using Google Gemini API for free

20
Experimental
4226 rockywuest/kawaii-bath-assistant

🛁 Cute AI-powered bathroom assistant for M5Stack Core 2 — kawaii face,...

20
Experimental
4227 tihu-nlp/tihu-native

Persian text-to-speech on web and mobile using expo react-native

20
Experimental
4228 yashasviyadav30/Omnibox

📦 AI-powered CLI utility with voice support - One Tool, Infinite Possibilities

20
Experimental
4229 shinshekai/VoxForge-Pro

VoxForge Pro is a premium, offline audiobook generator powered by Kokoro-82M...

20
Experimental
4230 deepgram-devs/talk-time-analytics

Sample app for generating and displaying speaker talk-time using the...

20
Experimental
4231 lukaszliniewicz/Subdub

A command line Python app offering a video-to-dubbed-video workflow with...

20
Experimental
4232 fuota-io/The-Things-Network-NodeJS-SDK

The user-friendly Node.js SDK to boost connectivity and data management...

20
Experimental
4233 CPCCoder/SoundMatchAnalyser

SoundMatchAnalyser (SMA) is a powerful tool designed to analyze and compare...

20
Experimental
4234 artryazanov/gemini-speech-to-speech-translator

Transform your audio content into any language with high accuracy and...

20
Experimental
4235 LINSUISHENG034/Qwen3-ASR-Desktop

Modern PyQt6 desktop GUI for Qwen3-ASR with batch transcription support

20
Experimental
4236 joe62/TalkingClipboard

文本朗读工具,文本转MP3

20
Experimental
4237 Xinghui-Wu/KENKU

KENKU: Towards Efficient and Stealthy Black-box Adversarial Attacks against...

20
Experimental
4238 sil-ai/tts-singlish

TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.

20
Experimental
4239 i-celeste-aurora/katip

A SFSpeechRecognizer-based voice recordings transcriber for macOS

20
Experimental
4240 Acumane/lectern

Listen to PDFs with natural TTS and read-along text prompts

20
Experimental
4241 msalhab96/Conformer

An implementation for "Conformer: Convolution-augmented Transformer for...

20
Experimental
4242 husseinnsourr/NeuralChatter

A Next-Generation Neural TTS Engine. High-quality, human-like voice...

20
Experimental
4243 scruss/micropython-SYN6988

MicroPython library for the VoiceTX SYN6988 text to speech module

20
Experimental
4244 p1an-lin-jung/WavThruVec_pytorch

An implementation of Charactr, Inc's "WavThruVec: Latent speech...

20
Experimental
4245 SVM0N/ttsweb

Convert PDFs/EPUBs to audiobooks with synchronized text highlighting using...

20
Experimental
4246 kyegomez/SoundStream

Implementation of SoundtStream from the paper: "SoundStream: An End-to-End...

20
Experimental
4247 Kit4Some/Voice_opencode

The open source vibe_voice coding agent.

20
Experimental
4248 hi-paris/CosyVoice2-EU

Europeanized CosyVoice2 for French & German

20
Experimental
4249 danamini/aichat

Speech-to-Speech conversational AI using Azure OpenAI Service and Azure...

20
Experimental
4250 HelgeSverre/glados

A web interface for GLaDOS text-to-speech with AI conversation capabilities

20
Experimental
4251 tangming579/text-to-speech

文字转语音Demo,分别使用百度云、科大讯飞、有道云实现

20
Experimental
4252 mahirgul/GoogleTTS.Net

.Net dll that uses Google's translate text to speech service.

20
Experimental
4253 Ajay-user/Streamlit-ElevenLabs-Text2Speech

Text to Speech by ElevenLabs

20
Experimental
4254 QuantiusBenignus/voluble

Let your GNOME desktop speak to you. Reads your desktop notifications or...

20
Experimental
4255 chirag127/ContextChat-AI-Webpage-Conversational-Browser-Extension

An AI-powered browser extension to chat directly with any webpage's content....

20
Experimental
4256 iconclub/zalo-tts

Zalo Text-To-Speech for python

20
Experimental
4257 matlab-deep-learning/Use-a-Python-Speech-Command-Recognition-System-to-MATLAB

Use a Python speech command recognition system in MATLAB

20
Experimental
4258 hecx333/edge-tts-go

一个用于 Microsoft Edge 在线文本转语音服务的 Go 语言库。 本项目允许您免费使用 Microsoft Edge 的高质量神经 TTS 语音。

20
Experimental
4259 siva-sub/pocket-tts-openapi-gpu

GPU-enhanced Pocket TTS with Remotion + TikTok captions

20
Experimental
4260 AmirAbaskohi/Automatic-Speech-recognition-for-Speech-Assessment-of-Persian-Preschool-Children

Preschool evaluation is crucial because it gives teachers and parents...

20
Experimental
4261 dog0sd/sven

elevenlabs powered TTS utility

20
Experimental
4262 rodrigoguedes09/multimodal-medical-assistant

End-to-end intelligent automation system for medical clinics, combining REST...

20
Experimental
4263 vishwakneelamegam/deepspeech-android

i have build speech recognition app using mozilla deepspeech

20
Experimental
4264 hyunjoonbok/natural-language-processing

Ready-to-use Implementation of Natural Language Processing models in...

20
Experimental
4265 Kirili4ik/QuartzNet-ASR-pytorch

Automatic Speech Recognition (ASR) model QuartzNet trained on English...

20
Experimental
4266 SPACESODA/read-txt

Read TXT is a lightweight text-to-speech reader with auto language detection...

20
Experimental
4267 virajbhutada/speech-emotion-recognition

This repository houses a robust speech emotion recognition system, featuring...

20
Experimental
4268 ImPavloh/WhiTTsper-The-Lora

Demo combining Whisper for speech recognition and Google TTS for speech...

20
Experimental
4269 oleglegun/polly-ru-ssml

Enhance AWS Polly TTS pronunciation for english words within russian text

20
Experimental
4270 andybi7676/reborn-uasr

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training...

20
Experimental
4271 h-iori/AI-Desktop-Assistant-Python-OpenAI

This project is a work-in-progress AI desktop assistant powered by OpenAI...

20
Experimental
4272 AlpinDale/kizuna

Fast TTS Library for Kokoro

20
Experimental
4273 ilyamiro/stewart

Personal voice assistant

20
Experimental
4274 jsbxyyx/tts_java

微软文本转语音工具

20
Experimental
4275 nickpending/lspeak

Speaks terminal output with semantic caching and serial playback

20
Experimental
4276 ArenAcikgoz/Whisper-Alignment

Forced alignment decoder for Whisper.

20
Experimental
4277 awasthiabhijeet/Error-Driven-ASR-Personalization

Code for "Error-driven Fixed-Budget ASR Personalization for Accented...

20
Experimental
4278 divshekhar/Jarvis

A Voice Assistant - Jarvis

20
Experimental
4279 AliceAuto/obsidian-auto-word-audio

一个为 Obsidian 单词笔记自动添加音频发音的插件

20
Experimental
4280 mcp-tool-shop-org/audiobooker

AI Audiobook Generator - Convert EPUB/TXT books into professionally narrated...

20
Experimental
4281 nerdpudding/nerdpudding

The proof is in the pudding. Real-time AI video commentary with...

20
Experimental
4282 joszuijderwijk/BarryBox

BarryBox is an MQTT controlled TTS Speaker. You can hook it up to the...

20
Experimental
4283 devp19/MyBuddy

Generative AI Therapist built using Google-Cloud's Speech-To-Text...

20
Experimental
4284 harishkotra/Voice-to-Text-Ionic

Ionic Framework example app for both iOS and Android to convert voice to...

20
Experimental
4285 steveseguin/tts.rocks

Cutting-edge Text-to-Speech in the browser - for free

20
Experimental
4286 symblai/real-time-speech-recognition-with-websockets

Use Symbl.ai's Streaming API to create real-time speech recognition with...

20
Experimental
4287 fxnoob/speech-recognition-toolkit

Voice control for chrome browser

20
Experimental
4288 Pogayo/african-voices-web

Website that hosts the African Voices projects. Users can download datasets...

20
Experimental
4289 shreyamalogi/ZAC-the-AI-Assistant

ZAC: Your robotic virtual assistant - Enhancing human-machine interaction...

20
Experimental
4290 qcri/Arabic_speech_code_switching

The first Dialectal Arabic Code Switching - DACS corpus from broadcast...

20
Experimental
4291 igor-lirussi/Dialogue-Pepper-Robot

it provides Pepper Robot conversation abilities to handle a free open-domain...

20
Experimental
4292 asafu-art/deepspeech-kabyle

Automatic Speech Recognition (ASR) - Kabyle

20
Experimental
4293 nmanikiran/browser-apis

There are a large number of Web / Browser APIs available. This repo...

20
Experimental
4294 Hlid-Systems/vanaheim-audio-generator

🔊 Professional Audio Simulation Microservice (Hlid Systems). Orchestrates...

20
Experimental
4295 aidayang/index-tts-OneClick

index-tts2声音克隆软件免安装一键启动整合包

20
Experimental
4296 erasedwalt/CTC-ASR

An implementation of Jasper, QuartzNet, Citrinet and pipeline for training...

20
Experimental
4297 f76tbntbww-crypto/VoiceForge

One-click local AI voice assistant powered by ASR+LLM+TTS, 100% coded by...

20
Experimental
4298 madcato/bl-speech-recognizer

Some implemented use cases for SFSpeechRecognizer

20
Experimental
4299 flumi3/speech-to-text

Transcribe audio files with Azure Cognitive Services

20
Experimental
4300 exemplaryai/ai-engine

Easy to use Multi-Provider ASR/Speech To Text and NLP engine

20
Experimental
« Prev 1 2 3 41 42 43 44 45 68 69 70 Next »