All Voice AI Tools

6,981 tools ranked by quality score · Page 33 of 70

Showing 3201–3300 of 6,981
# Tool Score Tier
3201 CodeSarthak/autopilot-shorts

Fully automated YouTube Shorts pipeline with a self-improving feedback loop....

24
Experimental
3202 zhurlik/smart-home

A multi-project that contains UDP server, MQTT broker and a few sub-projects...

24
Experimental
3203 khalooei/Voxtral-AI-Demo-Local-Interface

Voxtral is a state-of-the-art model developed to handle both speech...

24
Experimental
3204 kouyt5/lightning-asr

基于pytorch-lighting框架搭建的端到端语音识别模型,目前还在实验中,性能在不断优化

24
Experimental
3205 wildminder/awesome-ai-voice

List of open-source TTS, voice cloning, and music generation models

24
Experimental
3206 polterguy/magic-menu

An alternative input module for Phosphorus Five, allowing you to use natural...

24
Experimental
3207 AIOCW/EasyAss

Python3智能语音助手

24
Experimental
3208 otaviocc/Stenographer

A macOS Tahoe app for transcribing audio/video files using Apple's on-device...

24
Experimental
3209 shricodev/google-sheet-super-agent

An AI Agent to work with Google Sheet using Natural Language

24
Experimental
3210 1epalpyrgou/smartbell-server

Ένα έξυπνο κουδούνι για το σχολείο μας - 1ο Επαγγελματικό Λύκειο Πύργου

24
Experimental
3211 wukan1986/KWebSpeaker

保持原排版可选段的网页朗读神器

24
Experimental
3212 Sajith171111/whisper

🗣️ Transcribe your voice to text easily on macOS. Just hold **Fn**, speak,...

24
Experimental
3213 dive2Pro/AI-Waifu-Vtuber

AI Vtuber for Streaming on Youtube/Twitch

24
Experimental
3214 tonywu71/distilling-and-forgetting-in-large-pre-trained-models

Code for my dissertation on "Distilling and Forgetting in Large Pre-Trained...

24
Experimental
3215 nguyenanhtuan203207-arch/AI-Waifu-Vtuber

AI Vtuber for Streaming on Youtube/Twitch

24
Experimental
3216 egorsmkv/ukrainian-onnx-model

An ONNX model for speech recognition of the Ukrainian language

24
Experimental
3217 MaikeMota/comando-voz

Utilizando HTML5 SpeechRecognizer para Reconhecimento de Comandos.

24
Experimental
3218 Ben-jilo/awesome-faceless

🚀 Discover AI tools and resources to create faceless content for YouTube,...

24
Experimental
3219 alex-yelisieiev/advanced-speech-transcription

A minimalistic yet powerful voice transcribing app that features precise and...

24
Experimental
3220 jetblinx/sonus

Speech recognition and synthesis app

24
Experimental
3221 dhdaines/soundswallower-demo

Simple demo of client-side speech recognition

24
Experimental
3222 IDEA-Emdoor-Lab/UniTTS

A TTS Trained on Universal Audio.

24
Experimental
3223 pragmatrix/context-switch

Audio Streaming for FreeSWITCH with backends powered by Azure, OpenAI, and Aristech

24
Experimental
3224 aeleraqi/Text-to-Speech-gTTS---English-text

Easy-to-use Python library for converting English text into natural sounding...

24
Experimental
3225 Baisampayan1324/AI-MOM

Al MOM is an Al-powered meeting intelligence platform that delivers...

24
Experimental
3226 mathieutrudeau/Fast-TTS

API that uses Tortoise and RVC to speed up text-to-speech generation.

24
Experimental
3227 anonfaded/robospeaker101

Python tool for text-to-speech conversion with voice selection, usage...

24
Experimental
3228 guibranco/talabat-hackathon-2022

🏃 💡 Talabat Hackathon 2022 API project

24
Experimental
3229 PalabraAI/palabra-ai-java

Java SDK for Palabra AI's real-time speech-to-speech translation API. Break...

24
Experimental
3230 tsengia/SphinxTrainHelper

A Bash script designed to make training sphinx4 and pocketsphinx acoustic...

24
Experimental
3231 Lion-Wu/Voice-Chat

An app that allows you to have voice conversations using the API of AI...

24
Experimental
3232 Zuhef/Text-to-Speech

USING HTML , CSS AND JAVASCRIPT I HAVE BUILD A SIMPLE TEXT TO SPEECH CONVERTER.

24
Experimental
3233 zzpuser/SnapDict

macOS AI 翻译词典,基于 DeepSeek 提供智能翻译、词根助记、拼写纠正和语音朗读 | AI-powered dictionary app...

24
Experimental
3234 tetratensor/Next.js-Cloudflare-Voice-AI

🎤 A real-time voice AI assistant built with Next.js and deployed on...

24
Experimental
3235 erogol/ddc-samples

🐸💬 Coqui TTS Double Decoder Consistency samples

24
Experimental
3236 deapi-ai/claude-code-skills

YouTube/audio transcription, image, video generation, AI voice (TTS) & OCR...

24
Experimental
3237 speechnotes/speechnotes-speech-recognizer

The speech recognition engine behind Speechnotes, based on the Webspeech-API

24
Experimental
3238 hutchpd/AI-Medical-Scribe

Local-first AI medical scribe running entirely in the browser using Chrome...

24
Experimental
3239 GitPolyakoff/voice-assistant

Voice Assistant — приложение на C# для управления компьютером голосом....

24
Experimental
3240 danklabs/tts_dataset_maker

A gui to help make a text to speech dataset.

24
Experimental
3241 eddmann/VoiceScribe

Privacy-first macOS transcription app with global hotkey recording. 100%...

24
Experimental
3242 irismaker/pdf-to-audio

A flexible Python tool for converting PDF documents to audio using various...

24
Experimental
3243 ABashir88/enterprise-voice-ai-architectures

Reference architectures, cost models, and sales-engineering playbooks for...

24
Experimental
3244 milosgajdos/go-playht

PlayHT API client Go module

24
Experimental
3245 Jaffe2718/qwen3asr4j

Java binding for Qwen3 ASR

24
Experimental
3246 falabrasil/cmusphinx-br

Scripts e recursos para ASR em Português Brasileiro

24
Experimental
3247 kimtth/agentic-connected-vehicle-platform

🚗🤖 Agentic Connected Vehicle Platform — Agent orchestration 🤝, Model Context...

24
Experimental
3248 hpbyte/myanmar-tts

Myanmar Text-to-Speech with End-to-End Speech Synthesis

24
Experimental
3249 treychen-369/WallWhisper

🏠 Turn any IP camera into a smart English tutor for your family. AI-powered,...

24
Experimental
3250 seven-io/swift-client

Official Swift API Client for the seven.io SMS Gateway

24
Experimental
3251 tim-dickey/voice-clone

Consumer-grade voice cloning app using Coqui TTS - clone your voice from...

24
Experimental
3252 YOUSSEF-BT/Ai-Summarizer

AI-powered summarizer for articles, PDFs, and Word documents with...

24
Experimental
3253 arcb01/g-narrator

A screen reading accessibility tool

24
Experimental
3254 BYO-UPM/Neurovoz_Dababase

Neurovoz corpus of parkinosnian speech

24
Experimental
3255 thankcheeses/NHID-Clinical

Non-Human Identity Disclosure Standard for Healthcare Voice Workflows

24
Experimental
3256 kaveenkumar/Speech_Recognition_and_Emotion_Detection_in_English_and_German

Python project for Speech-to-Text and Sentiment Analysis. Supports English...

24
Experimental
3257 heptacode/interactivekiosk

다양한 사용자를 위한 키오스크 개선 프로젝트 ✨

24
Experimental
3258 Dante9581/laravel-elevenlabs

🎤 Integrate ElevenLabs Text-to-Speech and Speech-to-Text APIs seamlessly...

24
Experimental
3259 oeschsec/Sidekick---voice-controlled-keyboard-and-mouse

Voice controlled keyboard and mouse that is lightweight (minimal...

24
Experimental
3260 mrizwan47/vox

Python Text to Voice Package

24
Experimental
3261 HarikalarKutusu/3d-voice-chess

A voice driven 3D chess game for learning Voice AI

24
Experimental
3262 ZaneH/heybilly

🗣️ It's like Alexa, but for your computer. Highly modular, real-time voice...

24
Experimental
3263 Mildemelwe/Japanese-Tacotron-2-notebook

Training notebook for Japanese TTS model with Tacotron 2

24
Experimental
3264 jakecyr/llm-voice

Library to reduce latency in voice generations from LLM chat completion streams.

24
Experimental
3265 srinivaspedapati/Voice-Assistant-using-Speech-Recognition

Desktop based Personal Voice-Assistant Pi

24
Experimental
3266 repodiac/espeak-ng_german_loan_words

Brief tutorial with code where you can automatically create a dictionary...

24
Experimental
3267 Abhradipta/OCR-With-Read-Out-Loud-Using-Python

An Optical Character Recognition (OCR) System designed using Python to read...

24
Experimental
3268 bagustris/speech-recognition-course

Material for learning speech recognition, based on Microsoft teaching material on EdX

24
Experimental
3269 OpenVoiceOS/ovos-tts-plugin-pico

pico-tts-plugin

24
Experimental
3270 spokestack/android-skeleton

A functionless Android app that demonstrates a basic integration with the...

24
Experimental
3271 Dcros/NodeJs-AI-Live-Face-Recognition-Voice-Controlled

Its a Voice Controlled AI (Natural Language Processing) with some live face...

24
Experimental
3272 GSA/coe-hud-acq-advanced-analytics

A repository for information related to the Data Analytics team's Advanced...

24
Experimental
3273 AlasdairKing/Calendar-VB6

Simple, accessible Calendar for screenreader and blind users.

24
Experimental
3274 EliFuzz/parakeet-mlx

Parakeet MLX is a next-generation automatic speech recognition (ASR) engine...

24
Experimental
3275 belambert/asr-scripts

Lots of miscellaneous scripts to work with Sphinx ASR files and other...

24
Experimental
3276 funnyzak/aliyun-nls

阿里云智能语音处理 Node 模块。

24
Experimental
3277 Alexmhack/Django-PDF-Audio-Reader

Uploading PDF files on Webpage, converting text present in PDF to speech and...

24
Experimental
3278 matin91/Parrot

Parrot is a Talking Alarm App which allows the user to set up to 5 Alarm or...

24
Experimental
3279 BenjaminPoncet/bobby-snips-tts

bobby-snips-tts is an implementation of snips-tts written in Node.js with...

24
Experimental
3280 limbang/text-to-speech

基于 Azure 文本转语音

24
Experimental
3281 shyhirt/AutoDub

Automatic video translator and dubber using Whisper, XTTS v2 for voice...

24
Experimental
3282 IranTechNest/PersianSpeechRecognition

Persian Speech Recognition

24
Experimental
3283 ilbash/made_mail.ru

Code and theory from Big Data Academy

24
Experimental
3284 swiss-ai-center/text-to-speech-service

Queries an API based on Edge-TTS and returns an audio file based on...

24
Experimental
3285 Dawizzer/ComfyUI-Qwen3TTS-Emotional

Voice cloning with 80+ emotions and multi-emotion mixing for ComfyUI

24
Experimental
3286 GSA/coe-hud-acq-data-visualization

A repository for information related to the Data Analytics team's Data...

24
Experimental
3287 Barbany/Multi-speaker-Neural-Vocoder

Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial...

24
Experimental
3288 Ex094/VoiceCom

A Simple Voice Command Application powered by Java and Sphinx4 Speech...

24
Experimental
3289 palahsu/Greeting-PC

Greeting PC, made with simple Visual Basic Script. Run file it will executes...

24
Experimental
3290 bhadrik/Voice-Coding

Voice Coding is all about writing code by voice commands.

24
Experimental
3291 warisqr007/vq-bnf

Vector Quantizing speech representations

24
Experimental
3292 Epistates/rosellas

Automatic speech recognition (ASR) for Apple Silicon

24
Experimental
3293 localzet/tts

Веб-сервис для озвучки текстов с использованием Microsoft Edge TTS

24
Experimental
3294 aeleraqi/gTTS---Arabic-text-to-multiple-languages

Converting Arabic text to speech in various languages with the versatile...

24
Experimental
3295 voothi/20240411110510-autohotkey

This repository is a collection of personal AutoHotkey v2 scripts designed...

24
Experimental
3296 Redwiat/Language-Translator

Language Translator App - Translate text into multiple languages. Built with...

24
Experimental
3297 yeyupiaoling/VITS-PaddlePaddle

本项目是基于PaddlePaddle的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复...

24
Experimental
3298 cjh0613/vosk-android-demo-chinese

中文 vosk-android-demo

24
Experimental
3299 taoing/tts-server

微软晓晓 语音合成接口

24
Experimental
3300 viig99/esolafast

Fast C++ implementation of ESOLA using KFRLib, can be used for online...

24
Experimental
« Prev 1 2 3 31 32 33 34 35 68 69 70 Next »