All Voice AI Tools

6,981 tools ranked by quality score · Page 35 of 70

Showing 3401–3500 of 6,981
# Tool Score Tier
3401 6Morpheus6/IndicF5

High-Quality Text-to-Speech for Indian Languages

24
Experimental
3402 CodingWithEnjoy/Speech-To-Text-Python

متن به صدا | Text To Speech 😊🤩

24
Experimental
3403 scraptechguy/SpeechCheck

Speech recognition and subsequent speech evaluation, all driven by Microsoft Azure

24
Experimental
3404 VIKASRAPARTHI/Jarvis-Voice-Assistant

Jarvis is a powerful desktop voice assistant designed to enhance...

24
Experimental
3405 gwihlidal/speechtest-rs

Google Cloud text-to-speech prototype

24
Experimental
3406 violet125qq/my-live-caption-with-translation-for-macos

A Python script that captures microphone and/or system audio in real time,...

24
Experimental
3407 MatteoM95/Smart-Home-Vigilance-System

An indoor video surveillance system capable of recognizing the presence of a...

24
Experimental
3408 z1311/Face-Recognition-with-Voice-Output

Real Time Face Recognition with Voice Output System.

24
Experimental
3409 abhineetraj1/python-voice-command

This is voice command A.I. which give you output according to your predefined codes.

24
Experimental
3410 DmitryCherneckiy/speech-to-text

Telegram bot. Turns a voice message into a text message.

24
Experimental
3411 victormgross/RealVideo

📹 Create engaging video calls with RealVideo, a WebSocket-based system that...

23
Experimental
3412 Agash/TTSTextNormalization

Modern .NET10 / C#14 library to normalize text (emojis, currency, numbers,...

23
Experimental
3413 speechpro/speechpro-cloud-tts-examples

В данном репозитории представлены примеры использования синтеза речи с...

23
Experimental
3414 deepgram-starters/csharp-transcription

Get started using Deepgram's Transcription with this C# demo app

23
Experimental
3415 benrucker/JermaBot

A wacky, sound-oriented Discord bot

23
Experimental
3416 utsavpshah/SpeakingHands

This is an extension to LeapTrainer.js repository. With this project, we...

23
Experimental
3417 herrkaefer/SwiftEdgeTTS

Microsoft's Edge TTS in pure Swift

23
Experimental
3418 jharrilim/RasaDocker

Docker image with Rasa + Anaconda + Tensorflow + portaudio + PyAudio +...

23
Experimental
3419 ascender1729/AudioDictate

An efficient desktop application for transcribing audio files into text...

23
Experimental
3420 siddhantmishra1305/Anuvaad

An iOS translator that supports more that 40 languages. User can add notes...

23
Experimental
3421 haliphax/tts

Twitch text to speech overlay for OBS (using lobe-tts)

23
Experimental
3422 Lordmau5/firebot-script-elevenlabs-tts

A custom Firebot script that adds support for ElevenLabs TTS

23
Experimental
3423 allpaqa-jgk/twitch_text_to_speech_bot

Text to Speech bot using Twitch IRC for mac and (linux and windows

23
Experimental
3424 facejungle/fj_chat_to_speech

FJ Chat to Speech. Text To Speech: YouTube, Twitch

23
Experimental
3425 PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI

Demonstration for the Qwen/Qwen3-TTS-12Hz models using Daggr for modular UI...

23
Experimental
3426 miikkij/Speechos

Local-first speech AI benchmarking — compare STT, TTS, emotion & diarization...

23
Experimental
3427 rdyson/morsel

Forward links, get a daily podcast digest. Scripts that turns article URLs...

23
Experimental
3428 bibinkunjumon2020/Azure-Avatar-AI

The text to speech avatar system is a text to speech feature with vision...

23
Experimental
3429 b4rtaz/voice-assistant-net-server

Voice Assistant Server for VSCode

23
Experimental
3430 stefanwebb/unity-voice-agents

A Unity package for building open-source AI voice agents that run fully...

23
Experimental
3431 jonelo/unlock-win-tts-voices

Unlocks the Microsoft Windows TTS voices for use with other x64 applications...

23
Experimental
3432 QuyAnh2005/vits-japanese

Text to Speech for Japanese

23
Experimental
3433 agusibrahim/tiktok-tts-api

A Text-to-Speech API using TikTok’s private API to convert text into audio,...

23
Experimental
3434 nguyennpa412/simple-multimodal-ai

Simple Gradio application integrated with Hugging Face Multimodals to...

23
Experimental
3435 th33k/Luigi

LUIGI is an interactive pet robot designed for fun, companionship, and...

23
Experimental
3436 SaranshKejriwal/Harold_Finch

Face recognition via voice Commands (OpenCV Python + SpeechRecognition 3.1.3)

23
Experimental
3437 CT83/Hellin-Worki

A video conferencing platform which seamlessly dials your coworkers when you...

23
Experimental
3438 straff2002/OpenGlasses

Use Meta Rayban glasses with alternative providers

23
Experimental
3439 DevashishPrasad/Virtual-AI-assistant

This repository contains my Bachelor's degree final year project. It is a...

23
Experimental
3440 devfinwiz/Python-Voice-Assistant-Virtual-Slave

This voice assistant is buit in VS Code. It has an ability to understand...

23
Experimental
3441 fwcd/okpi

Virtual assistant with offline voice recognition for Raspberry Pi

23
Experimental
3442 2017fandrei/ForcedAlignment

Graphical utility for forced alignment using aeneas, an interactive audio player

23
Experimental
3443 FlyingPolarBear/CityKBQA

Xiaode: a Knowledge Based Question Answering System with Speech IO

23
Experimental
3444 etasdemir/OpticMap

On-device optical character recognition Android application.

23
Experimental
3445 sdsb8432/TextToSpeech-Android

Text to Speech for Android Application with Google API

23
Experimental
3446 sanjifr3/Narrator

An image and video description generator using an CNN-RNN based architecture.

23
Experimental
3447 rodrigosuelli/ditey-web

🎙 Leitor de textos online desenvolvido com React e Web Speech API. Tcc (ETEC)

23
Experimental
3448 zvz23/vProfanity

A software solution that automates the detection and censorship of profanity...

23
Experimental
3449 Otosaku/OtosakuStreamingASR-iOS

OtosakuStreamingASR-iOS is a real-time speech recognition engine for iOS,...

23
Experimental
3450 M-Mowina/TalentTalk---AI-powered-interview-system

AI-powered technical interview system with dynamic resume analysis, voice...

23
Experimental
3451 derek-byte/multilingual-voice-assistant-llm

cohere labs - aya expedition 2025: integrating speech & audio into aya...

23
Experimental
3452 jark006/SummerTTS_VS

SummerTTS...

23
Experimental
3453 hinantin/QuechuaTTS

Hinantin - Text-to-Speech System for Quechua

23
Experimental
3454 vra/supertonic-mnn

A command-line interface for running Supertonic TTS models using MNN.

23
Experimental
3455 stefantaubert/zho-tts

Web app, command-line interface and Python library for synthesizing Chinese...

23
Experimental
3456 6Morpheus6/IndexTTS2

[NVIDIA, MAC, ROCM] Emotionally Expressive and Duration-Controlled...

23
Experimental
3457 sezer-muhammed/EBookReaderFullStack

A local-first EPUB reader with high-fidelity neural text-to-speech,...

23
Experimental
3458 HadrienGardeur/read-aloud-best-practices

Documenting best practices for implementing a read aloud feature in reading apps

23
Experimental
3459 ChanMo/espeak-ng-tts

Espeak-ng TTS 是 Chrome 浏览器的 TTS 插件,使用 本地espeak-ng 作为 TTS 引擎。

23
Experimental
3460 SzLeaves/asr-webapp

ASR Web APP 中文语音识别实验室APP,使用Django构建,包含中文语音转文字与中文语音聊天机器人模块

23
Experimental
3461 nexusjuan12/AetherChat

AetherChat local RVC chat interface for Koboldcpp and OpenAI style API

23
Experimental
3462 SeanPLeary/dc_tts-transfer-learning

Transfer learning exploration of dc_tts text-to-speech model

23
Experimental
3463 jonasmore/Cloudflare-Workers-AI-Home-Assistant-Integration

Cloudflare Workers AI integration for Home Assistant - TTS, STT, and...

23
Experimental
3464 Yunichi/livekit-voice-ai-agent-setup

The "livekit-voice-ai-agent-setup" repository provides a comprehensive guide...

23
Experimental
3465 LEMAS-Project/LEMAS-Project

LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with...

23
Experimental
3466 pilarOG/unit_selection_tts

Toy example on how to build a unit selection TTS in Spanish

23
Experimental
3467 crispinprojects/klatt-synthesizer

Klatt speech synthesizer

23
Experimental
3468 amirivojdan/neyshekar

A Large-Scale Open Persian Speech Dataset

23
Experimental
3469 MahtaFetrat/GPTInformal-Persian-Speech-Dataset

A free licensed Persian TTS dataset including 6+ hours of audio-text pairs...

23
Experimental
3470 paulomarcos/pyelant

PyElant is a simple python tool for performing translations and storing it...

23
Experimental
3471 MrSean2d2/mouthwords

A python script to put words in other peoples mouths.

23
Experimental
3472 akabe/obs-transcript

Real-time subtitle generation by speech recognition for OBS Studio

23
Experimental
3473 xuan3986/UDDETTS

The first LLM that unifies discrete and dimensional emotions for...

23
Experimental
3474 racai-ai/TEPROLIN

This is the TEPROLIN Romanian text processing platform, developed in the...

23
Experimental
3475 6ixGODD/audex

Smart Medical Recording & Transcription System with voice recognition and...

23
Experimental
3476 axzml/VoxLinkAI_Client

Native macOS voice input assistant. Hold a hotkey, speak, and let AI...

23
Experimental
3477 HiMeditator/wfts-chinese-tool

使用中文游玩《群星低语》游戏。Playing the game "Whisper from the Stars" in Chinese.

23
Experimental
3478 ZackAkil/global-video-dubbing

Using Googel Cloud Video Intelligence API with Cloud Translation API and...

23
Experimental
3479 KyeongJooni/ai-dubbing-studio

AI-powered dubbing web service - Upload audio/video, get dubbed in any...

23
Experimental
3480 ThomasRigoni7/Audio-emotion-recognition-RAVDESS

Implementation of various models to address the speech emotion recognition...

23
Experimental
3481 LG-1/audio2text

Ease of use for Speech to Text

23
Experimental
3482 Sushkyn/simple-music-player

playing music in shell for linux.

23
Experimental
3483 Saganaki22/kokoro-web

Kokoro TTS Web

23
Experimental
3484 boltomli/speech-api

Demo to show how to use Azure Speech Services API in app

23
Experimental
3485 arjunmahishi/Speech-with-JavaScript

Code sample for speech recognition and syntheses with simple javascript

23
Experimental
3486 jfassis20/aish

🤖 Simplify command execution with AISH, an AI-powered shell assistant that...

23
Experimental
3487 Philipp2211/Udacity-Natural-Language-Processing-Nanodegree

This repository contains all my solutions to the tutorials/projects of the...

23
Experimental
3488 simran2104/Machine-Learning-Projects

It contains different projects made using different algorithms in Machine Learning

23
Experimental
3489 Sim-hu/voicebot-rust

Discordで使用可能な読み上げbot。rust言語で書かれていて、とにかく軽量(なはず)

23
Experimental
3490 yanorei32/aitalked-server

Simple GynoidTalk / VOICEROID Web Server based on aitalked library

23
Experimental
3491 isaacgounton/awesome-tts

A unified Text-to-Speech gateway combining multiple TTS providers (Kokoro...

23
Experimental
3492 VattamBhavaniPrasad5i5/Voice-Cloning-Project

String as a input and extract the youtube video from keyword and extract...

23
Experimental
3493 Nicolas-Prevot/TTS_playground

Unified toolkit for testing and comparing multiple state-of-the-art...

23
Experimental
3494 neosapience/typecast-js

The official Node.js SDK for the Typecast API.

23
Experimental
3495 ShunsukeHayashi/byteplus-voice-ai

BytePlus音声対話AIアプリケーション - ASR, TTS, Voice Cloning統合(WebSocket対応、日本語対応✅)

23
Experimental
3496 Oqaasileriffik/martha

Martha TTS (Greenlandic text-to-speech) documentation, containers, and helpers

23
Experimental
3497 natelindev/voice-agent

Low-latency real-time terminal voice assistant with VAD, ASR, LLM, and TTS

23
Experimental
3498 FUYOH666/VoiceToText

Cross-platform Voice-to-Text application with support for macOS, Linux, and...

23
Experimental
3499 gtiwari333/speech-recognition-java-hidden-markov-model-vq-mfcc

Automatically exported from...

23
Experimental
3500 aishoot/DTWSpeech

A simple application of DTW Algorithm in isolate word speech recognition.

23
Experimental
« Prev 1 2 3 33 34 35 36 37 68 69 70 Next »