All Voice AI Tools

8,525 tools ranked by quality score · Page 74 of 86

Showing 7301–7400 of 8,525
# Tool Score Tier
7301 Vlad1343/Sign-Wave

Real-time Ukrainian Sign Language translator using computer vision and...

13
Experimental
7302 Sumit0ubey/TorvixAI

TorchAI is an Android app that combines AI chat and voice assistance with...

13
Experimental
7303 funkyfranky/TTS-Radio

Create voice overs with radio effects for DCS

13
Experimental
7304 cser245086272/ComfyUI-FL-Qwen3TTS

🎤 Create realistic text-to-speech outputs with advanced voice cloning and...

13
Experimental
7305 fclaeys/nix-nerd-dictation

🎤 Nix flake for offline French speech-to-text with nerd-dictation....

13
Experimental
7306 harlanx/voice_recorder_recognizer

An audio recorder and speech to text with commands recognition created using...

13
Experimental
7307 eddiedunn/transcribe

[DEPRECATED — superseded by diarized_transcriber] Audio-to-text...

13
Experimental
7308 ItxMatti/tts

🗣️ Deploy high-quality text-to-speech services with Gemini, OpenAI, and...

13
Experimental
7309 traceypooh/audio2text

creates text from audio of A/V input file, using docker, sphinx. extracts...

13
Experimental
7310 hannabdul/etf4asr

Official repo for the paper "An Effective Training Framework for...

13
Experimental
7311 AnshGaikwad/Personal-Voice-Assistant

Personal Voice Assistant: Easy to change the code and making it suitable for...

13
Experimental
7312 di37/speech-to-text-fine-tuning-on-unseen-language

This projects aims to show how whisper model can be fine-tuned on language...

13
Experimental
7313 Diluksha-Upeka/Voxis

Voxis is an intelligent voice assistant powered by Groq's AI models,...

13
Experimental
7314 MichaelMBrown/VoiceLab

Local Apple Silicon voice studio for Qwen3-TTS with a FastAPI backend and...

13
Experimental
7315 TJ-Neary/TommyTalker-Pro

Privacy-first voice-to-text for macOS — local STT via mlx-whisper with...

13
Experimental
7316 Karan36k/text2speech

A Basic But Useful Online Text to Speech Converter with a male voice...

13
Experimental
7317 Srinath-N-R/IPA-Wav2Vec2-Phoneme-Recognition

End-to-end IPA-based phoneme recognition pipeline using Wav2Vec2, featuring...

13
Experimental
7318 IshaanLabs/Text-to-Speech-TTS

Open Source Text-to-Speech (TTS) repository

13
Experimental
7319 NimbleAINinja/swift-scribe-rs

Fast, on-device speech-to-text transcription for macOS using Apple's Speech framework

13
Experimental
7320 Gokila-S/smart-translate

Smart Translator is a modern MERN stack application that allows users to...

13
Experimental
7321 Rayyan9477/speech-app

AI Language Processor is a powerful application that leverages...

13
Experimental
7322 rk-vashista/TTS-Story_Generator

A versatile app that converts images into short stories and lifelike audio...

13
Experimental
7323 hongkongkiwi/scoop-elevenlabs-cli

Official Scoop bucket for installing elevenlabs-cli on Windows.

13
Experimental
7324 oddvoices/oddvoices

An indie singing synthesizer

13
Experimental
7325 bivex/whisper-large-v3-turbo

Whisper Large V3 Turbo - fast speech-to-text model implementation with...

13
Experimental
7326 labestia2/Qwen3-Audiobook-Converter

🎧 Convert various document formats into high-quality audiobooks with Qwen3...

13
Experimental
7327 upskaling/voice-keyboard

an interface for nerd-dictation in gtk

13
Experimental
7328 Her-mia/Imgspeaker

An Android app written in Kotlin that performs OCR on Simplified Chinese...

13
Experimental
7329 maycondata/apontamento-op-por-voz

Apontamento de produção por voz (Whisper STT + gTTS) com confirmação e...

13
Experimental
7330 akhilachiju/AI-Audio-Transcriber

Audio transcription app using Whisper AI for accurate speech-to-text...

13
Experimental
7331 metacore-stack/Voice-to-Insights

Enterprise AI platform that transforms audio meetings into structured...

13
Experimental
7332 anhuynh219/vietnamese_SVS

Demo page for ViSVS: ON AUTOMATIC VIETNAMESE SINGING VOICE SYNTHESIS

13
Experimental
7333 DemoL2004/Serverless-Content-Generation-Distribution-Pipeline

Cloud-native media automation system integrating Reddit, ElevenLabs TTS,...

13
Experimental
7334 Himanshi-2519/Speech-To-Text-API

Capturing the Rhythm of your words. Real-time AI transcription with a...

13
Experimental
7335 walid-hamdi/fluener_ai-service

FastAPI AI microservice for language learning - Provides speech-to-text...

13
Experimental
7336 RedDotz20/speech-to-text-recognition

🎤 Effortlessly integrate speech recognition capabilities into your React...

13
Experimental
7337 mocarlaura-source/parakeet

🐦 Customize Fedora Silverblue with niri DE tailored for FriendlyElec NanopPC...

13
Experimental
7338 stefanpietrusky/QUEST

Repository for the QUEST App prototype.

13
Experimental
7339 joachimhodana/rtTranslator

Simple overlay for Windows, that listens for background sound and translates...

13
Experimental
7340 THE-DEEPDAS/RealTime-Voice-Assistant

Voice-activated assistant using Groq API, Streamlit UI, speech recognition, and TTS

13
Experimental
7341 SuperKabman/audioNote

AI enabled notes taking app

13
Experimental
7342 elloza/slides2video-pinokio-script

Pinokio script for installing the app slides2video

13
Experimental
7343 morelen17/tts-papers

List of papers about TTS / Список статей о TTS

13
Experimental
7344 saroshfarhan/story-teller

Story-Teller

13
Experimental
7345 x-phone/demos

Working examples and tutorials for the x-phone ecosystem — xphone-go,...

13
Experimental
7346 unicodeveloper/voicery

Play with voices. Speak any language. Clone your vibe.

13
Experimental
7347 sj2tpgk/voiceroid-docker

Voiceroid+ in docker on X64/Arm linux + web interface (mirrored from...

13
Experimental
7348 AbhiramMandala/virtual_assistant

Voice-controlled virtual assistant built with Python using speech...

13
Experimental
7349 onwurahben/meeting-assistant

Transform raw meeting audio into speaker-aware transcripts, summaries, and...

13
Experimental
7350 NafisRayan/AI-Voice-Assistant-ST

AI voice assistant made with Streamlit python and powered by Gemini, Mistral...

13
Experimental
7351 madebyaris/dsw-voice

Real-time voice noise reduction app for macOS with virtual microphone support

13
Experimental
7352 manhph2211/ViTTS

In this repo, I developed a step-by-step pipeline for a standard...

13
Experimental
7353 kiraping1337/ChatTwitchTTS

Twitch TTS бот с клонированием голоса через XTTS v2. Озвучивание сообщений...

13
Experimental
7354 mccvliqht/signifeye-capstone

a capstone project about real-time sign language translator using camera

13
Experimental
7355 karim23657/ParsiGoo

ParsiGoo is a Persian multispeaker dataset for text-to-speech purposes. It...

13
Experimental
7356 heroic-differentialdiagnosis696/MeetingMindAI

Capture, transcribe, and summarize meetings effortlessly with MeetingMindAI,...

13
Experimental
7357 YossefMohamed/covid-app-api

An Api for testing covid using cough sound

13
Experimental
7358 akhileshmanitiwari06/InterviewMentor-AI

InterviewMentor AI is an intelligent mock interview assistant designed for...

13
Experimental
7359 nashalexander/PersonaSpeak

Simple but comprehensive TTS GUI tool for use with modern models

13
Experimental
7360 abhiFSD/VoiceForge

🎙️ Real-time AI voice assistant — Speak → Whisper STT → Gemini Flash → Edge...

13
Experimental
7361 sridattb96/MeetingStory

A project I built while doing research for a professor in the Visual &...

13
Experimental
7362 shujaatsunasra/ai-based-expensetracker

luminous_flow leverages a multi-layered AI pipeline to deliver personalized,...

13
Experimental
7363 dae9999nam/Memory-Garden

This repository is to provide service, Memory-Garden, that create narratives...

13
Experimental
7364 ca0wx/Gemini-Talker-Chat

🎙️ Gemini Talker Chat: Ollama ve Edge-TTS tabanlı, gerçek zamanlı sesli...

13
Experimental
7365 remsky/prebuilt_tts_wheels

Prebult wheels for dependencies of TTS service; Kokoro-FastAPI

13
Experimental
7366 max-lt/voxtral-cpp

Local implementation for voxtral

13
Experimental
7367 pukaa900/reagana

Ko taqaku konqamatuqa mo nqaaqaku meqa.

13
Experimental
7368 RamirJunior/idox-ia-project

Projeto MVP com processamento de áudio com IA local

13
Experimental
7369 duanxianpi/AI-Voice-Diary

Using voice to keep a journal.

13
Experimental
7370 carlfm01/my-speech-datasets

My public domain speech index

13
Experimental
7371 lianghsun/cosyvoice3-api

FastAPI wrapper for Fun-CosyVoice3-0.5B: zero-shot voice cloning TTS with...

13
Experimental
7372 nipponjo/tts-german-pytorch

🎙️ German TTS (FastPitch) with Thorsten voice / emotional

13
Experimental
7373 muurakami/momokiki

Open source language learning app — Duolingo alternative with offline...

13
Experimental
7374 Mormolykos/bedvibe-datasets

Multilingual emotional speech datasets for TTS training

13
Experimental
7375 kjanjua26/HearPapers

HearPapers allows you to listen to PDFs (by converting them to audiobooks,...

13
Experimental
7376 amay09x/TheNewsCoo

TheNewsCoo is a desktop AI application that helps users quickly understand...

13
Experimental
7377 BenjaminDanker/Audio-Cleaner-Web

AI-powered video audio noise reduction in the cloud using DeepFilterNet3 and...

13
Experimental
7378 LauraKokkarinen/AzureAI.TextToSpeech

A console application for converting long-form plain-text files into speech...

13
Experimental
7379 Thisen-Ekanayake/sinhala-vision-assist

Vision–language assistive pipeline that answers Sinhala voice questions...

13
Experimental
7380 RutronikSystemSolutions/RDK3_BLE_EnOcean

Project used to illustrate how to use a RDK3 to interact with EnOcean BLE...

13
Experimental
7381 Rumeysakeskin/ASR-Quantization

Post-training quantization on Nvidia Nemo ASR model

13
Experimental
7382 danielrosehill/ASR-And-STT-AI-Notebook

Propmts and outputs (and some notes) on STT + ASR + fine-tuning. LLM: Claude

13
Experimental
7383 NAJL123/voice-ai-assistant

Local Voice AI Assistant — faster-whisper STT + Ollama LLM + pyttsx3 TTS

13
Experimental
7384 Priyanshu-Yadav19/Call-Voice-Agent

Real-time AI Voice Agent using Streaming STT, LLM-based conversation...

13
Experimental
7385 laafeiak/ai_text_reader

text

13
Experimental
7386 namphung134/ASR-Vietnamese

Fine-tuning the openai/whisper-small model on the 250h dataset for...

13
Experimental
7387 Giuseppe-Della-Corte/IESTAC

A corpus that can be used to train English-to-Italian End-to-End...

13
Experimental
7388 N1kOk/WhispeRu

Голос — в текст. Приватно. Локально. Моментально.

13
Experimental
7389 allvoicelab/allvoicelab

AI-powered audio creation platform offering TTS, Voice Cloning, Voice...

13
Experimental
7390 metacore-stack/AuraVoice

Production-grade on-device AI meeting assistant featuring real-time...

13
Experimental
7391 jaychampaneri14/ai-voice-cloning

Text-to-speech with multiple voice styles using gTTS and pyttsx3

13
Experimental
7392 SMIL-SPCRAS/DAVIS

Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle...

13
Experimental
7393 JonPark0/web_audio_splitter

AI-powered audio source separation using Meta Demucs - Split songs into...

13
Experimental
7394 kocharvishal/Fast-Speech-Transcription-Grammar-Scoring-Engine

Built a transcription system using OpenAI’s Whisper and Fine-tuned...

13
Experimental
7395 lymcho/story-to-video

Create a fully narrated YouTube audiobook channel in one command. AI...

13
Experimental
7396 AleefBilal/tts_srt_gen

A runpod serverless docker that generates TTS using chatterbox-tts along with .srt

13
Experimental
7397 plandanogtav1-cmd/Conversational-For-Librechat

🎙 Headless real-time voice pipeline for LibreChat — LiveKit WebRTC +...

13
Experimental
7398 iamvon/AudioRead

Turn PDFs into audio with chunked LLMs and OpenAI TTS

13
Experimental
7399 adityakamat24/RTGX-Real-Time-Glossary-eXplainer

RTGX is an AI-powered real-time glossary explainer that adds contextual...

13
Experimental
7400 palaashatri/jvosk

Audio transcription using Vosk. Built with Swing.

13
Experimental
« Prev 1 2 3 72 73 74 75 76 84 85 86 Next »