All Voice AI Tools

6,981 tools ranked by quality score · Page 40 of 70

Showing 3901–4000 of 6,981
# Tool Score Tier
3901 wkdrns202/TTSDataSetCleanser

TTSDataSetCleanser. This program can do the labeling work for the Raw Speech...

22
Experimental
3902 hasscc/ai-conversation

🤖 AI Conversation Agent for Home Assistant. Compatible with any OpenAI...

22
Experimental
3903 Alcidespb24/podcast-workflow

Automated pipeline: Obsidian markdown → AI podcast scripts → TTS audio → RSS...

22
Experimental
3904 icsboyx/bottarga

Bottarga is a simle Text to Speech bot for Twitch chat. It can read chat...

22
Experimental
3905 iuliiakr/TTS-Project-Framework

Architecture framework for building production-grade text-to-speech systems,...

22
Experimental
3906 usrbinbrain/kokoro-tts-container

A Docker container for running Kokoro Text-to-Speech engine v.1, providing...

22
Experimental
3907 Mmiglio/SpeechRecognition

Small-footprint Keyword Spotting

22
Experimental
3908 mahathir444/yc-ui

Modern UI component library for Vue.js. `yc-ui` offers reusable,...

22
Experimental
3909 DevLoyola/talkingBot

A simple JavaScript voice chat bot using web speech API

22
Experimental
3910 iamnortey/ninolex-gh

Open Ghanaian pronunciation dictionary for TTS and AI systems — IPA, CSV,...

22
Experimental
3911 xuan139/ai-publisher-local-studio

Local audiobook production studio MVP with FastAPI, SQLite, review workflow,...

22
Experimental
3912 stutstev/pimp

Hackable music player optimized for use on screenless SBCs in cars.

22
Experimental
3913 Nandan-k-s-27/varna-voice-assistant

🌻 VARNA — A free, fully offline voice assistant for Windows. 160+ voice...

22
Experimental
3914 InuInu2022/NodoAme.Home

An official website for NodoAme

22
Experimental
3915 CookSleep/EasyTTS

EasyTTS是一个便捷的工具,旨在方便地使用第三方API服务来调用OpenAI的文本转语音(TTS)功能。...

22
Experimental
3916 ywatanabe1989/scitex-audio

Text-to-Speech with Multiple Backend Fallback (elevenlabs → luxtts → gtts → pyttsx3)

22
Experimental
3917 Ryadel/ClawTalk

Chrome side panel extension (MV3) that connects to an OpenClaw Gateway and...

22
Experimental
3918 rookiemann/LocalSoundsAPI

Portable offline AI audio studio with web UI & local API – XTTS, Fish...

22
Experimental
3919 nikkoxgonzales/streaming-tts

A streamlined, Kokoro-based text-to-speech library with streaming support.

22
Experimental
3920 matievisthekat/MyOnlyFriend

A program I made so I could talk to someone ;(

22
Experimental
3921 stkzlv/ContentEngineAI

AI-powered video production pipeline that scrapes e-commerce products,...

22
Experimental
3922 deepgram-starters/go-text-to-speech

Get started using Deepgram's Text-to-Speech with this Go demo app

22
Experimental
3923 DarkSide7839/PytDm

🌐 Streamline your downloads with PytDm, a modern Python download manager...

22
Experimental
3924 alexjsteffen/ttsrs

The ai-tts.rs project provides a command-line tool for generating spoken...

22
Experimental
3925 Stawa/GTTS

This project converts written material into speech by using Google AI...

22
Experimental
3926 anatolykoptev/moonshine-whisper

Fast speech-to-text HTTP service powered by Moonshine + sherpa-onnx. Beats...

22
Experimental
3927 JeanCaro/Babelin

Babelin Speach, for voice recognition and real-time translation, services...

22
Experimental
3928 Adr0it/Sub2Dub

A simple program that converts subtitles (.srt, .vtt) to an AI voiceover...

22
Experimental
3929 Hexanol777/Kikiyomu

聞き読む. real-time text-to-speech tool for VNs

22
Experimental
3930 seanox/seanox-ai-podcast

Automated podcast generation pipeline using a YAML-defined structure and...

22
Experimental
3931 Shuichi346/ja-dubbing

英語動画を話者の声質を保ったまま日本語吹替動画に変換。MioTTSによるボイスクローニング、PLaMo-2翻訳、2種のASRエンジン(Whisper /...

22
Experimental
3932 denz-pro/CoAI-PCB

CoAI-PCB offers an AI-driven PCB inspection module that detects defects with...

22
Experimental
3933 chandler767/Read-The-Room

This demo processes conversations in real-time with the Amazon Comprehend...

22
Experimental
3934 samnaveenkumaroff/Indic-F5

IndicF5: High-Quality Text-to-Speech for Indian Languages , including voice cloning

22
Experimental
3935 Hayder-IRAQ/srt-to-podcast

🎙️ Convert multilingual SRT subtitles (Arabic/Russian/English) into podcast...

22
Experimental
3936 XiaoYi2018/OfflineRealtimeTranslator

Fully offline Android real-time Russian-to-Chinese simultaneous interpreter...

22
Experimental
3937 deepgram-starters/csharp-live-text-to-speech

Get started using Deepgram's Live Text-to-Speech with this C# demo app

22
Experimental
3938 JJsilvera1/STT-Windows

An easy STT option to dictate text with your voice to your cursor using...

22
Experimental
3939 moimart/geppetto

GPT-Whisper-based Voice Assistant for Home Assistant (Experimental)

22
Experimental
3940 ronb1964/TalkType

Privacy-first voice dictation for Linux Wayland — press a key to talk,...

22
Experimental
3941 donapart/klatsch

Klatsch 🐾 — OpenClaw Local Agent: always-on voice assistant, peer...

22
Experimental
3942 kvnpetit/BetterFrenchTTS

Intelligent Android TTS wrapper optimized for French — Kotlin DSL, SSML...

22
Experimental
3943 KrishnaDN/BERTphone

Implementation of the paper "BERTphone: Phonetically-aware Encoder...

22
Experimental
3944 zsoltfrks/multimodal-story-generator

A rather simple story generator from images with text-to-speech integration...

22
Experimental
3945 s-l-h/cat

A basic toolkit for speech analytics, using GPT and Whisper-X

22
Experimental
3946 led-mirage/CoeiroClip

COEIROINKでクリップボードに貼り付けられたテキストを読み上げるアプリです。

22
Experimental
3947 Eleven1111/groq-whisper

Groq-powered OpenClaw speech tools for local audio transcription and...

22
Experimental
3948 dense-analysis/vim-speech

Vim Speech Recognition Experiments

22
Experimental
3949 josharsh/terminal-voice

Voice input for the terminal. Speak, and it types. Local transcription,...

22
Experimental
3950 Ilikepizza2/localspeech-AI

A one command Voice AI deployment script for MacOS. Supports Sesame, Kokoro,...

22
Experimental
3951 stanlsv/sayboard

Privacy-first AI voice keyboard for iOS that turns speech into ready-to-send...

22
Experimental
3952 fizoxt/openwhisper-app

Transcribe speech to text on macOS locally and offline with OpenWhisper, a...

22
Experimental
3953 yohasebe/speechdock

Turn any audio into text and any text into speech — menu bar app with...

22
Experimental
3954 rhulha/EchoMate

A web application that converts speech to speech 100% private using VAD...

22
Experimental
3955 faze-sway1/notthatstuff

Large-language-model + Text-to-speech + Voice-cloning doppelgangers, trained...

22
Experimental
3956 rcspam/animation-speech

Configurable transparent speech animation overlay for Wayland (KDE Plasma,...

22
Experimental
3957 Jhanwi/Intelligent-Desktop-Companion

This project developed a personalized Python-based voice controlled...

22
Experimental
3958 HRN-Projects/AVA---Accessibility-Virtual-Assistant

It is an open source accessibility tool created for better usability and...

22
Experimental
3959 marvitek0/Talk-to-Typer

Experience voice typing with Talk-to-Typer, a kid-friendly app that helps...

22
Experimental
3960 Ponyu-dev/Unity-Sherpa-ONNX

Unity plugin for sherpa-onnx — offline TTS, ASR, and VAD with one-click setup

22
Experimental
3961 yshnv/metavoice

Metavoice is text to speech convertor developed using Ionic Framework and CapacitorJS

22
Experimental
3962 stephenombuya/Virtual-Personal-Assistant

Production-grade Python virtual assistant with full asynchronous support....

22
Experimental
3963 lyncisdev/voco

Create a speech recognition system for programming by voice using Kaldi

22
Experimental
3964 svrooij/node-sonos-tts-polly

A text-to-speech server for node-sonos-ts

22
Experimental
3965 RenzMc/Renz_Assistant

ai assistant termux use voice

22
Experimental
3966 ryuuji06/keyword-spotting

In this repository, I implement a system for detecting specific spoken words...

22
Experimental
3967 Luzivog/Neuron

Personal voice assistant using Vosk for speech recognition

22
Experimental
3968 AbdulBasit-MrRobo/Real-Time-Speech-Emotion-Recognition

Code for the paper "Real Time Speech Emotion Recognition using Machine Learning"

22
Experimental
3969 djelia-org/djelia-js-sdk

Javascript client for interaction with djelia models throught it's API

22
Experimental
3970 chinasilva/MySmartPc

利用微信文件助手,进行语音或者文字控制电脑

22
Experimental
3971 LucaVitali/AzureTTSVoiceGeneratorGUI

PowerShell script to generate Voice Messages with Azure Cognitive Services...

22
Experimental
3972 funmaker/4voiced

4chan voiced

22
Experimental
3973 laithisgood/kokoclone

Deliver fast, real-time multilingual voice cloning with an efficient neural...

22
Experimental
3974 makemebitter/polly_srt2audio

Create audio voiceover from srt subtitle files using AWS Polly

22
Experimental
3975 leejgdh/GPT-SoVITS-ko

한국어 전용 GPT-SoVITS TTS 서비스

22
Experimental
3976 JuanSoFly/Ferrous

A high-performance, offline-first e-reader for Android built with a hybrid...

22
Experimental
3977 imanousar/Automatic-Subtitles-Synchronization

A project about learning how to synchronize subtitles in movies using...

22
Experimental
3978 GodzCursed/whisper-vtt2srt

🎥 Convert WebVTT to SRT easily, refining messy AI transcripts into clear...

22
Experimental
3979 Tina-1300/WinSpeech

WinSpeech is a C++ text-to-speech library available on Windows

22
Experimental
3980 isothermal-capitalgainstax520/Whisper-Transcriber

🎤 Transcribe audio and video files into text or subtitles effortlessly on...

22
Experimental
3981 grantCelley/Shout-Scribe

A completely free and open source dictation program

22
Experimental
3982 mobassir94/Multilingual-Speech-to-Speech-Translator

Multilingual Speech to Speech (STS) Translator is the First Ever Code-mixed...

22
Experimental
3983 abcname61/audiobook-creator

🎧 Convert MP3 files into professional-quality audiobooks in M4B format with...

22
Experimental
3984 LohChiaHeung/TechTutor

TechTutor is an Augmented Reality (AR) and AI-assisted mobile learning...

22
Experimental
3985 byraphaelmedeiros/pdf2mp3

Convert PDF documents into MP3 audio with text-to-speech (TTS).

22
Experimental
3986 OpenVoiceOS/ovos-docker-tts

Open Voice OS TTS Docker images

22
Experimental
3987 samriddhiyadav/live-audio-translation-captions

Portfolio project: live-audio-translation-captions

22
Experimental
3988 botbahlul/VOSK-Powered-LIVE-SUBTITLE

ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using VOSK...

22
Experimental
3989 GmEsoft/CTS256A-AL2

Commented disassembly of the GI(tm) CTS256A-AL2(tm) Code-To-Speech Processor

22
Experimental
3990 mecparts/Talker

A code-to-speech board based on the General Instrument 1980s chip set

22
Experimental
3991 victoryangzhijie/stt-server

Real-time speech-to-text WebSocket server with pluggable ASR backends,...

22
Experimental
3992 thiswillbeyourgithub/Voice2Anki

A powerful tool that converts voice recordings into high-quality Anki...

22
Experimental
3993 NietteLabs/NietteTTS

TTS Engine for Linux using Festival Speech Synthesis System

22
Experimental
3994 reservamos/speech-to-text-demo

Flutter app with implementation of openAI tools (ChatGPT & Whisper)

22
Experimental
3995 Shadowsith/qpicospeaker

Qt frontend for pico2wave text to speech console program

22
Experimental
3996 TCBOMC/audio-book-TTS-tool

一个可以快速对大批量长文本(百万字量级)的文章/小说/剧本等进行AI标注角色以及语言合成的软件

22
Experimental
3997 12alz/fun-with-clip-path

🎨 Explore clip-path techniques in HTML and CSS to create interactive menus...

22
Experimental
3998 supevil/SoulX-Singer-Eval

🎤 Evaluate zero-shot Singing Voice Synthesis systems for quality, accuracy,...

22
Experimental
3999 huuhka/t-pain

T-Pain Bot is a telegram bot that helps the user track their daily pain...

22
Experimental
4000 mprzewie/haspell

An awesome text-to-speech engine written in Haskell

22
Experimental
« Prev 1 2 3 38 39 40 41 42 68 69 70 Next »