All Voice AI Tools

6,981 tools ranked by quality score · Page 46 of 70

Showing 4501–4600 of 6,981
# Tool Score Tier
4501 hyqzz/ICodeStar-text2speech-mp3

Simple Python tool to convert text to speech (TTS) and save as MP3 files....

19
Experimental
4502 LetterLiGo/Inaudible-Adversarial-Perturbation-Vrifle

[NDSS'24] Inaudible Adversarial Perturbation: Manipulating the Recognition...

19
Experimental
4503 kauer3/Slang-Text-to-Speech

💻🔊 A chrome extension that converts text on the web to speech. This was my...

19
Experimental
4504 bryanstevensacosta/tts-studio

Personal voice cloning CLI tool using XTTS-v2

19
Experimental
4505 yotsuda/Speech

PowerShell modules for text-to-speech (TTS) and speech-to-text (STT) across...

19
Experimental
4506 mcp-tool-shop-org/soundboard-maui

Cross-platform .NET MAUI desktop client for the Sound Board voice engine.

19
Experimental
4507 chychen/srt_to_tts

use pysrt to parse the time in .srt file, and then call google cloud...

19
Experimental
4508 godspirit00/ListeningTestAudioMaker

一个可以帮助您快速制作外语考试中听力部分的音频的工具。 / A tool that helps you quickly generate...

19
Experimental
4509 monocasual/vocoder

Probably one of the best text-to-speech online apps in the world (if your...

19
Experimental
4510 rishikksh20/voxtral-codec-pytoch

Voxtral Codec : Combining Semantic VQ and Acoustic FSQ for Ultra-Low Bitrate...

19
Experimental
4511 partigabor/read-aloud

A rudimentary text-to-speech engine for reading PDF files aloud in English

19
Experimental
4512 axynos/STARK

CSGO Audio File playback and Text-to-Speech

19
Experimental
4513 ChrisBrooksbank/Vox

Open-source screen reader for Windows 11 — built in C#/.NET 9 with UI...

19
Experimental
4514 mcp-tool-shop-org/avatar-face-mvp

Real-time VRM avatar lipsync MVP — Godot 4 + FFT visemes + OpenSeeFace

19
Experimental
4515 RonanDavalan/PiperRead

Privacy-First Neural Text-to-Speech for Linux (Wayland & X11).

19
Experimental
4516 haya256/random-read-in-computer-voice-interval-cli

テキストファイルからランダムに1行を選び、一定間隔でmacOSの音声で読み上げる学習用CLIツール

19
Experimental
4517 hiansit/ankiflow

ブラウザで動く汎用暗記カードアプリ「AnkiFlow」。自動読み上げ(TTS)機能を搭載し、画面を見ない「聞き流し学習」にも対応しています。

19
Experimental
4518 neon-aiart/spitch-omakase-connect

Setup VOICEVOX & RVC on Google Colab. / GoogleColabでVOICEVOXとRVCの環境構築

19
Experimental
4519 voothi/20231001193911-tts

A small collection of helper scripts for working with Google Text-to-Speech...

19
Experimental
4520 AmourWaltz/BayesLMs

Project of IEEE/ACM TASLP “Bayesian Neural Network Language Modeling for...

19
Experimental
4521 cmaroti/speech_recognition

Convolutional Neural Network for Speech Recognition, implemented in Ms. Pacman game

19
Experimental
4522 ssharanyab/persona-tts

PersonaTTS is a personalized neural text-to-speech system that learns a...

19
Experimental
4523 npuichigo/grpc_gateway_demo

Audio streaming transfer demo with google.api.HttpBody and grpc gateway for...

19
Experimental
4524 dylanbretzjr/anki-kokoro-tts

Generates audio for Anki flashcards using the Kokoro TTS engine with direct...

19
Experimental
4525 Pchambet/tp-hmm-markov

Markov Chains and Hidden Markov Models: weather modeling with discrete...

19
Experimental
4526 Muthu-Mkode/audify

An asynchronous Python desktop application that extracts text from PDFs and...

19
Experimental
4527 0x0501/Apora

Anki plugin for using Apora platform.

19
Experimental
4528 marcusau2/VOX-1-Audiobook-Maker

VOX-1 Audiobook Maker is a local, GPU-accelerated studio for creating...

19
Experimental
4529 DevBytAmir/vocaudio

CLI tool to generate spoken vocabulary study audio from a JSON deck....

19
Experimental
4530 rudil24/pdf-audio-reader

Javascript leveraging browser-native Web Speech API to convert any PDF to...

19
Experimental
4531 JaesungHuh/look-listen-recognise

Dataset page for Look, Listen and Recognise : character-aware audio-visual...

19
Experimental
4532 vgarciasc/tts2pnp

Python tool that transforms Tabletop Simulator internal pictures into...

19
Experimental
4533 sseanik/google-home-task-relay

Voice-guided task routines for Google Home using Google Assistant

19
Experimental
4534 saxil/mareen

Mareen - A privacy-focused voice assistant with 3D orb UI, powered by Ollama...

19
Experimental
4535 NihaalNO/Voxis

A private, offline-capable Voice AI Assistant that runs locally on your...

19
Experimental
4536 enrelu/AITranslator

Gemini-powered Chrome extension for smart translations. Features...

19
Experimental
4537 lxaw/GoogleSubtitleGenerator

Using GoogleTranslate to generate automatic subtitles of videos.

19
Experimental
4538 MushroomFleet/QwenTTS-UI

Qwen 3 TTS web UI | https://www.scuffedepoch.com | https://www.oragenai.com...

19
Experimental
4539 EN10/SimpleSpeech

Simple Audio Recognition

19
Experimental
4540 sknadig/ASR_2018_T01

Example repository for 2018 DS/NC 821 / Automatic Speech Recognition projects

19
Experimental
4541 kdelmotte/Mumble

A simple, fast and free speech to text app running on OpenAI's Whisper Large v3

19
Experimental
4542 parvatijay2901/Hindi-ASR-and-TTS

EC499: Major Project

19
Experimental
4543 Thijsn04/MediClear-AI

An intelligent medical translator powered by Google Gemini 2.5. Simplifies...

19
Experimental
4544 princesingh-ai-dev/JARVIS-Voice-Assistant

🤖 AI-powered voice assistant with Whisper STT, Groq LLM, real-time TTS,...

19
Experimental
4545 jcsilva/asr-benchmark

Benchmark of industrial Speech Recognition systems for Brazilian Portuguese

19
Experimental
4546 pyromage/lazy-podinator-public

Create your own AI generated daily summary podcasts from news feeds

19
Experimental
4547 powerpig99/readaloud

Local-first text-to-speech reader powered by Qwen3-TTS. 9 voices, 10...

19
Experimental
4548 vpakarinen2/omnilocal

Local voice-enabled assistant.

19
Experimental
4549 KaMeLoTmArMoT/Qwen_TTS_Api

FastAPI wrapper for Qwen3-TTS CustomVoice: generate chapter WAV from...

19
Experimental
4550 Sundy1219/RNNLM

Using RNNLM rescoring a sentence in Chinese ASR system

19
Experimental
4551 diogobr90/AI-Narrator

A global, lightweight Text-to-Speech engine using the Kokoro model with...

19
Experimental
4552 lhfer/video-dub-studio

Convert YouTube/local videos into multilingual dubbed audio with Qwen ASR +...

19
Experimental
4553 al3xsus/AI-powered-waste-sorting-station

This is a concept for an AI-powered waste sorting station, that helps people...

19
Experimental
4554 FardinHash/TTS-Node

The TTS Engine is a sophisticated web-based platform designed to transform...

19
Experimental
4555 egorsmkv/ukrainian-tts-datasets

🇺🇦 Open Source Ukrainian Text-to-Speech datasets

19
Experimental
4556 moto-pu/claude-code-voicevox-notify

Claude Code hooks for VOICEVOX voice notifications on task completion and...

19
Experimental
4557 lvecsey/pushup1000

Perform timed pushups as a fitness routine, with text to speech.

19
Experimental
4558 Mx0M/speech-to-text-rust

A high-performance speech-to-text CLI tool written in Rust, powered by...

19
Experimental
4559 mcp-tool-shop-org/soundboard-plugin

Give Claude Code a voice. TTS plugin with emotion-aware speech,...

19
Experimental
4560 hecko-yes/tts-dataset-prompts

Finally, some decent sample sentences

19
Experimental
4561 darwinva97/yarvis-android

Asistente de voz para Android con reconocimiento de voz continuo,...

19
Experimental
4562 azandabot/asizwe-ai

Real-time AI-powered translation for vernacular, slang, and regional...

19
Experimental
4563 NawrizTurjo/Agri-Smart-BD

Empowering Bangladesh farmers with AI-driven price forecasts, market...

19
Experimental
4564 laustke/jimlet_classic

Offline text-to-speech GUI converter with drag-and-drop support,...

19
Experimental
4565 giriaryan694-a11y/Paste2Listen

A simple, privacy-friendly tool to convert text into speech. Instead of...

19
Experimental
4566 DarthJahus/azure-simple-tts

Simple Text-to-Speech web interface.

19
Experimental
4567 icosane/hyacinthia

Simple graphical front‑end for F5‑TTS

19
Experimental
4568 JuanJRA20/Conversor-Texto-a-Voz

🎙️ Sistema inteligente de conversión de texto a audio con detección...

19
Experimental
4569 skye-cyber/ttskit3

A lightweight text to speeach toolkit

19
Experimental
4570 ANVEAI/voice-ai-resources

A curated collection of voice AI tools, libraries, datasets, and learning resources

19
Experimental
4571 ANVEAI/open-source-voice-ai

Open source voice AI tools, models, and libraries for speech recognition and...

19
Experimental
4572 hongkongkiwi/action-elevenlabs-cli

GitHub Action for ElevenLabs CLI: TTS, STT, voice, knowledge, and usage operations.

19
Experimental
4573 deepgram-starters/cpp-text-to-speech

Get started using Deepgram's Text-to-Speech with this C++ demo app

19
Experimental
4574 zippyclawdbot-lab/zippy-voice

🎤 Voice-to-voice PWA for Clawdbot — talk to your AI assistant hands-free,...

19
Experimental
4575 R1ckShi/SeACo-Paraformer

[ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.

19
Experimental
4576 neosapience/typecast-skills

The official Typecast Claude Skils.

19
Experimental
4577 Echoshard/AudiobookStudio

Desktop app for PocketTTS with voice cloning audiobook creation,...

19
Experimental
4578 aashish-joshi/tts-bulk

Tool for generating TTS files in bulk.

19
Experimental
4579 shrey802/PyTTSeval

Evaluation tool for TTS systems

19
Experimental
4580 StuMason/claude-tts

Text-to-speech for AI coding assistants. Give your AI a voice with emotional...

19
Experimental
4581 deepgram-starters/rust-text-to-speech

Get started using Deepgram's Text-to-Speech with this Rust demo app

19
Experimental
4582 001kenji/Text_To_Speech_AI

A modern web application that converts text to speech using advanced TTS...

19
Experimental
4583 patelritiq/CodeClause-Internship-Projects

A comprehensive collection of 4 Python applications developed during a...

19
Experimental
4584 YChenL/UniVR

An official implement of "UniVR: A Unified Framework for Pitch-Shifted Voice...

19
Experimental
4585 hubetcardenasi/SpeechApp

Convertir tu celular en una aplicación de voz.

19
Experimental
4586 sujitpanda/Google-Cloud-Speech-API

Google Cloud Speech API Android Project Demo

19
Experimental
4587 icantc0de1/Qwen3-TTS-FastAPI

An OpenAI-compatible Text-to-Speech (TTS) API server for the Qwen3-TTS model series.

19
Experimental
4588 martinp95/meeting-transcriber

AI-powered meeting transcription tool that converts audio and video files...

19
Experimental
4589 nl8590687/asrt-sdk-go

ASRT Speech Recognition SDK for Golang. 用于ASRT语音识别系统的Golang SDK

19
Experimental
4590 pyzskw/meeting-teleprompter

线上会议提词器 - 语音识别自动跟读、防截屏、专注模式、离线模型 | Meeting Teleprompter with offline ASR

19
Experimental
4591 Zhennor/Multimodal-Video-Retrieval-Engine-with-Vision-and-Text

A video search engine combining OCR, ASR, CLIP, Image Captioning, Object &...

19
Experimental
4592 yaya-sy/speechscorer

unsupervised spoken utterances scoring

19
Experimental
4593 NickBouwhuis/QwenTTS

AI-powered text-to-speech for macOS with voice design and voice cloning....

19
Experimental
4594 rahulm-28/celebrity-voice-panel-qwen3-tts

AI voice cloning panel that generates multi-speaker discussions between...

19
Experimental
4595 ttsaigit/tts-ios

TTS.ai iOS app — 18 AI text-to-speech models, voice cloning, speech-to-text

19
Experimental
4596 Ploscha/Awesome-Audio-Generation

Awesome-Audio-Generation is a collection of resources for Text-to-Audio...

19
Experimental
4597 danielrosehill/Speech-To-Text-System-Prompt-Library

An updated skeleton library of system prompts for using LLMs to refine STT output

19
Experimental
4598 SysAdminDoc/Qwen3-TTS-Studio

Install and create TTS with AI voice generator powered by Alibaba's Qwen3-TTS.

19
Experimental
4599 surpoloyang/Audio-Chatbot

Intelligent Voice Interaction System Project

19
Experimental
4600 MrKruemel/VoicePaste

Voice-controlled transcription, AI summarization, and paste — triggered by...

19
Experimental
« Prev 1 2 3 44 45 46 47 48 68 69 70 Next »