All Voice AI Tools

6,983 tools ranked by quality score · Page 5 of 70

Showing 401–500 of 6,983
# Tool Score Tier
401 hehehai/voxt

🎙️Voice input and translation app for macOS. Press to talk, release to paste.

46
Emerging
402 manyeyes/ManySpeech

AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment...

46
Emerging
403 davidamacey/OpenTranscribe

Self-hosted AI-powered transcription platform with speaker diarization,...

46
Emerging
404 yanorei32/discord-tts

TTS Discord Bot [VOICEROID, VOICEVOX, AivisSpeech, kttsproject, WinRT, and...

46
Emerging
405 Henry-23/VideoChat

实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human,...

46
Emerging
406 primepake/wav2lip_288x288

Wav2Lip version 288 and pipeline to train

46
Emerging
407 jpreprocess/jbonsai

Voice synthesis library for Text-to-Speech applications (Currently HTS...

46
Emerging
408 common-voice/cv-dataset

Metadata and versioning details for the Common Voice dataset

46
Emerging
409 hetpandya/youtube_tts_data_generator

A python library to generate speech dataset from Youtube videos

46
Emerging
410 aahl/qwen-asr2api

🎤 Qwen 3 ASR to OpenAI API, 免费STT语音识别模型

46
Emerging
411 IhorShevchuk/piper-app

The original Piper, now on iOS and macOS

46
Emerging
412 hgneng/ekho

Chinese text-to-speech engine

46
Emerging
413 PaciStardust/HOSCY

Companion for OSC and Communication

46
Emerging
414 Macoron/whisper.unity

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

46
Emerging
415 Notely-Voice/NotelyVoice

A 100% private AI voice transcription app that converts speech to text in...

46
Emerging
416 mlalma/KokoroTestApp

Test application for Kokoro TTS model

46
Emerging
417 solyarisoftware/voskJs

Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.

46
Emerging
418 emnikhil/Sign-Language-To-Text-Conversion

Sign Language to Text Conversion is a real-time system that uses a camera to...

46
Emerging
419 jianchang512/clone-voice

A sound cloning tool with a web interface, using your voice or any sound to...

46
Emerging
420 Lex-au/Orpheus-FastAPI

High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices,...

46
Emerging
421 FunAudioLLM/Fun-ASR

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

46
Emerging
422 BolajiAyodeji/chat-with-siri

🤖 A text-to-speech chatbot built using Nextjs, OpenAI, and ElevenLabs.

46
Emerging
423 pnlpal/dictionariez

📚 A customizable dictionary extension that supports double-click lookups in...

46
Emerging
424 wxxxcxx/ms-ra-forwarder

免费的在线文本转语音API

46
Emerging
425 atomiechen/FunASR-Client

Really easy-to-use Python client for FunASR runtime server.

46
Emerging
426 PraaneshSelvaraj/speech_engine

Speech Engine is a Python package that provides a simple interface for...

45
Emerging
427 AIGC-Audio/AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

45
Emerging
428 ArdaGnsrn/elevenlabs-laravel

This is an Open Source PHP Laravel package for ElevenLabs Text to Speech API.

45
Emerging
429 PrzemyslawSwiderski/python-gradle-plugin

Gradle plugin to run Python projects.

45
Emerging
430 gabriele-mastrapasqua/qwen3-tts

Pure C inference engine for Qwen3-TTS text-to-speech. No Python, no PyTorch...

45
Emerging
431 mgonzs13/audio_common

A PortAudio based audio_common with text to speech for ROS 2

45
Emerging
432 deepgram-devs/nextjs-text-to-speech

Get started using Deepgram's Text-to-Speech with this Next.js demo app

45
Emerging
433 233stone/vocotype-cli

VocoType 是一款运行在本地端侧的隐私安全语音输入工具,通过快捷键即可将语音实时转换为文字并自动输入到当前应用。支持语音转文字MCP、AI...

45
Emerging
434 misyaguziya/VRCT

VRCT(VRChat Chatbox Translator & Transcription)

45
Emerging
435 artibex/piper-http

Creates a docker image that runs the piper http service

45
Emerging
436 Picovoice/leopard

On-device speech-to-text engine powered by deep learning

45
Emerging
437 rhasspy/piper

A fast, local neural text to speech system

45
Emerging
438 vannu07/jarvis

🤖 Jarvis - AI Voice Assistant with Face Recognition | Hacktoberfest 2025...

45
Emerging
439 createcandle/voco

Privacy friendly voice control for the Candle Controller / WebThings...

45
Emerging
440 Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

45
Emerging
441 alphacep/awesome-russian-speech

Russian speech technology links

45
Emerging
442 asiff00/On-Device-Speech-to-Speech-Conversational-AI

This is an on-CPU real-time conversational system for two-way speech...

45
Emerging
443 Weilbyte/tiktok-tts

Generate TikTok Text-to-Speech voices in your browser

45
Emerging
444 avinashvarna/sanskrit_tts

Sanskrit text to speech

45
Emerging
445 mlalma/MisakiSwift

Swift port of Misaki G2P (grapheme-to-phoneme) library that can be used e.g....

45
Emerging
446 BuildWithAIs/voicekey

Voice to text, one key to input.

45
Emerging
447 rhasspy/rhasspy

Offline private voice assistant for many human languages

45
Emerging
448 gooofy/zerovox

zero-shot realtime TTS system, fully offline, free and open source

45
Emerging
449 shashank2122/Local-Voice

A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local...

45
Emerging
450 sanchit-gandhi/whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

45
Emerging
451 FENRlR/MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

45
Emerging
452 Purple-Horizons/openclaw-voice

🦞 Open-source browser-based voice chat for AI assistants. Self-hosted,...

45
Emerging
453 Ashish-Patnaik/kokoclone

Voice Cloning, Now Inside Kokoro. Generate natural multilingual speech and...

45
Emerging
454 huggingface/distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller,...

45
Emerging
455 Thiagohgl/ai-pronunciation-trainer

This tool uses AI to evaluate your pronunciation.

45
Emerging
456 ceuk/speech-recognition-aws-polyfill

Polyfill for the SpeechRecognition browser API using AWS Transcribe as a fallback

45
Emerging
457 areebbeigh/winspeech

Speech recognition and synthesis library for Windows - Python 2 and 3.

45
Emerging
458 h5p/h5p-speak-the-words

Create questions answered through speech

45
Emerging
459 adrianlyjak/obsidian-aloud-tts

Obsidian TTS Plugin

45
Emerging
460 OpenVoiceOS/ovos-tts-plugin-cotovia

galician tts plugin for OVOS

45
Emerging
461 shhossain/BanglaTTS

BanglaTTS is a text-to-speech (TTS) system for Bangla language that works in...

45
Emerging
462 reazon-research/ReazonSpeech

Massive open Japanese speech corpus

45
Emerging
463 thorstenMueller/Thorsten-Voice

Thorsten-Voice: A free to use, offline working, high quality german TTS...

45
Emerging
464 saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper...

45
Emerging
465 athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

44
Emerging
466 gotev/android-speech

Android speech recognition and text to speech made easy

44
Emerging
467 i4Ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

44
Emerging
468 totalvoice/totalvoice-node

Client em NodeJS para API da Totalvoice

44
Emerging
469 thinhlpg/vixtts-demo

A Vietnamese Voice Cloning Text-to-Speech Model ✨

44
Emerging
470 petermg/Chatterbox-TTS-Extended

Modified version of Chatterbox that accepts text files as input and no...

44
Emerging
471 zw76859420/ASR_Theory

语音识别理论、论文和PPT

44
Emerging
472 MycroftAI/adapt

Adapt Intent Parser

44
Emerging
473 cosin2077/easyVoice

开源文本转语音工具,支持超长文本,多角色配音

44
Emerging
474 gooofy/py-nltools

A collection of basic python modules for spoken natural language processing

44
Emerging
475 AutoArk/GPA

[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion...

44
Emerging
476 mutablelogic/go-whisper

Speech-to-Text in golang

44
Emerging
477 tover0314-w/opentypeless

Talkmore with Opentypeless. Type with your voice. Anywhere. Talk -...

44
Emerging
478 speechio/chinese_text_normalization

Chinese text normalization for speech processing

44
Emerging
479 react-native-voice/voice

:microphone: React Native Voice Recognition library for iOS and Android...

44
Emerging
480 rse/speechflow

Speech Processing Flow Graph

44
Emerging
481 lifeiteng/vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo...

44
Emerging
482 r9y9/deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech...

44
Emerging
483 NVIDIA/OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

44
Emerging
484 spring-media/TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based...

44
Emerging
485 xcmyz/FastSpeech

The Implementation of FastSpeech based on pytorch.

44
Emerging
486 ggeop/Python-ai-assistant

Python AI assistant 🧠

44
Emerging
487 soobinseo/Transformer-TTS

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

44
Emerging
488 shhossain/BanglaSpeech2Text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla...

44
Emerging
489 Azure-Samples/SpeechToText-WebSockets-Javascript

SDK & Sample to do speech recognition using websockets in Javascript

44
Emerging
490 google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural...

44
Emerging
491 pannous/tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework,...

44
Emerging
492 Amey-Thakur/DEEPFAKE-AUDIO

🎙️ Deepfake Audio – A neural voice cloning studio powered by SV2TTS technology.

44
Emerging
493 jaywalnut310/glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

44
Emerging
494 bambocher/pocketsphinx-python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

44
Emerging
495 whitphx/streamlit-stt-app

Real time web based Speech-to-Text app with Streamlit

44
Emerging
496 fatchord/WaveRNN

WaveRNN Vocoder + TTS

44
Emerging
497 ArkanDash/Multi-Model-RVC-Inference

RVC Inference with multiple model and huggingface support

44
Emerging
498 alumae/kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit...

44
Emerging
499 symblai/getting-started-samples

Code samples to Get started quickly with Symbl's Voice SDK and APIs:...

44
Emerging
500 wildminder/ComfyUI-VibeVoice

ComfyUI custom node for the VibeVoice TTS. Expressive, long-form,...

44
Emerging
« Prev 1 2 3 4 5 6 7 68 69 70 Next »