All Voice AI Tools
6,983 tools ranked by quality score · Page 5 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 401 |
hehehai/voxt
🎙️Voice input and translation app for macOS. Press to talk, release to paste. |
|
Emerging |
| 402 |
manyeyes/ManySpeech
AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment... |
|
Emerging |
| 403 |
davidamacey/OpenTranscribe
Self-hosted AI-powered transcription platform with speaker diarization,... |
|
Emerging |
| 404 |
yanorei32/discord-tts
TTS Discord Bot [VOICEROID, VOICEVOX, AivisSpeech, kttsproject, WinRT, and... |
|
Emerging |
| 405 |
Henry-23/VideoChat
实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human,... |
|
Emerging |
| 406 |
primepake/wav2lip_288x288
Wav2Lip version 288 and pipeline to train |
|
Emerging |
| 407 |
jpreprocess/jbonsai
Voice synthesis library for Text-to-Speech applications (Currently HTS... |
|
Emerging |
| 408 |
common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset |
|
Emerging |
| 409 |
hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos |
|
Emerging |
| 410 |
aahl/qwen-asr2api
🎤 Qwen 3 ASR to OpenAI API, 免费STT语音识别模型 |
|
Emerging |
| 411 |
IhorShevchuk/piper-app
The original Piper, now on iOS and macOS |
|
Emerging |
| 412 |
hgneng/ekho
Chinese text-to-speech engine |
|
Emerging |
| 413 |
PaciStardust/HOSCY
Companion for OSC and Communication |
|
Emerging |
| 414 |
Macoron/whisper.unity
Running speech to text model (whisper.cpp) in Unity3d on your local machine. |
|
Emerging |
| 415 |
Notely-Voice/NotelyVoice
A 100% private AI voice transcription app that converts speech to text in... |
|
Emerging |
| 416 |
mlalma/KokoroTestApp
Test application for Kokoro TTS model |
|
Emerging |
| 417 |
solyarisoftware/voskJs
Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server. |
|
Emerging |
| 418 |
emnikhil/Sign-Language-To-Text-Conversion
Sign Language to Text Conversion is a real-time system that uses a camera to... |
|
Emerging |
| 419 |
jianchang512/clone-voice
A sound cloning tool with a web interface, using your voice or any sound to... |
|
Emerging |
| 420 |
Lex-au/Orpheus-FastAPI
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices,... |
|
Emerging |
| 421 |
FunAudioLLM/Fun-ASR
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab. |
|
Emerging |
| 422 |
BolajiAyodeji/chat-with-siri
🤖 A text-to-speech chatbot built using Nextjs, OpenAI, and ElevenLabs. |
|
Emerging |
| 423 |
pnlpal/dictionariez
📚 A customizable dictionary extension that supports double-click lookups in... |
|
Emerging |
| 424 |
wxxxcxx/ms-ra-forwarder
免费的在线文本转语音API |
|
Emerging |
| 425 |
atomiechen/FunASR-Client
Really easy-to-use Python client for FunASR runtime server. |
|
Emerging |
| 426 |
PraaneshSelvaraj/speech_engine
Speech Engine is a Python package that provides a simple interface for... |
|
Emerging |
| 427 |
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head |
|
Emerging |
| 428 |
ArdaGnsrn/elevenlabs-laravel
This is an Open Source PHP Laravel package for ElevenLabs Text to Speech API. |
|
Emerging |
| 429 |
PrzemyslawSwiderski/python-gradle-plugin
Gradle plugin to run Python projects. |
|
Emerging |
| 430 |
gabriele-mastrapasqua/qwen3-tts
Pure C inference engine for Qwen3-TTS text-to-speech. No Python, no PyTorch... |
|
Emerging |
| 431 |
mgonzs13/audio_common
A PortAudio based audio_common with text to speech for ROS 2 |
|
Emerging |
| 432 |
deepgram-devs/nextjs-text-to-speech
Get started using Deepgram's Text-to-Speech with this Next.js demo app |
|
Emerging |
| 433 |
233stone/vocotype-cli
VocoType 是一款运行在本地端侧的隐私安全语音输入工具,通过快捷键即可将语音实时转换为文字并自动输入到当前应用。支持语音转文字MCP、AI... |
|
Emerging |
| 434 |
misyaguziya/VRCT
VRCT(VRChat Chatbox Translator & Transcription) |
|
Emerging |
| 435 |
artibex/piper-http
Creates a docker image that runs the piper http service |
|
Emerging |
| 436 |
Picovoice/leopard
On-device speech-to-text engine powered by deep learning |
|
Emerging |
| 437 |
rhasspy/piper
A fast, local neural text to speech system |
|
Emerging |
| 438 |
vannu07/jarvis
🤖 Jarvis - AI Voice Assistant with Face Recognition | Hacktoberfest 2025... |
|
Emerging |
| 439 |
createcandle/voco
Privacy friendly voice control for the Candle Controller / WebThings... |
|
Emerging |
| 440 |
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI |
|
Emerging |
| 441 |
alphacep/awesome-russian-speech
Russian speech technology links |
|
Emerging |
| 442 |
asiff00/On-Device-Speech-to-Speech-Conversational-AI
This is an on-CPU real-time conversational system for two-way speech... |
|
Emerging |
| 443 |
Weilbyte/tiktok-tts
Generate TikTok Text-to-Speech voices in your browser |
|
Emerging |
| 444 |
avinashvarna/sanskrit_tts
Sanskrit text to speech |
|
Emerging |
| 445 |
mlalma/MisakiSwift
Swift port of Misaki G2P (grapheme-to-phoneme) library that can be used e.g.... |
|
Emerging |
| 446 |
BuildWithAIs/voicekey
Voice to text, one key to input. |
|
Emerging |
| 447 |
rhasspy/rhasspy
Offline private voice assistant for many human languages |
|
Emerging |
| 448 |
gooofy/zerovox
zero-shot realtime TTS system, fully offline, free and open source |
|
Emerging |
| 449 |
shashank2122/Local-Voice
A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local... |
|
Emerging |
| 450 |
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU. |
|
Emerging |
| 451 |
FENRlR/MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch |
|
Emerging |
| 452 |
Purple-Horizons/openclaw-voice
🦞 Open-source browser-based voice chat for AI assistants. Self-hosted,... |
|
Emerging |
| 453 |
Ashish-Patnaik/kokoclone
Voice Cloning, Now Inside Kokoro. Generate natural multilingual speech and... |
|
Emerging |
| 454 |
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller,... |
|
Emerging |
| 455 |
Thiagohgl/ai-pronunciation-trainer
This tool uses AI to evaluate your pronunciation. |
|
Emerging |
| 456 |
ceuk/speech-recognition-aws-polyfill
Polyfill for the SpeechRecognition browser API using AWS Transcribe as a fallback |
|
Emerging |
| 457 |
areebbeigh/winspeech
Speech recognition and synthesis library for Windows - Python 2 and 3. |
|
Emerging |
| 458 |
h5p/h5p-speak-the-words
Create questions answered through speech |
|
Emerging |
| 459 |
adrianlyjak/obsidian-aloud-tts
Obsidian TTS Plugin |
|
Emerging |
| 460 |
OpenVoiceOS/ovos-tts-plugin-cotovia
galician tts plugin for OVOS |
|
Emerging |
| 461 |
shhossain/BanglaTTS
BanglaTTS is a text-to-speech (TTS) system for Bangla language that works in... |
|
Emerging |
| 462 |
reazon-research/ReazonSpeech
Massive open Japanese speech corpus |
|
Emerging |
| 463 |
thorstenMueller/Thorsten-Voice
Thorsten-Voice: A free to use, offline working, high quality german TTS... |
|
Emerging |
| 464 |
saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper... |
|
Emerging |
| 465 |
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine |
|
Emerging |
| 466 |
gotev/android-speech
Android speech recognition and text to speech made easy |
|
Emerging |
| 467 |
i4Ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model. |
|
Emerging |
| 468 |
totalvoice/totalvoice-node
Client em NodeJS para API da Totalvoice |
|
Emerging |
| 469 |
thinhlpg/vixtts-demo
A Vietnamese Voice Cloning Text-to-Speech Model ✨ |
|
Emerging |
| 470 |
petermg/Chatterbox-TTS-Extended
Modified version of Chatterbox that accepts text files as input and no... |
|
Emerging |
| 471 |
zw76859420/ASR_Theory
语音识别理论、论文和PPT |
|
Emerging |
| 472 |
MycroftAI/adapt
Adapt Intent Parser |
|
Emerging |
| 473 |
cosin2077/easyVoice
开源文本转语音工具,支持超长文本,多角色配音 |
|
Emerging |
| 474 |
gooofy/py-nltools
A collection of basic python modules for spoken natural language processing |
|
Emerging |
| 475 |
AutoArk/GPA
[AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion... |
|
Emerging |
| 476 |
mutablelogic/go-whisper
Speech-to-Text in golang |
|
Emerging |
| 477 |
tover0314-w/opentypeless
Talkmore with Opentypeless. Type with your voice. Anywhere. Talk -... |
|
Emerging |
| 478 |
speechio/chinese_text_normalization
Chinese text normalization for speech processing |
|
Emerging |
| 479 |
react-native-voice/voice
:microphone: React Native Voice Recognition library for iOS and Android... |
|
Emerging |
| 480 |
rse/speechflow
Speech Processing Flow Graph |
|
Emerging |
| 481 |
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo... |
|
Emerging |
| 482 |
r9y9/deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech... |
|
Emerging |
| 483 |
NVIDIA/OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP |
|
Emerging |
| 484 |
spring-media/TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based... |
|
Emerging |
| 485 |
xcmyz/FastSpeech
The Implementation of FastSpeech based on pytorch. |
|
Emerging |
| 486 |
ggeop/Python-ai-assistant
Python AI assistant 🧠 |
|
Emerging |
| 487 |
soobinseo/Transformer-TTS
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network" |
|
Emerging |
| 488 |
shhossain/BanglaSpeech2Text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla... |
|
Emerging |
| 489 |
Azure-Samples/SpeechToText-WebSockets-Javascript
SDK & Sample to do speech recognition using websockets in Javascript |
|
Emerging |
| 490 |
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural... |
|
Emerging |
| 491 |
pannous/tensorflow-speech-recognition
🎙Speech recognition using the tensorflow deep learning framework,... |
|
Emerging |
| 492 |
Amey-Thakur/DEEPFAKE-AUDIO
🎙️ Deepfake Audio – A neural voice cloning studio powered by SV2TTS technology. |
|
Emerging |
| 493 |
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search |
|
Emerging |
| 494 |
bambocher/pocketsphinx-python
Python interface to CMU Sphinxbase and Pocketsphinx libraries |
|
Emerging |
| 495 |
whitphx/streamlit-stt-app
Real time web based Speech-to-Text app with Streamlit |
|
Emerging |
| 496 |
fatchord/WaveRNN
WaveRNN Vocoder + TTS |
|
Emerging |
| 497 |
ArkanDash/Multi-Model-RVC-Inference
RVC Inference with multiple model and huggingface support |
|
Emerging |
| 498 |
alumae/kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit... |
|
Emerging |
| 499 |
symblai/getting-started-samples
Code samples to Get started quickly with Symbl's Voice SDK and APIs:... |
|
Emerging |
| 500 |
wildminder/ComfyUI-VibeVoice
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form,... |
|
Emerging |