All Voice AI Tools
6,981 tools ranked by quality score · Page 6 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 501 |
ai-bot-pro/achatbot
An open source chat bot architecture for voice/vision (and multimodal)... |
|
Established |
| 502 |
Finrandojin/alexandria-audiobook
AI-powered multi-voice audiobook generator — LLM script annotation, voice... |
|
Established |
| 503 |
goodatlas/zeroth
Kaldi-based Korean ASR (한국어 음성인식) open-source project |
|
Established |
| 504 |
dmotz/thing-translator
📷 🗣 Point your camera at things to hear how to say them in a different language |
|
Established |
| 505 |
j3soon/whisper-to-input
An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI... |
|
Established |
| 506 |
jackaduma/CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2 |
|
Established |
| 507 |
moeru-ai/unspeech
🗣️🔊 Your Text-to-Speech Services, All-in-One. |
|
Established |
| 508 |
liuli-moe/to-the-stars
魔法少女小圆 飞向星空 中文翻译 |
|
Established |
| 509 |
hasscc/hass-edge-tts
🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key |
|
Established |
| 510 |
inevolin/DiscordEarsBot
A speech-to-text framework and bot for Discord. Take control of your Discord... |
|
Established |
| 511 |
woheller69/whoBIRD
Identify bird sounds in real time with this Android version of BirdNET. Bird... |
|
Established |
| 512 |
sdsds222/Unitale
一个基于Indextts和Qwen3TTS的 AI 有声书制作工具。利用 LLM 自动拆解剧本与识别情绪,集成多角色 TTS... |
|
Established |
| 513 |
NTT123/vietTTS
Vietnamese Text to Speech library |
|
Established |
| 514 |
Gr122lyBr/voicetag
Speaker identification powered by pyannote and resemblyzer |
|
Established |
| 515 |
SamirPaulb/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in... |
|
Established |
| 516 |
ekwek1/soprano-factory
Soprano-Factory: Train your own 2000x realtime text-to-speech model |
|
Established |
| 517 |
WhisperSpeech/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper. |
|
Established |
| 518 |
Azure-Samples/Cognitive-Services-Voice-Assistant
Welcome to the Microsoft Voice Assistant samples repository! Here you will... |
|
Established |
| 519 |
ZDisket/TensorVox
Desktop application for neural speech synthesis written in C++ |
|
Established |
| 520 |
hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC,... |
|
Established |
| 521 |
israelg99/deepvoice
Deep Voice: Real-time Neural Text-to-Speech |
|
Established |
| 522 |
AlexandaJerry/vits-mandarin-biaobei
application of vits on mandarin tts |
|
Established |
| 523 |
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion |
|
Established |
| 524 |
xkeyC/fl_caption
Offline real-time captioning software written in Flutter and Rust, powered... |
|
Established |
| 525 |
vlomme/Multi-Tacotron-Voice-Cloning
Phoneme multilingual(Russian-English) voice cloning based on |
|
Established |
| 526 |
FlashLabs-AI-Corp/FlashLabs-Chroma
Worlds first open-source real-time end-to-end spoken dialogue model with... |
|
Established |
| 527 |
jiaqili3/DualCodec
[Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural... |
|
Established |
| 528 |
iMicknl/azure-podcast-generator
Generate an engaging podcast based on your document using Azure OpenAI and... |
|
Emerging |
| 529 |
ddPn08/rvc-webui
liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project |
|
Emerging |
| 530 |
Gautham495/react-native-speech-recognition-kit
React Native Turbo Module to access Speech Recognition in Android & iOS |
|
Emerging |
| 531 |
litagin02/rvc-tts-webui
Text-to-Speech Gradio webui using RVC and edge-tts |
|
Emerging |
| 532 |
seungwonpark/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2) |
|
Emerging |
| 533 |
voice-cloning-app/Voice-Cloning-App
A Python/Pytorch app for easily synthesising human voices |
|
Emerging |
| 534 |
rakeshvar/rnn_ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with... |
|
Emerging |
| 535 |
jonatasgrosman/asrecognition
ASRecognition: just an easy-to-use library for Automatic Speech Recognition. |
|
Emerging |
| 536 |
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text... |
|
Emerging |
| 537 |
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS |
|
Emerging |
| 538 |
Artrajz/vits-simple-api
A simple VITS HTTP API, developed by extending Moegoe with additional features. |
|
Emerging |
| 539 |
SlapBot/stephanie-va
Stephanie is an open-source platform built specifically for voice-controlled... |
|
Emerging |
| 540 |
dessa-oss/fake-voice-detection
Using temporal convolution to detect Audio Deepfakes |
|
Emerging |
| 541 |
DragonComputer/Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions |
|
Emerging |
| 542 |
santi-pdp/pase
Problem Agnostic Speech Encoder |
|
Emerging |
| 543 |
arghyasur1991/Spark-TTS-Unity
Unity package for using Spark-TTS on-device models. This is a C# port of... |
|
Emerging |
| 544 |
nitaiaharoni1/whisper-speech-to-text
Whisper Speech-to-Text is a JavaScript library for recording and... |
|
Emerging |
| 545 |
pedroetb/tts-api
Text to speech REST API for multiple TTS engines |
|
Emerging |
| 546 |
jeroenterheerdt/pycsspeechtts
Python (py) library to use Microsofts Cognitive Services Speech (csspeech)... |
|
Emerging |
| 547 |
mpaepper/vibevoice
Fast local speech-to-text for any app using faster-whisper |
|
Emerging |
| 548 |
p0p4k/vits2_pytorch
unofficial vits2-TTS implementation in pytorch |
|
Emerging |
| 549 |
jim-schwoebel/voicebook
🗣️ A book and repo to get you started programming voice computing... |
|
Emerging |
| 550 |
analyticsinmotion/werx
🐍📦 Easy-to-use Python package for lightning-fast Word Error Rate (WER) analysis |
|
Emerging |
| 551 |
woheller69/whisperIME
Android Input Method Editor (IME) based on Whisper |
|
Emerging |
| 552 |
gionanide/Speech_Signal_Processing_and_Classification
Front-end speech processing aims at extracting proper features from short-... |
|
Emerging |
| 553 |
junzew/HanTTS
Chinese Text-to-Speech web service |
|
Emerging |
| 554 |
simonw/ospeak
CLI tool for running text through OpenAI Text to speech |
|
Emerging |
| 555 |
C-Loftus/QuickPiperAudiobook
With one command, create a natural-sounding audiobook from a variety of... |
|
Emerging |
| 556 |
modal-labs/quillman
A voice chat app |
|
Emerging |
| 557 |
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model. |
|
Emerging |
| 558 |
OpenVoiceOS/ovos-buildroot
Open Voice Operating System - Buildroot edition is a minimalistic linux OS... |
|
Emerging |
| 559 |
vasistalodagala/whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR)... |
|
Emerging |
| 560 |
thuhcsi/Crystal
Crystal - C++ implementation of a unified framework for multilingual TTS... |
|
Emerging |
| 561 |
juntaosun/ComeCut
「来剪」轻量级视频编辑器。网页版、桌面版等均可免费使用,功能灵感源自 CapCut 等编辑器。A Lightweight Video Editor.... |
|
Emerging |
| 562 |
tugstugi/pytorch-dc-tts
Text to Speech with PyTorch (English and Mongolian) |
|
Emerging |
| 563 |
revdotcom/fstalign
An efficient OpenFST-based tool for calculating WER and aligning two... |
|
Emerging |
| 564 |
Lex-au/Vocalis
Speech-to-speech AI assistant with natural conversation flow, mid-speech... |
|
Emerging |
| 565 |
PriesiaMioShirakana/DragonianVoice
多个SVC/TTS的C++推理库 |
|
Emerging |
| 566 |
savbell/whisper-writer
💬📝 A small dictation app using OpenAI's Whisper speech recognition model. |
|
Emerging |
| 567 |
jhuus/HawkEars1
⚠️ HawkEars 1.0 (obsolete). See HawkEars 2.0 → https://github.com/jhuus/HawkEars |
|
Emerging |
| 568 |
opendilab/CleanS2S
High-quality and streaming Speech-to-Speech interactive agent in a single... |
|
Emerging |
| 569 |
dhruvapte26/B.E.N.J.I.
B.E.N.J.I.- The Impossible Missions Force's digital assistant |
|
Emerging |
| 570 |
belambert/asr-evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word... |
|
Emerging |
| 571 |
vannu07/jarvis
🤖 Jarvis - AI Voice Assistant with Face Recognition | Hacktoberfest 2025... |
|
Emerging |
| 572 |
Poeschl/Hassio-Addons
The repository for my Home Assistant Supervisor Add-ons. |
|
Emerging |
| 573 |
Audio-WestlakeU/VINP
Official PyTorch implementation of 'VINP: Variational Bayesian Inference... |
|
Emerging |
| 574 |
robmsmt/KerasDeepSpeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation |
|
Emerging |
| 575 |
OpenBMB/UltraEval-Audio
Your faithful, impartial partner for audio evaluation — know yourself, know... |
|
Emerging |
| 576 |
eheikes/tts
Tools to convert text to speech :books::speech_balloon: |
|
Emerging |
| 577 |
google/tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end... |
|
Emerging |
| 578 |
sergenes/runandread-audiobook
🚀 Open-source project for creating high-quality AI TTS-narrated audiobooks... |
|
Emerging |
| 579 |
ARBML/klaam
Arabic speech recognition, classification and text-to-speech. |
|
Emerging |
| 580 |
zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis,... |
|
Emerging |
| 581 |
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating... |
|
Emerging |
| 582 |
NevilPatel01/RVC-WebUI-MacOS
Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs... |
|
Emerging |
| 583 |
pnnbao97/sea-g2p
Fast multilingual text-to-phoneme converter for South East Asian languages. |
|
Emerging |
| 584 |
deepgram/deepgram-go-sdk
Official Go SDK for Deepgram. |
|
Emerging |
| 585 |
233stone/vocotype-cli
VocoType 是一款运行在本地端侧的隐私安全语音输入工具,通过快捷键即可将语音实时转换为文字并自动输入到当前应用。支持语音转文字MCP、AI... |
|
Emerging |
| 586 |
scionoftech/DeepAsr
Keras(Tensorflow) implementations of Automatic Speech Recognition |
|
Emerging |
| 587 |
hehehai/voxt
🎙️Voice input and translation app for macOS. Press to talk, release to paste. |
|
Emerging |
| 588 |
alumae/kaldi-offline-transcriber
Offline transcription system for Estonian using Kaldi |
|
Emerging |
| 589 |
mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion... |
|
Emerging |
| 590 |
lucoiso/UEAzSpeech
This plugin integrates Azure Speech Cognitive Services in Unreal Engine. |
|
Emerging |
| 591 |
hegedustibor/htgo-tts
Text to speech package for Golang. |
|
Emerging |
| 592 |
ModelTC/LightTTS
LightTTS is a lightweight TTS inference framework optimized for CosyVoice2... |
|
Emerging |
| 593 |
haolinwang819-boop/ai-video-generation-workflow
AI video generation workflow with script, slides, TTS, subtitles, and FFmpeg... |
|
Emerging |
| 594 |
Kaljurand/dictate.js
A small Javascript library for browser-based real-time speech recognition,... |
|
Emerging |
| 595 |
YaoFANGUK/video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI... |
|
Emerging |
| 596 |
liangstein/Chinese-speech-to-text
Chinese Speech To Text Using Wavenet |
|
Emerging |
| 597 |
TheStageAI/TheWhisper
Optimized Whisper models for streaming and on-device use |
|
Emerging |
| 598 |
ivanvovk/durian-pytorch
Implementation of "Duration Informed Attention Network for Multimodal... |
|
Emerging |
| 599 |
upskyy/Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for... |
|
Emerging |
| 600 |
ActiveNick/HoloBot
HoloBot is a reusable 3D interface that allows HoloLens & VR users to... |
|
Emerging |