All Voice AI Tools

6,981 tools ranked by quality score · Page 6 of 70

Showing 501–600 of 6,981
# Tool Score Tier
501 ai-bot-pro/achatbot

An open source chat bot architecture for voice/vision (and multimodal)...

50
Established
502 Finrandojin/alexandria-audiobook

AI-powered multi-voice audiobook generator — LLM script annotation, voice...

50
Established
503 goodatlas/zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

50
Established
504 dmotz/thing-translator

📷 🗣 Point your camera at things to hear how to say them in a different language

50
Established
505 j3soon/whisper-to-input

An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI...

50
Established
506 jackaduma/CycleGAN-VC2

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

50
Established
507 moeru-ai/unspeech

🗣️🔊 Your Text-to-Speech Services, All-in-One.

50
Established
508 liuli-moe/to-the-stars

魔法少女小圆 飞向星空 中文翻译

50
Established
509 hasscc/hass-edge-tts

🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key

50
Established
510 inevolin/DiscordEarsBot

A speech-to-text framework and bot for Discord. Take control of your Discord...

50
Established
511 woheller69/whoBIRD

Identify bird sounds in real time with this Android version of BirdNET. Bird...

50
Established
512 sdsds222/Unitale

一个基于Indextts和Qwen3TTS的 AI 有声书制作工具。利用 LLM 自动拆解剧本与识别情绪,集成多角色 TTS...

50
Established
513 NTT123/vietTTS

Vietnamese Text to Speech library

50
Established
514 Gr122lyBr/voicetag

Speaker identification powered by pyannote and resemblyzer

50
Established
515 SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in...

50
Established
516 ekwek1/soprano-factory

Soprano-Factory: Train your own 2000x realtime text-to-speech model

50
Established
517 WhisperSpeech/WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

50
Established
518 Azure-Samples/Cognitive-Services-Voice-Assistant

Welcome to the Microsoft Voice Assistant samples repository! Here you will...

50
Established
519 ZDisket/TensorVox

Desktop application for neural speech synthesis written in C++

50
Established
520 hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC,...

50
Established
521 israelg99/deepvoice

Deep Voice: Real-time Neural Text-to-Speech

50
Established
522 AlexandaJerry/vits-mandarin-biaobei

application of vits on mandarin tts

50
Established
523 svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

50
Established
524 xkeyC/fl_caption

Offline real-time captioning software written in Flutter and Rust, powered...

50
Established
525 vlomme/Multi-Tacotron-Voice-Cloning

Phoneme multilingual(Russian-English) voice cloning based on

50
Established
526 FlashLabs-AI-Corp/FlashLabs-Chroma

Worlds first open-source real-time end-to-end spoken dialogue model with...

50
Established
527 jiaqili3/DualCodec

[Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural...

50
Established
528 iMicknl/azure-podcast-generator

Generate an engaging podcast based on your document using Azure OpenAI and...

49
Emerging
529 ddPn08/rvc-webui

liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project

49
Emerging
530 Gautham495/react-native-speech-recognition-kit

React Native Turbo Module to access Speech Recognition in Android & iOS

49
Emerging
531 litagin02/rvc-tts-webui

Text-to-Speech Gradio webui using RVC and edge-tts

49
Emerging
532 seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

49
Emerging
533 voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

49
Emerging
534 rakeshvar/rnn_ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with...

49
Emerging
535 jonatasgrosman/asrecognition

ASRecognition: just an easy-to-use library for Automatic Speech Recognition.

49
Emerging
536 mozilla/DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text...

49
Emerging
537 metavoiceio/metavoice-src

Foundational model for human-like, expressive TTS

49
Emerging
538 Artrajz/vits-simple-api

A simple VITS HTTP API, developed by extending Moegoe with additional features.

49
Emerging
539 SlapBot/stephanie-va

Stephanie is an open-source platform built specifically for voice-controlled...

49
Emerging
540 dessa-oss/fake-voice-detection

Using temporal convolution to detect Audio Deepfakes

49
Emerging
541 DragonComputer/Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

49
Emerging
542 santi-pdp/pase

Problem Agnostic Speech Encoder

49
Emerging
543 arghyasur1991/Spark-TTS-Unity

Unity package for using Spark-TTS on-device models. This is a C# port of...

49
Emerging
544 nitaiaharoni1/whisper-speech-to-text

Whisper Speech-to-Text is a JavaScript library for recording and...

49
Emerging
545 pedroetb/tts-api

Text to speech REST API for multiple TTS engines

49
Emerging
546 jeroenterheerdt/pycsspeechtts

Python (py) library to use Microsofts Cognitive Services Speech (csspeech)...

49
Emerging
547 mpaepper/vibevoice

Fast local speech-to-text for any app using faster-whisper

49
Emerging
548 p0p4k/vits2_pytorch

unofficial vits2-TTS implementation in pytorch

49
Emerging
549 jim-schwoebel/voicebook

🗣️ A book and repo to get you started programming voice computing...

49
Emerging
550 analyticsinmotion/werx

🐍📦 Easy-to-use Python package for lightning-fast Word Error Rate (WER) analysis

49
Emerging
551 woheller69/whisperIME

Android Input Method Editor (IME) based on Whisper

49
Emerging
552 gionanide/Speech_Signal_Processing_and_Classification

Front-end speech processing aims at extracting proper features from short-...

49
Emerging
553 junzew/HanTTS

Chinese Text-to-Speech web service

49
Emerging
554 simonw/ospeak

CLI tool for running text through OpenAI Text to speech

49
Emerging
555 C-Loftus/QuickPiperAudiobook

With one command, create a natural-sounding audiobook from a variety of...

49
Emerging
556 modal-labs/quillman

A voice chat app

49
Emerging
557 myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

49
Emerging
558 OpenVoiceOS/ovos-buildroot

Open Voice Operating System - Buildroot edition is a minimalistic linux OS...

49
Emerging
559 vasistalodagala/whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR)...

49
Emerging
560 thuhcsi/Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS...

49
Emerging
561 juntaosun/ComeCut

「来剪」轻量级视频编辑器。网页版、桌面版等均可免费使用,功能灵感源自 CapCut 等编辑器。A Lightweight Video Editor....

49
Emerging
562 tugstugi/pytorch-dc-tts

Text to Speech with PyTorch (English and Mongolian)

49
Emerging
563 revdotcom/fstalign

An efficient OpenFST-based tool for calculating WER and aligning two...

49
Emerging
564 Lex-au/Vocalis

Speech-to-speech AI assistant with natural conversation flow, mid-speech...

49
Emerging
565 PriesiaMioShirakana/DragonianVoice

多个SVC/TTS的C++推理库

49
Emerging
566 savbell/whisper-writer

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

49
Emerging
567 jhuus/HawkEars1

⚠️ HawkEars 1.0 (obsolete). See HawkEars 2.0 → https://github.com/jhuus/HawkEars

49
Emerging
568 opendilab/CleanS2S

High-quality and streaming Speech-to-Speech interactive agent in a single...

49
Emerging
569 dhruvapte26/B.E.N.J.I.

B.E.N.J.I.- The Impossible Missions Force's digital assistant

49
Emerging
570 belambert/asr-evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word...

49
Emerging
571 vannu07/jarvis

🤖 Jarvis - AI Voice Assistant with Face Recognition | Hacktoberfest 2025...

49
Emerging
572 Poeschl/Hassio-Addons

The repository for my Home Assistant Supervisor Add-ons.

49
Emerging
573 Audio-WestlakeU/VINP

Official PyTorch implementation of 'VINP: Variational Bayesian Inference...

49
Emerging
574 robmsmt/KerasDeepSpeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

49
Emerging
575 OpenBMB/UltraEval-Audio

Your faithful, impartial partner for audio evaluation — know yourself, know...

49
Emerging
576 eheikes/tts

Tools to convert text to speech :books::speech_balloon:

49
Emerging
577 google/tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end...

49
Emerging
578 sergenes/runandread-audiobook

🚀 Open-source project for creating high-quality AI TTS-narrated audiobooks...

49
Emerging
579 ARBML/klaam

Arabic speech recognition, classification and text-to-speech.

49
Emerging
580 zzw922cn/awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis,...

49
Emerging
581 rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating...

49
Emerging
582 NevilPatel01/RVC-WebUI-MacOS

Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs...

49
Emerging
583 pnnbao97/sea-g2p

Fast multilingual text-to-phoneme converter for South East Asian languages.

49
Emerging
584 deepgram/deepgram-go-sdk

Official Go SDK for Deepgram.

49
Emerging
585 233stone/vocotype-cli

VocoType 是一款运行在本地端侧的隐私安全语音输入工具,通过快捷键即可将语音实时转换为文字并自动输入到当前应用。支持语音转文字MCP、AI...

49
Emerging
586 scionoftech/DeepAsr

Keras(Tensorflow) implementations of Automatic Speech Recognition

48
Emerging
587 hehehai/voxt

🎙️Voice input and translation app for macOS. Press to talk, release to paste.

48
Emerging
588 alumae/kaldi-offline-transcriber

Offline transcription system for Estonian using Kaldi

48
Emerging
589 mozilla/TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion...

48
Emerging
590 lucoiso/UEAzSpeech

This plugin integrates Azure Speech Cognitive Services in Unreal Engine.

48
Emerging
591 hegedustibor/htgo-tts

Text to speech package for Golang.

48
Emerging
592 ModelTC/LightTTS

LightTTS is a lightweight TTS inference framework optimized for CosyVoice2...

48
Emerging
593 haolinwang819-boop/ai-video-generation-workflow

AI video generation workflow with script, slides, TTS, subtitles, and FFmpeg...

48
Emerging
594 Kaljurand/dictate.js

A small Javascript library for browser-based real-time speech recognition,...

48
Emerging
595 YaoFANGUK/video-subtitle-extractor

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI...

48
Emerging
596 liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

48
Emerging
597 TheStageAI/TheWhisper

Optimized Whisper models for streaming and on-device use

48
Emerging
598 ivanvovk/durian-pytorch

Implementation of "Duration Informed Attention Network for Multimodal...

48
Emerging
599 upskyy/Squeezeformer

PyTorch implementation of "Squeezeformer: An Efficient Transformer for...

48
Emerging
600 ActiveNick/HoloBot

HoloBot is a reusable 3D interface that allows HoloLens & VR users to...

48
Emerging
« Prev 1 2 3 4 5 6 7 8 68 69 70 Next »