All Voice AI Tools
6,981 tools ranked by quality score · Page 4 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 301 |
mateogon/pdf-narrator
Convert your PDFs and EPUBs into audiobooks effortlessly. Features... |
|
Established |
| 302 |
zai-org/GLM-ASR
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters |
|
Established |
| 303 |
lucasjinreal/Kokoros
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast,... |
|
Established |
| 304 |
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model... |
|
Established |
| 305 |
HumeAI/hume-typescript-sdk
Add Hume AI to any TypeScript project |
|
Established |
| 306 |
frostming/tetos
A unified interface for multiple Text-to-Speech (TTS) providers. |
|
Established |
| 307 |
jpreprocess/jpreprocess
Japanese text preprocessor for Text-to-Speech applications (OpenJTalk... |
|
Established |
| 308 |
codename0og/codename-rvc-fork-4
Codename's rvc fork version 4, based on Applio. |
|
Established |
| 309 |
Blaizzy/mlx-audio-swift
A modular Swift SDK for audio processing with MLX on Apple Silicon |
|
Established |
| 310 |
ArkanDash/Advanced-RVC-Inference
Advanced RVC Inference for quicker and effortless model downloads |
|
Established |
| 311 |
jtCodes/lyrictor
Browser-based lyric video editor built for complex timelines with hundreds... |
|
Established |
| 312 |
stemrollerapp/stemroller
Isolate vocals, drums, bass, and other instrumental stems from any song |
|
Established |
| 313 |
TrevorS/voxtral-mini-realtime-rs
Streaming speech recognition running natively and in the browser. A pure... |
|
Established |
| 314 |
Atm4x/tts-with-rvc
TTS with RVC-module to generate .wav audios |
|
Established |
| 315 |
crlandsc/torch-log-wmse
logWMSE, an audio quality metric & loss function with support for digital... |
|
Established |
| 316 |
revdotcom/revai-python-sdk
Rev AI Python SDK |
|
Established |
| 317 |
RageAgainstThePixel/com.rest.elevenlabs
A non-official Eleven Labs voice synthesis client for Unity (UPM) |
|
Established |
| 318 |
drmfinlay/tts-util-app
TTS Util — Text-to-speech utility Android app for synthesising text into... |
|
Established |
| 319 |
supertone-inc/supertonic-py
Lightning-Fast, On-Device TTS — running natively via ONNX. |
|
Established |
| 320 |
Notely-Voice/NotelyVoice
A 100% private AI voice transcription app that converts speech to text in... |
|
Established |
| 321 |
alphacep/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi... |
|
Established |
| 322 |
PaciStardust/HOSCY
Companion for OSC and Communication |
|
Established |
| 323 |
IhorShevchuk/piper-app
The original Piper, now on iOS and macOS |
|
Established |
| 324 |
Lex-au/Orpheus-FastAPI
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices,... |
|
Established |
| 325 |
LibreSpark/LibreTTS
TTS-文本转语音/文本转语音前端,兼容OpenAI、EdgeTTS等接口 |
|
Established |
| 326 |
emnikhil/Sign-Language-To-Text-Conversion
Sign Language to Text Conversion is a real-time system that uses a camera to... |
|
Established |
| 327 |
taigrr/elevenlabs
ElevenLabs Artificial Voice Synthesis Client |
|
Established |
| 328 |
nullabork/talkbot
Text-to-speech and translation bot for Discord |
|
Established |
| 329 |
feldberlin/timething
Timething is a library for aligning text transcripts with their audio recordings. |
|
Established |
| 330 |
common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset |
|
Established |
| 331 |
gustavostz/whisper-clip
WhisperClip simplifies your life by automatically transcribing audio... |
|
Established |
| 332 |
wxxxcxx/ms-ra-forwarder
免费的在线文本转语音API |
|
Established |
| 333 |
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface... |
|
Established |
| 334 |
mewmix/nabu
A multi engine TTS & LLM edge computing playground with audio book features... |
|
Established |
| 335 |
ciffelia/koe
Discord 読み上げ Bot |
|
Established |
| 336 |
supersu-man/pyt2s
The Python Text to Speech library you've been looking for. |
|
Established |
| 337 |
hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos |
|
Established |
| 338 |
botbahlul/PyAutoSRT
PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free... |
|
Established |
| 339 |
Aivis-Project/aivmlib
Aivis Voice Model File (.aivm/.aivmx) Utility Library |
|
Established |
| 340 |
deepgram-starters/node-transcription
Get started using Deepgram's Transcription with this Node demo app |
|
Established |
| 341 |
thewh1teagle/pyannote-rs
pyannote audio diarization in rust |
|
Established |
| 342 |
Jaymon/transcribe
Convert images or audio files to plain text on the command line |
|
Established |
| 343 |
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project. |
|
Established |
| 344 |
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition. |
|
Established |
| 345 |
BoltzmannEntropy/MimikaStudio
MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic... |
|
Established |
| 346 |
Henry-23/VideoChat
实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human,... |
|
Established |
| 347 |
rzru/nightingale
Machine learning powered Karaoke app (with scores!) |
|
Established |
| 348 |
Macoron/whisper.unity
Running speech to text model (whisper.cpp) in Unity3d on your local machine. |
|
Established |
| 349 |
hgneng/ekho
Chinese text-to-speech engine |
|
Established |
| 350 |
pnlpal/dictionariez
📚 A customizable dictionary extension that supports double-click lookups in... |
|
Established |
| 351 |
hugobloem/wyoming-microsoft-tts
Wyoming protocol server for Microsoft Azure text-to-speech |
|
Established |
| 352 |
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统 |
|
Established |
| 353 |
primepake/wav2lip_288x288
Wav2Lip version 288 and pipeline to train |
|
Established |
| 354 |
deepgram-starters/node-voice-agent
Get started using Deepgram's Voice Agent with this Node demo app |
|
Established |
| 355 |
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit. |
|
Established |
| 356 |
aedocw/epub2tts
Turn an epub or text file into an audiobook |
|
Established |
| 357 |
solyarisoftware/voskJs
Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server. |
|
Established |
| 358 |
misyaguziya/VRCT
VRCT(VRChat Chatbox Translator & Transcription) |
|
Established |
| 359 |
HeyWillow/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive... |
|
Established |
| 360 |
Thiagohgl/ai-pronunciation-trainer
This tool uses AI to evaluate your pronunciation. |
|
Established |
| 361 |
mgonzs13/audio_common
A PortAudio based audio_common with text to speech for ROS 2 |
|
Established |
| 362 |
Picovoice/leopard
On-device speech-to-text engine powered by deep learning |
|
Established |
| 363 |
OpenVoiceOS/ovos-tts-plugin-espeakNG
espeakNG plugin |
|
Established |
| 364 |
adrianlyjak/obsidian-aloud-tts
Obsidian TTS Plugin |
|
Established |
| 365 |
FENRlR/MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch |
|
Established |
| 366 |
avinashvarna/sanskrit_tts
Sanskrit text to speech |
|
Established |
| 367 |
saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper... |
|
Established |
| 368 |
soniqo/speech-swift
AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and... |
|
Established |
| 369 |
gooofy/zerovox
zero-shot realtime TTS system, fully offline, free and open source |
|
Established |
| 370 |
Weilbyte/tiktok-tts
Generate TikTok Text-to-Speech voices in your browser |
|
Established |
| 371 |
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model |
|
Established |
| 372 |
alphacep/awesome-russian-speech
Russian speech technology links |
|
Established |
| 373 |
zaigie/FunSpeech
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端 |
|
Established |
| 374 |
thorstenMueller/Thorsten-Voice
Thorsten-Voice: A free to use, offline working, high quality german TTS... |
|
Established |
| 375 |
reazon-research/ReazonSpeech
Massive open Japanese speech corpus |
|
Established |
| 376 |
mlalma/KokoroTestApp
Test application for Kokoro TTS model |
|
Established |
| 377 |
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,... |
|
Established |
| 378 |
manyeyes/ManySpeech
AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment... |
|
Established |
| 379 |
TuananhCR/Dia-Finetuning-Vietnamese
TTS Dia finetuning for Vietnamese |
|
Established |
| 380 |
davidamacey/OpenTranscribe
Self-hosted AI-powered transcription platform with speaker diarization,... |
|
Established |
| 381 |
asiff00/On-Device-Speech-to-Speech-Conversational-AI
This is an on-CPU real-time conversational system for two-way speech... |
|
Established |
| 382 |
pierreaubert/spinorama
A library to display and compare spinorama (speakers measurements) graphs. |
|
Established |
| 383 |
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech... |
|
Established |
| 384 |
mallorbc/whisper_mic
Project that allows one to use a microphone with OpenAI whisper. |
|
Established |
| 385 |
lkuza2/java-speech-api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using... |
|
Established |
| 386 |
spring-media/TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based... |
|
Established |
| 387 |
gokhaneraslan/chatterbox-finetuning
Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports... |
|
Established |
| 388 |
riderodd/react-native-vosk
Speech recognition module for react native using Vosk library |
|
Established |
| 389 |
ekwek1/soprano
Soprano: Instant, Ultra-Realistic Text-to-Speech |
|
Established |
| 390 |
philipperemy/deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System. |
|
Established |
| 391 |
drethage/speech-denoising-wavenet
A neural network for end-to-end speech denoising |
|
Established |
| 392 |
Devansh-47/Sign-Language-To-Text-and-Speech-Conversion
This is a python application which converts american sign language into text... |
|
Established |
| 393 |
alexa-pi/AlexaPi
Alexa client for all your devices! # No active development. PRs welcome #... |
|
Established |
| 394 |
canopyai/Orpheus-TTS
Towards Human-Sounding Speech |
|
Established |
| 395 |
alumae/kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit... |
|
Established |
| 396 |
AI4Bharat/Chitralekha
Chitralekha - A video transcreation platform for Indic languages, supporting... |
|
Established |
| 397 |
speechio/chinese_text_normalization
Chinese text normalization for speech processing |
|
Established |
| 398 |
MycroftAI/adapt
Adapt Intent Parser |
|
Established |
| 399 |
keithito/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with... |
|
Established |
| 400 |
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search |
|
Established |