All Voice AI Tools

6,981 tools ranked by quality score · Page 4 of 70

Showing 301–400 of 6,981
# Tool Score Tier
301 mateogon/pdf-narrator

Convert your PDFs and EPUBs into audiobooks effortlessly. Features...

54
Established
302 zai-org/GLM-ASR

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

54
Established
303 lucasjinreal/Kokoros

🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast,...

54
Established
304 stepfun-ai/Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model...

54
Established
305 HumeAI/hume-typescript-sdk

Add Hume AI to any TypeScript project

54
Established
306 frostming/tetos

A unified interface for multiple Text-to-Speech (TTS) providers.

54
Established
307 jpreprocess/jpreprocess

Japanese text preprocessor for Text-to-Speech applications (OpenJTalk...

54
Established
308 codename0og/codename-rvc-fork-4

Codename's rvc fork version 4, based on Applio.

54
Established
309 Blaizzy/mlx-audio-swift

A modular Swift SDK for audio processing with MLX on Apple Silicon

54
Established
310 ArkanDash/Advanced-RVC-Inference

Advanced RVC Inference for quicker and effortless model downloads

54
Established
311 jtCodes/lyrictor

Browser-based lyric video editor built for complex timelines with hundreds...

54
Established
312 stemrollerapp/stemroller

Isolate vocals, drums, bass, and other instrumental stems from any song

54
Established
313 TrevorS/voxtral-mini-realtime-rs

Streaming speech recognition running natively and in the browser. A pure...

54
Established
314 Atm4x/tts-with-rvc

TTS with RVC-module to generate .wav audios

54
Established
315 crlandsc/torch-log-wmse

logWMSE, an audio quality metric & loss function with support for digital...

54
Established
316 revdotcom/revai-python-sdk

Rev AI Python SDK

54
Established
317 RageAgainstThePixel/com.rest.elevenlabs

A non-official Eleven Labs voice synthesis client for Unity (UPM)

54
Established
318 drmfinlay/tts-util-app

TTS Util — Text-to-speech utility Android app for synthesising text into...

53
Established
319 supertone-inc/supertonic-py

Lightning-Fast, On-Device TTS — running natively via ONNX.

53
Established
320 Notely-Voice/NotelyVoice

A 100% private AI voice transcription app that converts speech to text in...

53
Established
321 alphacep/vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi...

53
Established
322 PaciStardust/HOSCY

Companion for OSC and Communication

53
Established
323 IhorShevchuk/piper-app

The original Piper, now on iOS and macOS

53
Established
324 Lex-au/Orpheus-FastAPI

High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices,...

53
Established
325 LibreSpark/LibreTTS

TTS-文本转语音/文本转语音前端,兼容OpenAI、EdgeTTS等接口

53
Established
326 emnikhil/Sign-Language-To-Text-Conversion

Sign Language to Text Conversion is a real-time system that uses a camera to...

53
Established
327 taigrr/elevenlabs

ElevenLabs Artificial Voice Synthesis Client

53
Established
328 nullabork/talkbot

Text-to-speech and translation bot for Discord

53
Established
329 feldberlin/timething

Timething is a library for aligning text transcripts with their audio recordings.

53
Established
330 common-voice/cv-dataset

Metadata and versioning details for the Common Voice dataset

53
Established
331 gustavostz/whisper-clip

WhisperClip simplifies your life by automatically transcribing audio...

53
Established
332 wxxxcxx/ms-ra-forwarder

免费的在线文本转语音API

53
Established
333 jianchang512/ChatTTS-ui

一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface...

53
Established
334 mewmix/nabu

A multi engine TTS & LLM edge computing playground with audio book features...

53
Established
335 ciffelia/koe

Discord 読み上げ Bot

53
Established
336 supersu-man/pyt2s

The Python Text to Speech library you've been looking for.

53
Established
337 hetpandya/youtube_tts_data_generator

A python library to generate speech dataset from Youtube videos

53
Established
338 botbahlul/PyAutoSRT

PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free...

53
Established
339 Aivis-Project/aivmlib

Aivis Voice Model File (.aivm/.aivmx) Utility Library

53
Established
340 deepgram-starters/node-transcription

Get started using Deepgram's Transcription with this Node demo app

53
Established
341 thewh1teagle/pyannote-rs

pyannote audio diarization in rust

53
Established
342 Jaymon/transcribe

Convert images or audio files to plain text on the command line

53
Established
343 kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

53
Established
344 pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

53
Established
345 BoltzmannEntropy/MimikaStudio

MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic...

53
Established
346 Henry-23/VideoChat

实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human,...

53
Established
347 rzru/nightingale

Machine learning powered Karaoke app (with scores!)

53
Established
348 Macoron/whisper.unity

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

53
Established
349 hgneng/ekho

Chinese text-to-speech engine

53
Established
350 pnlpal/dictionariez

📚 A customizable dictionary extension that supports double-click lookups in...

53
Established
351 hugobloem/wyoming-microsoft-tts

Wyoming protocol server for Microsoft Azure text-to-speech

53
Established
352 nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

53
Established
353 primepake/wav2lip_288x288

Wav2Lip version 288 and pipeline to train

53
Established
354 deepgram-starters/node-voice-agent

Get started using Deepgram's Voice Agent with this Node demo app

53
Established
355 unilight/seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

53
Established
356 aedocw/epub2tts

Turn an epub or text file into an audiobook

53
Established
357 solyarisoftware/voskJs

Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.

53
Established
358 misyaguziya/VRCT

VRCT(VRChat Chatbox Translator & Transcription)

52
Established
359 HeyWillow/willow

Open source, local, and self-hosted Amazon Echo/Google Home competitive...

52
Established
360 Thiagohgl/ai-pronunciation-trainer

This tool uses AI to evaluate your pronunciation.

52
Established
361 mgonzs13/audio_common

A PortAudio based audio_common with text to speech for ROS 2

52
Established
362 Picovoice/leopard

On-device speech-to-text engine powered by deep learning

52
Established
363 OpenVoiceOS/ovos-tts-plugin-espeakNG

espeakNG plugin

52
Established
364 adrianlyjak/obsidian-aloud-tts

Obsidian TTS Plugin

52
Established
365 FENRlR/MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

52
Established
366 avinashvarna/sanskrit_tts

Sanskrit text to speech

52
Established
367 saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper...

52
Established
368 soniqo/speech-swift

AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and...

52
Established
369 gooofy/zerovox

zero-shot realtime TTS system, fully offline, free and open source

52
Established
370 Weilbyte/tiktok-tts

Generate TikTok Text-to-Speech voices in your browser

52
Established
371 FunAudioLLM/SenseVoice

Multilingual Voice Understanding Model

52
Established
372 alphacep/awesome-russian-speech

Russian speech technology links

52
Established
373 zaigie/FunSpeech

开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端

52
Established
374 thorstenMueller/Thorsten-Voice

Thorsten-Voice: A free to use, offline working, high quality german TTS...

52
Established
375 reazon-research/ReazonSpeech

Massive open Japanese speech corpus

52
Established
376 mlalma/KokoroTestApp

Test application for Kokoro TTS model

52
Established
377 abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,...

52
Established
378 manyeyes/ManySpeech

AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment...

52
Established
379 TuananhCR/Dia-Finetuning-Vietnamese

TTS Dia finetuning for Vietnamese

52
Established
380 davidamacey/OpenTranscribe

Self-hosted AI-powered transcription platform with speaker diarization,...

52
Established
381 asiff00/On-Device-Speech-to-Speech-Conversational-AI

This is an on-CPU real-time conversational system for two-way speech...

52
Established
382 pierreaubert/spinorama

A library to display and compare spinorama (speakers measurements) graphs.

51
Established
383 Kyubyong/tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech...

51
Established
384 mallorbc/whisper_mic

Project that allows one to use a microphone with OpenAI whisper.

51
Established
385 lkuza2/java-speech-api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using...

51
Established
386 spring-media/TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based...

51
Established
387 gokhaneraslan/chatterbox-finetuning

Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports...

51
Established
388 riderodd/react-native-vosk

Speech recognition module for react native using Vosk library

51
Established
389 ekwek1/soprano

Soprano: Instant, Ultra-Realistic Text-to-Speech

51
Established
390 philipperemy/deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

51
Established
391 drethage/speech-denoising-wavenet

A neural network for end-to-end speech denoising

51
Established
392 Devansh-47/Sign-Language-To-Text-and-Speech-Conversion

This is a python application which converts american sign language into text...

51
Established
393 alexa-pi/AlexaPi

Alexa client for all your devices! # No active development. PRs welcome #...

51
Established
394 canopyai/Orpheus-TTS

Towards Human-Sounding Speech

51
Established
395 alumae/kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit...

51
Established
396 AI4Bharat/Chitralekha

Chitralekha - A video transcreation platform for Indic languages, supporting...

51
Established
397 speechio/chinese_text_normalization

Chinese text normalization for speech processing

51
Established
398 MycroftAI/adapt

Adapt Intent Parser

51
Established
399 keithito/tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with...

51
Established
400 jaywalnut310/glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

51
Established
« Prev 1 2 3 4 5 6 68 69 70 Next »