All Voice AI Tools

6,981 tools ranked by quality score · Page 4 of 70

Showing 301–400 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
301	mateogon/pdf-narrator Convert your PDFs and EPUBs into audiobooks effortlessly. Features...	54	Established	ebook-to-audiobook-conversion	167	Python
302	zai-org/GLM-ASR GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters	54	Established	llm-scaling-architecture	759	Python
303	lucasjinreal/Kokoros 🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast,...	54	Established	kokoro-tts-ecosystem	735	Rust
304	stepfun-ai/Step-Audio-EditX A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model...	54	Established	zero-shot-voice-synthesis	884	Python
305	HumeAI/hume-typescript-sdk Add Hume AI to any TypeScript project	54	Established	web-speech-api-libraries	75	TypeScript
306	frostming/tetos A unified interface for multiple Text-to-Speech (TTS) providers.	54	Established	lightweight-tts-libraries	277	Python
307	jpreprocess/jpreprocess Japanese text preprocessor for Text-to-Speech applications (OpenJTalk...	54	Established	rust-tts-libraries	52	Rust
308	codename0og/codename-rvc-fork-4 Codename's rvc fork version 4, based on Applio.	54	Established	voice-cloning-tools	41	Python
309	Blaizzy/mlx-audio-swift A modular Swift SDK for audio processing with MLX on Apple Silicon	54	Established	ios-speech-frameworks	446	Swift
310	ArkanDash/Advanced-RVC-Inference Advanced RVC Inference for quicker and effortless model downloads	54	Established	voice-cloning-tools	68	Python
311	jtCodes/lyrictor Browser-based lyric video editor built for complex timelines with hundreds...	54	Established	text-to-video-generation	52	TypeScript
312	stemrollerapp/stemroller Isolate vocals, drums, bass, and other instrumental stems from any song	54	Established	audio-source-separation	3,052	Svelte
313	TrevorS/voxtral-mini-realtime-rs Streaming speech recognition running natively and in the browser. A pure...	54	Established	rust-speech-recognition	710	Rust
314	Atm4x/tts-with-rvc TTS with RVC-module to generate .wav audios	54	Established	coqui-tts-applications	40	Python
315	crlandsc/torch-log-wmse logWMSE, an audio quality metric & loss function with support for digital...	54	Established	audio-noise-reduction	45	Python
316	revdotcom/revai-python-sdk Rev AI Python SDK	54	Established	voice-ai-sdks	36	Python
317	RageAgainstThePixel/com.rest.elevenlabs A non-official Eleven Labs voice synthesis client for Unity (UPM)	54	Established	elevenlabs-integrations	105	C#
318	drmfinlay/tts-util-app TTS Util — Text-to-speech utility Android app for synthesising text into...	53	Established	android-speech-apps	176	Kotlin
319	supertone-inc/supertonic-py Lightning-Fast, On-Device TTS — running natively via ONNX.	53	Established	lightweight-tts-runtimes	16	Python
320	Notely-Voice/NotelyVoice A 100% private AI voice transcription app that converts speech to text in...	53	Established	local-voice-dictation	629	C++
321	alphacep/vosk-server WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi...	53	Established	vosk-asr-implementations	1,240	Python
322	PaciStardust/HOSCY Companion for OSC and Communication	53	Established	dotnet-tts-libraries	37	C#
323	IhorShevchuk/piper-app The original Piper, now on iOS and macOS	53	Established	piper-tts-ecosystem	35	Swift
324	Lex-au/Orpheus-FastAPI High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices,...	53	Established	text-to-speech-conversion	673	Python
325	LibreSpark/LibreTTS TTS-文本转语音/文本转语音前端，兼容OpenAI、EdgeTTS等接口	53	Established	edge-tts-implementations	350	JavaScript
326	emnikhil/Sign-Language-To-Text-Conversion Sign Language to Text Conversion is a real-time system that uses a camera to...	53	Established	sign-language-recognition	348	Python
327	taigrr/elevenlabs ElevenLabs Artificial Voice Synthesis Client	53	Established	elevenlabs-integrations	64	Go
328	nullabork/talkbot Text-to-speech and translation bot for Discord	53	Established	discord-tts-bots	31	JavaScript
329	feldberlin/timething Timething is a library for aligning text transcripts with their audio recordings.	53	Established	whisper-subtitle-generation	130	Jupyter Notebook
330	common-voice/cv-dataset Metadata and versioning details for the Common Voice dataset	53	Established	speech-corpora-datasets	168	JavaScript
331	gustavostz/whisper-clip WhisperClip simplifies your life by automatically transcribing audio...	53	Established	speech-to-text-converters	137	Python
332	wxxxcxx/ms-ra-forwarder 免费的在线文本转语音API	53	Established	google-tts-libraries	1,030	TypeScript
333	jianchang512/ChatTTS-ui 一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface...	53	Established	self-hosted-tts-servers	7,521	Python
334	mewmix/nabu A multi engine TTS & LLM edge computing playground with audio book features...	53	Established	voice-assistant-frameworks	43	Kotlin
335	ciffelia/koe Discord 読み上げ Bot	53	Established	discord-tts-bots	43	Rust
336	supersu-man/pyt2s The Python Text to Speech library you've been looking for.	53	Established	lightweight-tts-libraries	36	Python
337	hetpandya/youtube_tts_data_generator A python library to generate speech dataset from Youtube videos	53	Established	tts-dataset-creation	37	Python
338	botbahlul/PyAutoSRT PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free...	53	Established	whisper-subtitle-generation	188	Python
339	Aivis-Project/aivmlib Aivis Voice Model File (.aivm/.aivmx) Utility Library	53	Established	openai-tts-applications	25	Python
340	deepgram-starters/node-transcription Get started using Deepgram's Transcription with this Node demo app	53	Established	deepgram-starter-projects	33	JavaScript
341	thewh1teagle/pyannote-rs pyannote audio diarization in rust	53	Established	parakeet-asr-implementations	108	Rust
342	Jaymon/transcribe Convert images or audio files to plain text on the command line	53	Established	real-time-voice-translation	30	Python
343	kaldi-asr/kaldi kaldi-asr/kaldi is the official location of the Kaldi project.	53	Established	kaldi-asr-ecosystem	15,346	Shell
344	pot-app/pot-desktop 🌈一个跨平台的划词翻译和OCR软件 \| A cross-platform software for text translation and recognition.	53	Established	ios-speech-frameworks	17,383	JavaScript
345	BoltzmannEntropy/MimikaStudio MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic...	53	Established	qwen3-tts-applications	357	Dart
346	Henry-23/VideoChat 实时交互数字人，可自定义形象与音色，支持音色克隆，对话延迟低至3s。Real-time voice interactive digital human,...	53	Established	ai-avatar-platforms	1,223	Python
347	rzru/nightingale Machine learning powered Karaoke app (with scores!)	53	Established	audio-music-learning	548	Rust
348	Macoron/whisper.unity Running speech to text model (whisper.cpp) in Unity3d on your local machine.	53	Established	whisper-framework-ports	704	C#
349	hgneng/ekho Chinese text-to-speech engine	53	Established	lightweight-tts-runtimes	1,202	Lex
350	pnlpal/dictionariez 📚 A customizable dictionary extension that supports double-click lookups in...	53	Established	ai-powered-ereaders	635	JavaScript
351	hugobloem/wyoming-microsoft-tts Wyoming protocol server for Microsoft Azure text-to-speech	53	Established	lightweight-tts-runtimes	25	Python
352	nl8590687/ASRT_SpeechRecognition A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统	53	Established	ctc-asr-implementations	8,359	Python
353	primepake/wav2lip_288x288 Wav2Lip version 288 and pipeline to train	53	Established	lip-reading-synthesis	642	Python
354	deepgram-starters/node-voice-agent Get started using Deepgram's Voice Agent with this Node demo app	53	Established	deepgram-starter-projects	31	JavaScript
355	unilight/seq2seq-vc A sequence-to-sequence voice conversion toolkit.	53	Established	zero-shot-voice-synthesis	108	Jupyter Notebook
356	aedocw/epub2tts Turn an epub or text file into an audiobook	53	Established	text-to-speech	903	Python
357	solyarisoftware/voskJs Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.	53	Established	vosk-asr-implementations	56	JavaScript
358	misyaguziya/VRCT VRCT(VRChat Chatbox Translator & Transcription)	52	Established	dotnet-tts-libraries	340	Python
359	HeyWillow/willow Open source, local, and self-hosted Amazon Echo/Google Home competitive...	52	Established	voice-assistant-applications	2,987	C
360	Thiagohgl/ai-pronunciation-trainer This tool uses AI to evaluate your pronunciation.	52	Established	ai-tutoring-platforms	452	Python
361	mgonzs13/audio_common A PortAudio based audio_common with text to speech for ROS 2	52	Established	lightweight-tts-libraries	32	C++
362	Picovoice/leopard On-device speech-to-text engine powered by deep learning	52	Established	funasr-speech-recognition	474	Python
363	OpenVoiceOS/ovos-tts-plugin-espeakNG espeakNG plugin	52	Established	espeak-ng-ecosystem	2	Python
364	adrianlyjak/obsidian-aloud-tts Obsidian TTS Plugin	52	Established	edge-tts-implementations	80	TypeScript
365	FENRlR/MB-iSTFT-VITS2 Application of MB-iSTFT-VITS components to vits2_pytorch	52	Established	vits-tts-implementations	134	Python
366	avinashvarna/sanskrit_tts Sanskrit text to speech	52	Established	lightweight-tts-libraries	33	Python
367	saharmor/whisper-playground Build real time speech2text web apps using OpenAI's Whisper...	52	Established	whisper-transcription-apps	833	Python
368	soniqo/speech-swift AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and...	52	Established	ios-speech-frameworks	417	Swift
369	gooofy/zerovox zero-shot realtime TTS system, fully offline, free and open source	52	Established	text-to-speech-frameworks	51	Python
370	Weilbyte/tiktok-tts Generate TikTok Text-to-Speech voices in your browser	52	Established	telegram-voice-transcription	419	JavaScript
371	FunAudioLLM/SenseVoice Multilingual Voice Understanding Model	52	Established	voice-assistant-devices	7,691	Python
372	alphacep/awesome-russian-speech Russian speech technology links	52	Established	voice-ai-learning-collections	370	—
373	zaigie/FunSpeech 开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端	52	Established	funasr-speech-recognition	111	Python
374	thorstenMueller/Thorsten-Voice Thorsten-Voice: A free to use, offline working, high quality german TTS...	52	Established	coqui-tts-applications	705	Python
375	reazon-research/ReazonSpeech Massive open Japanese speech corpus	52	Established	speech-corpora-datasets	373	Python
376	mlalma/KokoroTestApp Test application for Kokoro TTS model	52	Established	text-to-speech-tts	35	Swift
377	abus-aikorea/voice-pro Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS,...	52	Established	gradio-tts-webuis	6,366	Python
378	manyeyes/ManySpeech AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment...	52	Established	funasr-speech-recognition	71	C#
379	TuananhCR/Dia-Finetuning-Vietnamese TTS Dia finetuning for Vietnamese	52	Established	tts-model-finetuning	125	Python
380	davidamacey/OpenTranscribe Self-hosted AI-powered transcription platform with speaker diarization,...	52	Established	audio-transcription-apps	32	Python
381	asiff00/On-Device-Speech-to-Speech-Conversational-AI This is an on-CPU real-time conversational system for two-way speech...	52	Established	local-voice-assistants	242	Python
382	pierreaubert/spinorama A library to display and compare spinorama (speakers measurements) graphs.	51	Established	automatic-speech-recognition	151	Python
383	Kyubyong/tacotron A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech...	51	Established	tacotron-tts-models	1,833	Python
384	mallorbc/whisper_mic Project that allows one to use a microphone with OpenAI whisper.	51	Established	speech-to-text-converters	785	Python
385	lkuza2/java-speech-api The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using...	51	Established	java-tts-libraries	545	Java
386	spring-media/TransformerTTS 🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based...	51	Established	text-to-speech-frameworks	1,161	Python
387	gokhaneraslan/chatterbox-finetuning Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports...	51	Established	self-hosted-tts-servers	84	Python
388	riderodd/react-native-vosk Speech recognition module for react native using Vosk library	51	Established	react-native-voice-libraries	93	Objective-C++
389	ekwek1/soprano Soprano: Instant, Ultra-Realistic Text-to-Speech	51	Established	lightweight-tts-libraries	1,203	Python
390	philipperemy/deep-speaker Deep Speaker: an End-to-End Neural Speaker Embedding System.	51	Established	speaker-diarization-embedding	939	Python
391	drethage/speech-denoising-wavenet A neural network for end-to-end speech denoising	51	Established	audio-noise-reduction	708	Python
392	Devansh-47/Sign-Language-To-Text-and-Speech-Conversion This is a python application which converts american sign language into text...	51	Established	sign-language-recognition	310	Python
393	alexa-pi/AlexaPi Alexa client for all your devices! # No active development. PRs welcome #...	51	Established	voice-assistant-applications	1,331	Python
394	canopyai/Orpheus-TTS Towards Human-Sounding Speech	51	Established	multimodal-vision-language	6,000	Python
395	alumae/kaldi-gstreamer-server Real-time full-duplex speech recognition server, based on the Kaldi toolkit...	51	Established	kaldi-asr-ecosystem	1,092	Python
396	AI4Bharat/Chitralekha Chitralekha - A video transcreation platform for Indic languages, supporting...	51	Established	video-dubbing-tools	113	—
397	speechio/chinese_text_normalization Chinese text normalization for speech processing	51	Established	text-normalization-engines	722	Python
398	MycroftAI/adapt Adapt Intent Parser	51	Established	speech-ai-coursework	722	Python
399	keithito/tacotron A TensorFlow implementation of Google's Tacotron speech synthesis with...	51	Established	text-to-speech-frameworks	2,988	Python
400	jaywalnut310/glow-tts A Generative Flow for Text-to-Speech via Monotonic Alignment Search	51	Established	text-to-speech-frameworks	704	Python

« Prev 1 2 3 4 5 6 … 68 69 70 Next »