All Voice AI Tools

6,981 tools ranked by quality score · Page 9 of 70

Showing 801–900 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
801	yc9701/pansori Tools for ASR Corpus Generation from Online Video	46	Emerging	speech-corpora-datasets	140	Python
802	funcwj/aps A personal toolkit for single/multi-channel speech recognition & enhancement...	46	Emerging	automatic-speech-recognition	145	Python
803	microsoft/SpeechT5 Unified-Modal Speech-Text Pre-Training for Spoken Language Processing	46	Emerging	voice-ai-learning-collections	1,435	Python
804	gfdb/wav2aug A general purpose task-agnostic speech augmentation policy	46	Emerging	speaker-diarization-embedding	16	Python
805	FireRedTeam/FireRedTTS An Open-Sourced LLM-empowered Foundation TTS System	46	Emerging	zero-shot-voice-synthesis	905	Python
806	silversparro/wav2letter.pytorch A fully convolution-network for speech-to-text, built on pytorch.	45	Emerging	wav2vec2-asr-models	126	Python
807	Edresson/YourTTS YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion...	45	Emerging	zero-shot-voice-synthesis	1,052	Jupyter Notebook
808	lucadellalib/focalcodec A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation	45	Emerging	audio-noise-reduction	152	Jupyter Notebook
809	ceuk/speech-recognition-aws-polyfill Polyfill for the SpeechRecognition browser API using AWS Transcribe as a fallback	45	Emerging	web-speech-api-libraries	13	TypeScript
810	Kyubyong/cross_vc Cross-lingual Voice Conversion	45	Emerging	zero-shot-voice-synthesis	97	Python
811	KevinMIN95/StyleSpeech Official implementation of Meta-StyleSpeech and StyleSpeech	45	Emerging	fastspeech-tts-models	252	Python
812	CSTR-Edinburgh/magphase MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.	45	Emerging	fastspeech-tts-models	80	Python
813	baidubce/pie 百度云流式语音识别客户端 SDK	45	Emerging	java-tts-libraries	80	Java
814	Elleo/pied Pied makes it simple to install and manage text-to-speech Piper voices for...	45	Emerging	piper-tts-ecosystem	258	Dart
815	AIGC-Audio/AudioGPT AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head	45	Emerging	voice-chatgpt-interfaces	10,210	Python
816	nipponjo/tts-arabic-pytorch 🎙️ Arabic TTS models (Tacotron2, FastPitch)	45	Emerging	text-to-speech-frameworks	137	Jupyter Notebook
817	OpenVoiceOS/ovos-tts-plugin-cotovia galician tts plugin for OVOS	45	Emerging	espeak-ng-ecosystem	3	Python
818	MysteryPancake/Discord-TTS Text to speech Discord bot using FakeYou	45	Emerging	discord-tts-bots	41	JavaScript
819	chaiyujin/dctts-pytorch The pytorch implementation of DC-TTS	45	Emerging	tacotron-tts-models	76	Python
820	rishikksh20/vae_tacotron2 VAE Tacotron 2, an alternative of GST Tacotron	45	Emerging	tacotron-tts-models	90	Python
821	RapidAI/RapidASR 📣 商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR...	45	Emerging	funasr-speech-recognition	602	C++
822	Cay-Zhang/SwiftSpeech A speech recognition framework designed for SwiftUI.	45	Emerging	ios-speech-frameworks	527	Swift
823	rioharper/VocalForge Your one-stop solution for voice dataset creation	45	Emerging	audio-transcription-apps	130	Python
824	Voine/Bert-VITS2-MNN TTS System Bert-VITS2 Android Ver, powered by alibaba-MNN engine.	45	Emerging	vits-tts-implementations	129	Kotlin
825	ubisoft/ubisoft-laforge-daft-exprt Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis	45	Emerging	zero-shot-voice-synthesis	129	Python
826	zhao-kun/VibeVoiceFusion VibeVoiceFusion is a full-stack, multi-speaker voice generation web system...	45	Emerging	qwen3-tts-applications	453	Python
827	Picovoice/falcon On-device speaker diarization powered by deep learning	45	Emerging	speaker-diarization-embedding	69	Python
828	benjaminwan/ChineseTtsTflite Android Chinese TTS Engine Base On Tensorflow TTS , use for TfLite Models...	45	Emerging	lightweight-tts-runtimes	393	Java
829	danthelion/doc2audiobook Convert text documents to high fidelity audio(books).	45	Emerging	pdf-to-audio-conversion	204	Python
830	Niger-Volta-LTI/yoruba-text Yorùbá language training text for NLP, ASR and TTS tasks	45	Emerging	speech-corpora-datasets	82	Python
831	Oknolaz/vasisualy Vasisualy it's a simple Russian-language voice assistant written on Python...	45	Emerging	general-purpose-voice-assistants	68	Python
832	BernieTv/ElevenLabs-Clone A self-hosted ElevenLabs clone for text-to-speech, voice conversion, and AI...	45	Emerging	elevenlabs-integrations	66	Python
833	mush42/sonata-nvda This add-on implements a speech synthesizer driver for NVDA using neural TTS...	45	Emerging	piper-tts-ecosystem	67	Python
834	h5p/h5p-speak-the-words Create questions answered through speech	45	Emerging	web-speech-api-libraries	9	JavaScript
835	fewieden/MMM-voice Offline Voice Recognition Module for MagicMirror²	45	Emerging	vosk-asr-implementations	80	JavaScript
836	NATSpeech/NATSpeech A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official...	45	Emerging	fastspeech-tts-models	1,006	Python
837	daniilrobnikov/vits2 VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with...	45	Emerging	text-to-speech-frameworks	634	Jupyter Notebook
838	OAID/cortex-m-kws Cortex M KWS example with Tengine Lite.	45	Emerging	wake-word-detection	74	C
839	HAKORADev/VODER Voice Operation and Design Engine with Reproduction capabilities	45	Emerging	neural-vocoder-implementations	116	Python
840	by2101/OpenASR A pytorch based end2end speech recognition system.	45	Emerging	end-to-end-asr-frameworks	114	Python
841	huggingface/distil-whisper Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller,...	45	Emerging	whisper-fine-tuning	4,056	Python
842	rishikksh20/Fre-GAN-pytorch Fre-GAN: Adversarial Frequency-consistent Audio Synthesis	45	Emerging	neural-vocoder-implementations	111	Python
843	ccoreilly/LocalSTT Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech	45	Emerging	android-speech-apps	109	Java
844	synesthesiam/rhasspy Rhasspy voice assistant for offline home automation	45	Emerging	general-purpose-voice-assistants	952	HTML
845	baizeteam/baize-toolbox 白泽工具箱，基于electron+ffmpeg实现的一款功能强大的多媒体工具	45	Emerging	google-tts-libraries	110	TypeScript
846	PrzemyslawSwiderski/python-gradle-plugin Gradle plugin to run Python projects.	45	Emerging	voice-ai-learning-collections	22	Kotlin
847	chrisjp/tts A simple tool to demo text-to-speech using various services' voices. HTML5...	45	Emerging	twitch-chat-tts	107	PHP
848	siva-sub/NekoSpeak Private, offline AI Text-to-Speech for Android with Kokoro, KittenTTS,...	45	Emerging	kokoro-tts-ecosystem	43	Kotlin
849	ArdaGnsrn/elevenlabs-laravel This is an Open Source PHP Laravel package for ElevenLabs Text to Speech API.	45	Emerging	elevenlabs-integrations	21	PHP
850	Kaljurand/Inimesed An Android app that lets you search your contacts by voice. Internet not...	45	Emerging	android-speech-apps	61	Java
851	deepgram-devs/nextjs-text-to-speech Get started using Deepgram's Text-to-Speech with this Next.js demo app	45	Emerging	deepgram-starter-projects	24	TypeScript
852	Mobile-Artificial-Intelligence/babylon Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and...	45	Emerging	voice-cloning-synthesis	30	Python
853	areebbeigh/winspeech Speech recognition and synthesis library for Windows - Python 2 and 3.	45	Emerging	lightweight-tts-libraries	12	Python
854	nl8590687/ASRT_SDK_WinClient An Windows client SDK and Demo software for ASRT speech recognition system....	45	Emerging	java-tts-libraries	71	C#
855	daanzu/deepspeech-websocket-server Server & client for DeepSpeech using WebSockets for real-time speech...	45	Emerging	parakeet-asr-implementations	103	Python
856	spring-media/DeepPhonemizer Grapheme to phoneme conversion with deep learning.	45	Emerging	text-to-speech-frameworks	421	Python
857	Tinkoff/voicekit-examples Examples on how to use Tinkoff Voicekit	45	Emerging	yandex-speechkit-tools	57	C#
858	skit-ai/kaldi-serve Server framework for Kaldi ASR Toolkit	45	Emerging	kaldi-asr-ecosystem	99	C++
859	juliuskunze/speechless Speech-to-text based on wav2letter built for transfer learning	45	Emerging	wav2vec2-asr-models	98	Python
860	RaduBolbo/F5-TTS-Emotional-CFG Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class...	45	Emerging	zero-shot-voice-synthesis	30	Python
861	trldvix/youtube-transcript-api Java library which allows you to retrieve subtitles/transcripts for a single...	45	Emerging	video-transcription-extraction	37	Java
862	Amirrezahmi/SelfTalker Engage in conversation with your virtual self using AI techniques like NLP,...	45	Emerging	ai-chatbot-interfaces	85	Jupyter Notebook
863	GetcharZp/go-speech go-speech 基于 Golang + ONNX 构建的轻量语音库，支持 TTS（文本转语音）与 ASR（语音转文字）。已集成...	45	Emerging	go-tts-libraries	46	Go
864	mlalma/MisakiSwift Swift port of Misaki G2P (grapheme-to-phoneme) library that can be used e.g....	45	Emerging	text-to-speech-tts	20	Swift
865	createcandle/voco Privacy friendly voice control for the Candle Controller / WebThings...	45	Emerging	voice-assistant-frameworks	29	Python
866	IBM/speech-to-text-code-pattern WARNING: This repository is no longer maintained	45	Emerging	google-tts-libraries	46	JavaScript
867	rhasspy/rhasspy Offline private voice assistant for many human languages	45	Emerging	general-purpose-voice-assistants	2,725	Shell
868	yukukotani/pi-voice Headless voice interface for the Pi Coding Agent	45	Emerging	voice-controlled-robotics	46	TypeScript
869	vineeths96/Spoken-Keyword-Spotting In this repository, we explore using a hybrid system consisting of a...	45	Emerging	wake-word-detection	107	Python
870	maum-ai/univnet Unofficial PyTorch Implementation of UnivNet Vocoder...	45	Emerging	text-to-speech-frameworks	282	Python
871	theblackcat102/edgedict Working online speech recognition based on RNN Transducer. ( Trained model...	45	Emerging	end-to-end-asr-frameworks	292	Python
872	nnsvs/nnsvs Neural network-based singing voice synthesis library for research	45	Emerging	text-to-speech-frameworks	742	Python
873	Camb-ai/MARS5-TTS MARS5 speech model (TTS) from CAMB.AI	45	Emerging	voice-cloning-tools	2,814	Jupyter Notebook
874	gabriele-mastrapasqua/qwen3-tts Pure C inference engine for Qwen3-TTS text-to-speech. No Python, no PyTorch...	45	Emerging	qwen3-tts-applications	25	C
875	PraaneshSelvaraj/speech_engine Speech Engine is a Python package that provides a simple interface for...	45	Emerging	lightweight-tts-libraries	3	Python
876	WindQAQ/listen-attend-and-spell Tensorflow implementation of "Listen, Attend and Spell" authored by William...	45	Emerging	conformer-asr-implementations	89	Python
877	livingingroups/animal2vec animal2vec: A self-supervised transformer for rare-event raw audio input	45	Emerging	bioacoustic-species-classification	30	Python
878	yaph/tts-samples This repository provides text-to-speech (TTS) audio samples in MP3 format...	45	Emerging	edge-tts-implementations	45	Python
879	gsssrao/UnityAndroidSpeechRecognition This repository is a Unity plugin for Android Speech Recognition (based on...	45	Emerging	dotnet-tts-libraries	85	Java
880	rhasspy/piper A fast, local neural text to speech system	45	Emerging	piper-tts-ecosystem	10,694	C++
881	Kaljurand/speechutils Android library for speech-to-text and text-to-speech apps	45	Emerging	android-speech-apps	90	Java
882	see2023/Bert-VITS2-ext 基于Bert-VITS2做的表情、动画测试. Animation testing based on Bert-VITS2.	45	Emerging	vits-tts-implementations	539	Python
883	AlexandaJerry/whisper-vits-japanese Vits Japanese with Whisper as data processor (you can train your VITS even...	45	Emerging	vits-tts-implementations	162	Jupyter Notebook
884	georgesterpu/avsr-tf1 Audio-Visual Speech Recognition using Sequence to Sequence Models	45	Emerging	ctc-asr-implementations	83	Python
885	shhossain/BanglaTTS BanglaTTS is a text-to-speech (TTS) system for Bangla language that works in...	45	Emerging	tts-model-finetuning	23	Python
886	szimek/webrtc-translate Highly experimental (read: "barely working") app that uses WebRTC API and...	45	Emerging	live-meeting-translation	75	JavaScript
887	hash2430/pitchtron TTS for pitch-accented language. Korean dialect DB.	45	Emerging	tacotron-tts-models	157	Python
888	sanchit-gandhi/whisper-jax JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.	45	Emerging	whisper-transcription-apps	4,690	Jupyter Notebook
889	Onuronon-lab/Shrutik Open-source voice data collection platform for building inclusive voice...	44	Emerging	audio-transcription-apps	11	Python
890	i4Ds/whisper-finetune This repository contains code for fine-tuning the Whisper speech-to-text model.	44	Emerging	speech-to-text-transcription	22	Jupyter Notebook
891	mozhou-tech/kim-voice-assistant Kim，your personal voice kit for Home Inteligence.	44	Emerging	general-purpose-voice-assistants	80	Python
892	algolia/voice-overlay-android 🗣 An overlay that gets your user’s voice permission and input as text in a...	44	Emerging	android-speech-apps	263	Kotlin
893	Candida18/Virtual-Assistance-For-The-Blind The proposed Voice-based Email System uses AI (voice commands) that will...	44	Emerging	voice-controlled-robotics	40	Python
894	r9y9/ttslearn ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)	44	Emerging	text-to-speech-frameworks	267	Jupyter Notebook
895	inclusionAI/Ming-UniAudio Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing...	44	Emerging	voice-ai-learning-collections	435	Python
896	resemble-ai/resemble-alexa This is sample code for an Alexa skill that uses realistic voice cloning...	44	Emerging	voice-cloning-synthesis	87	Python
897	maum-ai/assem-vc Official Code for Assem-VC @ICASSP2022	44	Emerging	text-to-speech-frameworks	269	Jupyter Notebook
898	CheshireCC/faster-whisper-GUI faster_whisper GUI with PySide6	44	Emerging	speech-to-text-converters	2,911	Python
899	markomijic/TTS-Mod-Vault Cross-platform Tabletop Simulator mod backup & download tool — the modern...	44	Emerging	dotnet-tts-libraries	55	Dart
900	apluka34/Bud500 Bud500: A Comprehensive Vietnamese ASR Dataset	44	Emerging	multilingual-speech-datasets	69	—

« Prev 1 2 3 … 7 8 9 10 11 … 68 69 70 Next »