All Voice AI Tools

6,981 tools ranked by quality score · Page 12 of 70

Showing 1101–1200 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1101	mastashake08/speech-kit Simplifying the Speech Synthesis and Speech Recognition engines for...	42	Emerging	web-speech-api-libraries	6	JavaScript
1102	ranchlai/mandarin-tts Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 ,...	42	Emerging	fastspeech-tts-models	484	Python
1103	tikhonp/yandex-speechkit-lib-python Python SDK for Yandex Speechkit API.	42	Emerging	yandex-speechkit-tools	54	Python
1104	nipponjo/tts_arabic 🎙️ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format — Python...	42	Emerging	lightweight-tts-runtimes	37	Python
1105	apinge/MeloTTS.cpp A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO,...	42	Emerging	lightweight-tts-runtimes	95	C++
1106	WanderingAstronomer/Vociferous Vociferous captures audio from your microphone, transcribes it in real-time...	42	Emerging	speech-to-text-converters	13	Python
1107	vb000/Waveformer A deep neural network architecture for low-latency audio processing	42	Emerging	audio-noise-reduction	323	Python
1108	skirdey/voicerestore VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration	42	Emerging	audio-classification-transformers	199	Python
1109	atomicoo/PTTS-WebAPP Parallel TTS web demo based on Flask + Vue (Vuetify). 基于 Flask + Vue 的语音合成单网页演示项目。	42	Emerging	web-based-tts-apps	48	Python
1110	fishaudio/docs Official documentation for products, services, and projects by Fish Audio	42	Emerging	openai-tts-applications	3	MDX
1111	SILMA-AI/silma-tts SILMA TTS v1 Official Repo — a Lightweight Open Bilingual Text to Speech Model	42	Emerging	lightweight-tts-libraries	6	Python
1112	tabahi/formantfeatures Extract frequency, power, width and dissonance of formants from wav files	42	Emerging	keyword-speech-recognition	28	Python
1113	aviaryan/voice-writing-electron A real-time, instant dictation desktop application built on Electron that...	42	Emerging	whisper-transcription-apps	59	JavaScript
1114	Gyyyn/OpenWebTTS Open source Speechify alternative. Read PDFs and EPUBs with local models.	42	Emerging	gradio-tts-webuis	40	JavaScript
1115	scart97/thunder-speech A Hackable speech recognition library.	42	Emerging	ctc-asr-implementations	25	Python
1116	CodersCreative/natural-tts A rust crate for easily implementing Text-To-Speech into your rust programs.	42	Emerging	rust-tts-libraries	24	Rust
1117	TigreGotico/phoonnx A Python library for multilingual phonemization and Text-to-Speech (TTS)...	42	Emerging	lightweight-tts-runtimes	20	Python
1118	aahl/qwen-tts2api 🗣️ Qwen TTS to OpenAI Speech API	42	Emerging	qwen3-tts-applications	46	Python
1119	sc0ty/subsync Subtitle Speech Synchronizer	42	Emerging	whisper-subtitle-generation	1,421	C++
1120	showlab/whisperVideo Find out who said what in the video.	42	Emerging	whisper-diarization	138	Jupyter Notebook
1121	Purfview/whisper-standalone-win Whisper & Faster-Whisper standalone executables for those who don't want to...	42	Emerging	speech-to-text-converters	2,921	—
1122	Bebra777228/PolGen-RVC Преобразование голоса на основе VITS. Ориентировано на простоту, качество и...	42	Emerging	voice-cloning-tools	40	Python
1123	nvidia-riva/common Protocol buffers and other common resources.	42	Emerging	voice-ai-sdks	13	Starlark
1124	spotify/basic-pitch-ts A lightweight yet powerful audio-to-MIDI converter with pitch bend detection.	42	Emerging	audio-music-learning	319	TypeScript
1125	jinserk/pytorch-asr ASR with PyTorch	42	Emerging	end-to-end-asr-frameworks	140	Python
1126	lperezmo/real-time-translator A quick app to translate speech in real time using the Whisper API for...	42	Emerging	real-time-voice-translation	43	Python
1127	CiscoDevNet/g2p_seq2seq_pytorch Grapheme to phoneme model for PyTorch	42	Emerging	grapheme-to-phoneme-conversion	43	Python
1128	USStateDept/State-TalentMAP A comprehensive research, bidding, and matching system to match Foreign...	42	Emerging	audio-transcription-apps	33	JavaScript
1129	NateRickard/Xamarin.Cognitive.Speech A client library that makes it easy to work with the Microsoft Cognitive...	42	Emerging	dotnet-tts-libraries	58	C#
1130	SteTR/Emost-Bot Discord Music Bot using Voice Recognition to receive commands.	42	Emerging	discord-tts-bots	36	JavaScript
1131	SlashNephy/SimpleVoiceroid2Proxy VOICEROID 2 を HTTP API で操作できます	42	Emerging	dotnet-tts-libraries	31	C#
1132	rafaballerini/AssistentePessoal Assistente pessoal virtual desenvolvida com Python 🤖	42	Emerging	general-purpose-voice-assistants	412	Python
1133	mailong25/self-supervised-speech-recognition speech to text with self-supervised learning based on wav2vec 2.0 framework	42	Emerging	wav2vec2-asr-models	379	Python
1134	mapbox/mapbox-speech-swift Natural-sounding text-to-speech in Swift or Objective-C on iOS, macOS, tvOS,...	42	Emerging	ios-speech-frameworks	46	Swift
1135	litongjava/whisper-cpp-server whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper...	42	Emerging	whisper-framework-ports	74	HTML
1136	wq2012/SpeakerRecognitionFromScratch Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家	42	Emerging	speaker-diarization-embedding	47	Python
1137	LynxLine/qtspeech QtSpeech is cross-platform library based on Qt to provide common...	42	Emerging	cross-platform-tts-frameworks	47	C++
1138	MyrtleSoftware/deepspeech A PyTorch implementation of DeepSpeech and DeepSpeech2.	42	Emerging	ctc-asr-implementations	50	Python
1139	keonlee9420/Comprehensive-Tacotron2 PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning...	42	Emerging	text-to-speech-frameworks	48	Python
1140	drankush/VoxRad VOXRAD is a voice transcription application for radiologists leveraging...	42	Emerging	audio-transcription-tools	27	Python
1141	overcrash66/OpenTranslator Open Translator: Speech To Speech and Speech to text Translator with voice...	42	Emerging	text-to-speech-tts	14	Python
1142	mobilequickie/AmazonSpeechTranslator End-to-end Solution for Speech Recognition, Text Translation, and...	42	Emerging	ios-speech-frameworks	51	Swift
1143	charlesliucn/awesome-end2end-asr 💬 A list of End-to-End speech recognition, including papers, codes and other...	42	Emerging	end-to-end-asr-frameworks	52	—
1144	keonlee9420/Daft-Exprt PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across...	42	Emerging	fastspeech-tts-models	55	Python
1145	XimilalaXiang/DeLive DeLive is a cross-platform desktop app that captures system audio output and...	42	Emerging	live-caption-generation	22	TypeScript
1146	IBM/BigLittleNet Official repository for Big-Little Net	42	Emerging	speech-ai-coursework	58	Python
1147	spokestack/react-native-spokestack Spokestack: give your React Native app a voice interface!	42	Emerging	react-native-voice-libraries	60	TypeScript
1148	everydaycodings/MimicMania MimicMania is a web application that allows you to generate speech and clone...	42	Emerging	voice-cloning-tools	60	Python
1149	weespin/WillFromAfarDownloader acapellabox pwned.	42	Emerging	dotnet-tts-libraries	106	C#
1150	mush42/optispeech A lightweight end-to-end text-to-speech model	42	Emerging	fastspeech-tts-models	128	Python
1151	fizamusthafa/whisper-app This repository contains a web application for multi-lingual transcription...	42	Emerging	speech-to-text-transcription	32	Python
1152	ActiveNick/Unity-MS-SpeechSDK Sample Unity project used to demonstrate Speech Recognition using the new...	42	Emerging	dotnet-tts-libraries	65	C#
1153	techiaith/pyfestival Amlapiwr Python C ar gyfer hwyluso rhaglennu gyda Festival \| A Python C...	42	Emerging	lightweight-tts-runtimes	10	Python
1154	DanRuta/xVA-Synth Machine learning based speech synthesis Electron app, with voices from...	42	Emerging	text-to-speech-conversion	633	JavaScript
1155	domesticatedviking/TextyMcSpeechy Easily create Piper text-to-speech models in any voice. Make a...	42	Emerging	piper-tts-ecosystem	631	Shell
1156	jackaduma/LAS_Mandarin_PyTorch Listen, attend and spell Model and a Chinese Mandarin Pretrained model ...	42	Emerging	conformer-asr-implementations	123	Python
1157	moshehbenavraham/Voice-Agent-PuPuPlatter Multi-provider voice AI showcase featuring 7 providers (ElevenLabs + Widget,...	42	Emerging	voice-command-assistants	14	TypeScript
1158	NeuralFalconYT/Video-Dubbing Since most video dubbing services are paid, this project explores an...	42	Emerging	video-dubbing-tools	13	Python
1159	patrickmonteiro/quasar-speech-api 🎤 🔉 Projeto de um SPA desenvolvido com Quasar Framework 1.0 + Speech API...	42	Emerging	voice-interactive-games	68	JavaScript
1160	sberdevices/smart_app_framework SmartApp Framework для создания навыков семейства Виртуальных Ассистентов...	42	Emerging	general-purpose-voice-assistants	49	Python
1161	puff-dayo/Kokoro-82M-Android A minimal Android demo app for Kokoro-TTS	42	Emerging	kokoro-tts-ecosystem	49	Kotlin
1162	sksalahuddin2828/AI_Personal_Digital_Assistant AI Personal Voice Assistant Project (Male - Female version)	42	Emerging	voice-assistant-applications	212	Python
1163	voice-engine/make-a-smart-speaker A collection of resources to make a smart speaker	42	Emerging	local-voice-assistants	474	—
1164	astramind-ai/Auralis A Fast TTS Engine	42	Emerging	self-hosted-tts-servers	619	Python
1165	MikeyParton/react-speech-kit React hooks for Speech Recognition and Speech Synthesis	41	Emerging	react-speech-recognition	246	JavaScript
1166	Yuan-ManX/audio-development-tools Audio Development Tools (ADT) is a project for advancing sound, speech, and...	41	Emerging	audio-source-separation	441	—
1167	Pranjalya/tts-tortoise-gradio A Gradio setup for Tortoise TTS.	41	Emerging	gradio-tts-webuis	45	Python
1168	Aivis-Project/AIVM-Generator Aivis Voice Model File (.aivm/.aivmx) Generator / Editor	41	Emerging	openai-tts-applications	15	Vue
1169	Emotional-Text-to-Speech/hmm-for-emo-tts :computer: A repository with comprehensive instructions for using the...	41	Emerging	zero-shot-voice-synthesis	50	CSS
1170	pulijon/Sttcast Transcription from mp3 files to html with or without embedded player	41	Emerging	personal-assistant-rag	25	Jupyter Notebook
1171	rudrankriyam/Glosik Sample project for F5-TTS using MLX Swift	41	Emerging	ios-speech-frameworks	50	Swift
1172	rtzr/Awesome-Korean-Speech-Recognition 한국어 음성인식 STT API 리스트. 각 성능 벤치마크.	41	Emerging	voice-ai-learning-collections	492	—
1173	AkojimaSLP/Beamforming-for-speech-enhancement simple delaysum, MVDR and CGMM-MVDR	41	Emerging	keyword-speech-recognition	279	Python
1174	tuan3w/cnn_vocoder A fast cnn-based vocoder	41	Emerging	neural-vocoder-implementations	78	Python
1175	1neReality/MITSUHA World's First Multilingual Inexpensive Therapeutic Sophisticated...	41	Emerging	gemini-api-applications	272	Python
1176	revdotcom/reverb Open source inference code for Rev's model	41	Emerging	automatic-speech-recognition	435	Python
1177	solaoi/lycoris Real-time speech recognition & AI-powered note-taking app for macOS with...	41	Emerging	local-voice-dictation	73	TypeScript
1178	JoelShine/Jarvis-v2.0 This is a major update of my project JARVIS-The-Ultimate-Project. You can...	41	Emerging	python-voice-assistants	32	Python
1179	TheMorpheus407/OpenAI-Audiobook-Generator This project is a web-based application that converts text into audio,...	41	Emerging	openai-tts-applications	84	JavaScript
1180	ardha27/AI-Waifu-Vtuber AI Vtuber for Streaming on Youtube/Twitch	41	Emerging	interactive-ai-avatars	1,049	Python
1181	pika-online/AESRC2020 a deep accent recognition network	41	Emerging	end-to-end-asr-frameworks	50	Python
1182	1038lab/ComfyUI-SparkTTS ComfyUI-SparkTTS is a custom ComfyUI node implementation of SparkTTS, an...	41	Emerging	comfyui-tts-nodes	124	Python
1183	Edw590/VISOR---A-Voice-Assistant V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!	41	Emerging	voice-assistant-projects	36	Go
1184	lucko515/speech-recognition-neural-network This is the end-to-end Speech Recognition neural network, deployed in Keras....	41	Emerging	speaker-diarization-embedding	190	HTML
1185	hhguo/MSMC-TTS Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS	41	Emerging	text-to-speech-frameworks	169	Python
1186	OpenMOSS/MOSS-Audio-Tokenizer MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on...	41	Emerging	voice-ai-learning-collections	162	Python
1187	jianchang512/fireredasr-ui 一个中文语音转文字项目，封装自FireRedASR	41	Emerging	funasr-speech-recognition	85	Python
1188	tihu-nlp/tihu Persian Text-To-Speech	41	Emerging	persian-speech-ai	85	C++
1189	FontaineRiant/wrAIter AI writing assistant with voiced narrator and characters and an illustrator	41	Emerging	text-to-speech-tts	38	Python
1190	WangHelin1997/SSR-Speech SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis	41	Emerging	zero-shot-voice-synthesis	147	Python
1191	cameronking4/VapiBlocks Vapi Blocks is a library of components & api snips to copy and paste into...	41	Emerging	voice-command-assistants	83	TypeScript
1192	shenbengit/TTSTool 科大讯飞离线语音，Text to Speech，TTS	41	Emerging	android-speech-apps	36	Kotlin
1193	alan890104/sumi Sumi — Free, open-source voice dictation for macOS. Local-first Whisper +...	41	Emerging	audio-transcription-tools	9	Rust
1194	zeropointnine/tts-audiobook-tool Audiobook creation tool with support for multiple TTS models (Qwen3-TTS,...	41	Emerging	ebook-to-audiobook-conversion	81	Python
1195	kokimame/joytan Creative Audio/Textbook Maker 🎵 📖 See our YouTube channel	41	Emerging	ebook-to-audiobook-conversion	139	Python
1196	georgezhao2010/apple_airplayer Make your AirPlay devices as TTS speakers	41	Emerging	home-assistant-tts	136	Python
1197	pth2000/PowerPointReviewer 一个基于PySide6实现的演讲稿朗读审阅工具，使用TTS引擎朗读PPT中的备注部分，从而辅助您进一步完善演讲的内容与措辞，助您顺利完成精彩的PPT演讲与展示。	41	Emerging	lightweight-tts-libraries	17	Python
1198	ssssssilver/sherpa-ncnn-unity 在Unity环境下，借助sherpa-ncnn框架，实现实时并准确的中英双语语音识别功能。	41	Emerging	dotnet-tts-libraries	77	C#
1199	TETYYS/SAPI4 Web interface for Microsoft Sam & friends	41	Emerging	dotnet-tts-libraries	131	C++
1200	MainRo/docker-deepspeech-server A dockerfile to run deepspeech-server	41	Emerging	parakeet-asr-implementations	30	Dockerfile

« Prev 1 2 3 … 10 11 12 13 14 … 68 69 70 Next »