All Voice AI Tools

6,981 tools ranked by quality score · Page 13 of 70

Showing 1201–1300 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1201	alias454/YATSEE YATSEE - Yet Another Tool for Speech Extraction & Enrichment	41	Emerging	personal-assistant-rag	31	Python
1202	modelscope/FunCodec FunCodec is a research-oriented toolkit for audio quantization and...	41	Emerging	neural-vocoder-implementations	442	Python
1203	CoffeeVampir3/audiocraft-webui Quick webui for audiocraft	41	Emerging	audio-music-learning	169	Python
1204	HenestrosaDev/audiotext A desktop application that transcribes audio from files, microphone input or...	41	Emerging	real-time-voice-translation	345	Python
1205	kahne/SpeechTransProgress Tracking the progress in end-to-end speech translation	41	Emerging	audio-transcription-apps	261	—
1206	TeamAudio/reaspeech Speech recognition for REAPER	41	Emerging	audio-transcription-tools	36	Lua
1207	italankin/samplevoicebot TTS Telegram bot	41	Emerging	telegram-voice-transcription	7	Python
1208	smaranjitghose/AIAudioTranscriber A minimalistic web app to generate transciption for audio built using Python	41	Emerging	real-time-voice-translation	31	Python
1209	arpy8/ESP32_Voice_Assistant This project combines embedded system and AI inference to create an...	41	Emerging	voice-assistant-devices	39	Python
1210	MHaggis/ASRGEN ASR Configurator, Essentials and Atomic Testing	41	Emerging	automatic-speech-recognition	104	Python
1211	finos/greenkey-asrtoolkit A collection of useful tools for handling speech recognition data	41	Emerging	automatic-speech-recognition	30	Python
1212	serpapps/ai-voice-cloner AI Voice Cloning Desktop Application that runs locally on your computer and...	41	Emerging	voice-cloning-tools	55	—
1213	JosefAlbers/e2tts-mlx Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX	41	Emerging	zero-shot-voice-synthesis	29	Python
1214	benmaster82/writher Voice-powered productivity for Windows	41	Emerging	audio-transcription-tools	11	Python
1215	rishikksh20/gmvae_tacotron Gaussian Mixture VAE Tacotron	41	Emerging	tacotron-tts-models	54	Python
1216	r0227n/flutter_whisper_kit 🎤 A Flutter plugin for running WhisperKit speech-to-text models on-device,...	41	Emerging	whisper-speech-transcription	13	Dart
1217	jbelford/Eolian Eolian is a Discord music bot which provide a very powerful API for queuing...	41	Emerging	discord-tts-bots	23	TypeScript
1218	AIFSH/ComfyUI-FishSpeech a custom comfyui node for fish-speech	41	Emerging	comfyui-tts-nodes	49	Python
1219	sotelo/parrot RNN-based generative models for speech.	41	Emerging	next-word-prediction	609	Python
1220	rainygirl/rspeaker 말귀를 알아듣고 뉴스도 요약해 읽어줍니다	41	Emerging	news-audio-bulletins	26	Python
1221	petewarden/spchcat Speech recognition tool to convert audio to text transcripts, for Linux and...	41	Emerging	automatic-speech-recognition	482	C
1222	Kaljurand/Arvutaja An Android app for voice actions in Estonian and English	41	Emerging	android-speech-apps	30	Java
1223	spokestack/spokestack-ios Spokestack: give your iOS app a voice interface!	41	Emerging	ios-speech-frameworks	45	Swift
1224	gdoudeng/react-native-baidu-asr The react-native Baidu voice library provides voice recognition, voice...	41	Emerging	react-native-voice-libraries	34	Java
1225	MerlinCN/kinoko7danmaku 调用TTS来播报哔哩哔哩直播中的弹幕、礼物、舰长等	41	Emerging	gradio-tts-webuis	24	Python
1226	mobassir94/comprehensive-bangla-tts Aiming to achieve ultimate Multilingual TTS pipeline with main focus on...	41	Emerging	tts-model-finetuning	43	Jupyter Notebook
1227	botbahlul/VOSK-Powered-Live-Subtitle-V3 ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using free...	41	Emerging	live-caption-generation	42	Java
1228	declare-lab/speech-adapters Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient...	41	Emerging	end-to-end-asr-frameworks	42	Python
1229	mapluisch/OpenAI-Realtime-API-for-Unity Implementation of OpenAI's Realtime API in Unity. Easily integrate...	41	Emerging	ai-avatar-platforms	31	ShaderLab
1230	solyarisoftware/CoquiSTTJs Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.	41	Emerging	coqui-tts-applications	30	JavaScript
1231	cherts/mspeech Program for speech recognition using the Google Speech API, voice commands,...	41	Emerging	dotnet-tts-libraries	38	Pascal
1232	Andrewcpu/elevenlabs-api 🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs...	41	Emerging	elevenlabs-integrations	38	Java
1233	Frikallo/parakeet.cpp Ultra fast and portable Parakeet implementation for on-device inference in...	41	Emerging	parakeet-asr-implementations	244	C++
1234	Saganaki22/ComfyUI-Step_Audio_EditX_TTS ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice...	41	Emerging	text-to-speech-tts	57	Python
1235	Kini218/speech-to-text Speech to text script on python	41	Emerging	speech-recognition-apis	35	Python
1236	yeahhe365/PageTalk 一个简洁且优秀的描述是：这是一款在任何网页上实现无缝语音转文字的 Chrome 扩展，使用先进的 ASR API。	41	Emerging	browser-tts-extensions	37	JavaScript
1237	Edw590/VISOR---Android-Version-Assistant V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!	41	Emerging	voice-assistant-projects	35	Java
1238	AryanVBW/AiVoiceClonerPRO Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into...	41	Emerging	voice-cloning-synthesis	72	Python
1239	DeeepMaker/subtitle-to-audio A python script to generate .wav audio files for .srt subtitle files	41	Emerging	whisper-subtitle-generation	34	Python
1240	yl4579/HiFTNet HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter...	41	Emerging	text-to-speech-frameworks	247	Python
1241	itsRares/react-native-deepgram Brings Deepgram's capabilities to React Native applications, with a focus on...	41	Emerging	deepgram-starter-projects	6	TypeScript
1242	huckiyang/Voice2Series-Reprogramming ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time...	41	Emerging	text-to-speech-frameworks	73	TypeScript
1243	gladiaio/normalization A lightweight library for normalizing speech transcripts before computing WER	41	Emerging	text-normalization-engines	10	Python
1244	Spac5y/Vocal-Agent A cutting-edge Cascading voice assistant combining real-time speech...	41	Emerging	conversational-chatbot-applications	10	Python
1245	nodef/extra-amazontts Generate speech audio from super long text through machine (via "Amazon...	41	Emerging	aws-polly-tts	5	JavaScript
1246	advanced-media-inc/amivoice-api-client-library AmiVoice API Client Library and the sample programs	41	Emerging	web-speech-api-libraries	15	JavaScript
1247	OvidijusParsiunas/speech-to-element A simple way to add speech to text functionality to your website :microphone:	41	Emerging	web-speech-api-libraries	20	TypeScript
1248	cboard-org/ccboard Cordova wrapper for the Cboard application	41	Emerging	react-native-voice-libraries	5	Shell
1249	ae9is/subtitle-chan Live speech transcription and translation in your browser	41	Emerging	live-meeting-translation	14	TypeScript
1250	botbahlul/pyvosklivesubtitle PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23...	41	Emerging	live-caption-generation	29	Python
1251	holm-aune-bachelor2018/ctc Speech recognition with CTC in Keras with Tensorflow backend	41	Emerging	ctc-asr-implementations	31	Python
1252	nl8590687/ASRT_SDK_Python3 ASRT语音识别系统的Python版SDK	41	Emerging	voice-ai-sdks	54	Python
1253	kapi2800/qwen3-tts-apple-silicon Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning,...	41	Emerging	qwen3-tts-applications	396	Python
1254	TUD-STKS/VocalTractLabBackend-dev The VocalTractLab backend sources and C/C++ API	41	Emerging	lightweight-tts-runtimes	17	C++
1255	jing332/tts-server-go 微软TTS服务转发，以便在阅读APP中通过网络导入方式收听微软TTS / Edge大声朗读	41	Emerging	edge-tts-implementations	411	Go
1256	mattmireles/kokoro-coreml PyTorch → CoreML conversion pipeline for Kokoro TTS. Unlocks fast on-device...	41	Emerging	kokoro-tts-ecosystem	32	Python
1257	Renovamen/Speech-and-Text Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech...	41	Emerging	speech-recognition-apis	341	Python
1258	alexiokay/AriLink Modern ARI-STASI server, built on Asterisk ARI with real-time speech-to-text...	41	Emerging	ai-tutoring-platforms	10	TypeScript
1259	kristofferv98/VoiceProcessingToolkit The VoiceProcessingToolkit is an all-encompassing suite designed for...	40	Emerging	coqui-tts-applications	4	Python
1260	n0name45/node-red-contrib-yandex-station-management Модуль node-red-contrib-yandex-station-management для управления умными...	40	Emerging	yandex-speechkit-tools	29	JavaScript
1261	FedericaPaoli1/stm32-speech-recognition-and-traduction stm32-speech-recognition-and-traduction is a project developed for the...	40	Emerging	wake-word-detection	39	C
1262	kaituoxu/Listen-Attend-Spell A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End...	40	Emerging	conformer-asr-implementations	207	Python
1263	GlobalTechInfo/gspeak Google Text to Speech for Node.js — modern, typed, zero deprecated dependencies.	40	Emerging	google-tts-libraries	1	TypeScript
1264	bnsantoso/sub-to-audio Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS...	40	Emerging	whisper-subtitle-generation	121	Python
1265	satyam9090/Automatic-Indian-Sign-Language-Translator-ISL I created an application which takes in live speech or audio recording as...	40	Emerging	sign-language-translation	131	Python
1266	awslabs/speech-representations Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)	40	Emerging	end-to-end-asr-frameworks	104	Python
1267	fcjr/ltts Quick CLI for local text-to-speech using Qwen3-TTS or Kokoro TTS.	40	Emerging	qwen3-tts-applications	8	Python
1268	holgern/kokorog2p A unified multi-language G2P (Grapheme-to-Phoneme) library for Kokoro TTS.	40	Emerging	grapheme-to-phoneme-conversion	3	Python
1269	kosich/rxjs-tts RxJS wrapper for Text-to-Speech Web API	40	Emerging	web-speech-api-tts	9	TypeScript
1270	zh217/torch-asg Auto Segmentation Criterion (ASG) implemented in pytorch	40	Emerging	end-to-end-asr-frameworks	51	C++
1271	Nighthawk42/mOrpheus Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.	40	Emerging	audio-transcription-tools	84	Python
1272	ritazh/EchoML 🔉 A web app to play, visualize, and annotate your audio files for machine learning	40	Emerging	audio-music-learning	120	JavaScript
1273	deepgram-starters/flask-text-to-speech Get started using Deepgram's Text-to-Speech with this Flask demo app	40	Emerging	deepgram-starter-projects	15	Python
1274	DangerDaza/Dooms-Enhancement-Suite An immersive RPG enhancement extension for SillyTavern — character tracking,...	40	Emerging	browser-tts-extensions	10	JavaScript
1275	lokkelvin2/tacotron2-tts-GUI Text To Speech (TTS) GUI wrapper for NVIDIA Tacotron 2+Waveglow. For custom...	40	Emerging	tacotron-tts-models	37	Python
1276	shahules786/mayavoz Pytorch based speech enhancement toolkit.	40	Emerging	speaker-diarization-embedding	336	Python
1277	tianbot/rosecho Tianbot Rosecho (Tianecho)，中文语音人机交互模块，支持ROS即插即用	40	Emerging	voice-controlled-robotics	36	C
1278	weimeng23/speech-recognition-learning-resources :white_check_mark: A list of speech recognition learning resources including...	40	Emerging	speaker-diarization-embedding	68	—
1279	nikhilunni/demucs-rs Rust powered waveform source separation	40	Emerging	audio-source-separation	82	Rust
1280	deepgram-starters/flask-voice-agent Flask WebSocket proxy for Deepgram's Voice Agent API	40	Emerging	deepgram-starter-projects	8	Python
1281	smx-smx/KodiSharp Use Kodi python APIs in C#, and write rich addons using the .NET framework/Mono	40	Emerging	dotnet-tts-libraries	31	C#
1282	PhilippeRo/IBus-Speech-To-Text A speech to text IBus engine using VOSK	40	Emerging	vosk-asr-implementations	36	Python
1283	sl5net/SL5-aura-service Your offline, privacy-first voice assistant framework. Transform speech into...	40	Emerging	general-purpose-voice-assistants	9	Python
1284	maum-ai/wavegrad2 Unofficial Pytorch Implementation of WaveGrad2	40	Emerging	audio-noise-reduction	112	Jupyter Notebook
1285	oscie57/tiktok-voice Simple Python script to interact with the TikTok TTS API	40	Emerging	telegram-voice-transcription	599	Python
1286	EndlessReform/fish-speech.rs A Fish Speech implementation in Rust, with Candle.rs	40	Emerging	rust-tts-libraries	110	Rust
1287	qforge-dev/qspeak qSpeak is a powerful voice transcription and AI assistant tool that helps...	40	Emerging	ai-note-taking-apps	62	TypeScript
1288	robmsmt/ASR-Audio-Data-Links A list of publically available audio data that anyone can download for ASR...	40	Emerging	speech-corpora-datasets	231	Shell
1289	loretoparisi/wave2vec-recognize-docker Wave2vec 2.0 Recognize pipeline	40	Emerging	wav2vec2-asr-models	33	Python
1290	FaceOnLive/Spleeter-Android-iOS On-device, Offline Spleeter Solution For Mobile	40	Emerging	audio-source-separation	224	Java
1291	Igorcbraz/Calculadora 📐 Calculadora simples e intuitiva com suporte a comandos de voz e temas...	40	Emerging	voice-controlled-calculators	33	JavaScript
1292	definitio/ha-rhvoice Home Assistant integration for RHVoice - a local text-to-speech engine.	40	Emerging	home-assistant-tts	52	Python
1293	DrewThomasson/ebook2audiobookpiper-tts Converts ebooks into audiobooks with piper-tts	40	Emerging	ebook-to-audiobook-conversion	102	Jupyter Notebook
1294	tugstugi/mongolian-speech-recognition Mongolian speech recognition with PyTorch	40	Emerging	end-to-end-asr-frameworks	138	Python
1295	carleeno/elevenlabs_tts Custom TTS Integration using ElevenLabs API	40	Emerging	elevenlabs-integrations	99	Python
1296	sunshine0523/MNNServer A third-party MNN server supporting external calls, embedding model, TTS...	40	Emerging	llm-docker-deployments	149	C++
1297	keonlee9420/Robust_Fine_Grained_Prosody_Control PyTorch Implementation of Robust and fine-grained prosody control of...	40	Emerging	zero-shot-voice-synthesis	41	Python
1298	QiBowen2008/SuperTextToolBox 一个免费的文字处理工具箱	40	Emerging	dotnet-tts-libraries	57	Rich Text Format
1299	coqui-ai/STT-models Open models for Coqui STT	40	Emerging	voice-cloning-synthesis	152	—
1300	jing332/tts-server-android 这是一个Android系统TTS应用，内置微软演示接口，可自定义HTTP请求，可导入其他本地TTS引擎，以及根据中文双引号的简单旁白/对话识别朗读...	40	Emerging	java-tts-libraries	4,315	Kotlin

« Prev 1 2 3 … 11 12 13 14 15 … 68 69 70 Next »