All Voice AI Tools

6,981 tools ranked by quality score · Page 16 of 70

Showing 1501–1600 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1501	holgern/pykokoro A Python library for Kokoro TTS (Text-to-Speech) using ONNX runtime.	38	Emerging	kokoro-tts-ecosystem	2	Python
1502	wannaphong/KhanomTan-TTS-v1.0 KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that...	38	Emerging	lightweight-tts-runtimes	43	Python
1503	deepgram-starters/django-transcription Get started using Deepgram's Transcription with this Django demo app	38	Emerging	deepgram-starter-projects	7	Python
1504	p-groarke/wsay Windows "say"	38	Emerging	system-tts-wrappers	170	C++
1505	ibotplus/kbase-media 视频、音频、图片内容识别、语音转写、语音合成 / easy convert video audio image to text, and revert...	38	Emerging	java-tts-libraries	24	Java
1506	tochilkinva/tg_bot_stt_tts Telegram bot with voice message recognition and generation. Speech to Text...	38	Emerging	telegram-voice-transcription	68	Python
1507	wdbm/deep_throat speech synthesis program	38	Emerging	lightweight-tts-libraries	21	Python
1508	wxkingstar/TransEcho macOS 实时同声传译 - 捕获系统音频，实时翻译字幕 + 语音同传 \| Real-time simultaneous interpretation for macOS	38	Emerging	local-voice-dictation	4	Rust
1509	inevolin/DiscordSpeechBot A speech-to-text bot for discord with music commands and more using NodeJS....	38	Emerging	discord-tts-bots	20	JavaScript
1510	CarrotYuan/openclaw-voice-control A macOS local voice-control companion for OpenClaw with Siri-like wakeword...	38	Emerging	openclaw-voice-assistants	4	Python
1511	aeleraqi/Text-to-Speech-gTTS---Arabic-text Google Text-to-Speech API to convert text input into audio files	38	Emerging	lightweight-tts-libraries	3	Jupyter Notebook
1512	34j/mecab-text-cleaner Simple Python package (CLI/Python API) for getting japanese readings...	38	Emerging	text-normalization-engines	7	Python
1513	sciforce/phones-las Articulatory features estimation using Listen Attend and Spell architecture.	38	Emerging	conformer-asr-implementations	33	Python
1514	manhph2211/ViSR This repo builds an end-to-end deep learning application that supports...	38	Emerging	end-to-end-asr-frameworks	38	Jupyter Notebook
1515	Troyanovsky/awesome-TTS-Colab Collection of awesome TTS and voice cloning models to run with Google Colab	38	Emerging	tts-model-finetuning	54	Jupyter Notebook
1516	Kyubyong/specAugment Tensor2tensor experiment with SpecAugment	38	Emerging	tacotron-tts-models	46	Python
1517	shijincai/VibeVoice Archive of the official Microsoft VibeVoice repository (7B & 1.5B). Backup...	38	Emerging	qwen3-tts-applications	27	Python
1518	BlinkTagInc/gtfs-tts Review GTFS stop pronunciations to determine which stops need a tts_stop_name value.	38	Emerging	google-tts-libraries	5	TypeScript
1519	Dostoyewski/django_voice_bot Package for django onpage support bot with speech recognition and voice commands	38	Emerging	voice-chatbot-applications	4	Python
1520	falabrasil/kaldi-br ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro	38	Emerging	kaldi-asr-ecosystem	58	Shell
1521	ng-web-apis/speech A library for using Web Speech API with Angular	38	Emerging	web-speech-api-libraries	33	TypeScript
1522	IceFog72/pocket-tts-openapi Fast, local, OpenAI-compatible TTS server with voice cloning support powered...	38	Emerging	self-hosted-tts-servers	10	Python
1523	linagora-labs/ssak SSAK contains helpers and tools to process data and train/infer ASR models.	38	Emerging	automatic-speech-recognition	5	Python
1524	naeruru/mimiuchi a free, customizable, osc capable speech-to-text interface for relaying text...	38	Emerging	dotnet-tts-libraries	60	TypeScript
1525	sskorol/vosk-api-gpu Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC	38	Emerging	vosk-asr-implementations	45	Shell
1526	sexfrance/RecaptchaV2-Solver A Python-based solution for solving Google's reCAPTCHA v2 challenges...	38	Emerging	ibm-watson-speech	38	Python
1527	DrDroidLab/voicesummary Open Source AI Database for Voice Agent Transcripts \| Call Analysis &...	38	Emerging	voice-agent-applications	23	Python
1528	leduckhai/wav2graph wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech	38	Emerging	graph-database-rag	95	Python
1529	QuantiusBenignus/BlahST Input text from speech in any Linux window, the lean, fast and accurate way,...	38	Emerging	conversational-chatbot-applications	167	Shell
1530	noco-ai/spellbook-docker AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many...	38	Emerging	multi-modal-ai-assistants	168	Shell
1531	husniadil/cc-hooks Audio feedback plugin for Claude Code with TTS announcements, sound effects,...	38	Emerging	voice-enabled-coding-assistants	17	Python
1532	HawkAaron/E2E-ASR PyTorch Implementations for End-to-End Automatic Speech Recognition	38	Emerging	end-to-end-asr-frameworks	127	Python
1533	rxlabz/sytody a Flutter "speech to todo" app example	38	Emerging	educational-voice-apps	82	Dart
1534	keenresearch/KeenASR-Android-PoC A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING:...	38	Emerging	java-tts-libraries	28	Java
1535	HawkAaron/RNN-Transducer MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction...	38	Emerging	end-to-end-asr-frameworks	139	Python
1536	kroko-ai/kroko-onnx Kroko ASR - Speech-to-text	38	Emerging	funasr-speech-recognition	138	C++
1537	CMsmartvoice/One-Shot-Voice-Cloning :relaxed: One Shot Voice Cloning base on Unet-TTS	38	Emerging	voice-cloning-tools	245	Jupyter Notebook
1538	ybouhjira/claude-code-tts 🔊 Text-to-Speech MCP plugin for Claude Code - hear audio feedback while...	38	Emerging	voice-enabled-coding-assistants	7	Go
1539	rishikksh20/UnivNet-pytorch UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators...	38	Emerging	neural-vocoder-implementations	76	Python
1540	binzhouchn/masr 中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。	37	Emerging	text-to-speech-frameworks	285	Python
1541	persiandataset/PersianSpeech Persian ASR dataset	37	Emerging	persian-speech-ai	42	—
1542	stevenhillis/awesome-asr-contextualization A curated list of awesome papers on contextualizing E2E ASR outputs	37	Emerging	end-to-end-asr-frameworks	80	—
1543	seven-io/home-assistant HACS supporting Home Assistant integration for seven	37	Emerging	home-assistant-tts	3	Python
1544	thinh-vu/ur_audio_sub Generate text captions for audio files & youtube video using OpenAI Whisper...	37	Emerging	video-transcription-extraction	16	Jupyter Notebook
1545	talin190/Qwen3-TTS-Daggr-UI 🎤 Create dynamic voice experiences with Qwen3-TTS-Daggr-UI, a Gradio app for...	37	Emerging	qwen3-tts-applications	3	Python
1546	fqueis/pollinationsai 🔥 TypeScript SDK wrapper for Pollinations AI services	37	Emerging	google-tts-libraries	13	TypeScript
1547	Bunlong/react-webspeech The official WebSpeech for React.	37	Emerging	react-speech-recognition	11	TypeScript
1548	habla-liaa/ser-with-w2v2 Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from...	37	Emerging	speech-emotion-recognition	140	Jupyter Notebook
1549	hypeapps/black-mirror A voice controlled smart mirror powered by Raspberry Pi3 and AndroidThings.	37	Emerging	voice-controlled-robotics	94	Java
1550	LetsPlayNow/Speech_AI Speech to speech bot built with Python	37	Emerging	voice-chatbot-applications	70	Python
1551	aks-devs/mod_openai_asr Freeswitch Speech-To-Text module	37	Emerging	vosk-asr-implementations	15	C
1552	j3soon/speech-to-windows-input Perform speech-to-text (STT/ASR) with Azure speech service and simulate...	37	Emerging	dotnet-tts-libraries	45	C#
1553	audioku/cross-accent-maml-asr Meta-learning model agnostic (MAML) implementation for cross-accented ASR	37	Emerging	end-to-end-asr-frameworks	45	Python
1554	botbahlul/vosk_autosrt A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using...	37	Emerging	whisper-subtitle-generation	11	Python
1555	tmanderson/ivona-node Ivona Cloud (via Amazon services) client library for Node	37	Emerging	google-tts-libraries	31	JavaScript
1556	yzfly/awesome-voice-agents A curated list of voice AI agent frameworks, tools, resources, and best practices	37	Emerging	voice-agent-applications	20	—
1557	hcy71o/MB-iSTFT-VITS-with-AutoVocoder Incorporating AutoVocoder to MB-iSTFT-VITS	37	Emerging	vits-tts-implementations	48	Python
1558	jaywcjlove/TextSoundSaver Using the TextSoundSaver application, you can convert text into realistic...	37	Emerging	ios-speech-frameworks	85	Swift
1559	shi-gg/Auditional-Text The source code of the Auditional Text discord Boat	37	Emerging	discord-tts-bots	5	TypeScript
1560	hcoles/voices Fast, in-process text to speech for Java	37	Emerging	piper-tts-ecosystem	54	Java
1561	mrf345/flask_gtts A Flask extension to add gTTS Google text to speech	37	Emerging	web-based-tts-apps	9	Python
1562	jianchang512/chatterbox-api 一个基于 Chatterbox-TTS的文字转语音（TTS）服务。提供与 OpenAI TTS 兼容的 API 接口并支持声音克隆，附带简洁的 Web 用户界面。	37	Emerging	self-hosted-tts-servers	20	HTML
1563	MartinMashalov/VoiceCloning Generative voice cloning model using TTS synthesis with state-of-the-art...	37	Emerging	voice-cloning-tools	47	Python
1564	johnGettings/LIHQ Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)	37	Emerging	ai-video-generation	262	Python
1565	ismailperim/reportcast Transform reports into podcasts with AI - Nobody reads your reports. But...	37	Emerging	content-to-podcast-converters	4	TypeScript
1566	blip-radar/vatsim-parser Parser for a variety of VATSIM-related file formats	37	Emerging	rust-tts-libraries	4	Rust
1567	VoXera/VoXera An Open-Source Persian Language Techs Toolkit with Python	37	Emerging	self-hosted-tts-servers	5	Python
1568	outspeed-ai/voice-devtools Developer tools to debug and build realtime voice agents. Supports multiple models.	37	Emerging	voice-command-assistants	50	TypeScript
1569	wangz-code/legado-edge-tts edge大声朗读微软TTS服务, 在阅读legado中配置语音引擎方式收听微软TTS / Edge大声朗读, 如果没有 vps 部署可以看看阅读内置...	37	Emerging	edge-tts-implementations	23	Kotlin
1570	ORI-Muchim/PolyLangVITS Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)	37	Emerging	vits-tts-implementations	75	Python
1571	gianpaj/sexyvoice Voice Cloning, Voice Call and Text to Speech platform. Perfect for content...	37	Emerging	text-to-speech	17	TypeScript
1572	rishikksh20/iSTFT-Avocodo-pytorch Ultrafast GAN based Vocoder for Text to Speech	37	Emerging	neural-vocoder-implementations	50	Python
1573	kaloprojects/KALO-ESP32-Voice-Chat-AI-Friends ESP32-based voice device for chatting with multiple custom AI bots....	37	Emerging	voice-controlled-robotics	59	C++
1574	JstnMcBrd/dectalk-tts API wrapper for the Dectalk TTS system	37	Emerging	dotnet-tts-libraries	1	TypeScript
1575	thewh1teagle/piper-onnx Use piper TTS with onnxruntime	37	Emerging	piper-tts-ecosystem	8	Python
1576	verbio-technologies/python-verbio-speech-center Python integration with the Verbio Speech Center Cloud....	37	Emerging	speech-recognition-apis	8	Python
1577	HordRicJr/HordVoice HordVoice - AI-powered voice assistant built with Flutter and Azure AI...	37	Emerging	educational-voice-apps	10	Dart
1578	SpenserCai/cosyvoice3.rs Python bindings for CosyVoice3 TTS using Candle. Has the characteristics of...	37	Emerging	coqui-tts-applications	9	Rust
1579	Sundy1219/eesen-for-thchs30 ASR for Chinese Mandarin	37	Emerging	end-to-end-asr-frameworks	76	Perl
1580	hipnologo/EchoForge_Studio Multi-LLM writing and voice production workspace built with Streamlit.	37	Emerging	streamlit-tts-apps	17	Python
1581	khakers/go-subgen Automatically generate subtitles for your media using whisper.cpp via...	37	Emerging	whisper-subtitle-generation	68	Go
1582	alamparelli/mcp-claude-say Voice interaction for Claude Code - Talk to Claude and hear responses using...	37	Emerging	voice-enabled-coding-assistants	7	Python
1583	bookbot-kids/speech-recognizer-bahasa-indonesian A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer...	37	Emerging	educational-voice-apps	12	C++
1584	hhguo/SoCodec Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications	37	Emerging	neural-vocoder-implementations	90	Python
1585	mattzzz/rick-voice Give any bot the voice of Rick Sanchez	37	Emerging	discord-tts-bots	14	Python
1586	AlexxIT/FasterWhisper Faster Whisper for Home Assistant - custom integration with a local...	37	Emerging	speech-to-text-converters	97	Python
1587	sljavi/handsfree-for-web-zoom-module Zoom module implementation for Handsfree for web	37	Emerging	web-speech-api-libraries	5	JavaScript
1588	zassou65535/VITS VITSによるテキスト読み上げ器&ボイスチェンジャー	37	Emerging	vits-tts-implementations	92	Python
1589	NotAbhinavGamerz/emotion-aware-automatic-speech-recognition 🎤 Enhance speech recognition by detecting emotions in spoken language,...	37	Emerging	speech-emotion-recognition	4	Python
1590	zabir-nabil/bangla-tts Bangla text to speech, Multilingual (Bangla, English) real-time speech...	37	Emerging	tts-model-finetuning	91	Python
1591	mravanelli/pySpeechRev This python code performs an efficient speech reverberation starting from a...	37	Emerging	automatic-speech-recognition	97	Python
1592	tuanh123789/AdaSpeech An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for...	37	Emerging	fastspeech-tts-models	98	Python
1593	OpenASR/idiolect 🎙️ Handsfree Audio Development Interface	37	Emerging	android-voice-assistants	102	Kotlin
1594	anyvoiceai/Barkify Barkify: an unoffical training implementation of Bark TTS by suno-ai	37	Emerging	lightweight-tts-libraries	130	Python
1595	e-c-k-e-r/vall-e An unofficial PyTorch implementation of VALL-E	37	Emerging	tacotron-tts-models	88	Python
1596	XilinJia/Podcini Open source podcast instrument for Android supporting contents from YouTube...	37	Emerging	android-speech-apps	234	Kotlin
1597	soniqo/speech-android On-device speech SDK for Android — ASR, TTS, VAD, and noise cancellation...	37	Emerging	java-tts-libraries	4	C++
1598	erogol/FFTNet FFTNet vocoder implementation	37	Emerging	audio-noise-reduction	81	Jupyter Notebook
1599	GinoShun/Accent-Activation-Steering Official code for "Activation Steering for Accent Adaptation in Speech...	37	Emerging	end-to-end-asr-frameworks	3	Python
1600	zolomohan/speech-recognition-in-javascript Final Code for Speech Recognition in JavaScript tutorial.	37	Emerging	web-speech-api-libraries	54	JavaScript

« Prev 1 2 3 … 14 15 16 17 18 … 68 69 70 Next »