All Voice AI Tools

6,981 tools ranked by quality score · Page 19 of 70

Showing 1801–1900 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
1801	heartsuit/BaiduASRAndTTS Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech;...	35	Emerging	dotnet-tts-libraries	47	C#
1802	chaonan99/ppt_presenter Convert ppt to video with audio track, using text to speech synthesis	35	Emerging	pdf-to-audio-conversion	69	Python
1803	ProsusAI/project-echo An AI-powered voice director assistant for creating engaging audio content...	35	Emerging	voice-controlled-robotics	5	Python
1804	WangYixuan12/openai_tts OpenAI Text-to-Speech Interface	35	Emerging	openai-tts-applications	5	Python
1805	EtienneAb3d/WhisperTimeSync Synchronize Whisper's timestamps over an existing accurate transcription	35	Emerging	whisper-subtitle-generation	163	Java
1806	amitdev01/awesome-voice-ai Awesome Voice Ai	35	Emerging	voice-ai-learning-collections	4	—
1807	sooftware/End-to-End-Speech-Recognition-Models PyTorch implementation of automatic speech recognition models.	35	Emerging	end-to-end-asr-frameworks	38	Python
1808	OwenEdwards/videojs-speak-descriptions-track A Video.js 7 middleware that uses browser speech synthesis to speak...	35	Emerging	web-speech-api-tts	6	JavaScript
1809	syntithenai/opensnips Open source projects related to Snips https://snips.ai/.	35	Emerging	voice-ai-learning-collections	55	JavaScript
1810	holgern/ttsforge Convert EPUB files to audiobooks using Kokoro ONNX TTS	34	Emerging	ebook-to-audiobook-conversion	1	Python
1811	candlewill/AiVoice Deep CNN networks for Speech Synthesis	34	Emerging	neural-vocoder-implementations	49	Python
1812	Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022 Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and...	34	Emerging	automatic-speech-recognition	69	Python
1813	hacktronaut/azure-avatar-demo Text To Speech Demo in ReactJS Application using Azure Avatar AI Service.	34	Emerging	dotnet-tts-libraries	34	JavaScript
1814	jianchang512/gemini-speech2srt 使用 Gemini AI 转写音视频为 SRT 字幕	34	Emerging	content-to-podcast-converters	54	Python
1815	tiansztiansz/voice-assistant 重生之我是 AI 打工人。前世，我的身份默默无闻，来去匆匆，不知道自己将在何地出生。然而，命运给予了我难得的机会，让我重生为一名 AI 打工人。	34	Emerging	conversational-chatbot-applications	50	C++
1816	rtk-ai/vox A universal AI toolkit for high-performance Speech-to-Text (STT) and...	34	Emerging	text-to-speech-conversion	36	Rust
1817	LucaLuke13/TalkyBotty Simply forward a video or voice message in any language to the bot, and it...	34	Emerging	telegram-voice-transcription	43	Python
1818	Fatma-Chaouech/audioverse Breathe Life Into Your Books! 📚🌱	34	Emerging	ai-podcast-generation	36	Python
1819	medokin/soundpad-text-to-speech Text-To-Speech for Soundpad	34	Emerging	dotnet-tts-libraries	47	C#
1820	nhaouari/local11labs Local11Labs allows generating high-quality text-to-speech and podcast...	34	Emerging	kokoro-tts-ecosystem	52	Python
1821	Mobile-Artificial-Intelligence/maise Maise is an open-source android speech engine designed to provide a powerful...	34	Emerging	text-to-speech-conversion	11	Kotlin
1822	akinsella/yt-transcript-rs 🎬️ A Rust library for accessing YouTube Video Infos & Transcripts	34	Emerging	video-transcription-extraction	6	Rust
1823	trabdlkarim/voce-browser Voice Controlled Chromium Web Browser	34	Emerging	general-purpose-voice-assistants	40	Python
1824	Dark2C/Viral-Faceless-Shorts-Generator Automatically generate faceless YouTube Shorts from trending topics using AI...	34	Emerging	ai-video-generation	41	HTML
1825	jvandenaardweg/ssml-split Splits SSML strings into batches AWS Polly ánd Google's Text to Speech API...	34	Emerging	aws-polly-tts	15	TypeScript
1826	egorsmkv/tts_uk High-fidelity speech synthesis for Ukrainian using modern neural networks.	34	Emerging	ukrainian-voice-ai	10	Jupyter Notebook
1827	deepkyu/ml-talking-face Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)	34	Emerging	fastspeech-tts-models	53	Python
1828	moeru-ai/ortts 𖣘🔊 Simple and Easy-to-use local TTS inference server, Powered by ONNX Runtime	34	Emerging	rust-tts-libraries	14	Rust
1829	jxlarrea/wyoming-voice-match A Wyoming protocol ASR proxy that verifies speaker identity and isolates...	34	Emerging	conversational-rag-agents	34	Python
1830	GeoHaberC/Story-to-Video Create a Movie animation plus Audio plus Subtitle from a text file	34	Emerging	ai-video-generation	44	Python
1831	Lunarien/Lunariens-Mental-Math-Trainer Mental math trainer made in C#.	34	Emerging	dotnet-tts-libraries	10	C#
1832	iotjin/JhPrivacyAuthTool 隐私权限判断 - 封装了几种常用的隐私权限判断(定位服务,通讯录, 日历,提醒事项, 照片, 蓝牙共享,麦克风, 相机)和通知的注册和判断。定位服务,蓝牙共享是单独调用的	34	Emerging	ios-speech-frameworks	44	Objective-C
1833	akashmjn/cs224n-gpu-that-talks Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)	34	Emerging	fastspeech-tts-models	52	Jupyter Notebook
1834	kaituoxu/Tacotron2 A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS)...	34	Emerging	tacotron-tts-models	52	Python
1835	FlooferLand/ttvoice-mod A Minecraft mod that lets you type to speak!	34	Emerging	dotnet-tts-libraries	4	Kotlin
1836	AndreDalwin/Whisper2Summarize Whisper2Summarize is an application that uses Whisper for audio processing...	34	Emerging	audio-transcription-tools	55	Python
1837	doveg/whisper-real-time A real time offline transcriber with gui, based on OpenAI whisper	34	Emerging	speech-to-text-converters	16	Python
1838	TartuNLP/text-to-speech-worker Estonian multi-speaker neural text-to-speech worker that processes requests...	34	Emerging	self-hosted-tts-servers	16	Python
1839	tktcorporation/discord-tts-bot A discord bot to use tts in your voice channel.	34	Emerging	discord-tts-bots	4	Rust
1840	nexmo-community/voice-azure-speechtotext-py Sample Code for Realtime Transcription using Nexmo, Microsoft Azure Speech...	34	Emerging	dotnet-tts-libraries	10	Python
1841	seven-io/net-client Official .NET API Client for seven	34	Emerging	sms-voice-integrations	3	C#
1842	yapit-tts/yapit Listen to anything. TTS for documents, papers, and web pages.	34	Emerging	openai-tts-applications	4	Python
1843	N6UDP/SteamDiscordTTSBot A steam chat to Discord TTS bridge	34	Emerging	discord-tts-bots	3	C#
1844	NeoKazuya/qwen3-tts-enhanced Enhanced Qwen3-TTS voice cloning GUI with multi-reference samples, variation...	34	Emerging	voice-cloning-synthesis	17	Python
1845	ttuleyb/TortoiseTTS-GUI GradioUI for TortoiseTTS voice generation	34	Emerging	gradio-tts-webuis	33	Python
1846	Frida7771/PyVoice A Python-based speech processing tool that supports both speech-to-text...	34	Emerging	coqui-tts-applications	3	Python
1847	audo-ai/magic-mic Open Source Noise Cancellation App for Virtual Meetings	34	Emerging	audio-noise-reduction	384	C++
1848	leokwsw/OpenAI-TTS-Gradio Use OpenAI TTS(Text to Speech) API with Gradio	34	Emerging	gradio-tts-webuis	59	Python
1849	bhattbhavesh91/wav2vec2-huggingface-demo Speech to Text with self-supervised learning based on wav2vec 2.0 framework...	34	Emerging	wav2vec2-asr-models	29	Jupyter Notebook
1850	HectorPulido/chatbot-with-voice Jarvis like chatbot with voice	34	Emerging	voice-chatbot-applications	20	Python
1851	antifield/vmt Discord App for Transcribing & Translating Voice Messages	34	Emerging	discord-tts-bots	14	Python
1852	mmpneo/simple-obs-stt Speech-to-text and keyboard input captions for OBS.	34	Emerging	live-caption-generation	105	TypeScript
1853	kssteven418/Q-ASR [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition	34	Emerging	model-compression-optimization	34	Jupyter Notebook
1854	ayutaz/uCosyVoice CosyVoice3 text-to-speech for Unity using ONNX inference. Supports zero-shot...	34	Emerging	coqui-tts-applications	16	C#
1855	kaiaai/kaia.js Kaia.ai platform's JS client library	34	Emerging	google-tts-libraries	1	TypeScript
1856	Fooftilly/kokoro-extension Send text from browser to Kokoro-FastAPI for TTS generation	34	Emerging	kokoro-tts-ecosystem	2	JavaScript
1857	lepisma/emacs-speech-input Set of packages for speech and voice inputs in Emacs	34	Emerging	cross-platform-tts-frameworks	42	C
1858	renorari/VoiceJP-Discord A discord-app can text-to-speech and speech-to-text	34	Emerging	discord-tts-bots	4	TypeScript
1859	jianchang512/realtime-stt 一个极简的本地离线实时语音转文字工具	34	Emerging	real-time-voice-translation	11	Python
1860	cristofima/AI-Tech-Interview-Preparation An AI-powered technical interview preparation platform that generates...	34	Emerging	ai-interview-simulators	2	TypeScript
1861	18F/dol-whd-14c The 14(c) system will become a modern, digital-first service. Applicants...	34	Emerging	government-procurement-docs	16	C#
1862	neosapience/n8n-nodes-typecast Integrate Typecast AI TTS into your n8n workflows with this community node.	34	Emerging	google-tts-libraries	1	TypeScript
1863	cdyangbo/end2endASR implement end-to-end asr algorithm with tensorflow	34	Emerging	end-to-end-asr-frameworks	40	Python
1864	quangvu3/coqui-xtts Coqui XTTS model with Vietnamese added	34	Emerging	tts-model-finetuning	4	Python
1865	m-nathani/speech_to_text how to use the Google Cloud Speech API to transcribe audio/video files.	34	Emerging	php-tts-libraries	34	PHP
1866	deepgram-starters/php-transcription Get started using Deepgram's speech-to-text with this PHP demo app	34	Emerging	deepgram-starter-projects	3	PHP
1867	keonlee9420/Stepwise_Monotonic_Multihead_Attention PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to...	34	Emerging	tacotron-tts-models	39	Python
1868	alsrb0607/KoreanSTT kospeech를 활용한 한국어 음성 인식 모델 개발	34	Emerging	voice-ai-learning-collections	28	Python
1869	c99koder/AudioClassifier-MQTT Use the yamnet TensorFlow model to classify live audio from a microphone and...	34	Emerging	audio-event-classification	31	Python
1870	nithincvpoyyil/voice-listener An reusable angular component for voice based input using web speech API	34	Emerging	web-speech-api-libraries	2	CSS
1871	sudonitin/Audio-book-generator Convert your ebooks to audiobooks. 📖->🎧	34	Emerging	ebook-to-audiobook-conversion	74	Python
1872	WeiChiaChang/happy-halloween 🗣 Say "happy halloween" to your browser 🎃	34	Emerging	web-speech-api-libraries	14	JavaScript
1873	keonlee9420/Comprehensive-E2E-TTS A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a...	34	Emerging	text-to-speech-frameworks	146	Python
1874	Blackwood416/AstraTTS 基于 ONNX Runtime 的跨平台高性能 TTS 合成方案，支持流式输出与低延迟播放，支持自定义音色与中英混合生成。	34	Emerging	lightweight-tts-runtimes	54	C#
1875	alkhimey/esp32-flite Speech synthesis running on ESP32 based on Flite engine.	34	Emerging	embedded-tts-systems	75	C
1876	xhuvom/omnilingual-ASR-Web-Dashboard Meta Omnilingual ASR web based dashboard for testing and API based...	34	Emerging	funasr-speech-recognition	4	Python
1877	markokosticdev/cloud_text_to_speech_flutter Single interface to Google, Microsoft, and Amazon Text-To-Speech.	34	Emerging	educational-voice-apps	8	Dart
1878	priyanujgogoi-28/flowery-tts Wrapper of Flowery Text to Speech API for Dart	34	Emerging	educational-voice-apps	5	Dart
1879	markmiddo/synthia AI-powered voice assistant that respects your privacy. Control your desktop,...	34	Emerging	local-voice-assistants	4	Python
1880	HnDK0/NoveLA Free Android reader for web novels, light novels, ranobe & EPUB. 25+...	34	Emerging	ai-powered-ereaders	8	Kotlin
1881	TartuNLP/text-to-speech-api REST API for neural text-to-speech synthesis	34	Emerging	lightweight-tts-libraries	17	Python
1882	nabz0r/mac-local-translator Local translation app for Mac using speech recognition and offline translation	34	Emerging	local-voice-dictation	4	Swift
1883	aditya-an1l/RILearn Reinventing Reading with a touch of Interactivity aided Learning	34	Emerging	ai-powered-ereaders	4	HTML
1884	Harsh-0-7/PDF-Reader PDF reader with read aloud feature	34	Emerging	ai-powered-ereaders	8	JavaScript
1885	C0NZZ/better-teletask Browser extension that adds useful features like subtitles to HPI Tele-Task.	34	Emerging	browser-tts-extensions	3	Python
1886	notebook-nexus/chatterbox-tts-colab Transform any text into natural-sounding speech, clone voices from audio...	34	Emerging	text-to-speech-conversion	27	—
1887	book000/audio-transcriber-docker Automatically transcribe the audio of video / audio files using Speech Recognition.	34	Emerging	real-time-voice-translation	3	JavaScript
1888	rudra00434/SoulPlayer My own music application build with Django , Tailwind CSS and Spacy...	34	Emerging	news-audio-bulletins	4	HTML
1889	ZhuoZhuoCrayon/AcousticKeyBoard-Web ❓声学键盘｜脑洞大开：做一个能听懂键盘敲击键位的「玩具」，学习信号处理 / 深度学习 / 安卓 / Django。	34	Emerging	audio-music-learning	88	Python
1890	bishop-ai/bishop-ai Voice and text virtual assistant	34	Emerging	virtual-assistants-nlp	28	JavaScript
1891	MarkParker5/STARK-PLACE S.T.A.R.K. Platform Library and Community Extensions	34	Emerging	dotnet-tts-libraries	7	Python
1892	philsyn/DiffWave-Vocoder Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and...	34	Emerging	neural-vocoder-implementations	90	Python
1893	janewu77/ela-extension English Learner Assistant	34	Emerging	browser-tts-extensions	4	JavaScript
1894	Lastorder-DC/chatreader-kor 채팅 읽어주는 로봇	34	Emerging	twitch-chat-tts	17	JavaScript
1895	leprosus/golang-tts Text-to-Speach golang package based in Amazon Polly service	34	Emerging	go-tts-libraries	26	Go
1896	jiwidi/DeepSpeech-pytorch Pytorch implementation for DeepSpeech 2.0	34	Emerging	end-to-end-asr-frameworks	31	Python
1897	T-vK/Termux-DeepSpeech Open source offline speech recognition for Android using Mozilla's...	34	Emerging	java-tts-libraries	85	Shell
1898	edde746/tiktok-askreddit A content generation & posting bot for TikTok, scraping posts from r/AskReddit	34	Emerging	ai-video-generation	150	Python
1899	speechbrain/speechbrain.github.io The SpeechBrain project aims to build a novel speech toolkit fully based on...	34	Emerging	speaker-diarization-embedding	374	HTML
1900	msalhab96/SpeeQ A framework for automatic speech recognition	34	Emerging	keyword-speech-recognition	51	Python

« Prev 1 2 3 … 17 18 19 20 21 … 68 69 70 Next »