All Voice AI Tools

6,983 tools ranked by quality score · Page 2 of 70

Showing 101–200 of 6,983

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
101	kurianbenoy/whisper_normalizer A python package for whisper normalizer	60	Established	speech-to-text-converters	76	Jupyter Notebook
102	ieasybooks/tafrigh تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.	60	Established	whisper-subtitle-generation	141	Python
103	nttcslab-sp/kaldiio A pure python module for reading and writing kaldi ark files	60	Established	kaldi-asr-ecosystem	268	Python
104	PyThaiNLP/pythaiasr Python Thai Automatic Speech Recognition	60	Established	automatic-speech-recognition	77	Python
105	Picovoice/rhino On-device Speech-to-Intent engine powered by deep learning	60	Established	speech-ai-coursework	698	Python
106	cboard-org/cboard Augmentative and Alternative Communication (AAC) system with text-to-speech...	60	Established	react-native-voice-libraries	732	JavaScript
107	Vonage/vonage-php-sdk-core Vonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech,...	60	Established	sms-voice-integrations	928	PHP
108	ManimCommunity/manim-voiceover Manim plugin for all things voiceover	60	Established	ai-video-generation	280	Python
109	roryeckel/wyoming_openai OpenAI-Compatible Proxy Middleware for the Wyoming Protocol	60	Established	lightweight-tts-runtimes	150	Python
110	PyThaiNLP/PyThaiTTS Open Source Thai Text-to-speech library in Python	60	Established	lightweight-tts-runtimes	58	Jupyter Notebook
111	flashlight/wav2letter Facebook AI Research's Automatic Speech Recognition Toolkit	59	Established	speaker-diarization-embedding	6,446	C++
112	netease-youdao/EmotiVoice EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine	59	Established	text-to-speech-frameworks	8,455	Python
113	OpenMOSS/MOSS-TTS MOSS‑TTS Family is an open‑source speech and sound generation model family...	59	Established	voice-assistant-devices	922	Python
114	lugia19/elevenlabslib Full python wrapper for the elevenlabs API.	59	Established	elevenlabs-integrations	158	Python
115	amicalhq/amical 🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no...	59	Established	local-voice-dictation	1,014	TypeScript
116	Kieirra/murmure Fully local, private and cross platform Speech-to-Text with LLM Post-processing	59	Established	speech-to-text-converters	585	TypeScript
117	r9y9/nnmnkwii Library to build speech synthesis systems designed for easy and fast prototyping.	59	Established	voice-cloning-synthesis	399	Python
118	tabahi/bournemouth-forced-aligner Extract phoneme-level timestamps from speeh audio.	59	Established	asr-evaluation-metrics	121	Python
119	Picovoice/cheetah On-device streaming speech-to-text engine powered by deep learning	58	Established	funasr-speech-recognition	661	Python
120	xinjli/allosaurus Allosaurus is a pretrained universal phone recognizer for more than 2000 languages	58	Established	end-to-end-asr-frameworks	715	Python
121	babysor/MockingBird 🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time	58	Established	voice-cloning-synthesis	36,874	Python
122	MainRo/deepspeech-server A testing server for a speech to text service based on coqui.ai	58	Established	parakeet-asr-implementations	219	Python
123	OpenVoiceOS/ovos-tts-server simple flask server to host OpenVoiceOS tts plugins as a service	58	Established	espeak-ng-ecosystem	15	Python
124	aichaos/rivescript-python A RiveScript interpreter for Python. RiveScript is a scripting language for...	58	Established	discord-ai-chatbots	157	Python
125	software-mansion/react-native-executorch Declarative way to run AI models in React Native on device, powered by ExecuTorch.	58	Established	react-native-voice-libraries	1,284	C++
126	chinokikiss/GSV-TTS-Lite GSV-TTS-Lite A high-performance inference engine specifically designed for...	57	Established	vits-tts-implementations	57	Python
127	vilassn/whisper_android Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android	57	Established	whisper-framework-ports	630	C++
128	charleprr/redditube A video generator from Reddit posts and comments	57	Established	ai-video-generation	62	JavaScript
129	altunenes/parakeet-rs very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA...	57	Established	parakeet-asr-implementations	227	Rust
130	wenet-e2e/wenet Production First and Production Ready End-to-End Speech Recognition Toolkit	57	Established	end-to-end-asr-frameworks	5,056	Python
131	GitYCC/g2pW Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)	57	Established	grapheme-to-phoneme-conversion	382	Python
132	MycroftAI/mycroft-precise A lightweight, simple-to-use, RNN wake word listener	57	Established	wake-word-detection	959	Python
133	Wikidepia/g2p-id Indonesian Grapheme-to-Phoneme (IPA notation)	57	Established	grapheme-to-phoneme-conversion	43	Python
134	n1teshy/yapper-tts offline text to speech and free SOTA LLM APIs to let your programs speak to you	57	Established	lightweight-tts-libraries	46	Python
135	AbdullahHendy/live-translation Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM...	57	Established	speech-to-text-transcription	13	Python
136	Vonage/vonage-node-sdk Vonage API client for Node.js. API support for SMS, Voice, Text-to-Speech,...	57	Established	sms-voice-integrations	396	TypeScript
137	phuc-nt/my-translator Real-time speech translation — macOS & Windows, free TTS, no server, your...	57	Established	ios-speech-frameworks	308	JavaScript
138	haoheliu/voicefixer General Speech Restoration	57	Established	automatic-speech-recognition	1,302	Python
139	Spr-Aachen/Easy-Voice-Toolkit A user-friendly audio toolkit for voice recognition, voice transcription,...	56	Established	voice-ai-learning-collections	875	Python
140	jianchang512/stt Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式	56	Established	real-time-voice-translation	4,331	Python
141	RVC-Boss/GPT-SoVITS 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)	56	Established	vits-tts-implementations	55,896	Python
142	wq2012/SimpleDER A lightweight library to compute Diarization Error Rate (DER).	56	Established	asr-evaluation-metrics	62	Python
143	astorfi/speechpy :speech_balloon: SpeechPy - A Library for Speech Processing and Recognition:...	56	Established	automatic-speech-recognition	886	Python
144	sccn/eegprep EEGPrep is an automated preprocessing tool for human EEG data built on a...	56	Established	automatic-speech-recognition	19	Jupyter Notebook
145	revdotcom/revai-node-sdk Node.js SDK for the Rev AI API	56	Established	google-tts-libraries	21	TypeScript
146	justinsalamon/scaper A library for soundscape synthesis and augmentation	56	Established	audio-source-separation	414	Python
147	MahmoudAshraf97/whisper-diarization Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper	56	Established	whisper-diarization	5,437	Jupyter Notebook
148	XDcobra/react-native-sherpa-onnx React Native TurboModule for Sherpa-ONNX offline on-device Speech Processing...	56	Established	react-native-voice-libraries	9	TypeScript
149	yandexdataschool/speech_course YSDA course in Speech Processing.	56	Established	speech-ai-coursework	319	Jupyter Notebook
150	ahmetoner/whisper-asr-webservice OpenAI Whisper ASR Webservice API	56	Established	speech-to-text-converters	3,202	Python
151	jamsch/expo-speech-recognition Speech Recognition for React Native Expo projects	55	Established	react-native-voice-libraries	566	TypeScript
152	shivammehta25/Matcha-TTS [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching	55	Established	text-to-speech-frameworks	1,259	Jupyter Notebook
153	krillinai/KrillinAI Video translation and dubbing tool powered by LLMs. The video translator...	55	Established	video-dubbing-tools	9,724	Go
154	lucasnewman/f5-tts-mlx Implementation of F5-TTS in MLX	55	Established	zero-shot-voice-synthesis	611	Python
155	echogarden-project/echogarden Cross-platform speech toolset, used from the command-line or as a Node.js...	55	Established	google-tts-libraries	439	TypeScript
156	linto-ai/WebVoiceSDK Buildings block for voice-enabled applications in the browser	55	Established	text-to-speech-conversion	38	JavaScript
157	deepgram/deepgram-js-sdk Official JavaScript SDK for Deepgram.	55	Established	deepgram-starter-projects	248	TypeScript
158	kstonekuan/tambourine-voice Your personal voice interface for any app. Speak naturally and your words...	55	Established	local-voice-dictation	313	Rust
159	ken107/read-aloud An awesome browser extension that reads aloud webpage content with one click	55	Established	browser-tts-extensions	1,639	JavaScript
160	remsky/Kokoro-FastAPI Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX...	55	Established	kokoro-tts-ecosystem	4,585	Python
161	EddyVerbruggen/nativescript-speech-recognition :speech_balloon: Speech to text, using the awesome engines readily available...	55	Established	web-speech-api-libraries	91	TypeScript
162	itsmevictor/clean-transcribe A simple CLI to transcribe Youtube videos or local audio/video files and...	54	Established	audio-transcription-tools	23	Python
163	zuoban/tts tts 服务	54	Established	system-tts-wrappers	602	TypeScript
164	githubharald/CTCWordBeamSearch Connectionist Temporal Classification (CTC) decoder with dictionary and...	54	Established	ctc-asr-implementations	577	C++
165	NVIDIA-AI-Blueprints/pdf-to-podcast Transform PDFs into AI podcasts for engaging on-the-go audio content.	54	Established	pdf-to-audio-conversion	803	Python
166	dangvansam/viet-asr VietASR - Vietnamese Automatic Speech Recognition	54	Established	end-to-end-asr-frameworks	165	Python
167	OpenMOSS/MOSS-TTSD MOSS-TTSD is a spoken dialogue generation model designed for expressive...	54	Established	voice-assistant-devices	1,202	Python
168	Softcatala/open-dubbing Open dubbing is an AI dubbing system which uses machine learning models to...	54	Established	voice-cloning-synthesis	373	Python
169	met4citizen/HeadTTS HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for...	54	Established	kokoro-tts-ecosystem	112	JavaScript
170	Azure-Samples/Cognitive-Speech-TTS Microsoft Text-to-Speech API sample code in several languages, part of...	54	Established	dotnet-tts-libraries	1,004	C#
171	LokerL/tts-vue 🎤 微软语音合成工具，使用 Electron + Vue + ElementPlus + Vite 构建。	54	Established	google-tts-libraries	6,099	TypeScript
172	kalliope-project/kalliope Kalliope is a framework that will help you to create your own personal assistant.	54	Established	python-voice-assistants	1,754	Python
173	sandrohanea/whisper.net Whisper.net. Speech to text made simple using Whisper Models	54	Established	whisper-framework-ports	894	C#
174	VolcanicArts/VRCOSC A modular node-programming language, program creator, animation system,...	54	Established	dotnet-tts-libraries	502	C#
175	travisvn/edge-tts-universal Use Microsoft Edge's online text-to-speech service in Node.js, browsers, or...	54	Established	edge-tts-implementations	59	TypeScript
176	githubharald/CTCDecoder Connectionist Temporal Classification (CTC) decoding algorithms: best path,...	54	Established	ctc-asr-implementations	835	Python
177	aahl/zai-tts 🗣️ ZAI/GLM TTS to OpenAI Speech API, 免费的语音合成API，支持克隆音色，基于智谱TTS	54	Established	openai-tts-applications	158	Python
178	peteonrails/voxtype Voice-to-text with push-to-talk for Wayland compositors	54	Established	voice-dictation-typing	510	Rust
179	dlutton/flutter_tts Flutter Text to Speech package	54	Established	educational-voice-apps	732	Dart
180	gunthercox/chatterbot-voice A example of verbal communication using ChatterBot	54	Established	voice-chatbot-applications	112	—
181	pavelzbornik/whisperX-FastAPI FastAPI service on top of WhisperX	54	Established	speech-to-text-converters	174	Python
182	yuga-hashimoto/openclaw-assistant OpenClaw voice assistant app for Android - Wake word activation & system...	54	Established	openclaw-voice-assistants	196	Kotlin
183	dputhier/pygtftk A python package and a set of shell commands to handle GTF files	54	Established	lightweight-tts-libraries	51	Python
184	Oaklight/asr2clip handy cli tool to convert your speech to clipboard text	54	Established	speech-to-text-converters	15	Python
185	royshil/obs-localvocal OBS plugin for local speech recognition and captioning using AI	54	Established	speech-to-text-converters	1,412	C++
186	BryceWG/BiBi-Keyboard 说点啥（BiBi Keyboard）:一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR...	54	Established	audio-transcription-tools	535	Kotlin
187	stemrollerapp/stemroller Isolate vocals, drums, bass, and other instrumental stems from any song	54	Established	audio-source-separation	3,052	Svelte
188	kishanrajput23/Jarvis-Desktop-Voice-Assistant A python based desktop voice assistant capable of executing system-level...	54	Established	python-voice-assistants	589	Python
189	deepgram/deepgram-python-sdk Official Python SDK for Deepgram.	53	Established	voice-ai-sdks	406	Python
190	stimm-ai/stimm The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI...	53	Established	voice-assistant-frameworks	40	Python
191	zai-org/GLM-ASR GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters	53	Established	llm-scaling-architecture	759	Python
192	JamesBrill/react-speech-recognition 💬Speech recognition for your React app	53	Established	react-speech-recognition	835	JavaScript
193	wannaphong/ttsmms TTS with The Massively Multilingual Speech (MMS) project	53	Established	lightweight-tts-libraries	235	Python
194	ynop/audiomate Python library for handling audio datasets.	53	Established	speech-corpora-datasets	138	Python
195	sdkcarlos/artyom.js A voice control - voice commands - speech recognition and speech synthesis...	53	Established	web-speech-api-libraries	1,268	JavaScript
196	Aivis-Project/aivmlib Aivis Voice Model File (.aivm/.aivmx) Utility Library	53	Established	openai-tts-applications	25	Python
197	hugobloem/wyoming-microsoft-tts Wyoming protocol server for Microsoft Azure text-to-speech	53	Established	lightweight-tts-runtimes	25	Python
198	nl8590687/ASRT_SpeechRecognition A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统	53	Established	ctc-asr-implementations	8,359	Python
199	namastexlabs/murmurai 🎙️ Drop-in replacement for paid transcription APIs. Self-hosted,...	53	Established	speech-to-text-converters	39	Python
200	mkiol/dsnote Speech Note Linux app. Note taking, reading and translating with offline...	53	Established	voice-dictation-typing	1,395	C++

« Prev 1 2 3 4 … 68 69 70 Next »