All Voice AI Tools

6,981 tools ranked by quality score · Page 42 of 70

Showing 4101–4200 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
4101	prokhororlov/VoiceCraft Book to MP3 converter. Convert e-books (FB2, EPUB, TXT) to MP3 audiobooks...	21	Experimental	ebook-to-audiobook-conversion	2	TypeScript
4102	aiola-lab/aiola-js-sdk The official JavaScript/TypeScript SDK for the aiOla API	21	Experimental	google-tts-libraries	2	TypeScript
4103	naver/multilingual-distilwhisper This repository contains all the code necessary for running the multilingual...	21	Experimental	whisper-fine-tuning	33	Python
4104	gongouveia/Whisper-Synthetic-ASR-Dataset-Generator This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI...	21	Experimental	speech-to-text-converters	32	Python
4105	dustland/talk IELTS Speaking.	21	Experimental	ai-tutoring-platforms	11	TypeScript
4106	Revocalize/revocalize-python The official Python API for Revocalize AI voice synthesizer platform.	21	Experimental	voice-ai-sdks	9	Python
4107	sandeepmukku12/vocodine 🎙️ VocoDine: Book your table with your voice! Speak your booking details,...	21	Experimental	react-speech-recognition	2	JavaScript
4108	SatyamPote/Ai-Video-Interviewer An AI-powered mock interview platform that simulates a real-time video call...	21	Experimental	ai-interview-simulators	10	JavaScript
4109	dobby-seo/kosr Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)	21	Experimental	end-to-end-asr-frameworks	31	Python
4110	1ytic/edit-distance-papers A curated list of papers dedicated to edit-distance as objective function	21	Experimental	end-to-end-asr-frameworks	53	—
4111	Hariswar8018/Star-Wish-AI-Stories Create Stories with AI, View Stories as well as Scan BarCode to known more...	21	Experimental	image-to-speech-synthesis	6	Dart
4112	Smorodov/kaldi_vosk_win_cmake cmake based kaldi + vosk + microphone speech recognition example	21	Experimental	vosk-asr-implementations	7	C++
4113	abdufelsayed/talkio Talkio — TypeScript voice AI orchestration: STT + LLM + TTS with streaming,...	21	Experimental	ai-tutoring-platforms	2	TypeScript
4114	VARCOVoice/VARCOVoice_UNITYSDK Official Unity SDK for VARCO Voice API. High-quality AI text-to-speech,...	21	Experimental	dotnet-tts-libraries	2	C#
4115	Geguchh024/VocalizeMD A VS Code extension that converts Markdown files to natural-sounding speech...	21	Experimental	ai-powered-ereaders	2	TypeScript
4116	wq2012/VB_diarization VB Diarization with Eigenvoice and HMM Priors, refactored	21	Experimental	funasr-speech-recognition	15	Python
4117	partrita/tts-kokoro-app local app for Kokoro TTS.	21	Experimental	kokoro-tts-ecosystem	2	Python
4118	BluShooz/text-to-video-generator SOTA Text-to-Video Generator with MuseTalk 1.5, LivePortrait, and LTX-Video....	21	Experimental	ai-video-generation	2	Python
4119	Kaljurand/Grammars Grammatical Framework based speech recognition grammars for Estonian,...	21	Experimental	funasr-speech-recognition	10	Grammatical Framework
4120	FairyDevicesRD/mimi.client.kotlin mimi(R) API Client for Kotlin	21	Experimental	android-speech-apps	2	Kotlin
4121	Yangyangii/Tacotron-pytorch Tacotron implementation with pytorch 1.0	21	Experimental	tacotron-tts-models	10	Python
4122	mklement0/speak.awf An Alfred 3 workflow that uses macOS's TTS (text-to-speech) feature to speak...	21	Experimental	system-tts-wrappers	37	Shell
4123	bitgineer/Speakeasy Privacy-first local voice-to-text using Whisper AI. Cross-platform desktop...	21	Experimental	voice-dictation-typing	2	Python
4124	gikonyob/speake3 Speake3 library provides a wrapper around Espeak to easily write efficient...	21	Experimental	espeak-ng-ecosystem	8	Python
4125	funnyzak/xfyun-nls 讯飞云智能语音处理 Node 模块。	21	Experimental	google-tts-libraries	4	JavaScript
4126	ntddk/transcibe A script to transcribe audio files with Google Cloud Speech API.	21	Experimental	real-time-voice-translation	10	Python
4127	NassimaOULDOUALI/Prosody-Control-French-TTS An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control	21	Experimental	zero-shot-voice-synthesis	19	Python
4128	WelkinYang/EMPHASIS-pytorch EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System	21	Experimental	zero-shot-voice-synthesis	15	Python
4129	ORI-Muchim/Grad-TTS 'Grad-TTS' with Multilingual Cleaners	21	Experimental	zero-shot-voice-synthesis	11	Jupyter Notebook
4130	grossstadtmann/elevenbatch Elevenlabs.io API batch creation of text to speach files.	21	Experimental	elevenlabs-integrations	2	Shell
4131	cihanselim/python-codebyvoice talk for programming :loudspeaker: /w google speech recognition	21	Experimental	speech-recognition-apis	11	Python
4132	tltrogl/diaremot2-on DiaRemot2-ON: CPU-only audio intelligence pipeline (Faster-Whisper, ONNX,...	21	Experimental	whisper-diarization	6	Python
4133	m15-ai/TrooperAI Conversational AI, local, low-latency voice assistant for Raspberry Pi 5...	21	Experimental	local-voice-assistants	20	Python
4134	babua/TTSDatasetRecorder A simple app for recording speech datasets.	21	Experimental	tts-dataset-creation	26	Python
4135	QuantiusBenignus/Spoken Joplin text notes and to-dos via OFFLINE speech recognition. To-do reminders...	21	Experimental	voice-dictation-typing	11	Shell
4136	mozilla-ai/speech-to-text Blueprint by Mozilla.ai on how to transcribe audio files	21	Experimental	speech-to-text-converters	23	—
4137	FluxCapacitor2/whisper-asr-webapp A web app for automatic speech recognition using OpenAI's Whisper model...	21	Experimental	speech-to-text-converters	9	Svelte
4138	jik876/hifi-gan-demo Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient...	21	Experimental	neural-vocoder-implementations	10	HTML
4139	dimitriStoidis/GenGAN Repository for the paper: Generating gender-ambiguous voices for...	21	Experimental	neural-vocoder-implementations	8	Python
4140	AndreaLombax/Speech_emotion_recognition In this work is proposed a speech emotion recognition model based on the...	21	Experimental	speech-emotion-recognition	10	Python
4141	aliyzd95/modified-shemo A modification on the Sharif Emotional Speech Database	21	Experimental	speech-emotion-recognition	10	Jupyter Notebook
4142	PrashanthaTP/wav2mov Speech to Facial Animation using GANs	21	Experimental	lip-reading-synthesis	40	Python
4143	Ashmit-Kumar/Assess-AI End-to-end AI interview platform featuring live voice interaction, coding...	21	Experimental	ai-interview-simulators	2	TypeScript
4144	timf34/Article2Audio Convert articles to audio using OpenAI's Text to Speech API via a python...	21	Experimental	openai-tts-applications	10	Go
4145	mvalancy/logitech_bcc950 A talking eyeball on a stick - Logitech BCC950 PTZ camera control scripts	21	Experimental	assistive-vision-ai	2	Python
4146	Sharan-Kumar-R/Talk2Translate The application uses SpeechRecognition, GoogleTranslator, and gTTS to...	21	Experimental	speech-translation-apps	2	Python
4147	taresh18/livekit-kokoro Livekit TTS plugin for kokoro	21	Experimental	kokoro-tts-ecosystem	9	Python
4148	transitive-bullshit/unrealspeech-api TypeScript client for the Unreal Speech TTS API.	21	Experimental	web-speech-api-tts	4	TypeScript
4149	SpringerNLP/Chapter12 Chapter 12: End-to-end Speech Recognition	21	Experimental	end-to-end-asr-frameworks	9	Jupyter Notebook
4150	danvers/medienpaed-asr Understanding ASR	21	Experimental	automatic-speech-recognition	2	Python
4151	RakeshBabuGajula/real-time-voice-translator A real-time voice translator web app built with Streamlit that captures live...	21	Experimental	speech-translation-apps	2	Python
4152	jaju/voissistant Voiss Aceistant - Apple only, with mlx.	21	Experimental	local-voice-dictation	2	Python
4153	PanosAntoniadis/slp-ntua Lab exercises of Speech and Language Processing course in NTUA	21	Experimental	speech-ai-coursework	11	Jupyter Notebook
4154	microsoft/MunTTS-A-Text-to-Speech-System-For-Mundari Official Codebase for "MunTTS: A Text-to-Speech System for Mundari"...	21	Experimental	coqui-tts-applications	8	Python
4155	xulihang/Silhouette An open source computer-aided translation tool for audios and videos	21	Experimental	whisper-subtitle-generation	19	B4X
4156	Mmesek/mUSh Ultrastar Songs Creation/Management helper utils.	21	Experimental	gradio-tts-webuis	2	Python
4157	IAMJOYBO/index-tts Docker镜像自动构建并上传到阿里云	21	Experimental	coqui-tts-applications	4	Dockerfile
4158	speaking-portal-project-team-a/The-Speaking-Portal-Project The objective of the Speaking Portal Project is to design, develop, and...	21	Experimental	web-speech-api-tts	13	TypeScript
4159	burntcarrot/quackspeak Text-to-speech using ducks. 🦆	21	Experimental	web-speech-api-tts	10	JavaScript
4160	lifeCoder123/Speech-to-Text-Converter Speech-to-text converter tool using Google Speech Cloud API to convert...	21	Experimental	web-speech-api-tts	9	JavaScript
4161	aminul-huq/Speech-Command-Classification Speech command classification on Speech-Command v0.02 dataset using PyTorch...	21	Experimental	keyword-speech-recognition	9	Python
4162	narVidhai/Speech-Transcription-Benchmarking Example python scripts to evaluate various ASR methods	21	Experimental	asr-evaluation-metrics	11	Python
4163	malob/article-to-audio-cloud-function Google Cloud Function that takes a url, converts the article at that url to...	21	Experimental	content-to-podcast-converters	23	JavaScript
4164	KuchikiRenji/vall-e Unofficial PyTorch implementation of VALL-E: zero-shot text-to-speech and...	21	Experimental	tacotron-tts-models	2	Python
4165	popcornell/MicRank MicRank is a Learning to Rank neural channel selection framework where a DNN...	21	Experimental	keyword-speech-recognition	22	Python
4166	anicolson/matlab_feat Functions for creating speech features in MATLAB.	21	Experimental	keyword-speech-recognition	14	MATLAB
4167	mvshyvk/KaldiService Service for easy access to speech recognition capabilities of Kaldi using...	21	Experimental	kaldi-asr-ecosystem	5	Java
4168	George0828Zhang/simulst PyTorch toolkit for streaming speech recognition, speech translation and...	21	Experimental	speech-translation-apps	25	Python
4169	FarawaySail/Kaldi_thchs30 媒体与认知语音识别大作业	21	Experimental	kaldi-asr-ecosystem	9	Shell
4170	meichthys/sword_drill Displays Bible verses from parsed microphone input.	21	Experimental	automatic-speech-recognition	11	Python
4171	JunhoKim94/ASR_project This repository created for the NHN ASR hackathon competition.	21	Experimental	automatic-speech-recognition	11	Python
4172	german-asr/nvidia-jasper-german Scripts for training NVIDIA Jasper for German Speech Recognition (ASR).	21	Experimental	automatic-speech-recognition	10	Jupyter Notebook
4173	loryanstrant/HA-ElevenLabs-Custom-TTS An ElevenLabs TTS integration for Home Assistant that allows for creation of...	21	Experimental	elevenlabs-integrations	2	Python
4174	NetherQuartz/TextForSpeechNormalizer A Python library to accentuate Russian text	21	Experimental	text-normalization-engines	11	Python
4175	Rajvardhman05/openwhisper-app Free, open-source voice-to-text for macOS — 100% local, offline...	21	Experimental	local-voice-dictation	2	Swift
4176	davidsuragan/issai-playground A Python toolkit for accessing ISSAI’s AI services — Oylan (LLM), Soyle...	21	Experimental	openai-tts-applications	2	Python
4177	zvadaadam/speech-recognition End to End Speech Recognition with Tensorflow	21	Experimental	ctc-asr-implementations	9	Python
4178	TeaPoly/cat_tensorflow Crf-based Asr Toolkit with TensorFlow implement	21	Experimental	ctc-asr-implementations	8	Python
4179	TeaPoly/warp-ctc-crf An extension of thu-spmi/CAT which contains a full-fledged implementation of...	21	Experimental	ctc-asr-implementations	12	Cuda
4180	upskyy/Automatic-Speech-Recognition-Models End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.	21	Experimental	end-to-end-asr-frameworks	10	Python
4181	Kimosabey/vox-agent-neural Neural Voice Agent core constructs for conversational AI.	21	Experimental	voice-agent-applications	2	TypeScript
4182	yinruiqing/tiny-transducer Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices	21	Experimental	end-to-end-asr-frameworks	26	Python
4183	sunprinceS/MetaASR-CrossAccent Meta-Learning for End-to-End ASR	21	Experimental	end-to-end-asr-frameworks	10	Jupyter Notebook
4184	duc11021102/pyspeech Python Text To Speech Using gTTS @duc11021102	21	Experimental	lightweight-tts-libraries	2	Python
4185	ActiveIntelligentSystemsLab/japanese_tts_ros 日本語テキストを音声として出力するROS node	21	Experimental	lightweight-tts-libraries	12	Python
4186	derpeloper/ostinato giving a voice to the voiceless.	21	Experimental	discord-tts-bots	2	JavaScript
4187	lgpearson1771/openwakeword-trainer Train custom wake word models with openWakeWord. A granular 13-step pipeline...	21	Experimental	wake-word-detection	2	Python
4188	vijethph/violet-speech Violet is a Speech Assistant made using Python	21	Experimental	general-purpose-voice-assistants	2	Python
4189	led-mirage/AivoClip A.I.VOICEでクリップボードに貼り付けられたテキストを読み上げるアプリです。	21	Experimental	clipboard-text-to-speech	2	Python
4190	2tocom/F5-TTS-Vietnamese-Google-Colab Vietnamese TTS, Chuyển văn bản thành giọng nói tiếng Việt, text to speech...	21	Experimental	tts-model-finetuning	6	Python
4191	AssemblyAI/assemblyai-ruby-sdk The AssemblyAI Ruby SDK provides an easy-to-use interface for interacting...	21	Experimental	llm-sdk-packages	10	Ruby
4192	emmanuelinfante/SubtitlesEveryone Transcribe Like a Pro, Without Paying a Penny!	21	Experimental	whisper-transcription-apps	10	Jupyter Notebook
4193	junhoeKu/Jeju-Translation 제주어, 표준어 양방향 음성 번역 모델 생성 프로젝트 (알고리즘 \| 비정형 \| NLP \| 딥러닝 \| 기계번역 \| 음성인식 \| 멀티모달)	21	Experimental	text-translation-tools	11	Jupyter Notebook
4194	BitsofJeremy/WeirDing Audiobook narration engine powered by Qwen3-TTS. Upload documents, pick a...	21	Experimental	qwen3-tts-applications	—	Python
4195	Vatis-Tech/asr-client-js JavaScript SDK client for Vatis Tech ASR services.	21	Experimental	web-speech-api-libraries	4	JavaScript
4196	AssemblyAI/assemblyai-semantic-kernel Transcribe audio using AssemblyAI with Semantic Kernel plugins.	21	Experimental	semantic-kernel-tools	10	C#
4197	marcogenna/epub2audiobook Convert EPUB books to M4B audiobooks with AI-powered TTS (Edge TTS, Kokoro, Piper)	21	Experimental	ebook-to-audiobook-conversion	—	Python
4198	LucaAngioloni/Micchinetta HCI project: an application interface using both face and speech recognition...	21	Experimental	assistive-vision-ai	11	Python
4199	JoshuaCarroll/RepeaterProgrammingUtility N5JLC Repeater Programming Utility	21	Experimental	dotnet-tts-libraries	10	C#
4200	Listening-Lab/Annotator Listening Lab audio analysis and annotation tool. Develop audio...	21	Experimental	data-annotation-tools	11	JavaScript

« Prev 1 2 3 … 40 41 42 43 44 … 68 69 70 Next »