All Voice AI Tools

6,981 tools ranked by quality score · Page 39 of 70

Showing 3801–3900 of 6,981

« Prev Next »

#	Tool	Score	Tier	Category	Stars	Language
3801	sangramsingnk/Audio-Feature-Extraction In sound processing, the mel-frequency cepstrum (MFC) is a representation of...	22	Experimental	keyword-speech-recognition	9	Jupyter Notebook
3802	lucky-bai/wasm-speech-streaming Offline streaming speech-to-text in the browser	22	Experimental	web-speech-api-libraries	25	JavaScript
3803	mhagglun/Speech-Recognition Tensorflow implementation for Speech Recognition using Convolutional Neural...	22	Experimental	keyword-speech-recognition	13	Jupyter Notebook
3804	bacharyehya/outloud Beautiful TUI for text-to-speech. Gemini, OpenAI, or local. One command.	22	Experimental	gradio-tts-webuis	—	Python
3805	Pierillo/hallucination-check Pipeline automatizado que cura, redacta y envía un newsletter diario de IA...	22	Experimental	ai-video-generation	—	Python
3806	verrannt/snn_speechrec Convolutional Spiking Neural Network to recognize speech utterances using...	22	Experimental	keyword-speech-recognition	9	Python
3807	rwightman/pytorch-commands Some PyTorch code for the Kaggle Speech Recognition Challenge	22	Experimental	keyword-speech-recognition	12	Python
3808	lukasjakobi/ha-sync-announcement Broadcast synchronized TTS announcements across multiple media players in...	22	Experimental	home-assistant-tts	1	Python
3809	aleksandarbos/Sound-Recognition-Convo2D-Neural-Network Tools: Python (OpenCV 3.0 + Keras lib-Convolution 2D Neural Network). Desc:...	22	Experimental	keyword-speech-recognition	13	Python
3810	amityalwar/snoofus Generative AI based speech analyzer	22	Experimental	ai-avatar-platforms	7	JavaScript
3811	chameleon82/avatar-ai OpenAI Avatar for real-time api	22	Experimental	ai-avatar-platforms	4	JavaScript
3812	edwindoremi/Asterisk 🎮 Streamline esports tournaments with Asterisk, a real-time management...	22	Experimental	ai-avatar-platforms	—	HTML
3813	shr1324/orpheus-tts-docker 🔊 Deploy Orpheus TTS with ease using Docker, featuring GPU management,...	22	Experimental	self-hosted-tts-servers	—	Python
3814	Ultan-Kearns/GestureBasedUIProject Gesture Based UI Project 4th Year	22	Experimental	assistive-vision-ai	1	C
3815	shitian-ni/speech-recognition-transfer-learning Speech command recognition DenseNet transfer learning from UrbanSound8k in...	22	Experimental	keyword-speech-recognition	17	Python
3816	OzoneAnim/employee-api 🏢 Manage employee data efficiently with this RESTful API featuring full CRUD...	22	Experimental	audio-transcription-apps	—	JavaScript
3817	winccoa/winccoa-ae-ts-text2speech WinCC OA Text-To-Speech Library	22	Experimental	web-speech-api-tts	—	—
3818	easonlai/ms-speech-services-demo-web-tts Microsoft Azure Speech Services (Text-to-Speech, TTS) Web Demo with Node.JS...	22	Experimental	dotnet-tts-libraries	17	HTML
3819	QinHsiu/BiCLTTS Bi-level Cntrastive Learning for Text-to-Speech	22	Experimental	fastspeech-tts-models	1	Python
3820	Dragon745/urdu-roman-dictionary A growing open-source Urdu → Roman Urdu dictionary and lexicon for...	22	Experimental	multilingual-speech-datasets	—	—
3821	JeffWang0325/Microsoft-Azure-Cognitive-Services 🖍️ This project combines multiple operations in Microsoft Azure Cognitive...	22	Experimental	dotnet-tts-libraries	11	C#
3822	huss2342/x_news_station turn x/twitter feed into audio	22	Experimental	news-audio-bulletins	—	Python
3823	furushchev/ros_gtts Text-to-Speech service for ROS using python gTTS library for backend.	22	Experimental	lightweight-tts-libraries	1	Python
3824	Moonbase59/jingle Quickly generate a Jingle using Text-to-Speech	22	Experimental	web-speech-api-tts	9	Shell
3825	ihsacm/ComfyUI-KittenTTS Integrate KittenTTS into ComfyUI to enable fast, lightweight text-to-speech...	22	Experimental	comfyui-tts-nodes	—	Python
3826	shotafujie/asrivia PiP表示でローカル文字起こし結果を表示できます．	22	Experimental	live-caption-generation	2	Python
3827	benfordslaw/vowel-sound-generator Vowel-only speech synthesis of input text using tone.js with formants based...	22	Experimental	web-speech-api-tts	7	JavaScript
3828	mochi-neko/VOICEVOX-API-unity Binds VOICEVOX text to speech API to pure C# on Unity.	22	Experimental	dotnet-tts-libraries	7	C#
3829	edisonneza/image-to-text PWA - Convert Image to Text - A small multi language project built to use...	22	Experimental	ai-image-generation-platforms	15	TypeScript
3830	SARIT42/image-Annotation-Speech Explaining the contents of an image in the form of speech through caption...	22	Experimental	image-to-speech-synthesis	1	Jupyter Notebook
3831	kayrugold/andyai A self-evolving, tri-brain autonomous AI agent featuring local subconscious...	22	Experimental	interactive-ai-avatars	—	Python
3832	29sayantanc/Echo Echo is a privacy-first, offline AI journal and conversational assistant....	22	Experimental	personal-knowledge-management	15	Python
3833	ltphen/martha Free text to speech synthesizer made with coqui-ai/TTS and flask	22	Experimental	coqui-tts-applications	5	HTML
3834	vadimkantorov/discordspeechtotext Discord Speech-To-Text bot in Python using Google Cloud Speech-To-Text API	22	Experimental	discord-tts-bots	22	Python
3835	jibon57/nativescript-azure-cognitiveservices Azure cognitive services implementation for NativeScript.	22	Experimental	dotnet-tts-libraries	1	TypeScript
3836	Zaid440/cosyvoice-docker 🎙️ Deploy a production-ready Text-to-Speech service with voice cloning and a...	22	Experimental	coqui-tts-applications	—	Python
3837	Cyrostar/ITTS-TR An end-to-end, highly optimized Text-to-Speech (TTS) framework based on...	22	Experimental	coqui-tts-applications	—	Python
3838	Yacinewhatchandcode/VoiceCloning 🎙️ Real-Time TTS & Voice Cloning Pipeline — F5-TTS · PyTorch · Gradio · Voice Agent	22	Experimental	voice-cloning-tools	—	Python
3839	smswg/callwg 语音呼叫系统-外呼系统,2026年真正可商用CALLWG语音呼叫系统,语音呼叫系统功能:机器人话术外呼系统\|呼叫中心\|VIP队列\|来电记忆\|ASR语音识别...	22	Experimental	voice-agent-applications	20	Java
3840	01-SayantanI/Assistant This Python Voice Assistant with GUI uses Tkinter to enable users to...	22	Experimental	general-purpose-voice-assistants	18	Python
3841	AlisonGM03/Eva01 Build and interact with an AI that has its own mind, emotions, memory, and a...	22	Experimental	interactive-ai-avatars	—	Python
3842	farjadilyas/MUKALMA MUKALMA is a human-like chatbot which incorporates correct, relevant...	22	Experimental	voice-chatbot-applications	1	Jupyter Notebook
3843	HsiangNianian/funasr-api FunASR API is a FastAPI-based inference gateway that wraps multiple FunASR...	22	Experimental	funasr-speech-recognition	3	Python
3844	beecave-homelab/parakeet_rocm ROCm-optimized NVIDIA NeMo Parakeet ASR implementation with CLI, formatting,...	22	Experimental	parakeet-asr-implementations	3	Python
3845	jm12138/iFLYTEK-MSC-Python-SDK 一个讯飞智能语音平台 MSC 的第三方 Python SDK，支持语音唤醒、语音识别、语音合成、语音评测等功能。A third-party Python...	22	Experimental	voice-ai-sdks	23	Python
3846	bykemalh/S2ST Speech to Speech Translation Python	22	Experimental	speech-recognition-apis	1	Python
3847	kilogramme/nerdpudding Provide live AI video commentary with text-to-speech for any video source,...	22	Experimental	ai-avatar-platforms	—	Python
3848	itsanuragkumarjha/Voice-chat-enabled-RAG-chatbot-with-real-time-internet-access An open-source project that uses cutting-edge NLP models and real-time web...	22	Experimental	voice-chatbot-applications	21	Python
3849	nmstoker/SimpleSpeechLoop A very basic demonstration connecting speech recognition and text-to-speech	22	Experimental	automatic-speech-recognition	20	Python
3850	Tugaytalha/NarraPhon NarraPhon: Advanced Text-to-Speech Conversion Pipeline NarraPhon is a...	22	Experimental	pdf-to-audio-conversion	9	Python
3851	TheM1N9/stella Stella is an intelligent voice assistant built using Python. It leverages...	22	Experimental	streamlit-chatbot-apps	9	Python
3852	Neil-001/audio-to-subtitle-translate Easily convert speech to timed SRT subtitles and translated captions (Colab-ready)	22	Experimental	whisper-subtitle-generation	3	Jupyter Notebook
3853	0x61space/pu-cit371-helicopter-commander Control a helicopter in Grand Theft Auto: San Andreas using speech recognition	22	Experimental	dotnet-tts-libraries	1	C++
3854	ivsergeev/voicer Голосовой ввод, GigaAM v3 e2e, opencode-plugin, русский язык	22	Experimental	dotnet-tts-libraries	—	C#
3855	Noor-khalid/Selena 🚀 Accelerate your .NET applications with Selena, a zero-dependency library...	22	Experimental	dotnet-tts-libraries	—	C#
3856	ShunsukeHayashi/voicebox-tts VOICEVOX音声生成キューイングシステム (Celery + Redis)	22	Experimental	self-hosted-tts-servers	—	Python
3857	oasisnoehub/OsisnoeAISpeech English Text to Speech AI web app: You can better practice your english...	22	Experimental	openai-tts-applications	1	Python
3858	glloydie/flowtts-byok 🔊 Streamline voice synthesis with FlowTTS BYOK, leveraging Tencent's FlowTTS...	22	Experimental	self-hosted-tts-servers	—	Python
3859	ORI-Muchim/BERT-MB-iSTFT-VITS High-quality Multilingual(Korean, Japanese, Chinese, English, French and...	22	Experimental	vits-tts-implementations	7	Python
3860	Nomannazir/f5-tts-fastapi Open-source FastAPI wrapper for F5-TTS. A powerful Text-to-Speech API with...	22	Experimental	self-hosted-tts-servers	—	Python
3861	mk-knight23/37-tool-text-to-speech Production-grade Text-to-Speech utility built with Vue 3 and Web Speech API....	22	Experimental	web-speech-api-tts	—	Vue
3862	rajatgoyal715/Awaaz 🎙 An android project with some features like text to speech, speech to text...	22	Experimental	android-speech-apps	1	Java
3863	Voine/VITS-MNN TTS System VITS Android Ver, powered by alibaba-MNN engine.	22	Experimental	vits-tts-implementations	12	Kotlin
3864	AcTePuKc/Chatterbox-TTS-UI Just an UI for Chatterbox, which uses about 1-2 GB RAM. Double click and...	22	Experimental	self-hosted-tts-servers	20	Python
3865	cmirnow/Google-Cloud-TTS-Rails Using the power of Google Cloud Text-to-Speech API and ruby here is a simple...	22	Experimental	system-tts-wrappers	9	Ruby
3866	KelvinCampelo/open-aiudio-client This Next.js application provides a user interface for interacting with...	22	Experimental	openai-tts-applications	1	TypeScript
3867	zemags/golang-yandex-speech-kit SDK for converting text to audio by Yandex premium voices	22	Experimental	go-tts-libraries	7	Go
3868	nttcslab-sp/torchain WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)	22	Experimental	end-to-end-asr-frameworks	20	Python
3869	Mliviu79/cartesia-go Go SDK for the Cartesia AI API — TTS, STT, voice cloning, agents, WebSocket streaming	22	Experimental	go-tts-libraries	—	Go
3870	loglux/SpeakItAI Convert text to speech using Microsoft Azure Neural Text-to-Speech (TTS) and...	22	Experimental	openai-tts-applications	46	Python
3871	turtlehacks/speechportal (1st place at HopHacks) A dynamic webVR memory palace for speech training,...	22	Experimental	natural-language-task-scheduling	16	JavaScript
3872	richardr1126/KittenTTS-FastAPI High-performance KittenTTS API server with a built-in web UI,...	22	Experimental	self-hosted-tts-servers	3	Python
3873	yauhenipakala/Yandex.SpeechKit.Xamarin Yandex SpeechKit Mobile SDK for Xamarin	22	Experimental	yandex-speechkit-tools	1	C#
3874	neosapience/typecast-python The official Python SDK for the Typecast API.	22	Experimental	system-tts-wrappers	3	Python
3875	Artavazd2009/yandex-speechkit-php Provide easy PHP access to Yandex SpeechKit API for audio transcription,...	22	Experimental	yandex-speechkit-tools	—	PHP
3876	Kourva/TextToSpeechBot Text To Speech Telegram Bot with Brian voice.	22	Experimental	telegram-voice-transcription	17	Python
3877	minhsaco99/VoiceCore Build voice apps fast. Unified API for speech recognition & synthesis with...	22	Experimental	self-hosted-tts-servers	3	Python
3878	dannycrief/python-voice-assistant Sarah Voice Assistant (SVA) is a Python voice assistant project on...	22	Experimental	general-purpose-voice-assistants	1	Python
3879	MarceloSalazarV/Multimodal_Med_Ai_with_Deployment 🩺 Enhance patient care with MediBot 2.0, an AI doctor assistant that...	22	Experimental	multimodal-medical-assistants	—	Python
3880	phith0n/v2srt v2srt 是一个基于人工智能的视频字幕生成工具，为任意视频生成高质量的字幕文件。	22	Experimental	audio-transcription-tools	48	Python
3881	echocatzh/GTCNN Personalized AEC	22	Experimental	audio-noise-reduction	19	HTML
3882	tiefenauer/ip9 Code for my master thesis at FHNW	22	Experimental	speech-ai-coursework	7	Python
3883	Gopi-Durgaprasad/Speech-To-Text End-to-End Speech Recognition	22	Experimental	speech-ai-coursework	12	Jupyter Notebook
3884	laravieira/reddit-to-tiktok This project is a Python rendering and publishing pipeline that takes Reddit...	22	Experimental	ai-video-generation	—	Python
3885	twn39/edgetts-dart A pure Dart implementation of the excellent edge-tts library. Access...	22	Experimental	edge-tts-implementations	3	Dart
3886	loneicewolf/AI-SNN AI SNN - or Artificial Intelligence Stuttering Neural Network - a Project I...	22	Experimental	multimodal-medical-assistants	7	—
3887	collinsuen/Local-Whisper-STT-Windows11-ZH Local GPU-Accelerated Chinese Speech-to-Text for Windows 11 (Whisper-based,...	22	Experimental	speech-to-text-converters	3	—
3888	p337r/Efes Proof of concept demo for a tool that listens for keywords, and records...	22	Experimental	web-speech-api-libraries	11	C#
3889	Bangla-Language-Processing/Katha-Bangla-TTS The first Bangla Text To Speech System for Bangladeshi Bangla (Katha)	22	Experimental	tts-model-finetuning	19	—
3890	ninoish/lwc-web-speech-api-input Implements voice powered input for Lightning Web Component with Web Speech...	22	Experimental	web-speech-api-libraries	9	Apex
3891	voxia-ai/voxia-open Lightweight runtime for building real-time Voice AI applications	22	Experimental	self-hosted-tts-servers	—	Python
3892	bloo-berries/Library-of-the-Blind The world’s largest catalog of Braille, tactile, audio, and multimodal...	22	Experimental	assistive-vision-ai	3	—
3893	zolomohan/speech-recognition-in-javascript-starter Starter Code for Speech Recognition in JavaScript tutorial.	22	Experimental	web-speech-api-libraries	6	JavaScript
3894	Chrisisaac948/RealWonder Generate real-time videos conditioned on physical actions from a single...	22	Experimental	ai-video-generation	—	Python
3895	QXIP/RTPEngine-Speech2Text Simple RTPEngine Speech-to-Text Recording Spooler	22	Experimental	web-speech-api-libraries	17	JavaScript
3896	technicianted/msspeech-gbridge Bridge service to enable using Google Cloud Speech client SDKs with...	22	Experimental	dotnet-tts-libraries	1	C++
3897	Youhai020616/ai-video-pipeline Generate AI short dramas and news videos from Python. Text → Images → Video...	22	Experimental	ai-video-generation	—	Python
3898	kss2002/edge-TTS AI Voice TTS Generator to edge-tts	22	Experimental	edge-tts-implementations	3	PowerShell
3899	theawless/Dict-O-nator A dictation plugin for gedit (the GNOME text editor).	22	Experimental	web-speech-api-libraries	9	Python
3900	Axel-NCHO/ReddTok Generate a TikTok video from a Reddit post	22	Experimental	ai-video-generation	9	C#

« Prev 1 2 3 … 37 38 39 40 41 … 68 69 70 Next »