All Voice AI Tools
6,981 tools ranked by quality score · Page 39 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 3801 |
sangramsingnk/Audio-Feature-Extraction
In sound processing, the mel-frequency cepstrum (MFC) is a representation of... |
|
Experimental |
| 3802 |
lucky-bai/wasm-speech-streaming
Offline streaming speech-to-text in the browser |
|
Experimental |
| 3803 |
mhagglun/Speech-Recognition
Tensorflow implementation for Speech Recognition using Convolutional Neural... |
|
Experimental |
| 3804 |
bacharyehya/outloud
Beautiful TUI for text-to-speech. Gemini, OpenAI, or local. One command. |
|
Experimental |
| 3805 |
Pierillo/hallucination-check
Pipeline automatizado que cura, redacta y envía un newsletter diario de IA... |
|
Experimental |
| 3806 |
verrannt/snn_speechrec
Convolutional Spiking Neural Network to recognize speech utterances using... |
|
Experimental |
| 3807 |
rwightman/pytorch-commands
Some PyTorch code for the Kaggle Speech Recognition Challenge |
|
Experimental |
| 3808 |
lukasjakobi/ha-sync-announcement
Broadcast synchronized TTS announcements across multiple media players in... |
|
Experimental |
| 3809 |
aleksandarbos/Sound-Recognition-Convo2D-Neural-Network
Tools: Python (OpenCV 3.0 + Keras lib-Convolution 2D Neural Network). Desc:... |
|
Experimental |
| 3810 |
amityalwar/snoofus
Generative AI based speech analyzer |
|
Experimental |
| 3811 |
chameleon82/avatar-ai
OpenAI Avatar for real-time api |
|
Experimental |
| 3812 |
edwindoremi/Asterisk
🎮 Streamline esports tournaments with Asterisk, a real-time management... |
|
Experimental |
| 3813 |
shr1324/orpheus-tts-docker
🔊 Deploy Orpheus TTS with ease using Docker, featuring GPU management,... |
|
Experimental |
| 3814 |
Ultan-Kearns/GestureBasedUIProject
Gesture Based UI Project 4th Year |
|
Experimental |
| 3815 |
shitian-ni/speech-recognition-transfer-learning
Speech command recognition DenseNet transfer learning from UrbanSound8k in... |
|
Experimental |
| 3816 |
OzoneAnim/employee-api
🏢 Manage employee data efficiently with this RESTful API featuring full CRUD... |
|
Experimental |
| 3817 |
winccoa/winccoa-ae-ts-text2speech
WinCC OA Text-To-Speech Library |
|
Experimental |
| 3818 |
easonlai/ms-speech-services-demo-web-tts
Microsoft Azure Speech Services (Text-to-Speech, TTS) Web Demo with Node.JS... |
|
Experimental |
| 3819 |
QinHsiu/BiCLTTS
Bi-level Cntrastive Learning for Text-to-Speech |
|
Experimental |
| 3820 |
Dragon745/urdu-roman-dictionary
A growing open-source Urdu → Roman Urdu dictionary and lexicon for... |
|
Experimental |
| 3821 |
JeffWang0325/Microsoft-Azure-Cognitive-Services
🖍️ This project combines multiple operations in Microsoft Azure Cognitive... |
|
Experimental |
| 3822 |
huss2342/x_news_station
turn x/twitter feed into audio |
|
Experimental |
| 3823 |
furushchev/ros_gtts
Text-to-Speech service for ROS using python gTTS library for backend. |
|
Experimental |
| 3824 |
Moonbase59/jingle
Quickly generate a Jingle using Text-to-Speech |
|
Experimental |
| 3825 |
ihsacm/ComfyUI-KittenTTS
Integrate KittenTTS into ComfyUI to enable fast, lightweight text-to-speech... |
|
Experimental |
| 3826 |
shotafujie/asrivia
PiP表示でローカル文字起こし結果を表示できます. |
|
Experimental |
| 3827 |
benfordslaw/vowel-sound-generator
Vowel-only speech synthesis of input text using tone.js with formants based... |
|
Experimental |
| 3828 |
mochi-neko/VOICEVOX-API-unity
Binds VOICEVOX text to speech API to pure C# on Unity. |
|
Experimental |
| 3829 |
edisonneza/image-to-text
PWA - Convert Image to Text - A small multi language project built to use... |
|
Experimental |
| 3830 |
SARIT42/image-Annotation-Speech
Explaining the contents of an image in the form of speech through caption... |
|
Experimental |
| 3831 |
kayrugold/andyai
A self-evolving, tri-brain autonomous AI agent featuring local subconscious... |
|
Experimental |
| 3832 |
29sayantanc/Echo
Echo is a privacy-first, offline AI journal and conversational assistant.... |
|
Experimental |
| 3833 |
ltphen/martha
Free text to speech synthesizer made with coqui-ai/TTS and flask |
|
Experimental |
| 3834 |
vadimkantorov/discordspeechtotext
Discord Speech-To-Text bot in Python using Google Cloud Speech-To-Text API |
|
Experimental |
| 3835 |
jibon57/nativescript-azure-cognitiveservices
Azure cognitive services implementation for NativeScript. |
|
Experimental |
| 3836 |
Zaid440/cosyvoice-docker
🎙️ Deploy a production-ready Text-to-Speech service with voice cloning and a... |
|
Experimental |
| 3837 |
Cyrostar/ITTS-TR
An end-to-end, highly optimized Text-to-Speech (TTS) framework based on... |
|
Experimental |
| 3838 |
Yacinewhatchandcode/VoiceCloning
🎙️ Real-Time TTS & Voice Cloning Pipeline — F5-TTS · PyTorch · Gradio · Voice Agent |
|
Experimental |
| 3839 |
smswg/callwg
语音呼叫系统-外呼系统,2026年真正可商用CALLWG语音呼叫系统,语音呼叫系统功能:机器人话术外呼系统|呼叫中心|VIP队列|来电记忆|ASR语音识别... |
|
Experimental |
| 3840 |
01-SayantanI/Assistant
This Python Voice Assistant with GUI uses Tkinter to enable users to... |
|
Experimental |
| 3841 |
AlisonGM03/Eva01
Build and interact with an AI that has its own mind, emotions, memory, and a... |
|
Experimental |
| 3842 |
farjadilyas/MUKALMA
MUKALMA is a human-like chatbot which incorporates correct, relevant... |
|
Experimental |
| 3843 |
HsiangNianian/funasr-api
FunASR API is a FastAPI-based inference gateway that wraps multiple FunASR... |
|
Experimental |
| 3844 |
beecave-homelab/parakeet_rocm
ROCm-optimized NVIDIA NeMo Parakeet ASR implementation with CLI, formatting,... |
|
Experimental |
| 3845 |
jm12138/iFLYTEK-MSC-Python-SDK
一个讯飞智能语音平台 MSC 的第三方 Python SDK,支持语音唤醒、语音识别、语音合成、语音评测等功能。A third-party Python... |
|
Experimental |
| 3846 |
bykemalh/S2ST
Speech to Speech Translation Python |
|
Experimental |
| 3847 |
kilogramme/nerdpudding
Provide live AI video commentary with text-to-speech for any video source,... |
|
Experimental |
| 3848 |
itsanuragkumarjha/Voice-chat-enabled-RAG-chatbot-with-real-time-internet-access
An open-source project that uses cutting-edge NLP models and real-time web... |
|
Experimental |
| 3849 |
nmstoker/SimpleSpeechLoop
A very basic demonstration connecting speech recognition and text-to-speech |
|
Experimental |
| 3850 |
Tugaytalha/NarraPhon
NarraPhon: Advanced Text-to-Speech Conversion Pipeline NarraPhon is a... |
|
Experimental |
| 3851 |
TheM1N9/stella
Stella is an intelligent voice assistant built using Python. It leverages... |
|
Experimental |
| 3852 |
Neil-001/audio-to-subtitle-translate
Easily convert speech to timed SRT subtitles and translated captions (Colab-ready) |
|
Experimental |
| 3853 |
0x61space/pu-cit371-helicopter-commander
Control a helicopter in Grand Theft Auto: San Andreas using speech recognition |
|
Experimental |
| 3854 |
ivsergeev/voicer
Голосовой ввод, GigaAM v3 e2e, opencode-plugin, русский язык |
|
Experimental |
| 3855 |
Noor-khalid/Selena
🚀 Accelerate your .NET applications with Selena, a zero-dependency library... |
|
Experimental |
| 3856 |
ShunsukeHayashi/voicebox-tts
VOICEVOX音声生成キューイングシステム (Celery + Redis) |
|
Experimental |
| 3857 |
oasisnoehub/OsisnoeAISpeech
English Text to Speech AI web app: You can better practice your english... |
|
Experimental |
| 3858 |
glloydie/flowtts-byok
🔊 Streamline voice synthesis with FlowTTS BYOK, leveraging Tencent's FlowTTS... |
|
Experimental |
| 3859 |
ORI-Muchim/BERT-MB-iSTFT-VITS
High-quality Multilingual(Korean, Japanese, Chinese, English, French and... |
|
Experimental |
| 3860 |
Nomannazir/f5-tts-fastapi
Open-source FastAPI wrapper for F5-TTS. A powerful Text-to-Speech API with... |
|
Experimental |
| 3861 |
mk-knight23/37-tool-text-to-speech
Production-grade Text-to-Speech utility built with Vue 3 and Web Speech API.... |
|
Experimental |
| 3862 |
rajatgoyal715/Awaaz
🎙 An android project with some features like text to speech, speech to text... |
|
Experimental |
| 3863 |
Voine/VITS-MNN
TTS System VITS Android Ver, powered by alibaba-MNN engine. |
|
Experimental |
| 3864 |
AcTePuKc/Chatterbox-TTS-UI
Just an UI for Chatterbox, which uses about 1-2 GB RAM. Double click and... |
|
Experimental |
| 3865 |
cmirnow/Google-Cloud-TTS-Rails
Using the power of Google Cloud Text-to-Speech API and ruby here is a simple... |
|
Experimental |
| 3866 |
KelvinCampelo/open-aiudio-client
This Next.js application provides a user interface for interacting with... |
|
Experimental |
| 3867 |
zemags/golang-yandex-speech-kit
SDK for converting text to audio by Yandex premium voices |
|
Experimental |
| 3868 |
nttcslab-sp/torchain
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI) |
|
Experimental |
| 3869 |
Mliviu79/cartesia-go
Go SDK for the Cartesia AI API — TTS, STT, voice cloning, agents, WebSocket streaming |
|
Experimental |
| 3870 |
loglux/SpeakItAI
Convert text to speech using Microsoft Azure Neural Text-to-Speech (TTS) and... |
|
Experimental |
| 3871 |
turtlehacks/speechportal
(1st place at HopHacks) A dynamic webVR memory palace for speech training,... |
|
Experimental |
| 3872 |
richardr1126/KittenTTS-FastAPI
High-performance KittenTTS API server with a built-in web UI,... |
|
Experimental |
| 3873 |
yauhenipakala/Yandex.SpeechKit.Xamarin
Yandex SpeechKit Mobile SDK for Xamarin |
|
Experimental |
| 3874 |
neosapience/typecast-python
The official Python SDK for the Typecast API. |
|
Experimental |
| 3875 |
Artavazd2009/yandex-speechkit-php
Provide easy PHP access to Yandex SpeechKit API for audio transcription,... |
|
Experimental |
| 3876 |
Kourva/TextToSpeechBot
Text To Speech Telegram Bot with Brian voice. |
|
Experimental |
| 3877 |
minhsaco99/VoiceCore
Build voice apps fast. Unified API for speech recognition & synthesis with... |
|
Experimental |
| 3878 |
dannycrief/python-voice-assistant
Sarah Voice Assistant (SVA) is a Python voice assistant project on... |
|
Experimental |
| 3879 |
MarceloSalazarV/Multimodal_Med_Ai_with_Deployment
🩺 Enhance patient care with MediBot 2.0, an AI doctor assistant that... |
|
Experimental |
| 3880 |
phith0n/v2srt
v2srt 是一个基于人工智能的视频字幕生成工具,为任意视频生成高质量的字幕文件。 |
|
Experimental |
| 3881 |
echocatzh/GTCNN
Personalized AEC |
|
Experimental |
| 3882 |
tiefenauer/ip9
Code for my master thesis at FHNW |
|
Experimental |
| 3883 |
Gopi-Durgaprasad/Speech-To-Text
End-to-End Speech Recognition |
|
Experimental |
| 3884 |
laravieira/reddit-to-tiktok
This project is a Python rendering and publishing pipeline that takes Reddit... |
|
Experimental |
| 3885 |
twn39/edgetts-dart
A pure Dart implementation of the excellent edge-tts library. Access... |
|
Experimental |
| 3886 |
loneicewolf/AI-SNN
AI SNN - or Artificial Intelligence Stuttering Neural Network - a Project I... |
|
Experimental |
| 3887 |
collinsuen/Local-Whisper-STT-Windows11-ZH
Local GPU-Accelerated Chinese Speech-to-Text for Windows 11 (Whisper-based,... |
|
Experimental |
| 3888 |
p337r/Efes
Proof of concept demo for a tool that listens for keywords, and records... |
|
Experimental |
| 3889 |
Bangla-Language-Processing/Katha-Bangla-TTS
The first Bangla Text To Speech System for Bangladeshi Bangla (Katha) |
|
Experimental |
| 3890 |
ninoish/lwc-web-speech-api-input
Implements voice powered input for Lightning Web Component with Web Speech... |
|
Experimental |
| 3891 |
voxia-ai/voxia-open
Lightweight runtime for building real-time Voice AI applications |
|
Experimental |
| 3892 |
bloo-berries/Library-of-the-Blind
The world’s largest catalog of Braille, tactile, audio, and multimodal... |
|
Experimental |
| 3893 |
zolomohan/speech-recognition-in-javascript-starter
Starter Code for Speech Recognition in JavaScript tutorial. |
|
Experimental |
| 3894 |
Chrisisaac948/RealWonder
Generate real-time videos conditioned on physical actions from a single... |
|
Experimental |
| 3895 |
QXIP/RTPEngine-Speech2Text
Simple RTPEngine Speech-to-Text Recording Spooler |
|
Experimental |
| 3896 |
technicianted/msspeech-gbridge
Bridge service to enable using Google Cloud Speech client SDKs with... |
|
Experimental |
| 3897 |
Youhai020616/ai-video-pipeline
Generate AI short dramas and news videos from Python. Text → Images → Video... |
|
Experimental |
| 3898 |
kss2002/edge-TTS
AI Voice TTS Generator to edge-tts |
|
Experimental |
| 3899 |
theawless/Dict-O-nator
A dictation plugin for gedit (the GNOME text editor). |
|
Experimental |
| 3900 |
Axel-NCHO/ReddTok
Generate a TikTok video from a Reddit post |
|
Experimental |