All Voice AI Tools
6,981 tools ranked by quality score · Page 52 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 5101 |
egorsmkv/w2v2-bert-aligner
Aligner for wav2vec2-bert models |
|
Experimental |
| 5102 |
Ronnie-Leon76/Swahili-ASR
This repository contains the code for fine-tuning the XLS-R Wav2Vec2 model... |
|
Experimental |
| 5103 |
ranchlai/wav2vec-2.0
Wav2vec2 English speech recognition in PaddlePaddle |
|
Experimental |
| 5104 |
davidsuragan/tulga-cli
TulgaCLI is a tool that allows you to chat and voice chat with virtual... |
|
Experimental |
| 5105 |
kongju7/my_project6
Personal project 6: Speech Recognition Deep Learning Chatbot -... |
|
Experimental |
| 5106 |
yujiliu/oresta
Oresta - is the first voice assistant in the Ukrainian language. |
|
Experimental |
| 5107 |
dvamsidhar2002/Project-VIVA-Personal-Desktop-and-Voice-Assistant
This is a personal desktop assistant which will do few tasks for you. It is... |
|
Experimental |
| 5108 |
TakumiSenaha/Nreal_IoT
This project aims to visualize the sensor information of the surroundings... |
|
Experimental |
| 5109 |
priyanshpsalian/VISION-THE-BLIND
An all in one solution for safety and security of blind. Features covered in... |
|
Experimental |
| 5110 |
andrikAV18/Chat_app
💬 Build real-time chat experiences with a modern app that supports user... |
|
Experimental |
| 5111 |
TexasInstrumentsDIY/SpiceRack
Voice controlled turntable using the beaglebone black wireless. |
|
Experimental |
| 5112 |
Simone-Convertini/Speech-Summarization-Demo
A Web Api written using Go and Gin capable to perform Speech Summarization... |
|
Experimental |
| 5113 |
michsethowusu/kasanoma
Offline-first TTS models for African languages |
|
Experimental |
| 5114 |
sudonitin/MediumScraper
Scraping articles of medium and providing audio versions 📑 to 🔊 using django |
|
Experimental |
| 5115 |
BiasedToad1/AudiobookMaker
A tool utilizing piper-tts to convert books into audiobooks. |
|
Experimental |
| 5116 |
vantu5z/PyBookReaderTTS
Читалка для книг на Gtk через синтезаторы TTS |
|
Experimental |
| 5117 |
sebheron/TikTok-Reddit-Text-To-Speech
Reddit TTS generator designed for TikTok |
|
Experimental |
| 5118 |
chenying99/ttsv2
fast tts (ZH EN) lightweight |
|
Experimental |
| 5119 |
jmpnop/vdub
vdub — video dubbing and subtitle engine. Rust + MLX. Free local ASR/TTS. |
|
Experimental |
| 5120 |
sanvibyfish/OwlWhisper
Local voice input for macOS — hold a hotkey, speak, release, text appears at... |
|
Experimental |
| 5121 |
gztomas/utter
A Text-to-Speech CLI using ElevenLabs, designed for humans and AI agents. |
|
Experimental |
| 5122 |
jsc2017605097/chatgpt-audio-downloader
A lightweight Chrome/Edge extension to instantly catch and download the... |
|
Experimental |
| 5123 |
suzumushi0/SoundObject_source
SoundObject source code distribution. |
|
Experimental |
| 5124 |
brailcom/singing-computer
Computer singing synthesis |
|
Experimental |
| 5125 |
kolonist/edgetts
Use free Microsoft Edge's online text-to-speech service from golang |
|
Experimental |
| 5126 |
criadacasa/podcastfy-saas
SaaS platform for generating AI podcasts from multimodal content - Built... |
|
Experimental |
| 5127 |
mostafabahaa25/mediguide_MVP
AI-powered accessibility app that helps blind and low-vision users manage... |
|
Experimental |
| 5128 |
yuis-ice/text-to-speech
🎤 VoiceFlow - Modern text-to-speech web application with real-time word... |
|
Experimental |
| 5129 |
awesome-german/pronunciation
Guides, phonetic tools, and speaking exercises to achieve clear and natural... |
|
Experimental |
| 5130 |
nikita-popov/tts-api
Kokoro based TTS API |
|
Experimental |
| 5131 |
MarvinAmine/UDEMY_AWS_PREP_EXAM_COPILOT
A Chrome extension to interact with your Udemy AWS certification prep exams.... |
|
Experimental |
| 5132 |
ZacDair/SER_Platform_AICS
This repository contains the code to create and conduct emotion recognition... |
|
Experimental |
| 5133 |
vinsis/speech-commands-recognition
Single word speech recognition using PyTorch |
|
Experimental |
| 5134 |
Tinker-Twins/NLP_Using_Python
This repository hosts the code snippets used for small NLP project using... |
|
Experimental |
| 5135 |
sayak119/Express
Express Yourself. |
|
Experimental |
| 5136 |
praneethpj/Unity-Android-Utilities
Open Source Unity-Android Platform Voice Text API and Text To Voice API. |
|
Experimental |
| 5137 |
sergix44/oddcast-tts-php
A PHP interface to the online Oddcast demo API. |
|
Experimental |
| 5138 |
SenalDolage/object-detection-TFJS-ReactNative
A mobile application that identifies nearby objects and gives a voice output... |
|
Experimental |
| 5139 |
Gyvastis/google-speech-tts
A wrapper for Google Translate to generate an audio from a text. |
|
Experimental |
| 5140 |
HQQHQ/FinetuneSpeechT5-Spanish
This repository hosts the code and resources for fine-tuning a SpeechT5... |
|
Experimental |
| 5141 |
xignoe/videoTranslatorExtenstion
Real-time video translation Chrome extension that automatically generates... |
|
Experimental |
| 5142 |
XxAZVDxX/LLM-Live2D-VRM-AI-Girlfriend-iOS
Let’s start chatting with your Live2D or VRM girlfriend in iOS (Support... |
|
Experimental |
| 5143 |
NormVg/AutoCaptionGenAI
A Python project that extracts audio from video files, transcribes the... |
|
Experimental |
| 5144 |
gathrean/Nebula
Neural Network in Python trained for multi-musical instruments recognition. |
|
Experimental |
| 5145 |
Sh1nr1/mai-ai-assistant-self-hosted
Mai is an emotionally intelligent, voice-enabled AI assistant built with... |
|
Experimental |
| 5146 |
smivv/python-vosk-trial
Vosk Speech Recognition Trial |
|
Experimental |
| 5147 |
vpakarinen/mmaudio-webui
WebUI for MMAudio Video-to-Audio and Text-to-Audio. |
|
Experimental |
| 5148 |
NoNamePro0/Speech
🎙 Yet another python script that speech your text |
|
Experimental |
| 5149 |
beltoforion/Synthetischer-Wetterbericht
Ein Python-Skript für das automatisierte Erstellen von gesprochenen... |
|
Experimental |
| 5150 |
pkubowicz/vocab-tts
Learning vocabulary with text-to-speech and Anki |
|
Experimental |
| 5151 |
Orca0917/Spectrogram-VQ
Unofficial implementation of Spectrogram VQ from DCTTS paper - Vector... |
|
Experimental |
| 5152 |
jianchang512/speech2text-df
基于Dolphin模型的东方语言音视频转字幕api及webui |
|
Experimental |
| 5153 |
AbdulGani11/Vocably
Text-to-speech web application built with React, FastAPI, JWT... |
|
Experimental |
| 5154 |
manasmodak/SpeechRecognition
WPF App to show text-speech and speech recognition |
|
Experimental |
| 5155 |
FeuZen/Zonos-long-text-to-speech
Takes an input text and transcribes it using zonos-v0.1-hybrid |
|
Experimental |
| 5156 |
atmehedi/Speech-to-text-in-Assamese
TASK ORIENTED DIALOG SYSTEM IN NATIVE LANGUAGE(ASSAMESE) |
|
Experimental |
| 5157 |
pselvana/VoiceCrafter
Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild |
|
Experimental |
| 5158 |
arda-guler/CodexBabil
Codex Babil - Library of Babel expanded with random writing systems. |
|
Experimental |
| 5159 |
elemarmar/joke-teller
🤖💬 Joke Teller gets random jokes from third party API and converts them to... |
|
Experimental |
| 5160 |
beerberidie/Echo
Voice-controlled AI assistant with real-time transcription, natural language... |
|
Experimental |
| 5161 |
carmen-martin/Deep-Keyword-Spotting
A Small Footprint implementation of Keyword Spotting with different architectures. |
|
Experimental |
| 5162 |
anshshah23/nlp-mini-project
This project incorporates a rule based engine for recognising Gujarati using... |
|
Experimental |
| 5163 |
Prajithp/p5-Google-Cloud-Speech
Google Cloud Speech Client Library for Perl |
|
Experimental |
| 5164 |
0xstackforge/voice-agents-demo
AI-powered outbound calling chatbot built with Twilio, FastAPI, and Pipecat,... |
|
Experimental |
| 5165 |
motazsaad/jsc-news-broadcast
JSC news broadcast (speech corpus) |
|
Experimental |
| 5166 |
AaravK25/NetraSetuV2
For The Visually Impaired. |
|
Experimental |
| 5167 |
FatStinkyPanda/talk2me
A fully offline, self-contained voice interaction system featuring... |
|
Experimental |
| 5168 |
moe-mizrak/laravel-google-text-to-speech
Laravel package for integrating Gemini Text-to-Speech API and Google Cloud... |
|
Experimental |
| 5169 |
jaketae/conformer
PyTorch implementation of Conformer: Convolution-augmented Transformer for... |
|
Experimental |
| 5170 |
iam-smjamilsagar/Speech-Assistant
Today we will learn how to make speech assistant in Python. |
|
Experimental |
| 5171 |
WelkinYang/Tacotron2-pytorch
Tacotron2 implemented by pytorch |
|
Experimental |
| 5172 |
dnyanshwalwadkar/SIMHA-Personal-Assistant-using-Artificial-intelligence
The rise of automation, along with increased computational power, novel... |
|
Experimental |
| 5173 |
usubar-eats/voice-button-app
声ボタン - 文字を打つだけで話してくれるアプリ / Voice Button - Text-to-Speech App for Japanese |
|
Experimental |
| 5174 |
AndresRJ18/Study-Vault-AWS
Converts text study notes into audio podcasts automatically using AWS... |
|
Experimental |
| 5175 |
contro-projects/speechpad
A simple, lightweight web app that converts your voice into text in... |
|
Experimental |
| 5176 |
QuasarRyan/mlx-audio-bridge
这是一个基于 mlx-audio 的本地 REST 服务,用来实现兼容 OpenAI 的 TTS / STT 音频接口桥接层。 |
|
Experimental |
| 5177 |
khizarali07/VoiceForge-AI-Frontend
A complete synthetic media pipeline for high-fidelity TTS and talking-head... |
|
Experimental |
| 5178 |
matin91/Kasko
Kasko is a Talking To-do List app, which allows the user to set up Reminders... |
|
Experimental |
| 5179 |
adarshsingh6622-source/advanced_voice_assistant
An advanced AI-powered voice assistant built using Python, NLP, and speech... |
|
Experimental |
| 5180 |
dtrovato997/SpeechAnalysis
A sample application for on-device offline mobile voice inference using deep... |
|
Experimental |
| 5181 |
OnesAndZer0s/node-dectalk
Node.js module that provides bindings for the DecTalk Text-To-Speech library |
|
Experimental |
| 5182 |
cr2007/cambai-python
Python SDK for the CambAI API |
|
Experimental |
| 5183 |
stillcuriouscat/votype
Global voice typing for Linux — offline ASR, hotkey-triggered, works in any app |
|
Experimental |
| 5184 |
AKAPhilipD/CMTNET_for_SER
CMT-Net: A Collaborative Mamba-Transformer Network with Spatial-Temporal... |
|
Experimental |
| 5185 |
harikanaidu/NLP-health-assistant
An NLP-driven health assistant bot that interacts, asks a series of personal... |
|
Experimental |
| 5186 |
marklubin/kairix
Voice-first AI agent with persistent memory, background reflection, and... |
|
Experimental |
| 5187 |
sglkc/live-translate
🎙️ Translate as you speak using Google Chrome's Web Speech API for speech... |
|
Experimental |
| 5188 |
tonyshawjr/LiveDJ
AI-powered radio DJ display for Plex and Spotify. Shows artist info, album... |
|
Experimental |
| 5189 |
nbr23/gopipertts
A small HTTP API wrapper for piper's texttospeech |
|
Experimental |
| 5190 |
Mordekai66/Py-Captcha-Generator
PyCaptchaGenerator is a Python file that generates image and audio CAPTCHAs... |
|
Experimental |
| 5191 |
ringabout/scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer. |
|
Experimental |
| 5192 |
parth2152012/murf-voice-agent-hackathon
AI Voice Agent for Techfest IIT Bombay Hackathon - Built using Murf Falcon... |
|
Experimental |
| 5193 |
opensource-spraakherkenning-nl/ASR_NL_results
Results of Dutch ASR models, collected by the community |
|
Experimental |
| 5194 |
Nik-Kras/Live_ASR_Whisper_Gradio
Real Time Speech To Text with corrections powered by Gradio |
|
Experimental |
| 5195 |
happytunesai/EZ-STT-Logger-GUI
Python GUI for real-time Speech-to-Text (STT) using local Whisper, OpenAI... |
|
Experimental |
| 5196 |
Ashmithakur29/Chrome-Extensions
A Chrome Extension built to deliver daily jokes with audio support ,... |
|
Experimental |
| 5197 |
AssemblyAI-Community/intro-to-espnet
Getting Started with ESPnet | AssemblyAI |
|
Experimental |
| 5198 |
srvk/jsalt-2018-grounded-s2s
Grounded Sequence-to-Sequence Transduction Team at JSALT 2018 |
|
Experimental |
| 5199 |
ashisbehera/Smart_Alarm
This project is based on text to speech alarm application. |
|
Experimental |
| 5200 |
wanghao15536870732/ChatWithEveryone
🚧The Internet + project YiLuYuBan.The project is too messy, has moved to... |
|
Experimental |