All Voice AI Tools

8,165 tools ranked by quality score · Page 72 of 82

Showing 7101–7200 of 8,165
# Tool Score Tier
7101 WilleIshere/KokoroTTSGenerator

Generate high-quality speech from text using the powerful Kokoro TTS...

12
Experimental
7102 alnitak/cartesia_tts

Minimal example of using cartesia tts to produce text-to-speech audio and...

12
Experimental
7103 naren200/speech-llm-speech

A ROS2-based Conversational AI system that processes speech input, interacts...

12
Experimental
7104 jx1100370217/LAS_Tensorflow_jack

Tensorflow implement of LAS model

12
Experimental
7105 6Morpheus6/Kokoro-FastAPI

A FastAPI wrapper for KokoroTTS. Integrates with Open-WebUI and other...

12
Experimental
7106 philipperoubert/chloe

Chlo-e: A voice-activated AI friend powered by OpenAI's GPT-3.5 Turbo,...

12
Experimental
7107 taresh18/livekit-kyutai

LiveKit TTS plugin with Kyutai streaming implementation

12
Experimental
7108 PatrickPrakash/LiptoSpeech

Keras implementation of Lip Reading Sentences in the Wild.

12
Experimental
7109 cmirnow/Google-Cloud-Text-to-Speech-Pro

Using the power of Google Cloud Text-to-Speech API and ruby here is a simple...

12
Experimental
7110 0xspringtime/voicevox-anki

Documentation on how to add voicevox support to anki cards

12
Experimental
7111 Pavantext/Speech-Recognition

Speech Recognition in python

12
Experimental
7112 stefan-niedermann/web-reader

This web app allows users to let written or pasted texts read by the browsers.

12
Experimental
7113 wojciechszmelczerczyk/mern-notes-app

Speech-to-text notes app.

12
Experimental
7114 areebaghazal88/Speech-Emotion-Recognition-using-Deep-learning

Detects emotions from speech signals using a Convolutional Neural Network...

12
Experimental
7115 ltoinel/openkarotz-tts-ibm-watson

A bridge for the Openkarotz TTS using IBM Watson TTS API

12
Experimental
7116 Shirajuki/cuddly-journey

Hardsub -> Softsub + TTS = 🍿(An utility tool for converting hardsub to...

12
Experimental
7117 jmreis/audio-to-text-with-python

transcribing audio from mp3 format to text using python

12
Experimental
7118 meemanali/Rose_Voice-Controlled-Windows-App

Rose is an interactive voice-controlled Windows application developed in C#...

12
Experimental
7119 vdfbiz7/AI-Video-Transcriber

Python program able to transcribe a Youtube video to text with the help of AI.

12
Experimental
7120 Rahulpatil512/Audio-and-Text-Processing

GUI based application called Text and Audio Processing Application. Here we...

12
Experimental
7121 stavrosandres44/Animalese-TTS

The genuine high quality TTS of the Animal Crossing Language Animalese

12
Experimental
7122 ymzEmre/spremic

A simple JavaScript speech recognition library.

12
Experimental
7123 HeyHera/Hera

This project presents Hera, an Operating System level voice recognition...

12
Experimental
7124 cuinjune/voice-pdf-viewer

A voice-controlled PDF viewer app

12
Experimental
7125 Sattwikmaiti/CROSS-ORIGIN

Our platform offers a wide range of opportunities for students and...

12
Experimental
7126 Yonet/AysContent

Talks, Videos and workshops and abstracts.

12
Experimental
7127 bhairavmehta95/ASD_VideoApp

An application to help all children (especially those with Autism Spectrum...

12
Experimental
7128 parthshiv/simple-ai-assistant-in-python

A minimal Python-based voice assistant that listens for a wake word and...

12
Experimental
7129 vadash/ProjectScribeRelease

Epub to opus audiobook creator for windows. Tested on EN/RU input

12
Experimental
7130 webis-de/lecture.js

Lecture.js converts a script and slides to a spoken video presentation using...

12
Experimental
7131 legekka/eddiTTS

An interactive AI Voice assistant extension for EDDI

12
Experimental
7132 flo7up/PodcastAnything

Turn anything into an AI-generated podcast conversation using Azure Speech...

12
Experimental
7133 Aylore/Arabic-Voice-Interface-for-City-Operation-Center

A multi-language virtual assistant

12
Experimental
7134 ccoreilly/catalan-speech-recognition-benchmark

A benchmark of speech recognition solutions for the Catalan language

12
Experimental
7135 Meet-Turakhia/Makeat

Makeat is a Recipe Recommendation by Ingredients Detection App (Coming Soon!)

12
Experimental
7136 TufayelLUS/Voice-Based-ChatBot-Using-Google-Bard-And-Speech-Recognition

Voice based ChatBot using google bard unofficial API library and Google...

12
Experimental
7137 EthanLifeGreat/AudioPsyChat

这是一个在服务器本地运行的web语音心理咨询系统,咨询系统内核使用[PsyChat],我们为其制作了Web前端,并拼接了ASR和TTS组件,使局域网内用户...

12
Experimental
7138 ClearCut3000/SpeechToDo

SwiftUI ToDo app with speech recognition and CoreData persistence

12
Experimental
7139 GDGoC-CAU/Blinder

Blind people can order independently with Blinder

12
Experimental
7140 samigehi/PocketSphinxDemo

Offline urdu speech recognition toolkit based on PocketSphinx 1.8 for Android devices

12
Experimental
7141 ProjCRys/Realtime-TTS-AI-Assisstant

It's response time is based on how fast an LLM can reply and I used a low...

12
Experimental
7142 danielrosehill/Voice-Training-Script-Generator

Helper utils for generating training data for voice cloning with LLMs

12
Experimental
7143 sumantapani-pm/OpenTeller-Protocol

A privacy-preserving, voice-biometric banking interface for the visually...

12
Experimental
7144 cadia-lvl/althingi-asr

An ASR recipe and speech corpus of Icelandic parliamentary speeches

12
Experimental
7145 ActiveNick/MSSpeechServiceWebSocketConsole

Sample applications (.NET console & UWP app) used to test Speech Recognition...

12
Experimental
7146 THE-DEEPDAS/RealTime-Voice-Assistant

Voice-activated assistant using Groq API, Streamlit UI, speech recognition, and TTS

12
Experimental
7147 641i130/vocaloid-speech-generator

A Python program that creates a Vocaloid save that generated a sequence of...

12
Experimental
7148 AkshataGanbote/Text-Utility-App

Text Utility App is a text to audio converter and can be used to manipulate...

12
Experimental
7149 arsentd/ArmenianTextToSpeech

This library provides Text to Speech functionality for Armenian language....

12
Experimental
7150 mmatlin/formant-encoder

An encoder which compresses audio data based on prominent acoustic features...

12
Experimental
7151 zashin-AI/project

Speech-Recognition STT Project

12
Experimental
7152 ShampooWang/SpeechCLIP_plus

SpeechCLIP+: Self-supervised multi-task representation learning for speech...

12
Experimental
7153 coskundeniz/howcanisay

Multi-language translator using OpenAI APIs and Streamlit

12
Experimental
7154 syoamakase/ASR

Speech Recognition Toolkit

12
Experimental
7155 MlWoo/Tacotron2-PyTorch

TTS

12
Experimental
7156 engageintellect/ai-audio-tool

🎵 Convert .m4a files to .wav in bulk, preserving folder structure and file...

12
Experimental
7157 vistaran/speech-to-type

Speech to type text. Basic python script that continuously listens to your...

12
Experimental
7158 bk-1806/Universal-Access-AI

An AI-powered desktop assistant for the visually impaired. Uses YOLO...

12
Experimental
7159 SilentProgrammer-max/Voice2Code

Voice2Code is a smart AI-based tool that converts your voice commands into...

12
Experimental
7160 mikeesto/kokoro-web

Text-to-speech generated locally in your browser. Using Svelte, Kokoro.js...

12
Experimental
7161 yueyue4359/social-media-voiceover

Generate super realistic voiceover for any social media content using f5-tts

12
Experimental
7162 Einzigartige/voice-unlock-login

A web-based voice authentication system that unlocks a login interface using...

12
Experimental
7163 Anirudh-1606/Snake-Audio-Game

A classic snake game with voice commands. Game is made with javascript and...

12
Experimental
7164 The24thDS/discord-google-tts

Discord TTS bot using Google Cloud Text To Speech

12
Experimental
7165 DragomirBozoki/lipreading-cv-nlp

End-to-end visual speech recognition system using deep learning. Combines...

12
Experimental
7166 olaviinha/NeuralDialogueAudiolizer

Jupyter notebook for turning textual dialogue into voice audio.

12
Experimental
7167 MohadesehMatinkia/focus-dashboard-2026

A Smart Focus Dashboard & Todo List app featuring Voice Input, Glassmorphism...

12
Experimental
7168 umi-AIGC-saas/umi_ai_cms

双重驱动的智能AI系统,它对接了目前市场上主流的AI大模型,并根据这些大模型的优劣势进行算法分类。通过综合利用各种AI大模型的优势,无忧AI智脑能够提供更...

12
Experimental
7169 murabcd/vibecoder

AI Vibe Coder Built with Tanstack Start and OpenAI

12
Experimental
7170 Xornotor/VocalAssignment-SSCS

Trabalho de Conclusão de Curso (Final Undergraduate Project). Contributions...

12
Experimental
7171 MIHIR2006/Text-Utils

React page

12
Experimental
7172 jackmulligan-ire/altas

Python package to scrape webpages and transcribe video content from a video...

12
Experimental
7173 QuasarRyan/mlx-audio-bridge

这是一个基于 mlx-audio 的本地 REST 服务,用来实现兼容 OpenAI 的 TTS / STT 音频接口桥接层。

12
Experimental
7174 lukifer23/MacBot

MacBot: pre-release offline AI voice assistant for macOS, featuring an...

12
Experimental
7175 Youssef-Ashraf-Dev/Voice-Agent

Real-time voice agent using LiveKit and Gemini Live API

12
Experimental
7176 speechly/ios-repo-filtering

An example application build with Speechly iOS client

12
Experimental
7177 MadisoMelese/Voice-to-Text-and-Vice-versa

The Voice-to-Text and Text-to-Voice Converter is an innovative application...

12
Experimental
7178 AbirLOUARD/Virtual-Assistant-Eko

My personal virtual assistant EKO

12
Experimental
7179 KChantal/SignBridge

Building Bridges for Inclusive Communication

12
Experimental
7180 ramadhanssw/signalator

Signalator is an application that makes it easy for people with disabilities...

12
Experimental
7181 ArtaXerxess/Voice-Assistant-Mini-Project

This is a simple voice assistant, does not take any data about the user like...

12
Experimental
7182 madcato/speechtophoneme

This project is developed to create a Deep Learning algorithm able to...

12
Experimental
7183 dharness/sqwak-app

Audio classifying service

12
Experimental
7184 Jayanth-MKV/speech-emotion-recognition-api-using-fastapi

Speech Emotion Recognition api using models trained based on gender using...

12
Experimental
7185 backspacetg/distilAlhubert

code for our paper DistilALHuBERT: A Distilled Parameter Sharing Audio...

12
Experimental
7186 jhj0517/ComfyUI-jhj-Kokoro-Onnx

ComfyUI wrapper for Kokoro (TTS) models

12
Experimental
7187 FOC-SLIIT-Research-Project-2023/Mobile-Base-Sinhala-Book-Reader-for-The-Visually-Impaired-Individuals

Blind people face several challenges when reading books, but the main...

12
Experimental
7188 scionoftech/speaker_diarization

speaker diarization using spectralcluster and Deeplearning

12
Experimental
7189 Deratheone/kudumbAIsree

KudumbAISree is an interactive AI-powered conversation simulator that brings...

12
Experimental
7190 lissettecarlr/AutomaticSpeechRecognition

语音转文本的各类python封装实现(paraformer、whisper_online、whisper_offline、funasr),用于服务kuon仓库

12
Experimental
7191 Temerold/TobsTTS

Text to speech, Python 3.7. Swedish and English. bye

12
Experimental
7192 Avinraj01/SHL-Grammar-Scoring-Engine-for-Voice-Samples

This model predicts grammar scores (1–5) from audio files. It uses Whisper...

12
Experimental
7193 joselatines/speech-recognition-text-comparison

This program checks your pronunciation skills by comparing your speech to...

12
Experimental
7194 nipponjo/mixer-tts-pytorch

Mixer-TTS for efficient TTS

12
Experimental
7195 Hauntlight/video_translator

🎥 Translate and dub video audio into another language using AI. Built with...

12
Experimental
7196 YasinEnigma/chatbot

chatbot for mci course

12
Experimental
7197 anhvung/Capstone-Audio-Transcription

Exploring different ASR and language models for audio transcription

12
Experimental
7198 sagnikghoshcr7/Text-to-Voice-Converter

This application converts text to voice

12
Experimental
7199 krishnachaitanya0107/DictionaryApp

A Dictionary App to look up meanings and definitions of words , with the...

12
Experimental
7200 themistocleous/IPA_English

A text-editor that enables users to transcribe text written in Greek...

12
Experimental
« Prev 1 2 3 70 71 72 73 74 80 81 82 Next »