All Voice AI Tools

6,981 tools ranked by quality score · Page 52 of 70

Showing 5101–5200 of 6,981
# Tool Score Tier
5101 egorsmkv/w2v2-bert-aligner

Aligner for wav2vec2-bert models

16
Experimental
5102 Ronnie-Leon76/Swahili-ASR

This repository contains the code for fine-tuning the XLS-R Wav2Vec2 model...

16
Experimental
5103 ranchlai/wav2vec-2.0

Wav2vec2 English speech recognition in PaddlePaddle

16
Experimental
5104 davidsuragan/tulga-cli

TulgaCLI is a tool that allows you to chat and voice chat with virtual...

16
Experimental
5105 kongju7/my_project6

Personal project 6: Speech Recognition Deep Learning Chatbot -...

16
Experimental
5106 yujiliu/oresta

Oresta - is the first voice assistant in the Ukrainian language.

16
Experimental
5107 dvamsidhar2002/Project-VIVA-Personal-Desktop-and-Voice-Assistant

This is a personal desktop assistant which will do few tasks for you. It is...

16
Experimental
5108 TakumiSenaha/Nreal_IoT

This project aims to visualize the sensor information of the surroundings...

16
Experimental
5109 priyanshpsalian/VISION-THE-BLIND

An all in one solution for safety and security of blind. Features covered in...

16
Experimental
5110 andrikAV18/Chat_app

💬 Build real-time chat experiences with a modern app that supports user...

16
Experimental
5111 TexasInstrumentsDIY/SpiceRack

Voice controlled turntable using the beaglebone black wireless.

16
Experimental
5112 Simone-Convertini/Speech-Summarization-Demo

A Web Api written using Go and Gin capable to perform Speech Summarization...

16
Experimental
5113 michsethowusu/kasanoma

Offline-first TTS models for African languages

16
Experimental
5114 sudonitin/MediumScraper

Scraping articles of medium and providing audio versions 📑 to 🔊 using django

16
Experimental
5115 BiasedToad1/AudiobookMaker

A tool utilizing piper-tts to convert books into audiobooks.

16
Experimental
5116 vantu5z/PyBookReaderTTS

Читалка для книг на Gtk через синтезаторы TTS

16
Experimental
5117 sebheron/TikTok-Reddit-Text-To-Speech

Reddit TTS generator designed for TikTok

16
Experimental
5118 chenying99/ttsv2

fast tts (ZH EN) lightweight

16
Experimental
5119 jmpnop/vdub

vdub — video dubbing and subtitle engine. Rust + MLX. Free local ASR/TTS.

16
Experimental
5120 sanvibyfish/OwlWhisper

Local voice input for macOS — hold a hotkey, speak, release, text appears at...

16
Experimental
5121 gztomas/utter

A Text-to-Speech CLI using ElevenLabs, designed for humans and AI agents.

16
Experimental
5122 jsc2017605097/chatgpt-audio-downloader

A lightweight Chrome/Edge extension to instantly catch and download the...

16
Experimental
5123 suzumushi0/SoundObject_source

SoundObject source code distribution.

16
Experimental
5124 brailcom/singing-computer

Computer singing synthesis

16
Experimental
5125 kolonist/edgetts

Use free Microsoft Edge's online text-to-speech service from golang

16
Experimental
5126 criadacasa/podcastfy-saas

SaaS platform for generating AI podcasts from multimodal content - Built...

16
Experimental
5127 mostafabahaa25/mediguide_MVP

AI-powered accessibility app that helps blind and low-vision users manage...

16
Experimental
5128 yuis-ice/text-to-speech

🎤 VoiceFlow - Modern text-to-speech web application with real-time word...

16
Experimental
5129 awesome-german/pronunciation

Guides, phonetic tools, and speaking exercises to achieve clear and natural...

15
Experimental
5130 nikita-popov/tts-api

Kokoro based TTS API

15
Experimental
5131 MarvinAmine/UDEMY_AWS_PREP_EXAM_COPILOT

A Chrome extension to interact with your Udemy AWS certification prep exams....

15
Experimental
5132 ZacDair/SER_Platform_AICS

This repository contains the code to create and conduct emotion recognition...

15
Experimental
5133 vinsis/speech-commands-recognition

Single word speech recognition using PyTorch

15
Experimental
5134 Tinker-Twins/NLP_Using_Python

This repository hosts the code snippets used for small NLP project using...

15
Experimental
5135 sayak119/Express

Express Yourself.

15
Experimental
5136 praneethpj/Unity-Android-Utilities

Open Source Unity-Android Platform Voice Text API and Text To Voice API.

15
Experimental
5137 sergix44/oddcast-tts-php

A PHP interface to the online Oddcast demo API.

15
Experimental
5138 SenalDolage/object-detection-TFJS-ReactNative

A mobile application that identifies nearby objects and gives a voice output...

15
Experimental
5139 Gyvastis/google-speech-tts

A wrapper for Google Translate to generate an audio from a text.

15
Experimental
5140 HQQHQ/FinetuneSpeechT5-Spanish

This repository hosts the code and resources for fine-tuning a SpeechT5...

15
Experimental
5141 xignoe/videoTranslatorExtenstion

Real-time video translation Chrome extension that automatically generates...

15
Experimental
5142 XxAZVDxX/LLM-Live2D-VRM-AI-Girlfriend-iOS

Let’s start chatting with your Live2D or VRM girlfriend in iOS (Support...

15
Experimental
5143 NormVg/AutoCaptionGenAI

A Python project that extracts audio from video files, transcribes the...

15
Experimental
5144 gathrean/Nebula

Neural Network in Python trained for multi-musical instruments recognition.

15
Experimental
5145 Sh1nr1/mai-ai-assistant-self-hosted

Mai is an emotionally intelligent, voice-enabled AI assistant built with...

15
Experimental
5146 smivv/python-vosk-trial

Vosk Speech Recognition Trial

15
Experimental
5147 vpakarinen/mmaudio-webui

WebUI for MMAudio Video-to-Audio and Text-to-Audio.

15
Experimental
5148 NoNamePro0/Speech

🎙 Yet another python script that speech your text

15
Experimental
5149 beltoforion/Synthetischer-Wetterbericht

Ein Python-Skript für das automatisierte Erstellen von gesprochenen...

15
Experimental
5150 pkubowicz/vocab-tts

Learning vocabulary with text-to-speech and Anki

15
Experimental
5151 Orca0917/Spectrogram-VQ

Unofficial implementation of Spectrogram VQ from DCTTS paper - Vector...

15
Experimental
5152 jianchang512/speech2text-df

基于Dolphin模型的东方语言音视频转字幕api及webui

15
Experimental
5153 AbdulGani11/Vocably

Text-to-speech web application built with React, FastAPI, JWT...

15
Experimental
5154 manasmodak/SpeechRecognition

WPF App to show text-speech and speech recognition

15
Experimental
5155 FeuZen/Zonos-long-text-to-speech

Takes an input text and transcribes it using zonos-v0.1-hybrid

15
Experimental
5156 atmehedi/Speech-to-text-in-Assamese

TASK ORIENTED DIALOG SYSTEM IN NATIVE LANGUAGE(ASSAMESE)

15
Experimental
5157 pselvana/VoiceCrafter

Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

15
Experimental
5158 arda-guler/CodexBabil

Codex Babil - Library of Babel expanded with random writing systems.

15
Experimental
5159 elemarmar/joke-teller

🤖💬 Joke Teller gets random jokes from third party API and converts them to...

15
Experimental
5160 beerberidie/Echo

Voice-controlled AI assistant with real-time transcription, natural language...

15
Experimental
5161 carmen-martin/Deep-Keyword-Spotting

A Small Footprint implementation of Keyword Spotting with different architectures.

15
Experimental
5162 anshshah23/nlp-mini-project

This project incorporates a rule based engine for recognising Gujarati using...

15
Experimental
5163 Prajithp/p5-Google-Cloud-Speech

Google Cloud Speech Client Library for Perl

15
Experimental
5164 0xstackforge/voice-agents-demo

AI-powered outbound calling chatbot built with Twilio, FastAPI, and Pipecat,...

15
Experimental
5165 motazsaad/jsc-news-broadcast

JSC news broadcast (speech corpus)

15
Experimental
5166 AaravK25/NetraSetuV2

For The Visually Impaired.

15
Experimental
5167 FatStinkyPanda/talk2me

A fully offline, self-contained voice interaction system featuring...

15
Experimental
5168 moe-mizrak/laravel-google-text-to-speech

Laravel package for integrating Gemini Text-to-Speech API and Google Cloud...

15
Experimental
5169 jaketae/conformer

PyTorch implementation of Conformer: Convolution-augmented Transformer for...

15
Experimental
5170 iam-smjamilsagar/Speech-Assistant

Today we will learn how to make speech assistant in Python.

15
Experimental
5171 WelkinYang/Tacotron2-pytorch

Tacotron2 implemented by pytorch

15
Experimental
5172 dnyanshwalwadkar/SIMHA-Personal-Assistant-using-Artificial-intelligence

The rise of automation, along with increased computational power, novel...

15
Experimental
5173 usubar-eats/voice-button-app

声ボタン - 文字を打つだけで話してくれるアプリ / Voice Button - Text-to-Speech App for Japanese

15
Experimental
5174 AndresRJ18/Study-Vault-AWS

Converts text study notes into audio podcasts automatically using AWS...

15
Experimental
5175 contro-projects/speechpad

A simple, lightweight web app that converts your voice into text in...

15
Experimental
5176 QuasarRyan/mlx-audio-bridge

这是一个基于 mlx-audio 的本地 REST 服务,用来实现兼容 OpenAI 的 TTS / STT 音频接口桥接层。

15
Experimental
5177 khizarali07/VoiceForge-AI-Frontend

A complete synthetic media pipeline for high-fidelity TTS and talking-head...

15
Experimental
5178 matin91/Kasko

Kasko is a Talking To-do List app, which allows the user to set up Reminders...

15
Experimental
5179 adarshsingh6622-source/advanced_voice_assistant

An advanced AI-powered voice assistant built using Python, NLP, and speech...

15
Experimental
5180 dtrovato997/SpeechAnalysis

A sample application for on-device offline mobile voice inference using deep...

15
Experimental
5181 OnesAndZer0s/node-dectalk

Node.js module that provides bindings for the DecTalk Text-To-Speech library

15
Experimental
5182 cr2007/cambai-python

Python SDK for the CambAI API

15
Experimental
5183 stillcuriouscat/votype

Global voice typing for Linux — offline ASR, hotkey-triggered, works in any app

15
Experimental
5184 AKAPhilipD/CMTNET_for_SER

CMT-Net: A Collaborative Mamba-Transformer Network with Spatial-Temporal...

15
Experimental
5185 harikanaidu/NLP-health-assistant

An NLP-driven health assistant bot that interacts, asks a series of personal...

15
Experimental
5186 marklubin/kairix

Voice-first AI agent with persistent memory, background reflection, and...

15
Experimental
5187 sglkc/live-translate

🎙️ Translate as you speak using Google Chrome's Web Speech API for speech...

15
Experimental
5188 tonyshawjr/LiveDJ

AI-powered radio DJ display for Plex and Spotify. Shows artist info, album...

15
Experimental
5189 nbr23/gopipertts

A small HTTP API wrapper for piper's texttospeech

15
Experimental
5190 Mordekai66/Py-Captcha-Generator

PyCaptchaGenerator is a Python file that generates image and audio CAPTCHAs...

15
Experimental
5191 ringabout/scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

15
Experimental
5192 parth2152012/murf-voice-agent-hackathon

AI Voice Agent for Techfest IIT Bombay Hackathon - Built using Murf Falcon...

15
Experimental
5193 opensource-spraakherkenning-nl/ASR_NL_results

Results of Dutch ASR models, collected by the community

15
Experimental
5194 Nik-Kras/Live_ASR_Whisper_Gradio

Real Time Speech To Text with corrections powered by Gradio

15
Experimental
5195 happytunesai/EZ-STT-Logger-GUI

Python GUI for real-time Speech-to-Text (STT) using local Whisper, OpenAI...

15
Experimental
5196 Ashmithakur29/Chrome-Extensions

A Chrome Extension built to deliver daily jokes with audio support ,...

15
Experimental
5197 AssemblyAI-Community/intro-to-espnet

Getting Started with ESPnet | AssemblyAI

15
Experimental
5198 srvk/jsalt-2018-grounded-s2s

Grounded Sequence-to-Sequence Transduction Team at JSALT 2018

15
Experimental
5199 ashisbehera/Smart_Alarm

This project is based on text to speech alarm application.

15
Experimental
5200 wanghao15536870732/ChatWithEveryone

🚧The Internet + project YiLuYuBan.The project is too messy, has moved to...

15
Experimental
« Prev 1 2 3 50 51 52 53 54 68 69 70 Next »