All Voice AI Tools

6,981 tools ranked by quality score · Page 58 of 70

Showing 5701–5800 of 6,981
# Tool Score Tier
5701 asrajeh/deepspeech-arabic

End-to-End Arabic ASR using DeepSpeech engine

14
Experimental
5702 umjammer/vavi-speech

🗣 Java Text to Speech (JSAPI) engines (google cloud, cocoa, aquestalk(ゆっくり))

14
Experimental
5703 himanshutambuskar/Voice-and-gesture-control

🎤 Enable intuitive voice and gesture control for seamless interaction with...

14
Experimental
5704 coldcasefiles/voice-unlock-login

🎤 Unlock your login with voice authentication using a web-based system that...

14
Experimental
5705 Sam67xsaad/WWW-5

🎉 Kickstart your Web3 journey by showcasing your project from the Women Web3...

14
Experimental
5706 AndrewFarley/OpenTX-Generate-Sounds-Amazon-Polly

A helper script to generate OpenTX sounds based on the language .csv

14
Experimental
5707 Hannes1/react-native-wenet

Wenet speech to text for react native

14
Experimental
5708 wyatt-avilla/discord-tiktok-tts-bot

discord bot that can play tiktok tts in voice

14
Experimental
5709 spokestack/react-native-spokestack-tray

React Native component for adding Spokestack to a React Native app

14
Experimental
5710 Erio-Harrison/rust-g2p

用于学习TTS核心原理的教学项目

14
Experimental
5711 AyushDhanai1419/Vision

An Android Application for Object Detection

14
Experimental
5712 trannhan25/NeMoConformerASR-iOS

🎙️ Enable accurate speech recognition on iOS/macOS using NVIDIA NeMo...

14
Experimental
5713 brotherspear1994/AI_ReadingChildrenTale_PJT

Image Captioning, TTS, VC 기술을 이용해 동화책을 읽어주는 AI 구연동화 서비스입니다.

14
Experimental
5714 shesuyo/isi

alibaba 智能语音交互(Intelligent Speech Interaction) GO SDK

14
Experimental
5715 asrajeh/kaldi-arabic

HHM-based Arabic ASR using Kaldi engine

14
Experimental
5716 muurakami/momokiki

Open source language learning app — Duolingo alternative with offline...

14
Experimental
5717 Ankuj17/Python--todo-list-

📝 Manage your tasks efficiently with this CLI-based Python todo list...

14
Experimental
5718 imgta/vialect

Streamline your video/audio intake by transforming multimedia content into...

14
Experimental
5719 sruckh/Qwen3-TTS-serverless

Runpod Serverless for the Qwen3-TTS model

14
Experimental
5720 theaifutureguy/Hackathon-Winner-App

Hackathon Winner App, built with Next.js and TypeScript for a frontend,...

14
Experimental
5721 bougnaboy/react-mini-suite

🚀 Build and adapt lightweight freelancing mini-apps in React, designed for...

14
Experimental
5722 vell1/soulx-tts-metal

🎙️ Create high-quality speech synthesis with SoulX-TTS, optimized for Apple...

14
Experimental
5723 andigenesis/brainrot-generator

TikTok-style brainrot video generator — text to video with gameplay backgrounds

14
Experimental
5724 alpereee/SpeakerRecognition

🎙️ Makine öğrenmesi ile konuşmacı tanıma, sesten duygu analizi ve metne...

13
Experimental
5725 nvjob/rain-voice-note

Rain Voice Note (Speech To Text). CW Frame App. JavaScript.

13
Experimental
5726 kdorichev/text2speech

Text-To-Speech Dataset Preparation and Architecture

13
Experimental
5727 deeplearningcafe/animespeechdataset

Dataset Generation for Language Model Training and Text-to-Speech Synthesis...

13
Experimental
5728 vislupus/Bulgarian-TTS-dataset

LibriVox dataset for Bulgarian language TTS

13
Experimental
5729 Luigi-Pizzolito/English2KanaTransliteration

Convert english phrases into phonetic japanese kana approximations; also...

13
Experimental
5730 gikonyob/speake

Speake library provides a wrapper around Espeak to easily write efficient...

13
Experimental
5731 juancarlospaco/nim-espeak

Nim Espeak NG wrapper, for super easy Voice and Text-To-Speech

13
Experimental
5732 OleksandrZhabenko/mm1

Program that reads Ukrainian text using eSpeak and SoX.

13
Experimental
5733 Ratul345/BanglaSTT

BanglaSTT 🎙️ | Bangla Speech-to-Text using OpenAI Whisper. Fast, accurate,...

13
Experimental
5734 neeraj-nagiri/Assistant-Bro-

Assistant "Bro" is a voice-controlled personal assistant that opens...

13
Experimental
5735 DuyguA/Interspeech2025-Smooth-Operating-LLMs-for-Disfluency

Innovative approach for modelling speech disfluencies with LLaMa and Conformer.

13
Experimental
5736 Victeam-and-Sabourault/VueMe

Home Assistant - VueJS

13
Experimental
5737 Vaibhav-kesarwani/Orion-AI

This is the Orion AI Project made in Python3. It is a virtual assistant and...

13
Experimental
5738 lukaszliniewicz/easy_xtts_trainer

A command line utility to easily finetune XTTS models in a fully automated...

13
Experimental
5739 harveenchadha/Speech-Learning-Resources

Repo containing resources to learn about various verticals of speech. ASR , TTS

13
Experimental
5740 CherokeeLanguage/IMS-Toucan

Cherokee Language TTS

13
Experimental
5741 Kavindu-Rankothge/tiktok-bot

TikTok video generation from scraping Reddit community posts

13
Experimental
5742 bfackland/replica_dialog_generator

🗣 Auto-generate dialog audio files using the Replica Studios 'AI Voices' API...

13
Experimental
5743 jpanged/ItsDisturbing

Identifying potential red flags using Watson NLU

13
Experimental
5744 Dalia-Sher/Speech-Emotion-Recognition-using-BLSTM-with-Attention

We present a study of a neural network based method for speech emotion...

13
Experimental
5745 diver-j/melgan-multi

MelGAN Multi GPU Implementation.

13
Experimental
5746 pschatzmann/TalkiePCM

Platform Independent TTS Library that generates PCM data

13
Experimental
5747 will-rice/diffwave

TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for...

13
Experimental
5748 gnat/text-to-speech-windows

🙊 Text to speech GUI / TTS on Windows, Android, Web. Ideal for speed...

13
Experimental
5749 hd996/material-local

🎬 素材本地化

13
Experimental
5750 Hrishikesh-Gavai/NERV-TRANSLATE

Problem Statement: Developing A Software For Dubbing Videos.

13
Experimental
5751 maziac/currah_uspeech_tests

Tests for the ZX Spectrums speech synthesizer peripheral: Currah uSpeech...

13
Experimental
5752 karthikrshet/text-to-speech

Convert any text into lifelike speech. Choose your language and voice.

13
Experimental
5753 Ralireza/PSDR

Persian spoken digit recognition

13
Experimental
5754 om-arya/T.O.M

A multi-purpose, cat-themed web app created for college students, by college...

13
Experimental
5755 lymcho/story-to-video

Create a fully narrated YouTube audiobook channel in one command. AI...

13
Experimental
5756 keonlee9420/tacotron2_MMI

Another PyTorch implementation of Tacotron2 MMI (with waveglow) which...

13
Experimental
5757 pika-online/Whishow

python based online audio/video player

13
Experimental
5758 voidful/whisper-live-asr-demo

run whisper on CPU/GPU server

13
Experimental
5759 Vagabond-K/Speechabler

루게릭병 환우의 목소리 프로젝트

13
Experimental
5760 NotMyMajor/MATLAB-Transcription-with-Python-and-Google

Uses a Python script to transcribe an audio file and turn the transcription...

13
Experimental
5761 Soumo-git-hub/AI-News-Aggregator

An intelligent news aggregator (Python/JS) using spaCy for NLP topic...

13
Experimental
5762 dwil2444/AudioTranscription

Python Scripts which utilize the low latency streaming transcription of the...

13
Experimental
5763 StarxSky/tacotron2-JP

Base on "tacotron2-jpanese" builded & change

13
Experimental
5764 Otosaku/NeMoConformerASR-iOS

On-device speech-to-text for iOS/macOS powered by NVIDIA NeMo Conformer CTC...

13
Experimental
5765 sunilband/doge-transpiler

Hemllo Friemnds.This is a Englimsh to Domge lamnguage tramnslator . Huihuihui

13
Experimental
5766 Skulux/Voicetral

[DEPRECATED] This repository contains an amateur implementation of an...

13
Experimental
5767 arshc0der/Javscript-Mini-Projects

🧩 JavaScript Mini Projects – Beginner-Friendly Practice Projects This...

13
Experimental
5768 charlielito/teachable-machines-audio-demo

An audio model for recognizing a whistle pattern was trained to toggle a...

13
Experimental
5769 fclaeys/nix-nerd-dictation

🎤 Nix flake for offline French speech-to-text with nerd-dictation....

13
Experimental
5770 elllusion/calibre

为linux发行版的Calibre添加Edge TTS | Add Edge TTS for calibre of linux

13
Experimental
5771 dawoodkhatri1/Talk2TextSim

The objective of this project is to design and develop a tool that converts...

13
Experimental
5772 hemangjoshi37a/French_audio_transcription_using_gradio

French audio transcription using gradio

13
Experimental
5773 daisy/tobi

Tobi is a free, open source, multimedia book production authoring tool for...

13
Experimental
5774 erendogan6/Translateify

An interactive English learning app with personalized daily word...

13
Experimental
5775 SirCryptic/cli-sms

use clicksend to send either sms or text to speech to a phone number via the...

13
Experimental
5776 hooshvare/speech2text

A demo of speech to text by google

13
Experimental
5777 pvanand07/BhashiniClient

A Python client library for interacting with Bhashini services, including...

13
Experimental
5778 shreyashghag/OfflineSpeechRecognition

Offline Speech Recognition For Android Library

13
Experimental
5779 egorsmkv/flashlight-ukrainian

The Ukrainian Acoustic Model for Flashlight

13
Experimental
5780 lucylow/Yeezy-Taught-Me

Yeezy Taught Me Text Generation. Training next character predictions RNN...

13
Experimental
5781 broadfield-dev/PyPiperTTS

PyPiperTTS is a Python library that provides a simple and intuitive...

13
Experimental
5782 ShadowLp174/stt-example-bot

A basic discord bot but with voice commands

13
Experimental
5783 Adyaprana/Nexora.ai

A Mistral AI-powered chatbot built with Streamlit for real-time...

13
Experimental
5784 lwdovico/zonos

Basic Zonos setup for seamless integration with multiple sentence inference tasks.

13
Experimental
5785 LucaBallan/wikipedia-aloud-reader

Read aloud wikipedia pages

13
Experimental
5786 Arkueid/Moeroid

Low overhead llm-tts-live2d desktop moe

13
Experimental
5787 natsu90/jjk

JJK Domain Expansions 👹

13
Experimental
5788 Zoomicon/SpeechTurtle

Control Turtle on 2D canvas using Windows or Kinect-based speech recognition

13
Experimental
5789 m-mohsin-ali/closed-captioning-azure-speech-ai

This project demonstrates how to use Azure Cognitive Services with a...

13
Experimental
5790 aliabdm/laravel-ai-showcase

A professional Laravel AI SDK showcase featuring Mock Mode for instant demos...

13
Experimental
5791 HaomingXR/Mandarin-TTS-Unity

在 Unity 裡進行 中文(普通話) 的 文字轉語音

13
Experimental
5792 Murlors/VITS_Japanese

VITS implementation of Japanese

13
Experimental
5793 gaelic-ghost/TalkToMeKit

A toolkit for Local Qwen3 TTS on macOS, or embedded within macOS applications.

13
Experimental
5794 Oldes/Rebol-Speak

Rebol text-to-speech extension

13
Experimental
5795 upskaling/voice-keyboard

an interface for nerd-dictation in gtk

13
Experimental
5796 strcoder4007/S2S-Lipsync-UnrealAvatar-Backend

Unreal Metahuman Conversation Speech to Speech backend and frontend.

13
Experimental
5797 LauraKokkarinen/AzureAI.TextToSpeech

A console application for converting long-form plain-text files into speech...

13
Experimental
5798 odnodn/dgx-spark

NVIDIA DGX Spark ressources

13
Experimental
5799 InuInu2022/LibSasara

The utility library for CeVIO project file (.ccs / .ccst) and timing label...

13
Experimental
5800 quochuy242/VNAVC

Data Pipeline for Text to Speech Project

13
Experimental
« Prev 1 2 3 56 57 58 59 60 68 69 70 Next »