All Voice AI Tools

6,981 tools ranked by quality score · Page 25 of 70

Showing 2401–2500 of 6,981
# Tool Score Tier
2401 mirfan899/CTTS

Cantonese TTS frontend

30
Emerging
2402 Helow19274/aiogTTS

Async Python library to interface with Google Translate's text-to-speech API

30
Emerging
2403 go-restream/zipenhancer-rs

🚀 High-Performance Real-Time Audio Noise Reduction Library - Rust...

30
Emerging
2404 hamzaehsan97/Speech_Recognition_CNN

CNN (Convolutional Neural Networks) Speech Recognition

30
Emerging
2405 Malith-Rukshan/whisper-transcriber-bot

🎙️ AI-powered Telegram bot for voice-to-text transcription using OpenAI...

30
Emerging
2406 TheCodeTraveler/XamSpeak

An iOS and Android app that will dictate text from a photo. XamSpeak...

30
Emerging
2407 AlimTleuliyev/image-to-audio

Image Captioning and Text-to-Speech

30
Emerging
2408 iGerman00/Pollyduble

An experimental proof-of-concept script to automatically dub videos to...

30
Emerging
2409 Mateusz-Dera/whisperspeech-webui

Simple WhisperSpeech web UI

30
Emerging
2410 01-vyom/End_2_End_Automatic_Speech_Recognition_For_Gujarati

[ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition...

30
Emerging
2411 nuaazs/VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and...

30
Emerging
2412 phineas-pta/speech-synthesis-ngngngan

python script to download & process data to train a speech-synthesis model...

30
Emerging
2413 neurlang/gospeak

A Golang Text to Speech System

30
Emerging
2414 troykelly/live-news-break

An advanced tool designed for creating automated news bulletins. It...

30
Emerging
2415 LonePheasantWarrior/VolcengineTTS

基于火山引擎豆包语音服务的在线TTS安卓应用 (An online TTS Android application based on the...

30
Emerging
2416 PareekshithPalat/AETHER---Personal-Assistant

AETHER is a voice-activated Python personal assistant that responds to...

29
Experimental
2417 MitchellAW/Discord-Bot

My own Discord chat bot built in Python using the discord.py API. Has been...

29
Experimental
2418 ducnt18121997/Viet-Text-Normalization

A Python library for text normalization, specifically designed for...

29
Experimental
2419 LinqLover/simple-openai-tts-playground

Try out the OpenAI Text to Speech API in your browser.

29
Experimental
2420 crimson0829/RecordVoiceView

录音控件 for Android,支持实时语音转化为文字

29
Experimental
2421 arora-r/chatapp-with-voice-and-openai

This project uses OpenAI's GPT-3 model to create a simple assistant that can...

29
Experimental
2422 Langhalsdino/StageMate

StageMate is the smart assistant for your presentation. It will cover all...

29
Experimental
2423 koesan/ReManga_web

ReManga: A user-friendly platform for translating and colorizing manga....

29
Experimental
2424 pschatzmann/arduino-flite

A small fast portable speech synthesis system

29
Experimental
2425 arham-kk/openai-tts

This repository features a Gradio interface designed to leverage the OpenAI...

29
Experimental
2426 jishengpeng/ControlSpeech

[ACL 2025 Main] ControlSpeech: Towards Simultaneous Zero-shot Speaker...

29
Experimental
2427 richlira/MeetingMindAI

AI-powered meeting assistant for iPhone — real-time transcription,...

29
Experimental
2428 erzaozi/vits-plugin

基于 Yunzai 的语音合成插件

29
Experimental
2429 htn-l/htn-l.github.io

Takes in audio feed from lectures or meetings, performs speech to text...

29
Experimental
2430 syb0rg/Khronos

The open source intelligent personal assistant

29
Experimental
2431 chirag127/WebSpeak-TextToSpeech-Browser-Extension

High-fidelity browser extension leveraging the Web Speech API for precise,...

29
Experimental
2432 supershaneski/openai-chatterbox

A sample Nuxt 3 application that listens to chatter in the background and...

29
Experimental
2433 sayyedrizwan/TextConvertor

Convert Text into Voice(Speech) and Speech into Text..

29
Experimental
2434 zhang-tuo-pdf/FedAudio

[ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks

29
Experimental
2435 mattt/supertone-swift

A Swift wrapper for the Supertone text-to-speech model

29
Experimental
2436 zhenye234/FlashSpeech

ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis

29
Experimental
2437 AceCentre/TextAloud

iOS app. Built in Swift. Reads out text - sentence by sentence, paragraph by...

29
Experimental
2438 theinlinaung2010/Azure_speech_to_test

Sample code for testing speech recognition (speech-to-text) of Burmese...

29
Experimental
2439 anwar-gazi/ivrworks

Build IVR, run voice campaign, with machine detection, speech recognition...

29
Experimental
2440 hmeutzner/kaldi-avsr

Kaldi-based audio-visual speech recognition

29
Experimental
2441 r9y9/jsut-lab

HTS-style full-context labels for JSUT v1.1

29
Experimental
2442 winedarkmoon/ElevenGUI

A user-friendly interface for ElevenLabs' API with added audio transcription...

29
Experimental
2443 Unovamata/Neopets-Shop-And-Attic-Autobuyer-Cracked

An Auto Item Buyer and Pricer Bot for Neopets.com

29
Experimental
2444 GSA/coe-discovery-bpa

Information on the Discovery BPA for discovery-related work performed by the...

29
Experimental
2445 Ma-Dan/asr-decode

从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库

29
Experimental
2446 tim-gromeyer/VoiceAssistant

Empower Your Voice, Secure Your Privacy - Experience VoiceAssistant, Your...

29
Experimental
2447 m15-ai/Local-Voice

A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local...

29
Experimental
2448 ictnlp/SLED-TTS

Streamable Text-to-Speech model using a language modeling approach, without...

29
Experimental
2449 Mohamed-samy2/Video-Interview-Analysis

PRVIA is an AI-powered system that automates the evaluation of pre-recorded...

29
Experimental
2450 zmeet-ai/asr_demo

语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译

29
Experimental
2451 edouardpoitras/eva

Open source voice-enabled personal assistant

29
Experimental
2452 heyseth/Piper_TTS

Use Piper TTS in Visual Studio Code

29
Experimental
2453 GoodSpeech/good-speech-web-client

Practice your speech level in any language using speech recognition

29
Experimental
2454 indigane/wyoming-android-tts

Use your Android device's TTS engines in Home Assistant via the Wyoming protocol.

29
Experimental
2455 primepake/learnable-speech

This repo is text to speech with learnable audio encoder without alignment...

29
Experimental
2456 sooftware/jasper

PyTorch implementation of "Jasper: An End-to-End Convolutional Neural...

29
Experimental
2457 lucko515/Speech-commands-recognition

Recognizing common speech commands using Keras and Tensorflow.

29
Experimental
2458 IS2AI/TurkicTTS

A multilingual text-to-speech synthesis system for ten lower-resourced...

29
Experimental
2459 manish-4007/YT-video-Transcription

An AI tools which helps to analyze any YouTube video, give the sentiment of...

29
Experimental
2460 Jugendhackt/synthi-tts

Hackathon project to digitize your own voice and have it speak for you!...

29
Experimental
2461 llami-team/wake-me

AI-based React component library that detects clapping sounds or finger...

29
Experimental
2462 tsengia/JSGFKit_Plus_Plus

A C++ library for parsing and manipulating JSGF grammar files.

29
Experimental
2463 deeheber/text-to-speech-converter

A serverless application that converts blobs of text to speech in an audio file

29
Experimental
2464 qkl9527/voice-assistant

基于Funasr的[实时]AI语音助手

29
Experimental
2465 chattylabs/conversational-flow

The Conversational Flow combines both native built-in resources and cloud...

29
Experimental
2466 aks-devs/mod_piper_tts

Freeswitch Text-to-Speech module

29
Experimental
2467 ankushbhatia2/django-speech-to-text

A small API for speech to text made in Django.

29
Experimental
2468 dalehumby/openWakeWord-rhasspy

openWakeWord for Rhasspy

29
Experimental
2469 Br3n0k/transcriber

AI-powered transcription for audio & video with Whisper — self-hosted, fast,...

29
Experimental
2470 Oct4Pie/persian-stt

A Text-To-Speech Model Developed Using 🐸STT

29
Experimental
2471 jefflai108/Semi-Supervsied-Spoken-Language-Understanding-PyTorch

Semi-supervised spoken language understanding (SLU) via self-supervised...

29
Experimental
2472 StachePL/ExcelToAmazonPolly

Simple text-to-speech tool combining powers of Excel and Amazon Polly.

29
Experimental
2473 Iiridayn/pico-tts

Android PicoTTS w/C calling application using submodule

29
Experimental
2474 MahtaFetrat/ManaTTS-Persian-Tacotron2-Model

Tacotron2 Persian Text-to-Speech Model trained on ManaTTS, the largest open...

29
Experimental
2475 vasilevp/sam

SAM: Software Automatic Mouth (Ported from https://github.com/vidarh/SAM)

29
Experimental
2476 KoalaV2/K.A.I

Home automation program controlled by your voice.

29
Experimental
2477 poretsky/rulex

Russian pronunciation dictionary

29
Experimental
2478 shervinemami/practice_speechrec_mappings

A game to help design a better character mapping and to learn the mapping...

29
Experimental
2479 playerony/TensorFlowTTS-ts

This project implements TensorflowTTS in Tensorflow.js using Typescript,...

29
Experimental
2480 TejasQ/praise

Do stuff with your voice in the browser.

29
Experimental
2481 alisolphp/EchoTalk

A browser-based language training app using Shadowing technique with...

29
Experimental
2482 csyan5/AttnGAN-Audio-to-image-geneation

CMPT726 Machine Learning Final Project

29
Experimental
2483 bhashini-ai/g2p

Grapheme-to-phoneme (G2P) conversion for Tamil / Kannada languages - a...

29
Experimental
2484 vijethph/Insight

A Flutter app to help blind people.

29
Experimental
2485 X-LANCE/UniCATS-CTX-txt2vec

[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS

29
Experimental
2486 cloudcommunity/Text-to-Speech-Engines

A list of different text to speech engines.

29
Experimental
2487 manab-kb/Voice-Based-Translator

A Voice Based Translator - Speak in English or any of the available selected...

29
Experimental
2488 CodersCreative/faster-whisper-rs

a rust crate for easily implementing faster-whisper stt into your rust programs.

29
Experimental
2489 uzbekvoice/UzbekVoiceBot

Current and Live Telegram bot for collecting dataset

29
Experimental
2490 momalekiii/VTT

Extract Speech/Text from Video

29
Experimental
2491 prateekralhan/Speech2Text-for-Long-Audio-Files

Perform SOTA Speech2Text on Long Audio Files with/without diarization Using...

29
Experimental
2492 german-asr/megs

A merged version of multiple open-source German speech datasets.

29
Experimental
2493 greg-kennedy/p5-NRL-TextToPhoneme

Perl implementation of the Naval Research Laboratory text-to-phoneme...

29
Experimental
2494 nate-russell/Scholar2Go

Make MP3 albums out of Academic PDFs. Works by gluing together Grobid and...

29
Experimental
2495 18F/bpa-disaster-data-portal-pilot

The scope of this task is to build a working pilot of a portal that collects...

29
Experimental
2496 h4rm0n1c/NetTTS

A Retro-modern SAPI 4.0 TTS Client with Network Connectivity and custom...

29
Experimental
2497 Zoomicon/SpeechLib

Library for Speech Synthesis and Recognition using Windows.Speech or...

29
Experimental
2498 nheidloff/unity-watson-vr-sample

Virtual Reality Sample using IBM Watson, Unity and Google Cardboard

29
Experimental
2499 asus4/unity-speech-recognizer

iOS Speech Recognizer for Unity

29
Experimental
2500 MycroftAI/ZZZ-RETIRED__openstt

RETIRED - OpenSTT is now retired. If you would like more information on...

29
Experimental
« Prev 1 2 3 23 24 25 26 27 68 69 70 Next »