All Voice AI Tools
6,981 tools ranked by quality score · Page 25 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 2401 |
mirfan899/CTTS
Cantonese TTS frontend |
|
Emerging |
| 2402 |
Helow19274/aiogTTS
Async Python library to interface with Google Translate's text-to-speech API |
|
Emerging |
| 2403 |
go-restream/zipenhancer-rs
🚀 High-Performance Real-Time Audio Noise Reduction Library - Rust... |
|
Emerging |
| 2404 |
hamzaehsan97/Speech_Recognition_CNN
CNN (Convolutional Neural Networks) Speech Recognition |
|
Emerging |
| 2405 |
Malith-Rukshan/whisper-transcriber-bot
🎙️ AI-powered Telegram bot for voice-to-text transcription using OpenAI... |
|
Emerging |
| 2406 |
TheCodeTraveler/XamSpeak
An iOS and Android app that will dictate text from a photo. XamSpeak... |
|
Emerging |
| 2407 |
AlimTleuliyev/image-to-audio
Image Captioning and Text-to-Speech |
|
Emerging |
| 2408 |
iGerman00/Pollyduble
An experimental proof-of-concept script to automatically dub videos to... |
|
Emerging |
| 2409 |
Mateusz-Dera/whisperspeech-webui
Simple WhisperSpeech web UI |
|
Emerging |
| 2410 |
01-vyom/End_2_End_Automatic_Speech_Recognition_For_Gujarati
[ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition... |
|
Emerging |
| 2411 |
nuaazs/VAF_2
Aims to create a comprehensive voice toolkit for training, testing, and... |
|
Emerging |
| 2412 |
phineas-pta/speech-synthesis-ngngngan
python script to download & process data to train a speech-synthesis model... |
|
Emerging |
| 2413 |
neurlang/gospeak
A Golang Text to Speech System |
|
Emerging |
| 2414 |
troykelly/live-news-break
An advanced tool designed for creating automated news bulletins. It... |
|
Emerging |
| 2415 |
LonePheasantWarrior/VolcengineTTS
基于火山引擎豆包语音服务的在线TTS安卓应用 (An online TTS Android application based on the... |
|
Emerging |
| 2416 |
PareekshithPalat/AETHER---Personal-Assistant
AETHER is a voice-activated Python personal assistant that responds to... |
|
Experimental |
| 2417 |
MitchellAW/Discord-Bot
My own Discord chat bot built in Python using the discord.py API. Has been... |
|
Experimental |
| 2418 |
ducnt18121997/Viet-Text-Normalization
A Python library for text normalization, specifically designed for... |
|
Experimental |
| 2419 |
LinqLover/simple-openai-tts-playground
Try out the OpenAI Text to Speech API in your browser. |
|
Experimental |
| 2420 |
crimson0829/RecordVoiceView
录音控件 for Android,支持实时语音转化为文字 |
|
Experimental |
| 2421 |
arora-r/chatapp-with-voice-and-openai
This project uses OpenAI's GPT-3 model to create a simple assistant that can... |
|
Experimental |
| 2422 |
Langhalsdino/StageMate
StageMate is the smart assistant for your presentation. It will cover all... |
|
Experimental |
| 2423 |
koesan/ReManga_web
ReManga: A user-friendly platform for translating and colorizing manga.... |
|
Experimental |
| 2424 |
pschatzmann/arduino-flite
A small fast portable speech synthesis system |
|
Experimental |
| 2425 |
arham-kk/openai-tts
This repository features a Gradio interface designed to leverage the OpenAI... |
|
Experimental |
| 2426 |
jishengpeng/ControlSpeech
[ACL 2025 Main] ControlSpeech: Towards Simultaneous Zero-shot Speaker... |
|
Experimental |
| 2427 |
richlira/MeetingMindAI
AI-powered meeting assistant for iPhone — real-time transcription,... |
|
Experimental |
| 2428 |
erzaozi/vits-plugin
基于 Yunzai 的语音合成插件 |
|
Experimental |
| 2429 |
htn-l/htn-l.github.io
Takes in audio feed from lectures or meetings, performs speech to text... |
|
Experimental |
| 2430 |
syb0rg/Khronos
The open source intelligent personal assistant |
|
Experimental |
| 2431 |
chirag127/WebSpeak-TextToSpeech-Browser-Extension
High-fidelity browser extension leveraging the Web Speech API for precise,... |
|
Experimental |
| 2432 |
supershaneski/openai-chatterbox
A sample Nuxt 3 application that listens to chatter in the background and... |
|
Experimental |
| 2433 |
sayyedrizwan/TextConvertor
Convert Text into Voice(Speech) and Speech into Text.. |
|
Experimental |
| 2434 |
zhang-tuo-pdf/FedAudio
[ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks |
|
Experimental |
| 2435 |
mattt/supertone-swift
A Swift wrapper for the Supertone text-to-speech model |
|
Experimental |
| 2436 |
zhenye234/FlashSpeech
ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis |
|
Experimental |
| 2437 |
AceCentre/TextAloud
iOS app. Built in Swift. Reads out text - sentence by sentence, paragraph by... |
|
Experimental |
| 2438 |
theinlinaung2010/Azure_speech_to_test
Sample code for testing speech recognition (speech-to-text) of Burmese... |
|
Experimental |
| 2439 |
anwar-gazi/ivrworks
Build IVR, run voice campaign, with machine detection, speech recognition... |
|
Experimental |
| 2440 |
hmeutzner/kaldi-avsr
Kaldi-based audio-visual speech recognition |
|
Experimental |
| 2441 |
r9y9/jsut-lab
HTS-style full-context labels for JSUT v1.1 |
|
Experimental |
| 2442 |
winedarkmoon/ElevenGUI
A user-friendly interface for ElevenLabs' API with added audio transcription... |
|
Experimental |
| 2443 |
Unovamata/Neopets-Shop-And-Attic-Autobuyer-Cracked
An Auto Item Buyer and Pricer Bot for Neopets.com |
|
Experimental |
| 2444 |
GSA/coe-discovery-bpa
Information on the Discovery BPA for discovery-related work performed by the... |
|
Experimental |
| 2445 |
Ma-Dan/asr-decode
从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库 |
|
Experimental |
| 2446 |
tim-gromeyer/VoiceAssistant
Empower Your Voice, Secure Your Privacy - Experience VoiceAssistant, Your... |
|
Experimental |
| 2447 |
m15-ai/Local-Voice
A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local... |
|
Experimental |
| 2448 |
ictnlp/SLED-TTS
Streamable Text-to-Speech model using a language modeling approach, without... |
|
Experimental |
| 2449 |
Mohamed-samy2/Video-Interview-Analysis
PRVIA is an AI-powered system that automates the evaluation of pre-recorded... |
|
Experimental |
| 2450 |
zmeet-ai/asr_demo
语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译 |
|
Experimental |
| 2451 |
edouardpoitras/eva
Open source voice-enabled personal assistant |
|
Experimental |
| 2452 |
heyseth/Piper_TTS
Use Piper TTS in Visual Studio Code |
|
Experimental |
| 2453 |
GoodSpeech/good-speech-web-client
Practice your speech level in any language using speech recognition |
|
Experimental |
| 2454 |
indigane/wyoming-android-tts
Use your Android device's TTS engines in Home Assistant via the Wyoming protocol. |
|
Experimental |
| 2455 |
primepake/learnable-speech
This repo is text to speech with learnable audio encoder without alignment... |
|
Experimental |
| 2456 |
sooftware/jasper
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural... |
|
Experimental |
| 2457 |
lucko515/Speech-commands-recognition
Recognizing common speech commands using Keras and Tensorflow. |
|
Experimental |
| 2458 |
IS2AI/TurkicTTS
A multilingual text-to-speech synthesis system for ten lower-resourced... |
|
Experimental |
| 2459 |
manish-4007/YT-video-Transcription
An AI tools which helps to analyze any YouTube video, give the sentiment of... |
|
Experimental |
| 2460 |
Jugendhackt/synthi-tts
Hackathon project to digitize your own voice and have it speak for you!... |
|
Experimental |
| 2461 |
llami-team/wake-me
AI-based React component library that detects clapping sounds or finger... |
|
Experimental |
| 2462 |
tsengia/JSGFKit_Plus_Plus
A C++ library for parsing and manipulating JSGF grammar files. |
|
Experimental |
| 2463 |
deeheber/text-to-speech-converter
A serverless application that converts blobs of text to speech in an audio file |
|
Experimental |
| 2464 |
qkl9527/voice-assistant
基于Funasr的[实时]AI语音助手 |
|
Experimental |
| 2465 |
chattylabs/conversational-flow
The Conversational Flow combines both native built-in resources and cloud... |
|
Experimental |
| 2466 |
aks-devs/mod_piper_tts
Freeswitch Text-to-Speech module |
|
Experimental |
| 2467 |
ankushbhatia2/django-speech-to-text
A small API for speech to text made in Django. |
|
Experimental |
| 2468 |
dalehumby/openWakeWord-rhasspy
openWakeWord for Rhasspy |
|
Experimental |
| 2469 |
Br3n0k/transcriber
AI-powered transcription for audio & video with Whisper — self-hosted, fast,... |
|
Experimental |
| 2470 |
Oct4Pie/persian-stt
A Text-To-Speech Model Developed Using 🐸STT |
|
Experimental |
| 2471 |
jefflai108/Semi-Supervsied-Spoken-Language-Understanding-PyTorch
Semi-supervised spoken language understanding (SLU) via self-supervised... |
|
Experimental |
| 2472 |
StachePL/ExcelToAmazonPolly
Simple text-to-speech tool combining powers of Excel and Amazon Polly. |
|
Experimental |
| 2473 |
Iiridayn/pico-tts
Android PicoTTS w/C calling application using submodule |
|
Experimental |
| 2474 |
MahtaFetrat/ManaTTS-Persian-Tacotron2-Model
Tacotron2 Persian Text-to-Speech Model trained on ManaTTS, the largest open... |
|
Experimental |
| 2475 |
vasilevp/sam
SAM: Software Automatic Mouth (Ported from https://github.com/vidarh/SAM) |
|
Experimental |
| 2476 |
KoalaV2/K.A.I
Home automation program controlled by your voice. |
|
Experimental |
| 2477 |
poretsky/rulex
Russian pronunciation dictionary |
|
Experimental |
| 2478 |
shervinemami/practice_speechrec_mappings
A game to help design a better character mapping and to learn the mapping... |
|
Experimental |
| 2479 |
playerony/TensorFlowTTS-ts
This project implements TensorflowTTS in Tensorflow.js using Typescript,... |
|
Experimental |
| 2480 |
TejasQ/praise
Do stuff with your voice in the browser. |
|
Experimental |
| 2481 |
alisolphp/EchoTalk
A browser-based language training app using Shadowing technique with... |
|
Experimental |
| 2482 |
csyan5/AttnGAN-Audio-to-image-geneation
CMPT726 Machine Learning Final Project |
|
Experimental |
| 2483 |
bhashini-ai/g2p
Grapheme-to-phoneme (G2P) conversion for Tamil / Kannada languages - a... |
|
Experimental |
| 2484 |
vijethph/Insight
A Flutter app to help blind people. |
|
Experimental |
| 2485 |
X-LANCE/UniCATS-CTX-txt2vec
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS |
|
Experimental |
| 2486 |
cloudcommunity/Text-to-Speech-Engines
A list of different text to speech engines. |
|
Experimental |
| 2487 |
manab-kb/Voice-Based-Translator
A Voice Based Translator - Speak in English or any of the available selected... |
|
Experimental |
| 2488 |
CodersCreative/faster-whisper-rs
a rust crate for easily implementing faster-whisper stt into your rust programs. |
|
Experimental |
| 2489 |
uzbekvoice/UzbekVoiceBot
Current and Live Telegram bot for collecting dataset |
|
Experimental |
| 2490 |
momalekiii/VTT
Extract Speech/Text from Video |
|
Experimental |
| 2491 |
prateekralhan/Speech2Text-for-Long-Audio-Files
Perform SOTA Speech2Text on Long Audio Files with/without diarization Using... |
|
Experimental |
| 2492 |
german-asr/megs
A merged version of multiple open-source German speech datasets. |
|
Experimental |
| 2493 |
greg-kennedy/p5-NRL-TextToPhoneme
Perl implementation of the Naval Research Laboratory text-to-phoneme... |
|
Experimental |
| 2494 |
nate-russell/Scholar2Go
Make MP3 albums out of Academic PDFs. Works by gluing together Grobid and... |
|
Experimental |
| 2495 |
18F/bpa-disaster-data-portal-pilot
The scope of this task is to build a working pilot of a portal that collects... |
|
Experimental |
| 2496 |
h4rm0n1c/NetTTS
A Retro-modern SAPI 4.0 TTS Client with Network Connectivity and custom... |
|
Experimental |
| 2497 |
Zoomicon/SpeechLib
Library for Speech Synthesis and Recognition using Windows.Speech or... |
|
Experimental |
| 2498 |
nheidloff/unity-watson-vr-sample
Virtual Reality Sample using IBM Watson, Unity and Google Cardboard |
|
Experimental |
| 2499 |
asus4/unity-speech-recognizer
iOS Speech Recognizer for Unity |
|
Experimental |
| 2500 |
MycroftAI/ZZZ-RETIRED__openstt
RETIRED - OpenSTT is now retired. If you would like more information on... |
|
Experimental |