All Voice AI Tools
6,981 tools ranked by quality score · Page 48 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 4701 |
thewh1teagle/heb-piper-tts-gemma-g2p-onnx
Text to speech with Hebrew G2P and TTS models based on Piper/Gemma3 |
|
Experimental |
| 4702 |
egorsmkv/radtts-hifigan
RADTTS + HiFiGAN vocoder |
|
Experimental |
| 4703 |
IseduardoRezende/IAParty
Profile/Persona Call using LLM |
|
Experimental |
| 4704 |
p-jacobo2012240/AI-Real-Time-Recognition
Tensorflow app for real-time environment sketching using text-to-speech and GCP |
|
Experimental |
| 4705 |
SKLD-xm/speechy
A text-to-speech synthesizer based on C# that supports SSML |
|
Experimental |
| 4706 |
NAVI-TTS/NAVI-TTS.github.io
The NAVI's Text-To-Speech System for VLSP 2021 |
|
Experimental |
| 4707 |
Vanthink-UED/vanspeak
a plugin for text to speech |
|
Experimental |
| 4708 |
ColsonZhang/ASR-SPN
基于spn网络构建的孤立词语音识别模型,训练前处理过程使用了端点检测、mfcc特征提取和编码压缩以及DTW对齐算法。 |
|
Experimental |
| 4709 |
SnoutBug/csgo-tts
Reading Counter-Strike: Global Offensive in-game chat with a synthesized voice |
|
Experimental |
| 4710 |
vivek-nexus/lizen
A text to speech web application that speaks word, sentences or even reads... |
|
Experimental |
| 4711 |
EmanuelAlogna/Gender-Classification-using-ML
Gender Classification with different Machine Learning models, using the... |
|
Experimental |
| 4712 |
navalnica/be_nlp_speech_resources
Links to Belarusian NLP and Speech resources |
|
Experimental |
| 4713 |
Shangamesh2805/TECH_OCULAR--AI-BASED-AUDIO-TRANSCRIBER-FOR-VISUALLY-IMPAIRED
Smart eye glasses, it is an AI based audio transcriber for visually... |
|
Experimental |
| 4714 |
BrunoHenrique00/ear
Ear is a desktop app that will help you transcribe what is playing on your computer! |
|
Experimental |
| 4715 |
kyopark2014/demo-robo-soulmate
It is a repository to prepare a demo for dansing robot. |
|
Experimental |
| 4716 |
Bangla-Language-Processing/Bangla-Speech-Corpora
Bangla cleaned speech corpus, specially developed for Bangla Text to Speech |
|
Experimental |
| 4717 |
inevolin/DiscordEarsGo
A speech-to-text framework and bot for Discord written in GoLang. Take... |
|
Experimental |
| 4718 |
stephenswetonic/ytpai
AI powered ytp/sentence mixing for audio and video. |
|
Experimental |
| 4719 |
sasharun/awesome-faceless
A curated list of 50+ AI tools for faceless YouTube content creators. Voice,... |
|
Experimental |
| 4720 |
tuanio/deepspeech-ctc
Deepspeech with ctc loss on Vivos Vietnamese Dataset |
|
Experimental |
| 4721 |
YoungloLee/tf2-speech-recognition-las
Tensorflow 2 Speech Recognition Code (LAS) |
|
Experimental |
| 4722 |
harmindersinghnijjar/streamlit-punjabi-ai
Punjabi AI, ChatGPT with translation and Punjabi TTS using Narakeet's API. |
|
Experimental |
| 4723 |
abhinav-bohra/THAT
A webapp to improve the online learning experience of people with hearing impairment. |
|
Experimental |
| 4724 |
ComputerCampaign/contentflow-ai
一个功能强大的Python工具,集成网页图片爬取和博客自动生成功能。支持XPath规则配置、任务ID管理、Selenium动态加载、GitHub图床上传、... |
|
Experimental |
| 4725 |
tarponjargon/clipcast
Convert news articles, blog posts (and more) into audio podcast episodes... |
|
Experimental |
| 4726 |
BraianMendes/Clold-Project
Mobile app using ALTU AI, integrated into an embedded device. Made with... |
|
Experimental |
| 4727 |
europanite/client_side_audio_transcription
A Browser-Based AI Audio Transcription Playground Powered by Whisper. |
|
Experimental |
| 4728 |
klee-repos/dialogflow-voice-streaming
Intent mapping with real-time voice to text stream |
|
Experimental |
| 4729 |
bunyaminergen/awesome-speech-dataset
Awesome Speech Dataset, including download links and a brief explanation for... |
|
Experimental |
| 4730 |
Leonqn/speech-to-text-bot
Speech to text telegram bot. It can convert voice and video note messages to... |
|
Experimental |
| 4731 |
hernanrazo/human-voice-detection
Binary classification problem that aims to classify human voices from audio... |
|
Experimental |
| 4732 |
ybenkirane/AI_Tutor
An LLM-powered automated tutoring program that will converse with you on any... |
|
Experimental |
| 4733 |
sanastasiou/dictation-service
GPU-accelerated speech-to-text service that types what you say, powered by... |
|
Experimental |
| 4734 |
avikantz/Samaritan
Samaritan demo clone for iOS. |
|
Experimental |
| 4735 |
SaiHarsha9992/Leo-Ai-Assistant-2.0
Leo is a 3D interactive AI assistant built with React Three Fiber,... |
|
Experimental |
| 4736 |
Shristirajpoot/CalcVoive
🎙️ Voice-enabled calculator built with React | Supports speech input/output... |
|
Experimental |
| 4737 |
thewh1teagle/zipvoice-onnx
TTS with ZipVoice and onnxruntime |
|
Experimental |
| 4738 |
ParasAvkirkar/MagooshHelper
A simple application based on words published by Magoosh Vocabulary Flash... |
|
Experimental |
| 4739 |
ssumin6/Korean-TTS-Server
Korean text-to-speech |
|
Experimental |
| 4740 |
djleamen/renamer
Utility to rename mp3 files based on speech content |
|
Experimental |
| 4741 |
old-cookie/dr_ai
AI-powered health assistant Flutter app featuring voice interaction, smart... |
|
Experimental |
| 4742 |
oc8/YouTranslate
Chrome extension that reads YouTube video subtitles out loud |
|
Experimental |
| 4743 |
dsidlo/FlexiTTS
A simple powerful workflow for Text to AudioBook creation. Uses realistic AI... |
|
Experimental |
| 4744 |
Better-Than-You/brainrotbot
BrainRotBot is a Python-based Reddit video maker bot that automates creating... |
|
Experimental |
| 4745 |
habibimustafa/VoiceBot
Simple Voice Bot using IBM Watson Service |
|
Experimental |
| 4746 |
chstan/voice-notes
Personal notes transcription (AWS Transcribe) and Notion integration. |
|
Experimental |
| 4747 |
JagratiVerma1408/ObjectDetectionApplication
Andriod app integrating tflite model for object detection |
|
Experimental |
| 4748 |
prabormukherjee/Coursera_Helper_chatbot
A chatbot to help coursera student with their difficulty. |
|
Experimental |
| 4749 |
LexicalStressDetection/lexical-stress-detection
Deep Learning model for lexical stress detection in spoken English |
|
Experimental |
| 4750 |
Sergey004/silero_tts_rvc
A simple extension that allows LLM to speak in any voice, literally, based... |
|
Experimental |
| 4751 |
akashchaudhary-git/android-azure-speech-openai
An integration of Azure Speech Service and Azure OpenAI in Android. This... |
|
Experimental |
| 4752 |
Manokero/face-recognition-and-tts-numbers
En este proyecto se utiliza reconocimiento facial para verificar una persona... |
|
Experimental |
| 4753 |
Erfanafshar/speech-gender-detection
An audio signal processing project that detects speaker gender from recorded... |
|
Experimental |
| 4754 |
jagerzhang/FastTTS
基于edge-tts的简单语音合成服务,支持私有化部署,支持和源阅读APP无缝对接。 |
|
Experimental |
| 4755 |
ZhanpengWang96/pytorch-speech2vec
Pytorch implementation of the paper Speech2Vec: A Sequence-to-Sequence... |
|
Experimental |
| 4756 |
reuAC/reCosyVoiceService
A text-to-speech service built with CosyVoice2, with multi-node concurrency... |
|
Experimental |
| 4757 |
Tim55667757/AudioGenerator
Озвучка русских и иностранных текстов через платформу OpenAI |
|
Experimental |
| 4758 |
florabtw/google-translate-tts
Node library for Google Translate TTS (Text-to-Speech) API |
|
Experimental |
| 4759 |
toshalpatel/AudioSimilarity
When two audio files compared, the result is giving the similar part from... |
|
Experimental |
| 4760 |
umitkacar/transformer-asr-transcription
Real-time transformer-based ASR supporting 100+ languages - Google Cloud... |
|
Experimental |
| 4761 |
tubexchat/interpreter-zh2en-gemini
An interpreter web app between Chinese and English that is powered by Gemini-2.0-fash |
|
Experimental |
| 4762 |
asheghi/text-to-speech
Text to Speech |
|
Experimental |
| 4763 |
IRSPlays/ProjectCortexV2
A $300 wearable that gives visually impaired users real-time scene... |
|
Experimental |
| 4764 |
erogol/TTS_tf
WIP Tensorflow implementation of https://github.com/mozilla/TTS |
|
Experimental |
| 4765 |
sandeepswain54/Yukti-Care
Yukti Care is a mobile app that enables pharmacies, medical distributors,... |
|
Experimental |
| 4766 |
vishal1patidar/TEXT-TO-SPEAK
🔖24 Different Languages voice's Add a text🗨️ in it and listen👂 |
|
Experimental |
| 4767 |
andydowsen/voice-assistant
🏳🌌♨ Simple voice assistant with minimal ai logics includes streamlit web... |
|
Experimental |
| 4768 |
Ani0202/Speech-Translation-with-Python
Translate your speech to many languages using Google Translate API |
|
Experimental |
| 4769 |
Atamyrat2005/text-to-speech
There are several APIs available to convert text to speech in Python. One of... |
|
Experimental |
| 4770 |
charles-forsyth/generate-tts
A professional CLI for Google Gemini's Native 2.5 TTS. Generate... |
|
Experimental |
| 4771 |
Winnie-Fred/text-to-speech
Text-to-speech web-based application using Django and Google Translate... |
|
Experimental |
| 4772 |
EGWeeks/translate_tts_api
AWS Translate & Text to Speech API Javascript Example |
|
Experimental |
| 4773 |
Clebson-Torres/WinVoice
An offline voice assistant for Windows, utilizing local AI (Ollama) and... |
|
Experimental |
| 4774 |
abhijhacodes/PDF_to_AudioBook_converter
Python code that converts any pdf file into audiobook |
|
Experimental |
| 4775 |
dongheehand/Tacotron-PyTorch
PyTorch implementation of Tacotron |
|
Experimental |
| 4776 |
ekdysis/Speech-POC
POC using Apple's Speech framework demonstrating real-time speech... |
|
Experimental |
| 4777 |
atanu20/alan-ai-news-project
Here i build a Conversational Voice Controlled React News Application using... |
|
Experimental |
| 4778 |
clarenceluo78/singer-adaptive-svc
This repository is the implementation of project Converting to Realistic... |
|
Experimental |
| 4779 |
raminnakhli/HMM-DNN-Speech-Recognition
This repository is a Python implementation of HMM-DNN model. |
|
Experimental |
| 4780 |
lliWcWill/maVoice-Linux
🎙️ Lightning-fast voice dictation Desktop Web App powered by Groq's Whisper... |
|
Experimental |
| 4781 |
wujunwei928/go-zero-tts
基于微软edge大声朗读接口开发的语音合成服务, 后端 go-zero, 前端 vuetify |
|
Experimental |
| 4782 |
xujiaao/BezierSpline
Android - Smooth Bézier Spline Through Prescribed Points |
|
Experimental |
| 4783 |
lianabisuna/spelltacular
Random word spelling skills test/practice (Vue.js 2 & Vuetify) |
|
Experimental |
| 4784 |
garconvacher/TextToSpeech_eBook
Un kit de test pour la synthèse vocale eBook (EPUB + Kindle) |
|
Experimental |
| 4785 |
BinkyWong/speech-recognition
Centos 7 based container for speech recognition |
|
Experimental |
| 4786 |
salehsargolzaee/Audio-Signal-Processing-and-Feature-Extraction
Feature extraction from audio signal (explained in Persian) |
|
Experimental |
| 4787 |
dangvansam/deepxi-flask-server
DeepXi with Flask Server |
|
Experimental |
| 4788 |
gillan-krishna/meeting_notes
Hobby project to transcribe audio files from meetings to transcripts with a summary |
|
Experimental |
| 4789 |
Momotoculteur/Keyword-voice-recognition
Créer une reconnaissance vocale de mots clés via des algorithmes... |
|
Experimental |
| 4790 |
joaoalvarenga/voice-assistant
An open-source Alexa-like complete voice assistant system, from speech... |
|
Experimental |
| 4791 |
sindhura-pv/lip-reading
In this project, visual speech recognition has been attempted using 2 major... |
|
Experimental |
| 4792 |
proger/uk
Фонограми та синтагми: інструменти обробки |
|
Experimental |
| 4793 |
harshshirke66/AntarVani
AntarVani – Neural Brain-to-Speech Demo A real-time thought-to-speech... |
|
Experimental |
| 4794 |
FernandoLpz/SpeechRecognition
This repository contains the implementation of an Automatic Speech... |
|
Experimental |
| 4795 |
mbailey/push2type
Turn CAPSLOCK key into Dictation Key |
|
Experimental |
| 4796 |
CrankZ/muyi
本地字幕生成与翻译,支持显卡加速 |
|
Experimental |
| 4797 |
Bacdong/virtual-assistant-v1
Learning build virtual assistant with python and python library support. |
|
Experimental |
| 4798 |
Sxriptor/Whispra-Download
Whispra's Offical Download | AI-powered real-time voice and subtitle... |
|
Experimental |
| 4799 |
baharudin-yusup/salingsapa
A video call apps to enable deaf people to communicate with normal people... |
|
Experimental |
| 4800 |
yiwise/yiwise-asr-demo-java
杭州一知智能科技有限公司自研 ASR Java客户端demo |
|
Experimental |