All Voice AI Tools

6,981 tools ranked by quality score · Page 48 of 70

Showing 4701–4800 of 6,981
# Tool Score Tier
4701 thewh1teagle/heb-piper-tts-gemma-g2p-onnx

Text to speech with Hebrew G2P and TTS models based on Piper/Gemma3

18
Experimental
4702 egorsmkv/radtts-hifigan

RADTTS + HiFiGAN vocoder

18
Experimental
4703 IseduardoRezende/IAParty

Profile/Persona Call using LLM

18
Experimental
4704 p-jacobo2012240/AI-Real-Time-Recognition

Tensorflow app for real-time environment sketching using text-to-speech and GCP

18
Experimental
4705 SKLD-xm/speechy

A text-to-speech synthesizer based on C# that supports SSML

18
Experimental
4706 NAVI-TTS/NAVI-TTS.github.io

The NAVI's Text-To-Speech System for VLSP 2021

18
Experimental
4707 Vanthink-UED/vanspeak

a plugin for text to speech

18
Experimental
4708 ColsonZhang/ASR-SPN

基于spn网络构建的孤立词语音识别模型,训练前处理过程使用了端点检测、mfcc特征提取和编码压缩以及DTW对齐算法。

18
Experimental
4709 SnoutBug/csgo-tts

Reading Counter-Strike: Global Offensive in-game chat with a synthesized voice

18
Experimental
4710 vivek-nexus/lizen

A text to speech web application that speaks word, sentences or even reads...

18
Experimental
4711 EmanuelAlogna/Gender-Classification-using-ML

Gender Classification with different Machine Learning models, using the...

18
Experimental
4712 navalnica/be_nlp_speech_resources

Links to Belarusian NLP and Speech resources

18
Experimental
4713 Shangamesh2805/TECH_OCULAR--AI-BASED-AUDIO-TRANSCRIBER-FOR-VISUALLY-IMPAIRED

Smart eye glasses, it is an AI based audio transcriber for visually...

18
Experimental
4714 BrunoHenrique00/ear

Ear is a desktop app that will help you transcribe what is playing on your computer!

18
Experimental
4715 kyopark2014/demo-robo-soulmate

It is a repository to prepare a demo for dansing robot.

18
Experimental
4716 Bangla-Language-Processing/Bangla-Speech-Corpora

Bangla cleaned speech corpus, specially developed for Bangla Text to Speech

18
Experimental
4717 inevolin/DiscordEarsGo

A speech-to-text framework and bot for Discord written in GoLang. Take...

18
Experimental
4718 stephenswetonic/ytpai

AI powered ytp/sentence mixing for audio and video.

18
Experimental
4719 sasharun/awesome-faceless

A curated list of 50+ AI tools for faceless YouTube content creators. Voice,...

18
Experimental
4720 tuanio/deepspeech-ctc

Deepspeech with ctc loss on Vivos Vietnamese Dataset

18
Experimental
4721 YoungloLee/tf2-speech-recognition-las

Tensorflow 2 Speech Recognition Code (LAS)

18
Experimental
4722 harmindersinghnijjar/streamlit-punjabi-ai

Punjabi AI, ChatGPT with translation and Punjabi TTS using Narakeet's API.

18
Experimental
4723 abhinav-bohra/THAT

A webapp to improve the online learning experience of people with hearing impairment.

18
Experimental
4724 ComputerCampaign/contentflow-ai

一个功能强大的Python工具,集成网页图片爬取和博客自动生成功能。支持XPath规则配置、任务ID管理、Selenium动态加载、GitHub图床上传、...

18
Experimental
4725 tarponjargon/clipcast

Convert news articles, blog posts (and more) into audio podcast episodes...

18
Experimental
4726 BraianMendes/Clold-Project

Mobile app using ALTU AI, integrated into an embedded device. Made with...

18
Experimental
4727 europanite/client_side_audio_transcription

A Browser-Based AI Audio Transcription Playground Powered by Whisper.

18
Experimental
4728 klee-repos/dialogflow-voice-streaming

Intent mapping with real-time voice to text stream

18
Experimental
4729 bunyaminergen/awesome-speech-dataset

Awesome Speech Dataset, including download links and a brief explanation for...

18
Experimental
4730 Leonqn/speech-to-text-bot

Speech to text telegram bot. It can convert voice and video note messages to...

18
Experimental
4731 hernanrazo/human-voice-detection

Binary classification problem that aims to classify human voices from audio...

18
Experimental
4732 ybenkirane/AI_Tutor

An LLM-powered automated tutoring program that will converse with you on any...

18
Experimental
4733 sanastasiou/dictation-service

GPU-accelerated speech-to-text service that types what you say, powered by...

18
Experimental
4734 avikantz/Samaritan

Samaritan demo clone for iOS.

18
Experimental
4735 SaiHarsha9992/Leo-Ai-Assistant-2.0

Leo is a 3D interactive AI assistant built with React Three Fiber,...

18
Experimental
4736 Shristirajpoot/CalcVoive

🎙️ Voice-enabled calculator built with React | Supports speech input/output...

18
Experimental
4737 thewh1teagle/zipvoice-onnx

TTS with ZipVoice and onnxruntime

18
Experimental
4738 ParasAvkirkar/MagooshHelper

A simple application based on words published by Magoosh Vocabulary Flash...

18
Experimental
4739 ssumin6/Korean-TTS-Server

Korean text-to-speech

18
Experimental
4740 djleamen/renamer

Utility to rename mp3 files based on speech content

18
Experimental
4741 old-cookie/dr_ai

AI-powered health assistant Flutter app featuring voice interaction, smart...

18
Experimental
4742 oc8/YouTranslate

Chrome extension that reads YouTube video subtitles out loud

18
Experimental
4743 dsidlo/FlexiTTS

A simple powerful workflow for Text to AudioBook creation. Uses realistic AI...

18
Experimental
4744 Better-Than-You/brainrotbot

BrainRotBot is a Python-based Reddit video maker bot that automates creating...

18
Experimental
4745 habibimustafa/VoiceBot

Simple Voice Bot using IBM Watson Service

18
Experimental
4746 chstan/voice-notes

Personal notes transcription (AWS Transcribe) and Notion integration.

18
Experimental
4747 JagratiVerma1408/ObjectDetectionApplication

Andriod app integrating tflite model for object detection

17
Experimental
4748 prabormukherjee/Coursera_Helper_chatbot

A chatbot to help coursera student with their difficulty.

17
Experimental
4749 LexicalStressDetection/lexical-stress-detection

Deep Learning model for lexical stress detection in spoken English

17
Experimental
4750 Sergey004/silero_tts_rvc

A simple extension that allows LLM to speak in any voice, literally, based...

17
Experimental
4751 akashchaudhary-git/android-azure-speech-openai

An integration of Azure Speech Service and Azure OpenAI in Android. This...

17
Experimental
4752 Manokero/face-recognition-and-tts-numbers

En este proyecto se utiliza reconocimiento facial para verificar una persona...

17
Experimental
4753 Erfanafshar/speech-gender-detection

An audio signal processing project that detects speaker gender from recorded...

17
Experimental
4754 jagerzhang/FastTTS

基于edge-tts的简单语音合成服务,支持私有化部署,支持和源阅读APP无缝对接。

17
Experimental
4755 ZhanpengWang96/pytorch-speech2vec

Pytorch implementation of the paper Speech2Vec: A Sequence-to-Sequence...

17
Experimental
4756 reuAC/reCosyVoiceService

A text-to-speech service built with CosyVoice2, with multi-node concurrency...

17
Experimental
4757 Tim55667757/AudioGenerator

Озвучка русских и иностранных текстов через платформу OpenAI

17
Experimental
4758 florabtw/google-translate-tts

Node library for Google Translate TTS (Text-to-Speech) API

17
Experimental
4759 toshalpatel/AudioSimilarity

When two audio files compared, the result is giving the similar part from...

17
Experimental
4760 umitkacar/transformer-asr-transcription

Real-time transformer-based ASR supporting 100+ languages - Google Cloud...

17
Experimental
4761 tubexchat/interpreter-zh2en-gemini

An interpreter web app between Chinese and English that is powered by Gemini-2.0-fash

17
Experimental
4762 asheghi/text-to-speech

Text to Speech

17
Experimental
4763 IRSPlays/ProjectCortexV2

A $300 wearable that gives visually impaired users real-time scene...

17
Experimental
4764 erogol/TTS_tf

WIP Tensorflow implementation of https://github.com/mozilla/TTS

17
Experimental
4765 sandeepswain54/Yukti-Care

Yukti Care is a mobile app that enables pharmacies, medical distributors,...

17
Experimental
4766 vishal1patidar/TEXT-TO-SPEAK

🔖24 Different Languages voice's Add a text🗨️ in it and listen👂

17
Experimental
4767 andydowsen/voice-assistant

🏳🌌♨ Simple voice assistant with minimal ai logics includes streamlit web...

17
Experimental
4768 Ani0202/Speech-Translation-with-Python

Translate your speech to many languages using Google Translate API

17
Experimental
4769 Atamyrat2005/text-to-speech

There are several APIs available to convert text to speech in Python. One of...

17
Experimental
4770 charles-forsyth/generate-tts

A professional CLI for Google Gemini's Native 2.5 TTS. Generate...

17
Experimental
4771 Winnie-Fred/text-to-speech

Text-to-speech web-based application using Django and Google Translate...

17
Experimental
4772 EGWeeks/translate_tts_api

AWS Translate & Text to Speech API Javascript Example

17
Experimental
4773 Clebson-Torres/WinVoice

An offline voice assistant for Windows, utilizing local AI (Ollama) and...

17
Experimental
4774 abhijhacodes/PDF_to_AudioBook_converter

Python code that converts any pdf file into audiobook

17
Experimental
4775 dongheehand/Tacotron-PyTorch

PyTorch implementation of Tacotron

17
Experimental
4776 ekdysis/Speech-POC

POC using Apple's Speech framework demonstrating real-time speech...

17
Experimental
4777 atanu20/alan-ai-news-project

Here i build a Conversational Voice Controlled React News Application using...

17
Experimental
4778 clarenceluo78/singer-adaptive-svc

This repository is the implementation of project Converting to Realistic...

17
Experimental
4779 raminnakhli/HMM-DNN-Speech-Recognition

This repository is a Python implementation of HMM-DNN model.

17
Experimental
4780 lliWcWill/maVoice-Linux

🎙️ Lightning-fast voice dictation Desktop Web App powered by Groq's Whisper...

17
Experimental
4781 wujunwei928/go-zero-tts

基于微软edge大声朗读接口开发的语音合成服务, 后端 go-zero, 前端 vuetify

17
Experimental
4782 xujiaao/BezierSpline

Android - Smooth Bézier Spline Through Prescribed Points

17
Experimental
4783 lianabisuna/spelltacular

Random word spelling skills test/practice (Vue.js 2 & Vuetify)

17
Experimental
4784 garconvacher/TextToSpeech_eBook

Un kit de test pour la synthèse vocale eBook (EPUB + Kindle)

17
Experimental
4785 BinkyWong/speech-recognition

Centos 7 based container for speech recognition

17
Experimental
4786 salehsargolzaee/Audio-Signal-Processing-and-Feature-Extraction

Feature extraction from audio signal (explained in Persian)

17
Experimental
4787 dangvansam/deepxi-flask-server

DeepXi with Flask Server

17
Experimental
4788 gillan-krishna/meeting_notes

Hobby project to transcribe audio files from meetings to transcripts with a summary

17
Experimental
4789 Momotoculteur/Keyword-voice-recognition

Créer une reconnaissance vocale de mots clés via des algorithmes...

17
Experimental
4790 joaoalvarenga/voice-assistant

An open-source Alexa-like complete voice assistant system, from speech...

17
Experimental
4791 sindhura-pv/lip-reading

In this project, visual speech recognition has been attempted using 2 major...

17
Experimental
4792 proger/uk

Фонограми та синтагми: інструменти обробки

17
Experimental
4793 harshshirke66/AntarVani

AntarVani – Neural Brain-to-Speech Demo A real-time thought-to-speech...

17
Experimental
4794 FernandoLpz/SpeechRecognition

This repository contains the implementation of an Automatic Speech...

17
Experimental
4795 mbailey/push2type

Turn CAPSLOCK key into Dictation Key

17
Experimental
4796 CrankZ/muyi

本地字幕生成与翻译,支持显卡加速

17
Experimental
4797 Bacdong/virtual-assistant-v1

Learning build virtual assistant with python and python library support.

17
Experimental
4798 Sxriptor/Whispra-Download

Whispra's Offical Download | AI-powered real-time voice and subtitle...

17
Experimental
4799 baharudin-yusup/salingsapa

A video call apps to enable deaf people to communicate with normal people...

17
Experimental
4800 yiwise/yiwise-asr-demo-java

杭州一知智能科技有限公司自研 ASR Java客户端demo

17
Experimental
« Prev 1 2 3 46 47 48 49 50 68 69 70 Next »