All Voice AI Tools

6,981 tools ranked by quality score · Page 32 of 70

Showing 3101–3200 of 6,981
# Tool Score Tier
3101 mrmanna/Nvidia_Nemo_FastPitch_TTS_Example

How to Build a High-Quality Text-to-Speech (TTS) System Locally with Nvidia...

25
Experimental
3102 Sumit81107/echo-tts

🔊 Create lifelike speech from text using a multi-speaker model, enhancing...

25
Experimental
3103 rock3125/tts

Simple text to speech server in docker using coqui-ai/TTS

25
Experimental
3104 The-Data-Dilemma/Medibeng-Orpheus-3b-0.1-ft-Fine-Tuning

Medibeng-Orpheus-3b-0.1-ft- A TTS model for bilingual Bengali-English...

25
Experimental
3105 mym-br/gama_tts

Experimental articulatory speech synthesizer derived from Gnuspeech

25
Experimental
3106 railmapgen/rma

Generate the rail announcement from your rmg project!

25
Experimental
3107 MahtaFetrat/VirgoolInformal-Speech-Dataset

A dataset of informal Persian audio and text chunks, along with a fully open...

25
Experimental
3108 AssemblyAI-Community/assemblyai-and-python-in-5-minutes

Repo for hosting tutorial code associated with the "AssemblyAI and Python in...

25
Experimental
3109 Leapward-Koex/Namida-OCR

A purely browser based OCR tool designed recognizing, copying, and...

25
Experimental
3110 dpressel/reserve

FastAPI + WebSockets + SSE service to interface with Triton/Riva ASR

25
Experimental
3111 helemanc/ambient-intelligence

Application for Disruptive Situations Detection in public transports through...

25
Experimental
3112 botbahlul/VOSK-Powered-LIVE-SUBTITLE-V2

ANDROID APP that can RECOGNIZE LIVE AUDIO/VIDEO STREAMING (using free VOSK...

25
Experimental
3113 aloproducao/Live-captions-for-broadcast

The Real-Time Speech Recognition System is an innovative tool designed to...

25
Experimental
3114 Baibhav-nag/SER-using-MLP-and-CNN

Speech emotion recognition using MLP and CNN on four benchmark datasets...

25
Experimental
3115 koesan/Evoars

A multi-model AI platform for comics, manga, and videos. It colorizes...

25
Experimental
3116 BobRandomNumber/ComfyUI-KyutaiTTS

A non real-time ComfyUI implementation of Kyutai TTS

25
Experimental
3117 itning/hass-aliyun_bailian_tts

Home Assistant integrates Alibaba Cloud's BaiLian Platform TTS

25
Experimental
3118 AfkaraLP/qwen3-tts-webui

A Simple webui and api for cloning voices with Qwen3-TTS

25
Experimental
3119 tjysdsg/capt-public

Public version of my Computer-Aided Pronunciation Training (CAPT) system (server)

25
Experimental
3120 slayerrr12/WaveSlayer

ai chatbot that uses speech to operate and respond

25
Experimental
3121 gfrangiamone/audiobook-maker

Web tool to convert epub or txt book in audiobook via edge_tts lib

25
Experimental
3122 ckaznable/yt-cli-live

Youtube Text Live Streaming in CLI

25
Experimental
3123 zugaldia/speedofsound

Voice typing for the Linux desktop.

25
Experimental
3124 NN-Project-2/Emotion-TTS-Emebddings

This project explores zero-shot emotional speech synthesis using EMOD, a...

25
Experimental
3125 srvk/srvk-eesen-offline-transcriber

Top level code to transcribe English audio/video files into text/subtitles

25
Experimental
3126 arthurxlw/cytonNss

Cyton Online Neural Sentence Segmentation for Simultaneous Interpretation

25
Experimental
3127 netbuffer/android-technology-test

android technology test use java language,DI,Handler,Hilt,Scheduler,TTS,Log...

25
Experimental
3128 yuvraj108c/ComfyUI-PiperTTS

ComfyUI Piper TTS Custom Node

25
Experimental
3129 DmytroNorth/Automated_Subtitles_Generation-Regex_Java

An automated workflow that generates timestamped subtitles from a video file...

25
Experimental
3130 sanyasamineva0x/govorun-app

Говорун — офлайн голосовой ввод на русском для macOS (GigaAM-v3 + Silero VAD)

25
Experimental
3131 ibelgin/Text-To-Speech-App

This App is Made Using React Native.

25
Experimental
3132 QuantiusBenignus/NoteWhispers

Voice memos recorded from the microphone, transcribed offline to text and...

25
Experimental
3133 FragJage/PicoVoiceCpp

PicoVoiceCpp is a simple TTS (text to speech) class base on picovoice (svox).

25
Experimental
3134 nodef/extra-tts

Generate speech audio from super long text through machine.

25
Experimental
3135 CingZeoi/YDVoiceTTS

Chinese TTS from Yongde screen reader

25
Experimental
3136 jarmitage/tts-cli

Simple CLI app for TTS

25
Experimental
3137 dwain-barnes/chatterbox-streaming-api-docker

Chatterbox with OpenAI-compatible endpoints, streaming support, multiple...

25
Experimental
3138 pnkvalavala/multivoice

Multivoice: Enhance your foreign-language movie and TV show experience with...

25
Experimental
3139 jmdlab/vesper

Therapeutic audio pipeline. Faith meets science. Free, static, open source.

25
Experimental
3140 nilakshdas/ADAGIO

Adversarial Defense for Audio in a Gadget with Interactive Operations

25
Experimental
3141 ramsrk7/AIVoxPlay

AI-powered real-time voice interaction framework for building conversational...

25
Experimental
3142 shreyasnisal/VoiceQuiz-v2

Verstion 2 of the quiz-app, this is the repository for the voice-based quiz....

25
Experimental
3143 ry-sun/bob-plugin-openai-tts

OpenAI TTS for Bob Plugin is a tts plugin for bob, a brilliant translation...

25
Experimental
3144 louiscoetzee/mlx-tts-studio

Native macOS text-to-speech app powered by Qwen3-TTS and Apple Silicon...

25
Experimental
3145 KVarnitZ/Total-Tank-Simulator-UA

Українізатор TTS, який повноцінно додає мову як окрему (перекладено з...

25
Experimental
3146 abumubaarak/Wellbeing-Doctor

Doctor management app

25
Experimental
3147 rohanmistry231/Voice-Assistant

A Python-based voice assistant that processes voice commands to perform...

25
Experimental
3148 zhongyuchen/speech-classification

CNN and VGG speech classification with interactive website for testing

25
Experimental
3149 circle-hotaru/talk-boost-ai

A web application that utilizes AI to help you improve your English speaking...

24
Experimental
3150 elie-atia/talk-to-chat-gpt

Enable to talk to ChatGPTusing voice-to-text (record and recognize the...

24
Experimental
3151 sidphbot/visual-to-audio-aid-for-visually-impaired

A system to process visual input on timed frames to produce sensible audio...

24
Experimental
3152 luciferchase/chase_hospitals

This is a GUI based Python connectivity project on Hospital Management. The...

24
Experimental
3153 Sidra-009/AI-Interview-Coach

AuraCoach is an AI-powered interview coach that generates personalized...

24
Experimental
3154 taeefnajib/Vocazee

A voice cloning and text-to-speech application that can generate speech in any voice.

24
Experimental
3155 SzLeaves/asr-model-ctc

ASR deep learning models (use BiGRU & WaveNet & CTC), use Tensorflow2...

24
Experimental
3156 adi611/The-CheatGPT

Python application that uses GPT-3 language model and Pinecone vector...

24
Experimental
3157 german-asr/kaldi-german

Scripts for training Kaldi for German speech recognition (ASR).

24
Experimental
3158 Iroha-P/MiniBox

Character voice chatbot with GPT-SoVITS TTS + LLM role-playing, supports Web...

24
Experimental
3159 Saga9103/t2yLLM

A voice assistant with local LLM as a backend

24
Experimental
3160 flexhub77/piper-tts-call

🎙️ Generate high-quality audio from text in real-time with Piperin, the...

24
Experimental
3161 developer-mezbah/Mock-Test-UI

Practice IELTS, TOEFL, & PTE speaking online. This web app offers full test...

24
Experimental
3162 binglel/asr_baidu_web_server

asr web server based on flask

24
Experimental
3163 myths-labs/prometheus-avatar

Open-source SDK for driving Live2D & 3D avatars with LLM output. Give your AI a face.

24
Experimental
3164 dangkhoadl/WER-in-cpp

Calculates the word error rate between the reference and hypothesis in ASR,...

24
Experimental
3165 buddheshwarnath/blurtpy

Offline, cross-platform Python text-to-speech and sound notifications....

24
Experimental
3166 mcw519/Brownie

Post processing for speech recognition

24
Experimental
3167 vinbhaskara/Digit-Speech-Recognition

Using MFCC features on Speech Signals to classify Digits after matching...

24
Experimental
3168 hi5/nvda-autohotkey

NVDA and AutoHotkey - Text to Speech (TTS) and Braille from AHK scripts

24
Experimental
3169 nisiddharth/TextToSpeech

A Simple Java based Text to Speech converter made using NetBeans 8.2

24
Experimental
3170 answersolutionsapps/runandread-ios

Ultimate Text-to-Speech and Audiobook Player for iOS, macOS

24
Experimental
3171 naturalDesign/fusion-remote

Chatbot for Autodesk Fusion 360 with speech recognition

24
Experimental
3172 TCL606/Speech-Number-Recognition

基于数字信号处理的语音数字识别器

24
Experimental
3173 juan-csv/Alfred-assistant

Assistant in which you can program any type of action in the python...

24
Experimental
3174 marttirandma/tipi

Tipi Web v2

24
Experimental
3175 backpropper/DNN-Activation-Brain

Code repository for Dissecting the DNN Brain for a Better Insight (ICASSP 2016)

24
Experimental
3176 artemnikitin/tts-test-app

Android app for testing Text-to-speech stuff

24
Experimental
3177 mmphego/medium-to-speech

Medium posts as Markdown to Speech.

24
Experimental
3178 egorsmkv/radtts-uk

🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model

24
Experimental
3179 Inviro/Illud

Illud is a smart text analyzer written in pure Java that displays different...

24
Experimental
3180 sidagarwal04/SpeechRecognition-Sphinx-GCP

Speech Recognition on edge using CMU Sphinx and on cloud using Google Cloud...

24
Experimental
3181 auroraapi/aurora-python

Aurora SDK for Python

24
Experimental
3182 ganlvtech/bing-stt

Rust implementation of bing "Search using voice" button speech recognition...

24
Experimental
3183 denizariyan/Real-Time-Auto-Transcriber

Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to...

24
Experimental
3184 stefantaubert/english-text-normalization

Command-line interface (CLI) and library to normalize English texts.

24
Experimental
3185 tigjaw/remyme

ReMyMe - a basic "Read My Messages" Android application (old)

24
Experimental
3186 alfianlosari/flutter_chatbot_inventory

Chatbot Flutter App used to track inventory of product and description using...

24
Experimental
3187 Raj2503/Python-Text-To-Speech-Hindi

Python Hindi Concatenative Based TTS using Phoneme Database

24
Experimental
3188 amelielavender/voicely

a discord bot that transmit text-to-speech messages to voice channels directly

24
Experimental
3189 jhdeov/armenian-intonation

Repository of question-answer dialogues of Armenian, for an intonation study.

24
Experimental
3190 alihassanml/Voice-Controlled-Agentic-AI-Bot

A real-time voice assistant powered by Ollama, Piper TTS, and...

24
Experimental
3191 Philipinho/ThreadVoice

Source code for https://twitter.com/threadvoice

24
Experimental
3192 RavnOP/Vehicle-Speed-and-Type-Detection-Using-YoloV8

This system uses artificial intelligence to detect vehicles in video...

24
Experimental
3193 Arbazkhan4712/Text-To-Speech

A program that can convert Text into Speech using python

24
Experimental
3194 PiasRoY/Bangla-Spoken-Number-Recognition

recognizing spoken Bangla numbers using MFCCs and CNN.

24
Experimental
3195 JaesungBae/Speech-Command-Recognition-with-Capsule-Network

Speech command recognition with capsule network & various NNs / KWS on...

24
Experimental
3196 mehdichaouch/nabstory

Let your Nabaztag 🐰 read you a story 📖

24
Experimental
3197 franchesoni/s2t

:speaking_head: :keyboard: Speech-to-text on key for Linux

24
Experimental
3198 ggh-png/EMOTIBOT

emotion robot using gpt model3.5 EMOTIBOT

24
Experimental
3199 M-SRIKAR-VARDHAN/speech-to-speech-with-lipsync

End-to-end speech-to-speech translation pipeline with voice cloning (RVC)...

24
Experimental
3200 pigzach/MagicSpeechASR

magicspeech competition recipe

24
Experimental
« Prev 1 2 3 30 31 32 33 34 68 69 70 Next »