All Voice AI Tools

6,981 tools ranked by quality score · Page 23 of 70

Showing 2201–2300 of 6,981
# Tool Score Tier
2201 MbBrainz/ttslab

TTSLab is THE place to easily test ANY text to text to speech model on your...

31
Emerging
2202 kapi2800/qwen3-tts-mac

Optimized implementation of Qwen3-TTS for Apple Silicon (M1-M4)

31
Emerging
2203 sayak-brm/espeakng-python

An eSpeak NG TTS binding for Python3.

31
Emerging
2204 GloomyGrave/Sinsy-NG

(discontinued) 🎵The Formant-Based All Language Singing Voice Syntheis...

31
Emerging
2205 OpenVoiceOS/ovos-tts-plugin-beepspeak

experiment adding new r2d2 tts engine for mycroft

31
Emerging
2206 HelloChatterbox/py_responsivevoice

unoficial python api for responsive voice

31
Emerging
2207 gokhaneraslan/tts-dataset-generator

With this tool you can create custom TTS dataset from video or audio.

31
Emerging
2208 diggerdu/pytorch_audio

audio processing module for pytorch:stft, istft

31
Emerging
2209 andi611/CS-Tacotron-Pytorch

Pytorch implementation of CS-Tacotron, a code-switching speech synthesis...

31
Emerging
2210 hkdb/offline-tts

A Chrome extension that reads web pages and PDFs aloud using Supertonic's...

31
Emerging
2211 USSLab/DolphinAttack

Inaudible Voice Commands

31
Emerging
2212 Proteusiq/saa

Making Time Speak! 🎙️

31
Emerging
2213 go-restream/supertts

🎧 Supertonic TTS ONNX Inference Openai Speech REST API

31
Emerging
2214 Sciss/SpeechRecognitionHMM

Exported from...

31
Emerging
2215 aidayang/LatentSync-OneClick

免费视频对口型软件LatentSync一键启动整合包

31
Emerging
2216 AI-TOOLKIT/VoiceBridge

VoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit

31
Emerging
2217 npuichigo/ttsflow

tensorflow speech synthesis c++ inference for voicenet

31
Emerging
2218 hkilang/TTS

香港圍頭話及客家話文字轉語音朗讀器

31
Emerging
2219 UFOAlastor/AI-Waifu-Project-LaIN

一个拥有长期记忆, 表情动作, 语音对话/打断/声纹识别, FunctionCall, 多模型支持的AI Waifu客户端.

31
Emerging
2220 Issac-Moses/Beacon

Beacon – A lightweight voice-controlled AI assistant using Whisper.cpp. ...

31
Emerging
2221 wspr-ncsu/robocall-audio-dataset

A dataset of real-world robocall audio recordings

31
Emerging
2222 SEPIA-Framework/sepia-web-audio

Create modular, cross-browser, web audio pipelines to record and process...

31
Emerging
2223 skit-ai/speech-to-intent-dataset

Dataset Release for Intent Classification from Speech

31
Emerging
2224 siddhant-vij/Health-Fitness-Tracker

Health & fitness app with natural language processing, custom...

31
Emerging
2225 scarletcho/prep4kaldi

Data preparation code for building Kaldi ASR system

31
Emerging
2226 krestaino/prankstr

📞 Prank your friends with text-to-speech phone calls powered by Twilio and...

31
Emerging
2227 amirharati/kaldi-alligner

scripts to align a given wave to its transcription using trained models by Kaldi

31
Emerging
2228 hanxiao/mls

MLX Local Serving (MLS) - Unified ASR, TTS, and Translation on Apple Silicon

31
Emerging
2229 khanld/Wav2vec2-Pretraining

Wav2vec 2.0 Self-Supervised Pretraining

31
Emerging
2230 IPS-LMU/transcription-portal

A portal that offers a transcription chain for multi upload and processing...

31
Emerging
2231 deepgram-devs/deepgram-demos-rust

Useful demo applications for Deepgram Voice AI APIs, using the Rust language! 🦀

31
Emerging
2232 jopedroliveira/speech_recog_uc

Speech processing ROS-package. Performs speech recognition and estimates the...

31
Emerging
2233 ASR-project/Multilingual-PR

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM....

31
Emerging
2234 karrarkazuya/ArabicTTS

ArabicTTS (TextToSpeech) Android library with a sample

31
Emerging
2235 boudhayan-dev/Blind-Reader-project

A low cost reading device for blind people.

31
Emerging
2236 mozilla/deepspeech-playbook

A crash course for training speech recognition models using DeepSpeech.

31
Emerging
2237 SEPIA-Framework/sepia-docs

Documentation and Wiki for SEPIA. Please post your questions and bug-reports...

31
Emerging
2238 xcmyz/FastSpeech2

The Implementation of FastSpeech2 Based on Pytorch.

31
Emerging
2239 overcrash66/Audio-File-Translator---S2ST

Audio file translator is a multilingual speech to speech and speech to text...

31
Emerging
2240 ayshrv/memento-app

Android App which serves as an AI assistant for human memory

31
Emerging
2241 papercast-dev/papercast

A Python pipeline tool and plugin ecosystem for processing technical...

31
Emerging
2242 shreyanspagariya/sankshep

Video Summarization - Summarized a video lecture and converted it to a...

31
Emerging
2243 ondrejklejch/learning_to_adapt

Coordinate-wise meta-learner for speaker adaptation of ASR models.

31
Emerging
2244 The-Data-Dilemma/ParquetToHuggingFace

ParquetToHuggingFace processes raw audio data, converts it into Parquet...

31
Emerging
2245 suzuran0y/Live2D-LLM-Chat

Live2D + ASR + LLM + TTS → Real-time communication + Offline...

31
Emerging
2246 zalo/OpenAI-Voice

A simple proof of concept for voice-to-voice interaction.

31
Emerging
2247 ericc-ch/edge-tts

Use Microsoft Edge's online text-to-speech service from JS code directly!

31
Emerging
2248 laszukdawid/cracker

Usable GUI for text-to-speech services

31
Emerging
2249 AshutoshDongare/convo

Open source voice bot for Humanoid Robots and virtual digital humans

31
Emerging
2250 X-LANCE/VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient...

31
Emerging
2251 MichalKacprzak99/jarvis

Jarvis is a personal voice assistant inspired by the Marvel movie series

31
Emerging
2252 jenswittmann/CurlyFramework

Tiny Framework for accessibility and sustainability, not only for MODX or Kirby CMS.

31
Emerging
2253 opsdroid/opsdroid-audio

🗣 A companion application for opsdroid which adds hotwords, speech...

31
Emerging
2254 HasnainDarkNet/DarKVoice

DarKVoice is an open-source voice assistant and audio processing tool built...

31
Emerging
2255 upskyy/ContextNet

PyTorch implementation of "ContextNet: Improving Convolutional Neural...

31
Emerging
2256 hug33k/PyTalk-R2D2

Python script for R2D2 text-to-speech

31
Emerging
2257 zmeet-ai/tts-demo

支持各种感情的男女声音,支持实时和离线文本合成tts语音;支持单模特声音变声,语音速率调整,语音音量大小调整;支持自定义语音模型。

31
Emerging
2258 in03/squawk

Automatic subtitles for DaVinci Resolve with OpenAI Whisper

31
Emerging
2259 Ronik22/Voice-Controlled-Email

A python-based voice-controlled email application for visually impaired persons.

31
Emerging
2260 filimo/ReaderTranslator

PDF/WebPages Reader with embedded Google Translate and voice engine on...

31
Emerging
2261 ognistik/alfred-superwhisper

Use Alfred to Control Superwhisper - AI Powered Voice to Text

31
Emerging
2262 JSON2Video/json2video-php-sdk

Video automation with PHP: add watermarks, resize videos, create slideshows,...

31
Emerging
2263 telecombcn-dl/2018-dlsl

UPC Deep Learning for Speech and Language 2018

31
Emerging
2264 azraelkuan/FFTNet

FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

31
Emerging
2265 ckaytev/tgisper

Telegram bot with ASR

31
Emerging
2266 vorojar/VoiceSnap

Open-source offline voice dictation — a free alternative to Typeless. 100%...

31
Emerging
2267 ZeroMirai/Waifu_AI_Vtuber

Waifu_AI_Vtuber is a AI virtual YouTuber chatbot powered by OpenAI GPT-3.5,...

31
Emerging
2268 hanifabd/voice-activity-detection-vad-realtime

Real-time Voice Activity Detection (VAD) with some example use case like...

31
Emerging
2269 hutchresearch/latex2speech

TeX2Speech is an application that turns LaTeX documents into spoken audio.

31
Emerging
2270 PowerBeef/QwenVoice

Native macOS app for Qwen3-TTS with custom voices, voice design, and voice...

31
Emerging
2271 suzumushi0/SoundObject_binary

SoundObject binary distribution.

31
Emerging
2272 HCI-LAB-UGSPEECHDATA/speech_data_ghana_ug

The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani,...

31
Emerging
2273 kcitlyn/PolyScribe_Desktop

Fully-offline transcription and translator w/ speech-to-text and...

31
Emerging
2274 i4Ds/whisper-prep

Data preparation utility for the finetuning of OpenAI's Whisper model.

31
Emerging
2275 indri-voice/audiotoken

Audio tokenization, in the fastest way possible!

31
Emerging
2276 BraceYourselfGames/UE-BYGTextToSpeech

A plugin that uses the Windows Speech API to speak text in Unreal Engine 4.

31
Emerging
2277 bensonruan/Speech-Command

Speech Command Recognizer using tensorflowjs

31
Emerging
2278 theaifutureguy/Vocal-Agent

A sophisticated real-time voice assistant that seamlessly integrates speech...

31
Emerging
2279 led-mirage/VoivoClip

VOICEVOXでクリップボードに貼り付けられたテキストを読み上げるアプリです。

31
Emerging
2280 masonthemaker/saidwell

Open Source Voice AI Dashboard

31
Emerging
2281 lmangani/docker-rtpengine-speech

OpenSIPS + RTPEngine Recording + Speech Recognition in HEP

31
Emerging
2282 hebbihebb/MBook

EPUB to M4B using Maya1

31
Emerging
2283 gkrsv/split_audio

A rough and ready Python utility which splits audio files based on silence...

31
Emerging
2284 oren-cohen/whatsmybitrate

Whatsmybitrate analyzes audio files for quality metrics such as bit rate,...

31
Emerging
2285 hollygrimm/voice-dataset-creation

Tools to create your own voice dataset for TTS training

30
Emerging
2286 aabdurakhmanov/uzbekcha-gapir

Matnni O'zbek tilida talafuz qiluvchi desktop dastur | Text to speech...

30
Emerging
2287 RapDoodle/Web-Real-Time-Speech-Recognition-with-Azure

An example project that provides a web interface to real-time speech-to-text...

30
Emerging
2288 calinalexandru/pericles

A browser extension offering intuitive text-to-speech functionality, making...

30
Emerging
2289 surajondev/text-to-speech

Conver text into speech

30
Emerging
2290 vectominist/End-to-end-ASR-Pytorch-DLHLP

Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation...

30
Emerging
2291 gokulkarthik/text2speech

Towards Building Text-To-Speech Systems for the Next Billion Users -...

30
Emerging
2292 weespin/RequestifyTF2

Client side commands for mic spamming and more!

30
Emerging
2293 SUNGBEOMCHOI/Korean-Streaming-ASR

Korean Streaming ASR(with Denoiser and Conformer CTC)

30
Emerging
2294 Rongjiehuang/Multiband-WaveRNN

An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio...

30
Emerging
2295 jesseward/azuretexttospeech

A Go library for Azure's Cognitive Services text-to-speech API.

30
Emerging
2296 Vazgen005/discord-virtual-micro

Says everything you type in discord for you using ai (Silero Models)

30
Emerging
2297 betaoverflow/donna

Transform your smart devices to intelligent communicators.

30
Emerging
2298 CMsmartvoice/Unet-TTS

One-shot TTS with Improved Unseen Speaker and Style Transfer

30
Emerging
2299 mishrababhishek/chatbot

AI Chatbot answers students' queries about their college program using...

30
Emerging
2300 gokhaneraslan/XTTS_V2-finetuning

Training XTTS V2 and PEFT LORA Text-to-Speech (TTS)

30
Emerging
« Prev 1 2 3 21 22 23 24 25 68 69 70 Next »