All Voice AI Tools
6,981 tools ranked by quality score · Page 19 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 1801 |
heartsuit/BaiduASRAndTTS
Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech;... |
|
Emerging |
| 1802 |
chaonan99/ppt_presenter
Convert ppt to video with audio track, using text to speech synthesis |
|
Emerging |
| 1803 |
ProsusAI/project-echo
An AI-powered voice director assistant for creating engaging audio content... |
|
Emerging |
| 1804 |
WangYixuan12/openai_tts
OpenAI Text-to-Speech Interface |
|
Emerging |
| 1805 |
EtienneAb3d/WhisperTimeSync
Synchronize Whisper's timestamps over an existing accurate transcription |
|
Emerging |
| 1806 |
amitdev01/awesome-voice-ai
Awesome Voice Ai |
|
Emerging |
| 1807 |
sooftware/End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models. |
|
Emerging |
| 1808 |
OwenEdwards/videojs-speak-descriptions-track
A Video.js 7 middleware that uses browser speech synthesis to speak... |
|
Emerging |
| 1809 |
syntithenai/opensnips
Open source projects related to Snips https://snips.ai/. |
|
Emerging |
| 1810 |
holgern/ttsforge
Convert EPUB files to audiobooks using Kokoro ONNX TTS |
|
Emerging |
| 1811 |
candlewill/AiVoice
Deep CNN networks for Speech Synthesis |
|
Emerging |
| 1812 |
Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and... |
|
Emerging |
| 1813 |
hacktronaut/azure-avatar-demo
Text To Speech Demo in ReactJS Application using Azure Avatar AI Service. |
|
Emerging |
| 1814 |
jianchang512/gemini-speech2srt
使用 Gemini AI 转写音视频为 SRT 字幕 |
|
Emerging |
| 1815 |
tiansztiansz/voice-assistant
重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。 |
|
Emerging |
| 1816 |
rtk-ai/vox
A universal AI toolkit for high-performance Speech-to-Text (STT) and... |
|
Emerging |
| 1817 |
LucaLuke13/TalkyBotty
Simply forward a video or voice message in any language to the bot, and it... |
|
Emerging |
| 1818 |
Fatma-Chaouech/audioverse
Breathe Life Into Your Books! 📚🌱 |
|
Emerging |
| 1819 |
medokin/soundpad-text-to-speech
Text-To-Speech for Soundpad |
|
Emerging |
| 1820 |
nhaouari/local11labs
Local11Labs allows generating high-quality text-to-speech and podcast... |
|
Emerging |
| 1821 |
Mobile-Artificial-Intelligence/maise
Maise is an open-source android speech engine designed to provide a powerful... |
|
Emerging |
| 1822 |
akinsella/yt-transcript-rs
🎬️ A Rust library for accessing YouTube Video Infos & Transcripts |
|
Emerging |
| 1823 |
trabdlkarim/voce-browser
Voice Controlled Chromium Web Browser |
|
Emerging |
| 1824 |
Dark2C/Viral-Faceless-Shorts-Generator
Automatically generate faceless YouTube Shorts from trending topics using AI... |
|
Emerging |
| 1825 |
jvandenaardweg/ssml-split
Splits SSML strings into batches AWS Polly ánd Google's Text to Speech API... |
|
Emerging |
| 1826 |
egorsmkv/tts_uk
High-fidelity speech synthesis for Ukrainian using modern neural networks. |
|
Emerging |
| 1827 |
deepkyu/ml-talking-face
Cloned repository from Hugging Face Spaces (CVPR 2022 Demo) |
|
Emerging |
| 1828 |
moeru-ai/ortts
𖣘🔊 Simple and Easy-to-use local TTS inference server, Powered by ONNX Runtime |
|
Emerging |
| 1829 |
jxlarrea/wyoming-voice-match
A Wyoming protocol ASR proxy that verifies speaker identity and isolates... |
|
Emerging |
| 1830 |
GeoHaberC/Story-to-Video
Create a Movie animation plus Audio plus Subtitle from a text file |
|
Emerging |
| 1831 |
Lunarien/Lunariens-Mental-Math-Trainer
Mental math trainer made in C#. |
|
Emerging |
| 1832 |
iotjin/JhPrivacyAuthTool
隐私权限判断 - 封装了几种常用的隐私权限判断(定位服务,通讯录, 日历,提醒事项, 照片, 蓝牙共享,麦克风, 相机)和通知的注册和判断。定位服务,蓝牙共享是单独调用的 |
|
Emerging |
| 1833 |
akashmjn/cs224n-gpu-that-talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18) |
|
Emerging |
| 1834 |
kaituoxu/Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS)... |
|
Emerging |
| 1835 |
FlooferLand/ttvoice-mod
A Minecraft mod that lets you type to speak! |
|
Emerging |
| 1836 |
AndreDalwin/Whisper2Summarize
Whisper2Summarize is an application that uses Whisper for audio processing... |
|
Emerging |
| 1837 |
doveg/whisper-real-time
A real time offline transcriber with gui, based on OpenAI whisper |
|
Emerging |
| 1838 |
TartuNLP/text-to-speech-worker
Estonian multi-speaker neural text-to-speech worker that processes requests... |
|
Emerging |
| 1839 |
tktcorporation/discord-tts-bot
A discord bot to use tts in your voice channel. |
|
Emerging |
| 1840 |
nexmo-community/voice-azure-speechtotext-py
Sample Code for Realtime Transcription using Nexmo, Microsoft Azure Speech... |
|
Emerging |
| 1841 |
seven-io/net-client
Official .NET API Client for seven |
|
Emerging |
| 1842 |
yapit-tts/yapit
Listen to anything. TTS for documents, papers, and web pages. |
|
Emerging |
| 1843 |
N6UDP/SteamDiscordTTSBot
A steam chat to Discord TTS bridge |
|
Emerging |
| 1844 |
NeoKazuya/qwen3-tts-enhanced
Enhanced Qwen3-TTS voice cloning GUI with multi-reference samples, variation... |
|
Emerging |
| 1845 |
ttuleyb/TortoiseTTS-GUI
GradioUI for TortoiseTTS voice generation |
|
Emerging |
| 1846 |
Frida7771/PyVoice
A Python-based speech processing tool that supports both speech-to-text... |
|
Emerging |
| 1847 |
audo-ai/magic-mic
Open Source Noise Cancellation App for Virtual Meetings |
|
Emerging |
| 1848 |
leokwsw/OpenAI-TTS-Gradio
Use OpenAI TTS(Text to Speech) API with Gradio |
|
Emerging |
| 1849 |
bhattbhavesh91/wav2vec2-huggingface-demo
Speech to Text with self-supervised learning based on wav2vec 2.0 framework... |
|
Emerging |
| 1850 |
HectorPulido/chatbot-with-voice
Jarvis like chatbot with voice |
|
Emerging |
| 1851 |
antifield/vmt
Discord App for Transcribing & Translating Voice Messages |
|
Emerging |
| 1852 |
mmpneo/simple-obs-stt
Speech-to-text and keyboard input captions for OBS. |
|
Emerging |
| 1853 |
kssteven418/Q-ASR
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition |
|
Emerging |
| 1854 |
ayutaz/uCosyVoice
CosyVoice3 text-to-speech for Unity using ONNX inference. Supports zero-shot... |
|
Emerging |
| 1855 |
kaiaai/kaia.js
Kaia.ai platform's JS client library |
|
Emerging |
| 1856 |
Fooftilly/kokoro-extension
Send text from browser to Kokoro-FastAPI for TTS generation |
|
Emerging |
| 1857 |
lepisma/emacs-speech-input
Set of packages for speech and voice inputs in Emacs |
|
Emerging |
| 1858 |
renorari/VoiceJP-Discord
A discord-app can text-to-speech and speech-to-text |
|
Emerging |
| 1859 |
jianchang512/realtime-stt
一个极简的本地离线实时语音转文字工具 |
|
Emerging |
| 1860 |
cristofima/AI-Tech-Interview-Preparation
An AI-powered technical interview preparation platform that generates... |
|
Emerging |
| 1861 |
18F/dol-whd-14c
The 14(c) system will become a modern, digital-first service. Applicants... |
|
Emerging |
| 1862 |
neosapience/n8n-nodes-typecast
Integrate Typecast AI TTS into your n8n workflows with this community node. |
|
Emerging |
| 1863 |
cdyangbo/end2endASR
implement end-to-end asr algorithm with tensorflow |
|
Emerging |
| 1864 |
quangvu3/coqui-xtts
Coqui XTTS model with Vietnamese added |
|
Emerging |
| 1865 |
m-nathani/speech_to_text
how to use the Google Cloud Speech API to transcribe audio/video files. |
|
Emerging |
| 1866 |
deepgram-starters/php-transcription
Get started using Deepgram's speech-to-text with this PHP demo app |
|
Emerging |
| 1867 |
keonlee9420/Stepwise_Monotonic_Multihead_Attention
PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to... |
|
Emerging |
| 1868 |
alsrb0607/KoreanSTT
kospeech를 활용한 한국어 음성 인식 모델 개발 |
|
Emerging |
| 1869 |
c99koder/AudioClassifier-MQTT
Use the yamnet TensorFlow model to classify live audio from a microphone and... |
|
Emerging |
| 1870 |
nithincvpoyyil/voice-listener
An reusable angular component for voice based input using web speech API |
|
Emerging |
| 1871 |
sudonitin/Audio-book-generator
Convert your ebooks to audiobooks. 📖->🎧 |
|
Emerging |
| 1872 |
WeiChiaChang/happy-halloween
🗣 Say "happy halloween" to your browser 🎃 |
|
Emerging |
| 1873 |
keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a... |
|
Emerging |
| 1874 |
Blackwood416/AstraTTS
基于 ONNX Runtime 的跨平台高性能 TTS 合成方案,支持流式输出与低延迟播放,支持自定义音色与中英混合生成。 |
|
Emerging |
| 1875 |
alkhimey/esp32-flite
Speech synthesis running on ESP32 based on Flite engine. |
|
Emerging |
| 1876 |
xhuvom/omnilingual-ASR-Web-Dashboard
Meta Omnilingual ASR web based dashboard for testing and API based... |
|
Emerging |
| 1877 |
markokosticdev/cloud_text_to_speech_flutter
Single interface to Google, Microsoft, and Amazon Text-To-Speech. |
|
Emerging |
| 1878 |
priyanujgogoi-28/flowery-tts
Wrapper of Flowery Text to Speech API for Dart |
|
Emerging |
| 1879 |
markmiddo/synthia
AI-powered voice assistant that respects your privacy. Control your desktop,... |
|
Emerging |
| 1880 |
HnDK0/NoveLA
Free Android reader for web novels, light novels, ranobe & EPUB. 25+... |
|
Emerging |
| 1881 |
TartuNLP/text-to-speech-api
REST API for neural text-to-speech synthesis |
|
Emerging |
| 1882 |
nabz0r/mac-local-translator
Local translation app for Mac using speech recognition and offline translation |
|
Emerging |
| 1883 |
aditya-an1l/RILearn
Reinventing Reading with a touch of Interactivity aided Learning |
|
Emerging |
| 1884 |
Harsh-0-7/PDF-Reader
PDF reader with read aloud feature |
|
Emerging |
| 1885 |
C0NZZ/better-teletask
Browser extension that adds useful features like subtitles to HPI Tele-Task. |
|
Emerging |
| 1886 |
notebook-nexus/chatterbox-tts-colab
Transform any text into natural-sounding speech, clone voices from audio... |
|
Emerging |
| 1887 |
book000/audio-transcriber-docker
Automatically transcribe the audio of video / audio files using Speech Recognition. |
|
Emerging |
| 1888 |
rudra00434/SoulPlayer
My own music application build with Django , Tailwind CSS and Spacy... |
|
Emerging |
| 1889 |
ZhuoZhuoCrayon/AcousticKeyBoard-Web
❓声学键盘|脑洞大开:做一个能听懂键盘敲击键位的「玩具」,学习信号处理 / 深度学习 / 安卓 / Django。 |
|
Emerging |
| 1890 |
bishop-ai/bishop-ai
Voice and text virtual assistant |
|
Emerging |
| 1891 |
MarkParker5/STARK-PLACE
S.T.A.R.K. Platform Library and Community Extensions |
|
Emerging |
| 1892 |
philsyn/DiffWave-Vocoder
Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and... |
|
Emerging |
| 1893 |
janewu77/ela-extension
English Learner Assistant |
|
Emerging |
| 1894 |
Lastorder-DC/chatreader-kor
채팅 읽어주는 로봇 |
|
Emerging |
| 1895 |
leprosus/golang-tts
Text-to-Speach golang package based in Amazon Polly service |
|
Emerging |
| 1896 |
jiwidi/DeepSpeech-pytorch
Pytorch implementation for DeepSpeech 2.0 |
|
Emerging |
| 1897 |
T-vK/Termux-DeepSpeech
Open source offline speech recognition for Android using Mozilla's... |
|
Emerging |
| 1898 |
edde746/tiktok-askreddit
A content generation & posting bot for TikTok, scraping posts from r/AskReddit |
|
Emerging |
| 1899 |
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on... |
|
Emerging |
| 1900 |
msalhab96/SpeeQ
A framework for automatic speech recognition |
|
Emerging |