All Voice AI Tools
6,981 tools ranked by quality score · Page 20 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 1901 |
Saganaki22/ComfyUI-KugelAudio
🗣️ ComfyUI nodes for KugelAudi- Open-source text-to-speech with voice... |
|
Emerging |
| 1902 |
rohanprichard/fastrtc-demo
A simple POC of FastRTC, a framework to use voice mode in python! |
|
Emerging |
| 1903 |
Aman22sharma/Python-AI-Virtual-Assistant
This is python AI Virtual Assistant. |
|
Emerging |
| 1904 |
m1el/nemotron-asr.cpp
Nemotron ASR rewrite to GGML |
|
Emerging |
| 1905 |
dsfsi/dsfsi-datasets
Official DSFSI Public Datasets Registry - Comprehensive catalog of 50+... |
|
Emerging |
| 1906 |
pevers/parkiet
Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS) |
|
Emerging |
| 1907 |
sandy1990418/ChineseTaiwaneseWhisper
This repository focuses on leveraging OpenAI's Whisper model for speech... |
|
Emerging |
| 1908 |
jhermann/kopfkino
Syntactic sugar sprinkled on top of MoviePy and AI components to allow... |
|
Emerging |
| 1909 |
seven-io/node-red
The official Node-RED collection by seven. |
|
Emerging |
| 1910 |
ontypehq/mlx-swift-asr
On-device speech recognition for Apple Silicon, powered by MLX. |
|
Emerging |
| 1911 |
slp-rl/HebTTS
The official implementation of "A Language Modeling Approach to... |
|
Emerging |
| 1912 |
A-Jacobson/tacotron2
pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf |
|
Emerging |
| 1913 |
kwebby/Qwen3-TTS-Voice-Studio
A Text to Speech App for Qwen3-TTS Family Models to create custom voices,... |
|
Emerging |
| 1914 |
rishiskhare/parrot
A free, offline, private AI text-to-speech desktop app built on Rust 🦜 |
|
Emerging |
| 1915 |
RoyNkem/SwiftUI-AI-Voice-Assistant
A multi-platform app for voice-based interactions built using SwiftUI with... |
|
Emerging |
| 1916 |
boochow/TFLite_Micro_MicroSpeech_M5Stack
M5Stack (ESP32) port of TensorFlow Lite for Microcontrollers demo "Micro Speech" |
|
Emerging |
| 1917 |
yufan-aslp/AliMeeting
The project is associated with the recently-launched ICASSP 2022... |
|
Emerging |
| 1918 |
soheil-mp/Speech-Recognition
End-to-End Speech Recognition using Neural Networks. |
|
Emerging |
| 1919 |
khuangaf/ITRI-speech-recognition-dataset-generation
Automatic Speech Recognition Dataset Generation |
|
Emerging |
| 1920 |
rishikksh20/TalkNet2-pytorch
TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for... |
|
Emerging |
| 1921 |
totalvoice/totalvoice-php
Client em PHP para API da Totalvoice |
|
Emerging |
| 1922 |
gladchinda/web-speech-demo
Learn how to build a simple text-to-speech voice app for the web using the... |
|
Emerging |
| 1923 |
henryhale/ttspeech
🔊 A fully basic voice synthesizer in vanillaJS |
|
Emerging |
| 1924 |
TheDeathDragon/LiveTranslate
Real-time audio translation overlay for Windows — captures system audio +... |
|
Emerging |
| 1925 |
GuangChen2333/FindUrVoicesPJSK
《世界计划 : 缤纷舞台》单角色语音数据集一键获取小工具 | 无需手动打标 | wav无压缩 | A simple tool for obtaining... |
|
Emerging |
| 1926 |
yousefkotp/Egyptian-Arabic-ASR-and-Diarization
The official submission from Speech Squad team for the MTC-AIC 2 competition... |
|
Emerging |
| 1927 |
Allan-Nava/fakeyou.go
A powerful golang sdk library for interacting with the FakeYouAPI easily |
|
Emerging |
| 1928 |
deepgram-starters/csharp-voice-agent
Get started using Deepgram's Voice Agent with this C# demo app |
|
Emerging |
| 1929 |
inboxpraveen/Speech-Annotation-Tool
Review, correct, and export ASR transcripts at scale. Web-based ASR accuracy... |
|
Emerging |
| 1930 |
b7s/whisper-php
State-of-the-art speech recognition to your PHP/Laravel applications |
|
Emerging |
| 1931 |
surfaceyu/edge-tts-go
Use Microsoft Edge's online text-to-speech service from golang WITHOUT... |
|
Emerging |
| 1932 |
wongfei/UEHMI
Unreal Engine Human Machine Interface |
|
Emerging |
| 1933 |
lucascamillomd/anki-tts
A free, open-source app for Anki text-to-speech in MacOS. |
|
Emerging |
| 1934 |
ethicalabs-ai/Kurtis-E1-MLX-Voice-Agent
A lightweight voice companion, optimized for macOS. |
|
Emerging |
| 1935 |
VirtualZer0/StreamTalkerClient
Cross-platform desktop app that reads Twitch and VK Play chat aloud using AI... |
|
Emerging |
| 1936 |
minseok0809/robotic-process-automation
File Management, School Automation, Text Automation, Web Crawler, Web... |
|
Emerging |
| 1937 |
ddlBoJack/MT4SSL
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL:... |
|
Emerging |
| 1938 |
jindongwang/EasyEspnet
Making Espnet easier to use |
|
Emerging |
| 1939 |
renaudjenny/swift-tts
A straightforward package containing version for Swift modern concurrency,... |
|
Emerging |
| 1940 |
Alex-Tremayne/LaTeXt
Python package for converting LaTeX to text which can be read by text to... |
|
Emerging |
| 1941 |
Madhur215/Chatbot-cum-voice-Assistant
An AI chatbot with features like conversation through voice, fetching events... |
|
Emerging |
| 1942 |
second-state/gsv_tts
Streaming TTS API server written in Rust |
|
Emerging |
| 1943 |
Alenkar/kairos-asr
Адаптированный ASR pipeline для удобной интеграции в другие приложения на... |
|
Emerging |
| 1944 |
sp-squared/Turkic-Languages-Audio-to-Text-Transcription
Open-source Automatic Speech Recognition (ASR) pipeline for Bashkir... |
|
Emerging |
| 1945 |
nchudleigh/sc2-ultra
Voice-controlled StarCraft II - command Zerg, Protoss, or Terran using... |
|
Emerging |
| 1946 |
kanttouchthis/text_generation_webui_xtts
XTTSv2 Extension for oobabooga text-generation-webui |
|
Emerging |
| 1947 |
andi611/TTS-Tacotron-Pytorch
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative... |
|
Emerging |
| 1948 |
skshadan/WhisCall
A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo,... |
|
Emerging |
| 1949 |
williamxhero/ttsmaker
TTSMaker: A Python library for interacting with the TTSMaker API to easily... |
|
Emerging |
| 1950 |
umbertocappellazzo/Omni-AVSR
Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal... |
|
Emerging |
| 1951 |
lucasnewman/descript-mlx
Implementation of the Descript Audio Codec in MLX |
|
Emerging |
| 1952 |
hanxi/epub2mp3
这是一个使用 Microsoft Edge TTS 服务将 EPUB 电子书转换为 MP3 音频文件的工具。 |
|
Emerging |
| 1953 |
user3301/ssml_builder
:sound: a general SSML(Speech Synthesis Markup Language) builder |
|
Emerging |
| 1954 |
dmatekenya/Chichewa-Speech2Text
Automated Speech Recognition for Chichewa. |
|
Emerging |
| 1955 |
Tinkoff/asterisk-voicekit-modules
Non-blocking Asterisk modules for accessing VoiceKit services for speech... |
|
Emerging |
| 1956 |
warisqr007/vocos
Causal version of Vocos (neural vocoders for high-quality audio synthesis)... |
|
Emerging |
| 1957 |
aydinnyunus/LinuxVoiceAssistant
Linux Voice Assistant for to Make Your Work Easier |
|
Emerging |
| 1958 |
tomik395/ESP32-AI
Speak to your ESP32 and it speaks back! Your new personal assistance is... |
|
Emerging |
| 1959 |
Ishan7390/Jarvis_AI
This is my attempt at building a not so much of an AI, Jarvis |
|
Emerging |
| 1960 |
huytd/speech
A tool to practice English speaking |
|
Emerging |
| 1961 |
lang-uk/ukrainian-tts-preprocessing
Tools and models for Ukrainian phonemization and lexical stress prediction |
|
Emerging |
| 1962 |
codename0og/codename-rvc-fork-3
Codename's rvc fork version 3, based on Applio. |
|
Emerging |
| 1963 |
nowickam/facial-animation
Audio-driven facial animation generator with BiLSTM used for transcribing... |
|
Emerging |
| 1964 |
felivalencia3/RealVoiceGPT
RealVoiceGPT is a web application that lets you have voice conversations... |
|
Emerging |
| 1965 |
seungwonpark/awesome-tts-samples
Awesome list of TTS papers with audio samples |
|
Emerging |
| 1966 |
ehtisham91/Django-Speech-to-text-Chat
This App allows users to convert their speech into text and send that text... |
|
Emerging |
| 1967 |
Shyguy99/Whatsapp-bot
A simple WhatsApp Bot made using open-wa library with some additional features. |
|
Emerging |
| 1968 |
victor369basu/End2EndAutomaticSpeechRecognition
In this repository, I have developed an end to end Automatic speech... |
|
Emerging |
| 1969 |
naschorr/hawking
The retro text-to-speech bot for Discord |
|
Emerging |
| 1970 |
EvilFreelancer/docker-fish-speech-server
OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model. |
|
Emerging |
| 1971 |
aks-devs/mod_google_asr
Freeswitch Speech-to-Text module |
|
Emerging |
| 1972 |
saadbutt32/Conversion-of-Pakistan-Sign-Languag-into-Text-and-Speech-using-OpenPose-and-Machine-Learning
Real-time translation of Pakistan sign language into text and speech using... |
|
Emerging |
| 1973 |
Jdreioe/Wingmate
A project to make people who cannot speak, speak! |
|
Emerging |
| 1974 |
atharva-again/indic-asr-onnx
Helper package for using quantized versions of the Indic ASR Model by AI4Bharat. |
|
Emerging |
| 1975 |
soundhound/houndify-sdk-go
The official Houndify SDK for Go |
|
Emerging |
| 1976 |
RF5/transfusion-asr
Transcribing Speech with Multinomial Diffusion, training code and models. |
|
Emerging |
| 1977 |
stellarloop/bitbat.ai
My father, a journalist, used to painstakingly transcribe interviews from a... |
|
Emerging |
| 1978 |
matt-goldman/AI-Panelist
An AI Panelist participating in Beer Driven Devs Live 2026 |
|
Emerging |
| 1979 |
MrAliHasan/Sophia-AI-Assistant
Sophia AI Assistant is a Python-based desktop AI that performs a variety of... |
|
Emerging |
| 1980 |
HerbertHe/edge-tts-server
Server for edge-tts |
|
Emerging |
| 1981 |
Umbaji/NMTMD
Official repository for the Opensource Textdataset for NMT for local langues... |
|
Emerging |
| 1982 |
dokuniev/claude-voice
Hear which Claude Code session needs you — speaks the repo and branch name out loud |
|
Emerging |
| 1983 |
AdamHolwerda/bloom-cli
A command line utitlity to create a multipage static website from Ulysses export |
|
Emerging |
| 1984 |
matthijsvk/TIMITspeech
Speech recognition on the TIMIT (or any other) dataset |
|
Emerging |
| 1985 |
jekyll2014/VoiceAssistant
Locally hosted voice assistant with plugin extension feature |
|
Emerging |
| 1986 |
Rubiksman78/RenAI-Chat
VN Like Interface for Chatbots |
|
Emerging |
| 1987 |
rhulha/Speech2Speech
A web application that converts speech to speech 100% private |
|
Emerging |
| 1988 |
The-Swarm-Corporation/Voice-Agents
Voice-Agents is a production-ready Python library for building... |
|
Emerging |
| 1989 |
Enforcer03/voice-cloning
Voice cloning with tortoise-tts |
|
Emerging |
| 1990 |
marytts/gradle-marytts-voicebuilding-plugin
A replacement for the legacy VoiceImportTools in MaryTTS |
|
Emerging |
| 1991 |
charstorm/vilberta
Voice chatbot with voice+screen output to show that "not everything needs to... |
|
Emerging |
| 1992 |
pinch-eng/pinch-python-sdk
Real-time voice translation SDK |
|
Emerging |
| 1993 |
eazhary/dctts2
Deep Convolution Text to Speech |
|
Emerging |
| 1994 |
deepgram-devs/flask-live-chatgpt-text-to-speech
Get started using Deepgram's Live ChatGPT Text-to-Speech with this Flask demo app |
|
Emerging |
| 1995 |
pnkvalavala/digitaltwin
Using a single image and just 10 seconds of sample audio, our project... |
|
Emerging |
| 1996 |
tristan-mcinnis/Multimodal-voice-assistant
This project is a multi-modal AI voice assistant that uses LM Studio, OpenAI... |
|
Emerging |
| 1997 |
lpalbou/VoiceLLM
A modular Python library for voice interactions with AI systems, featuring... |
|
Emerging |
| 1998 |
kromme/Teams-Notetaker
Let AI create the notes of your Teams Meeting |
|
Emerging |
| 1999 |
hwRG/End-to-End-TTS-Fine-Tune
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis. |
|
Emerging |
| 2000 |
MotazSabri/Hanami-release
Live translator that captures any audio that comes from a WINDOWS speaker or... |
|
Emerging |