All Voice AI Tools

6,981 tools ranked by quality score · Page 21 of 70

Showing 2001–2100 of 6,981
# Tool Score Tier
2001 MaxMax2016/Grad-TTS-Chinese

Huawei Grad-TTS for Chinese

33
Emerging
2002 tabahi/WebSpeechAnalyzer

JS speech analyzer for fast speech analysis and labeling

33
Emerging
2003 rapidaai/rapida-go

Open-source Golang SDK for Rapida to build real-time, observable Voice AI...

33
Emerging
2004 AcTePuKc/Kokoro-Local-Gui

Hyper-fast, local, high-quality TTS based on Kokoro-82M. PySide6 GUI included.

33
Emerging
2005 dsi-icl/do-voice-interaction

The goal of this project is to provide a voice assistant to the Data...

33
Emerging
2006 AASHISHAG/DeepSpeech-API

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

33
Emerging
2007 ibm-self-serve-assets/Watson-Speech

This collection demonstrates how to help you to quickly embed Watson Speech...

33
Emerging
2008 ajaygujja/Kahani-Storytelling-App-For-Children-With-Hearing-Impairment

Storytelling App For Children With Hearing Impairment

33
Emerging
2009 thewh1teagle/vad-rs

Speech detection using silero vad in Rust

33
Emerging
2010 madzadev/voice-cue

📣 Find sentiments, tags, entities, and actions in your voice recordings instantly

33
Emerging
2011 yyaadet/autosrt_page

AutoSRT is an macOS app that automatically generates dual language subtitles...

33
Emerging
2012 rryam/SakuraKit

Swift SDK for Prototyping AI Speech Generation

33
Emerging
2013 twn39/EdgeTTS.DotNet

EdgeTTS.DotNet is a C# (.NET) library that allows you to use Microsoft...

33
Emerging
2014 muhammadGagah/native-speech-generation

Add-on NVDA untuk mengubah teks menjadi suara alami dengan Google Gemini AI.

33
Emerging
2015 small-cactus/Jarvis-ChatGPT-VoiceAssistant

Jarvis powered by GPT-3.5/GPT-4

33
Emerging
2016 atakanakin/TutunSabri

He is not our hero. He is a silent guardian. A watchful protector.

33
Emerging
2017 ywatanabe1989/scitex-notification

Give your AI agents a voice — TTS, phone calls, SMS, email, webhooks. One...

33
Emerging
2018 eminemahjoub/pdf-voice-reader

"PDF Reader: A Python application for seamless PDF viewing with enhanced...

33
Emerging
2019 rt400/ReversoTTS-HA

ReversoTTS component for HomeAssistant

33
Emerging
2020 fquirin/speech-recognition-experiments

Experiments to test different speech recognition systems for SEPIA Framework

33
Emerging
2021 eellak/gsoc2019-sphinx

Creation of an online Greek mail dictation system, using Sphinx and...

33
Emerging
2022 aishoot/Multi-Hotword_Spotting

Won't it be cool to build a speech assistant like Alexa or Siri yourself...

33
Emerging
2023 gheyret/uyghur-asr-ctc

Speech Recognition for Uyghur using deep learning

33
Emerging
2024 FlorianEagox/WeeaBlind

A program to dub non-english media with modern AI speech synthesis,...

33
Emerging
2025 vdutts7/ai-rapper

Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise...

33
Emerging
2026 eellak/gsoc2021-audio-annotation-tool

Creation of a multi user audio first annotation tool - GSoC 2021

33
Emerging
2027 Harshit-shrivastav/TikTok-TTS-Bot

A python TikTok Text to speech generator telegram bot.

33
Emerging
2028 vroomai/vst

🎹 Generate sounds from words. Directly in your DAW.

33
Emerging
2029 0xPD33/sonori

Sonori is a fully local STT app for Linux (Wayland).

33
Emerging
2030 Gust4voSales/Marvin-VirtualAssistent

A dinamic virtual assistent made with Python, you can easily add more voice...

33
Emerging
2031 shawnrushefsky/talky-talky

MCP server for Audio Generation and Analysis with a Variety of Open Models.

33
Emerging
2032 mramshaw/Speech-Recognition

Speech recognition with Python

33
Emerging
2033 xinjli/ucla-phonetic-corpus

Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH...

33
Emerging
2034 akku2005/VocalInk

Next-gen open-source voice-to-blog platform with AI, TTS, gamification, and...

33
Emerging
2035 lesleyrs/clipboard-narrator

Turn any web page into an audiobook, works in the background on desktop!

33
Emerging
2036 lucasnewman/vocos-mlx

Implementation of 'Vocos: Closing the gap between time-domain and...

33
Emerging
2037 aks-devs/mod_openai_tts

Freeswitch Speech-To-Text module

33
Emerging
2038 speechly/ios-client

The iOS client library for Speechly API

33
Emerging
2039 wwdok/faster-whisper-webui-cn

Cloned from https://huggingface.co/spaces/aadnk/faster-whisper-webui, and...

33
Emerging
2040 jimbobbennett/SpeechToTextSamples

Sample code showing how to use the Azure Speech to Text service from Python 🗣

33
Emerging
2041 DojoCodingLabs/remotion-superpowers

🎬 Claude Code plugin — full video production studio for Remotion. AI...

33
Emerging
2042 royshil/obs-squawk

Real-time Text-to-Speech AI Engine built-in OBS, integrative and intuitive

33
Emerging
2043 lucadellalib/audiocodecs

A collections of audio codecs with a standardized API

33
Emerging
2044 ShawnPi233/SynParaSpeech

Official Repository of Paper: "SynParaSpeech: Automated Synthesis of...

33
Emerging
2045 mravanelli/pytorch_MLP_for_ASR

This code implements a basic MLP for speech recognition. The MLP is trained...

33
Emerging
2046 pviotti/sayit

A text-to-speech command line tool backed by Azure Cognitive Services.

33
Emerging
2047 HeyHeyChicken/NOVA-Python

NOVA is a customizable voice assistant made with Python.

33
Emerging
2048 ORI-Muchim/One-Click-MB-iSTFT-VITS2

MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making...

33
Emerging
2049 prathamsolanki/gender-recognition-by-voice

Identify a voice as male or female.

33
Emerging
2050 yui-mhcp/text_to_speech

(Multi Speaker) Text-To-Speech (TTS) project

33
Emerging
2051 daisy/obi

Obi is an open source audio book production tool that produces digital...

32
Emerging
2052 r1di/neutts-fastapi

OpenAI-compatible Text-to-Speech API server powered by NeuTTS. Drop-in...

32
Emerging
2053 ga642381/Taiwanese-Whisper

fine-tune Whipser model for Taiwanese speech recognition

32
Emerging
2054 Citadawn/VoiceDAO

语道 (VoiceDAO) - 专注于文本转语音功能的 Android 应用

32
Emerging
2055 taresh18/orpheus-streaming

Orpheus TTS Server with streaming support (TTFB ~160ms)

32
Emerging
2056 ye-kyaw-thu/myG2P

Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary...

32
Emerging
2057 thevickypedia/Jarvis_UI

Light weight UI to interact with Jarvis via API calls

32
Emerging
2058 saky-semicolon/Emotion-Aware-AI-Support-System

A smart AI-powered platform that detects emotions from student voice input,...

32
Emerging
2059 jianchang512/kokoro-uiapi

用于kokoro TTS的webui界面和兼容openai api

32
Emerging
2060 poretsky/ru_tts

Compact and portable Russian speech synthesizer

32
Emerging
2061 yanghaha0908/FastHuBERT

Official implementation for Fast-HuBERT: An Efficient Training Framework for...

32
Emerging
2062 arunk140/serve-piper-tts

Go Lang API Wrapper around Piper TTS - Supports TTS Inference and List of Voices

32
Emerging
2063 susilnem/American-sign-Language

A CNN based human computer interface for American Sign Language recognition...

32
Emerging
2064 esoyeon/KoreanTTS

Korean Text To Speech Project: Using Tacotron1, Tacotron2, Wavenet and Melgan

32
Emerging
2065 SCRN-VRC/Voice-Recognition-Shader

Audio detection with visemes in a fragment shader

32
Emerging
2066 rcdalj/speech2speech

Full speech-to-speech workflow (can be customized to user's requirements)

32
Emerging
2067 manascb1344/zonos-api

Production-ready FastAPI wrapper for Zonos TTS models with GPU acceleration,...

32
Emerging
2068 unza-speech-lab/zambezi-voice

Repository for multilingual speech data resources for native languages of Zambia.

32
Emerging
2069 biyoml/End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

32
Emerging
2070 jonaro00/wallace-minion

🔨🙂 Discord Bot for my private friend server

32
Emerging
2071 lcraver/ProxiTalk

This is the repo for ProxiTalk OS. ProxiTalk is a custom operating system...

32
Emerging
2072 30stomercury/Automatic-Speech-Recognition

End-to-End Speech Recognition Using Tensorflow

32
Emerging
2073 phineas-pta/fine-tune-whisper-vi

jupyter notebooks to fine tune whisper models on Vietnamese using Colab...

32
Emerging
2074 LedoKun/028-simple-queue-system

A real-time, responsive queue calling system designed for TV displays,...

32
Emerging
2075 ivanvovk/compressed-tacotron2-pytorch

Compressed version of Tacotron 2 using Tensor Train + Waveglow.

32
Emerging
2076 SiddhantSadangi/st_deepgram_playground

API playground for Deepgram built with Streamlit

32
Emerging
2077 DataXujing/ASR-paper

:fire: ASR教程: https://dataxujing.github.io/ASR-paper/

32
Emerging
2078 vani-voice/vani

Open protocol & middleware for Indian language voice agents — STT→LLM→TTS in...

32
Emerging
2079 Ephrem-ETH/E2E-KWS

End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM

32
Emerging
2080 Aditya-ds-1806/dictpress-tts

TTS plugin for dictpress

32
Emerging
2081 sberdevices/smartspeech

SmartSpeech — это сервис для синтеза и распознавания речи

32
Emerging
2082 daymade/chattts-seed-example

这是一个 ChatTTS 音频仓库,包含用不同 seed 生成的不同音色,你可以方便地挑选你喜欢的 seed。

32
Emerging
2083 stefantaubert/mean-opinion-score

Python library for calculating the mean opinion score and 95% confidence...

32
Emerging
2084 thewh1teagle/israwave

Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet

32
Emerging
2085 funway/audible-epub3-maker

Generate audiobooks from plain EPUB files in EPUB 3 Media Overlays format...

32
Emerging
2086 Deimos-M/DL-Virtual-Assistant

It is a virtual assistant for visually impaired which include models like...

32
Emerging
2087 Yangyangii/TPGST-Tacotron

Google's TPGST reimplementation.

32
Emerging
2088 taikun114/VOICEVOX-TTS-for-Home-Assistant

Custom integration for Japanese TTS using VOICEVOX in Home Assistant.

32
Emerging
2089 mike-nott/smart-announcements

Intelligent context-aware voice announcements for Home Assistant....

32
Emerging
2090 AkshathRaghav/tinyspeech

Code release for "TinySpeech: Attention Condensers for Deep Speech...

32
Emerging
2091 OpenTSLab/BELLE

Official implementation of BELLE "Bayesian Speech Synthesizers Can Learn...

32
Emerging
2092 souvikg544/TTS_Data_Maker

Text to speech is an emerging zone of AI. This repository helps to create a...

32
Emerging
2093 ih3xcode/h3xassist

Meeting assistant that records, transcribes, and summarizes online meetings...

32
Emerging
2094 brewusinc/Edge-TTS

Edge-TTS is a Swift implementation of Microsoft Edge's Text-to-Speech (TTS)...

32
Emerging
2095 samuelbradshaw/text-to-timestamps

Python and command-line utility for aligning audio to a transcript.

32
Emerging
2096 georgesterpu/Taris

Transformer-based online speech recognition system with TensorFlow 2

32
Emerging
2097 wahyd4/say-it

TTS in command line -- Pronounce the Chinese and English words you typed in.

32
Emerging
2098 art1415926535/yandex_speech

Generation of speech using Yandex SpeechKit.

32
Emerging
2099 oleges1/quartznet-pytorch

Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]

32
Emerging
2100 mazzasaverio/youtube-auto-dub

Automated voice dubbing for YouTube videos using Docker, OpenVoice, and...

32
Emerging
« Prev 1 2 3 19 20 21 22 23 68 69 70 Next »