All Voice AI Tools

6,981 tools ranked by quality score · Page 20 of 70

Showing 1901–2000 of 6,981
# Tool Score Tier
1901 Saganaki22/ComfyUI-KugelAudio

🗣️ ComfyUI nodes for KugelAudi- Open-source text-to-speech with voice...

34
Emerging
1902 rohanprichard/fastrtc-demo

A simple POC of FastRTC, a framework to use voice mode in python!

34
Emerging
1903 Aman22sharma/Python-AI-Virtual-Assistant

This is python AI Virtual Assistant.

34
Emerging
1904 m1el/nemotron-asr.cpp

Nemotron ASR rewrite to GGML

34
Emerging
1905 dsfsi/dsfsi-datasets

Official DSFSI Public Datasets Registry - Comprehensive catalog of 50+...

34
Emerging
1906 pevers/parkiet

Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)

34
Emerging
1907 sandy1990418/ChineseTaiwaneseWhisper

This repository focuses on leveraging OpenAI's Whisper model for speech...

34
Emerging
1908 jhermann/kopfkino

Syntactic sugar sprinkled on top of MoviePy and AI components to allow...

34
Emerging
1909 seven-io/node-red

The official Node-RED collection by seven.

34
Emerging
1910 ontypehq/mlx-swift-asr

On-device speech recognition for Apple Silicon, powered by MLX.

34
Emerging
1911 slp-rl/HebTTS

The official implementation of "A Language Modeling Approach to...

34
Emerging
1912 A-Jacobson/tacotron2

pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf

34
Emerging
1913 kwebby/Qwen3-TTS-Voice-Studio

A Text to Speech App for Qwen3-TTS Family Models to create custom voices,...

34
Emerging
1914 rishiskhare/parrot

A free, offline, private AI text-to-speech desktop app built on Rust 🦜

34
Emerging
1915 RoyNkem/SwiftUI-AI-Voice-Assistant

A multi-platform app for voice-based interactions built using SwiftUI with...

34
Emerging
1916 boochow/TFLite_Micro_MicroSpeech_M5Stack

M5Stack (ESP32) port of TensorFlow Lite for Microcontrollers demo "Micro Speech"

34
Emerging
1917 yufan-aslp/AliMeeting

The project is associated with the recently-launched ICASSP 2022...

34
Emerging
1918 soheil-mp/Speech-Recognition

End-to-End Speech Recognition using Neural Networks.

34
Emerging
1919 khuangaf/ITRI-speech-recognition-dataset-generation

Automatic Speech Recognition Dataset Generation

34
Emerging
1920 rishikksh20/TalkNet2-pytorch

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for...

34
Emerging
1921 totalvoice/totalvoice-php

Client em PHP para API da Totalvoice

34
Emerging
1922 gladchinda/web-speech-demo

Learn how to build a simple text-to-speech voice app for the web using the...

33
Emerging
1923 henryhale/ttspeech

🔊 A fully basic voice synthesizer in vanillaJS

33
Emerging
1924 TheDeathDragon/LiveTranslate

Real-time audio translation overlay for Windows — captures system audio +...

33
Emerging
1925 GuangChen2333/FindUrVoicesPJSK

《世界计划 : 缤纷舞台》单角色语音数据集一键获取小工具 | 无需手动打标 | wav无压缩 | A simple tool for obtaining...

33
Emerging
1926 yousefkotp/Egyptian-Arabic-ASR-and-Diarization

The official submission from Speech Squad team for the MTC-AIC 2 competition...

33
Emerging
1927 Allan-Nava/fakeyou.go

A powerful golang sdk library for interacting with the FakeYouAPI easily

33
Emerging
1928 deepgram-starters/csharp-voice-agent

Get started using Deepgram's Voice Agent with this C# demo app

33
Emerging
1929 inboxpraveen/Speech-Annotation-Tool

Review, correct, and export ASR transcripts at scale. Web-based ASR accuracy...

33
Emerging
1930 b7s/whisper-php

State-of-the-art speech recognition to your PHP/Laravel applications

33
Emerging
1931 surfaceyu/edge-tts-go

Use Microsoft Edge's online text-to-speech service from golang WITHOUT...

33
Emerging
1932 wongfei/UEHMI

Unreal Engine Human Machine Interface

33
Emerging
1933 lucascamillomd/anki-tts

A free, open-source app for Anki text-to-speech in MacOS.

33
Emerging
1934 ethicalabs-ai/Kurtis-E1-MLX-Voice-Agent

A lightweight voice companion, optimized for macOS.

33
Emerging
1935 VirtualZer0/StreamTalkerClient

Cross-platform desktop app that reads Twitch and VK Play chat aloud using AI...

33
Emerging
1936 minseok0809/robotic-process-automation

File Management, School Automation, Text Automation, Web Crawler, Web...

33
Emerging
1937 ddlBoJack/MT4SSL

[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL:...

33
Emerging
1938 jindongwang/EasyEspnet

Making Espnet easier to use

33
Emerging
1939 renaudjenny/swift-tts

A straightforward package containing version for Swift modern concurrency,...

33
Emerging
1940 Alex-Tremayne/LaTeXt

Python package for converting LaTeX to text which can be read by text to...

33
Emerging
1941 Madhur215/Chatbot-cum-voice-Assistant

An AI chatbot with features like conversation through voice, fetching events...

33
Emerging
1942 second-state/gsv_tts

Streaming TTS API server written in Rust

33
Emerging
1943 Alenkar/kairos-asr

Адаптированный ASR pipeline для удобной интеграции в другие приложения на...

33
Emerging
1944 sp-squared/Turkic-Languages-Audio-to-Text-Transcription

Open-source Automatic Speech Recognition (ASR) pipeline for Bashkir...

33
Emerging
1945 nchudleigh/sc2-ultra

Voice-controlled StarCraft II - command Zerg, Protoss, or Terran using...

33
Emerging
1946 kanttouchthis/text_generation_webui_xtts

XTTSv2 Extension for oobabooga text-generation-webui

33
Emerging
1947 andi611/TTS-Tacotron-Pytorch

Pytorch implementation of Tacotron, a speech synthesis end-to-end generative...

33
Emerging
1948 skshadan/WhisCall

A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo,...

33
Emerging
1949 williamxhero/ttsmaker

TTSMaker: A Python library for interacting with the TTSMaker API to easily...

33
Emerging
1950 umbertocappellazzo/Omni-AVSR

Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal...

33
Emerging
1951 lucasnewman/descript-mlx

Implementation of the Descript Audio Codec in MLX

33
Emerging
1952 hanxi/epub2mp3

这是一个使用 Microsoft Edge TTS 服务将 EPUB 电子书转换为 MP3 音频文件的工具。

33
Emerging
1953 user3301/ssml_builder

:sound: a general SSML(Speech Synthesis Markup Language) builder

33
Emerging
1954 dmatekenya/Chichewa-Speech2Text

Automated Speech Recognition for Chichewa.

33
Emerging
1955 Tinkoff/asterisk-voicekit-modules

Non-blocking Asterisk modules for accessing VoiceKit services for speech...

33
Emerging
1956 warisqr007/vocos

Causal version of Vocos (neural vocoders for high-quality audio synthesis)...

33
Emerging
1957 aydinnyunus/LinuxVoiceAssistant

Linux Voice Assistant for to Make Your Work Easier

33
Emerging
1958 tomik395/ESP32-AI

Speak to your ESP32 and it speaks back! Your new personal assistance is...

33
Emerging
1959 Ishan7390/Jarvis_AI

This is my attempt at building a not so much of an AI, Jarvis

33
Emerging
1960 huytd/speech

A tool to practice English speaking

33
Emerging
1961 lang-uk/ukrainian-tts-preprocessing

Tools and models for Ukrainian phonemization and lexical stress prediction

33
Emerging
1962 codename0og/codename-rvc-fork-3

Codename's rvc fork version 3, based on Applio.

33
Emerging
1963 nowickam/facial-animation

Audio-driven facial animation generator with BiLSTM used for transcribing...

33
Emerging
1964 felivalencia3/RealVoiceGPT

RealVoiceGPT is a web application that lets you have voice conversations...

33
Emerging
1965 seungwonpark/awesome-tts-samples

Awesome list of TTS papers with audio samples

33
Emerging
1966 ehtisham91/Django-Speech-to-text-Chat

This App allows users to convert their speech into text and send that text...

33
Emerging
1967 Shyguy99/Whatsapp-bot

A simple WhatsApp Bot made using open-wa library with some additional features.

33
Emerging
1968 victor369basu/End2EndAutomaticSpeechRecognition

In this repository, I have developed an end to end Automatic speech...

33
Emerging
1969 naschorr/hawking

The retro text-to-speech bot for Discord

33
Emerging
1970 EvilFreelancer/docker-fish-speech-server

OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.

33
Emerging
1971 aks-devs/mod_google_asr

Freeswitch Speech-to-Text module

33
Emerging
1972 saadbutt32/Conversion-of-Pakistan-Sign-Languag-into-Text-and-Speech-using-OpenPose-and-Machine-Learning

Real-time translation of Pakistan sign language into text and speech using...

33
Emerging
1973 Jdreioe/Wingmate

A project to make people who cannot speak, speak!

33
Emerging
1974 atharva-again/indic-asr-onnx

Helper package for using quantized versions of the Indic ASR Model by AI4Bharat.

33
Emerging
1975 soundhound/houndify-sdk-go

The official Houndify SDK for Go

33
Emerging
1976 RF5/transfusion-asr

Transcribing Speech with Multinomial Diffusion, training code and models.

33
Emerging
1977 stellarloop/bitbat.ai

My father, a journalist, used to painstakingly transcribe interviews from a...

33
Emerging
1978 matt-goldman/AI-Panelist

An AI Panelist participating in Beer Driven Devs Live 2026

33
Emerging
1979 MrAliHasan/Sophia-AI-Assistant

Sophia AI Assistant is a Python-based desktop AI that performs a variety of...

33
Emerging
1980 HerbertHe/edge-tts-server

Server for edge-tts

33
Emerging
1981 Umbaji/NMTMD

Official repository for the Opensource Textdataset for NMT for local langues...

33
Emerging
1982 dokuniev/claude-voice

Hear which Claude Code session needs you — speaks the repo and branch name out loud

33
Emerging
1983 AdamHolwerda/bloom-cli

A command line utitlity to create a multipage static website from Ulysses export

33
Emerging
1984 matthijsvk/TIMITspeech

Speech recognition on the TIMIT (or any other) dataset

33
Emerging
1985 jekyll2014/VoiceAssistant

Locally hosted voice assistant with plugin extension feature

33
Emerging
1986 Rubiksman78/RenAI-Chat

VN Like Interface for Chatbots

33
Emerging
1987 rhulha/Speech2Speech

A web application that converts speech to speech 100% private

33
Emerging
1988 The-Swarm-Corporation/Voice-Agents

Voice-Agents is a production-ready Python library for building...

33
Emerging
1989 Enforcer03/voice-cloning

Voice cloning with tortoise-tts

33
Emerging
1990 marytts/gradle-marytts-voicebuilding-plugin

A replacement for the legacy VoiceImportTools in MaryTTS

33
Emerging
1991 charstorm/vilberta

Voice chatbot with voice+screen output to show that "not everything needs to...

33
Emerging
1992 pinch-eng/pinch-python-sdk

Real-time voice translation SDK

33
Emerging
1993 eazhary/dctts2

Deep Convolution Text to Speech

33
Emerging
1994 deepgram-devs/flask-live-chatgpt-text-to-speech

Get started using Deepgram's Live ChatGPT Text-to-Speech with this Flask demo app

33
Emerging
1995 pnkvalavala/digitaltwin

Using a single image and just 10 seconds of sample audio, our project...

33
Emerging
1996 tristan-mcinnis/Multimodal-voice-assistant

This project is a multi-modal AI voice assistant that uses LM Studio, OpenAI...

33
Emerging
1997 lpalbou/VoiceLLM

A modular Python library for voice interactions with AI systems, featuring...

33
Emerging
1998 kromme/Teams-Notetaker

Let AI create the notes of your Teams Meeting

33
Emerging
1999 hwRG/End-to-End-TTS-Fine-Tune

Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.

33
Emerging
2000 MotazSabri/Hanami-release

Live translator that captures any audio that comes from a WINDOWS speaker or...

33
Emerging
« Prev 1 2 3 18 19 20 21 22 68 69 70 Next »