All Voice AI Tools

6,981 tools ranked by quality score · Page 16 of 70

Showing 1501–1600 of 6,981
# Tool Score Tier
1501 holgern/pykokoro

A Python library for Kokoro TTS (Text-to-Speech) using ONNX runtime.

38
Emerging
1502 wannaphong/KhanomTan-TTS-v1.0

KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that...

38
Emerging
1503 deepgram-starters/django-transcription

Get started using Deepgram's Transcription with this Django demo app

38
Emerging
1504 p-groarke/wsay

Windows "say"

38
Emerging
1505 ibotplus/kbase-media

视频、音频、图片内容识别、语音转写、语音合成 / easy convert video audio image to text, and revert...

38
Emerging
1506 tochilkinva/tg_bot_stt_tts

Telegram bot with voice message recognition and generation. Speech to Text...

38
Emerging
1507 wdbm/deep_throat

speech synthesis program

38
Emerging
1508 wxkingstar/TransEcho

macOS 实时同声传译 - 捕获系统音频,实时翻译字幕 + 语音同传 | Real-time simultaneous interpretation for macOS

38
Emerging
1509 inevolin/DiscordSpeechBot

A speech-to-text bot for discord with music commands and more using NodeJS....

38
Emerging
1510 CarrotYuan/openclaw-voice-control

A macOS local voice-control companion for OpenClaw with Siri-like wakeword...

38
Emerging
1511 aeleraqi/Text-to-Speech-gTTS---Arabic-text

Google Text-to-Speech API to convert text input into audio files

38
Emerging
1512 34j/mecab-text-cleaner

Simple Python package (CLI/Python API) for getting japanese readings...

38
Emerging
1513 sciforce/phones-las

Articulatory features estimation using Listen Attend and Spell architecture.

38
Emerging
1514 manhph2211/ViSR

This repo builds an end-to-end deep learning application that supports...

38
Emerging
1515 Troyanovsky/awesome-TTS-Colab

Collection of awesome TTS and voice cloning models to run with Google Colab

38
Emerging
1516 Kyubyong/specAugment

Tensor2tensor experiment with SpecAugment

38
Emerging
1517 shijincai/VibeVoice

Archive of the official Microsoft VibeVoice repository (7B & 1.5B). Backup...

38
Emerging
1518 BlinkTagInc/gtfs-tts

Review GTFS stop pronunciations to determine which stops need a tts_stop_name value.

38
Emerging
1519 Dostoyewski/django_voice_bot

Package for django onpage support bot with speech recognition and voice commands

38
Emerging
1520 falabrasil/kaldi-br

☕🇧🇷 Scripts para o Kaldi em Português Brasileiro

38
Emerging
1521 ng-web-apis/speech

A library for using Web Speech API with Angular

38
Emerging
1522 IceFog72/pocket-tts-openapi

Fast, local, OpenAI-compatible TTS server with voice cloning support powered...

38
Emerging
1523 linagora-labs/ssak

SSAK contains helpers and tools to process data and train/infer ASR models.

38
Emerging
1524 naeruru/mimiuchi

a free, customizable, osc capable speech-to-text interface for relaying text...

38
Emerging
1525 sskorol/vosk-api-gpu

Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC

38
Emerging
1526 sexfrance/RecaptchaV2-Solver

A Python-based solution for solving Google's reCAPTCHA v2 challenges...

38
Emerging
1527 DrDroidLab/voicesummary

Open Source AI Database for Voice Agent Transcripts | Call Analysis &...

38
Emerging
1528 leduckhai/wav2graph

wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech

38
Emerging
1529 QuantiusBenignus/BlahST

Input text from speech in any Linux window, the lean, fast and accurate way,...

38
Emerging
1530 noco-ai/spellbook-docker

AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many...

38
Emerging
1531 husniadil/cc-hooks

Audio feedback plugin for Claude Code with TTS announcements, sound effects,...

38
Emerging
1532 HawkAaron/E2E-ASR

PyTorch Implementations for End-to-End Automatic Speech Recognition

38
Emerging
1533 rxlabz/sytody

a Flutter "speech to todo" app example

38
Emerging
1534 keenresearch/KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING:...

38
Emerging
1535 HawkAaron/RNN-Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction...

38
Emerging
1536 kroko-ai/kroko-onnx

Kroko ASR - Speech-to-text

38
Emerging
1537 CMsmartvoice/One-Shot-Voice-Cloning

:relaxed: One Shot Voice Cloning base on Unet-TTS

38
Emerging
1538 ybouhjira/claude-code-tts

🔊 Text-to-Speech MCP plugin for Claude Code - hear audio feedback while...

38
Emerging
1539 rishikksh20/UnivNet-pytorch

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators...

38
Emerging
1540 binzhouchn/masr

中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。

37
Emerging
1541 persiandataset/PersianSpeech

Persian ASR dataset

37
Emerging
1542 stevenhillis/awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs

37
Emerging
1543 seven-io/home-assistant

HACS supporting Home Assistant integration for seven

37
Emerging
1544 thinh-vu/ur_audio_sub

Generate text captions for audio files & youtube video using OpenAI Whisper...

37
Emerging
1545 talin190/Qwen3-TTS-Daggr-UI

🎤 Create dynamic voice experiences with Qwen3-TTS-Daggr-UI, a Gradio app for...

37
Emerging
1546 fqueis/pollinationsai

🔥 TypeScript SDK wrapper for Pollinations AI services

37
Emerging
1547 Bunlong/react-webspeech

The official WebSpeech for React.

37
Emerging
1548 habla-liaa/ser-with-w2v2

Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from...

37
Emerging
1549 hypeapps/black-mirror

A voice controlled smart mirror powered by Raspberry Pi3 and AndroidThings.

37
Emerging
1550 LetsPlayNow/Speech_AI

Speech to speech bot built with Python

37
Emerging
1551 aks-devs/mod_openai_asr

Freeswitch Speech-To-Text module

37
Emerging
1552 j3soon/speech-to-windows-input

Perform speech-to-text (STT/ASR) with Azure speech service and simulate...

37
Emerging
1553 audioku/cross-accent-maml-asr

Meta-learning model agnostic (MAML) implementation for cross-accented ASR

37
Emerging
1554 botbahlul/vosk_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using...

37
Emerging
1555 tmanderson/ivona-node

Ivona Cloud (via Amazon services) client library for Node

37
Emerging
1556 yzfly/awesome-voice-agents

A curated list of voice AI agent frameworks, tools, resources, and best practices

37
Emerging
1557 hcy71o/MB-iSTFT-VITS-with-AutoVocoder

Incorporating AutoVocoder to MB-iSTFT-VITS

37
Emerging
1558 jaywcjlove/TextSoundSaver

Using the TextSoundSaver application, you can convert text into realistic...

37
Emerging
1559 shi-gg/Auditional-Text

The source code of the Auditional Text discord Boat

37
Emerging
1560 hcoles/voices

Fast, in-process text to speech for Java

37
Emerging
1561 mrf345/flask_gtts

A Flask extension to add gTTS Google text to speech

37
Emerging
1562 jianchang512/chatterbox-api

一个基于 Chatterbox-TTS的文字转语音(TTS)服务。提供与 OpenAI TTS 兼容的 API 接口并支持声音克隆,附带简洁的 Web 用户界面。

37
Emerging
1563 MartinMashalov/VoiceCloning

Generative voice cloning model using TTS synthesis with state-of-the-art...

37
Emerging
1564 johnGettings/LIHQ

Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)

37
Emerging
1565 ismailperim/reportcast

Transform reports into podcasts with AI - Nobody reads your reports. But...

37
Emerging
1566 blip-radar/vatsim-parser

Parser for a variety of VATSIM-related file formats

37
Emerging
1567 VoXera/VoXera

An Open-Source Persian Language Techs Toolkit with Python

37
Emerging
1568 outspeed-ai/voice-devtools

Developer tools to debug and build realtime voice agents. Supports multiple models.

37
Emerging
1569 wangz-code/legado-edge-tts

edge大声朗读微软TTS服务, 在阅读legado中配置语音引擎方式收听微软TTS / Edge大声朗读, 如果没有 vps 部署可以看看阅读内置...

37
Emerging
1570 ORI-Muchim/PolyLangVITS

Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)

37
Emerging
1571 gianpaj/sexyvoice

Voice Cloning, Voice Call and Text to Speech platform. Perfect for content...

37
Emerging
1572 rishikksh20/iSTFT-Avocodo-pytorch

Ultrafast GAN based Vocoder for Text to Speech

37
Emerging
1573 kaloprojects/KALO-ESP32-Voice-Chat-AI-Friends

ESP32-based voice device for chatting with multiple custom AI bots....

37
Emerging
1574 JstnMcBrd/dectalk-tts

API wrapper for the Dectalk TTS system

37
Emerging
1575 thewh1teagle/piper-onnx

Use piper TTS with onnxruntime

37
Emerging
1576 verbio-technologies/python-verbio-speech-center

Python integration with the Verbio Speech Center Cloud....

37
Emerging
1577 HordRicJr/HordVoice

HordVoice - AI-powered voice assistant built with Flutter and Azure AI...

37
Emerging
1578 SpenserCai/cosyvoice3.rs

Python bindings for CosyVoice3 TTS using Candle. Has the characteristics of...

37
Emerging
1579 Sundy1219/eesen-for-thchs30

ASR for Chinese Mandarin

37
Emerging
1580 hipnologo/EchoForge_Studio

Multi-LLM writing and voice production workspace built with Streamlit.

37
Emerging
1581 khakers/go-subgen

Automatically generate subtitles for your media using whisper.cpp via...

37
Emerging
1582 alamparelli/mcp-claude-say

Voice interaction for Claude Code - Talk to Claude and hear responses using...

37
Emerging
1583 bookbot-kids/speech-recognizer-bahasa-indonesian

A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer...

37
Emerging
1584 hhguo/SoCodec

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications

37
Emerging
1585 mattzzz/rick-voice

Give any bot the voice of Rick Sanchez

37
Emerging
1586 AlexxIT/FasterWhisper

Faster Whisper for Home Assistant - custom integration with a local...

37
Emerging
1587 sljavi/handsfree-for-web-zoom-module

Zoom module implementation for Handsfree for web

37
Emerging
1588 zassou65535/VITS

VITSによるテキスト読み上げ器&ボイスチェンジャー

37
Emerging
1589 NotAbhinavGamerz/emotion-aware-automatic-speech-recognition

🎤 Enhance speech recognition by detecting emotions in spoken language,...

37
Emerging
1590 zabir-nabil/bangla-tts

Bangla text to speech, Multilingual (Bangla, English) real-time speech...

37
Emerging
1591 mravanelli/pySpeechRev

This python code performs an efficient speech reverberation starting from a...

37
Emerging
1592 tuanh123789/AdaSpeech

An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for...

37
Emerging
1593 OpenASR/idiolect

🎙️ Handsfree Audio Development Interface

37
Emerging
1594 anyvoiceai/Barkify

Barkify: an unoffical training implementation of Bark TTS by suno-ai

37
Emerging
1595 e-c-k-e-r/vall-e

An unofficial PyTorch implementation of VALL-E

37
Emerging
1596 XilinJia/Podcini

Open source podcast instrument for Android supporting contents from YouTube...

37
Emerging
1597 soniqo/speech-android

On-device speech SDK for Android — ASR, TTS, VAD, and noise cancellation...

37
Emerging
1598 erogol/FFTNet

FFTNet vocoder implementation

37
Emerging
1599 GinoShun/Accent-Activation-Steering

Official code for "Activation Steering for Accent Adaptation in Speech...

37
Emerging
1600 zolomohan/speech-recognition-in-javascript

Final Code for Speech Recognition in JavaScript tutorial.

37
Emerging
« Prev 1 2 3 14 15 16 17 18 68 69 70 Next »