All Voice AI Tools

6,981 tools ranked by quality score · Page 19 of 70

Showing 1801–1900 of 6,981
# Tool Score Tier
1801 heartsuit/BaiduASRAndTTS

Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech;...

35
Emerging
1802 chaonan99/ppt_presenter

Convert ppt to video with audio track, using text to speech synthesis

35
Emerging
1803 ProsusAI/project-echo

An AI-powered voice director assistant for creating engaging audio content...

35
Emerging
1804 WangYixuan12/openai_tts

OpenAI Text-to-Speech Interface

35
Emerging
1805 EtienneAb3d/WhisperTimeSync

Synchronize Whisper's timestamps over an existing accurate transcription

35
Emerging
1806 amitdev01/awesome-voice-ai

Awesome Voice Ai

35
Emerging
1807 sooftware/End-to-End-Speech-Recognition-Models

PyTorch implementation of automatic speech recognition models.

35
Emerging
1808 OwenEdwards/videojs-speak-descriptions-track

A Video.js 7 middleware that uses browser speech synthesis to speak...

35
Emerging
1809 syntithenai/opensnips

Open source projects related to Snips https://snips.ai/.

35
Emerging
1810 holgern/ttsforge

Convert EPUB files to audiobooks using Kokoro ONNX TTS

34
Emerging
1811 candlewill/AiVoice

Deep CNN networks for Speech Synthesis

34
Emerging
1812 Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and...

34
Emerging
1813 hacktronaut/azure-avatar-demo

Text To Speech Demo in ReactJS Application using Azure Avatar AI Service.

34
Emerging
1814 jianchang512/gemini-speech2srt

使用 Gemini AI 转写音视频为 SRT 字幕

34
Emerging
1815 tiansztiansz/voice-assistant

重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。

34
Emerging
1816 rtk-ai/vox

A universal AI toolkit for high-performance Speech-to-Text (STT) and...

34
Emerging
1817 LucaLuke13/TalkyBotty

Simply forward a video or voice message in any language to the bot, and it...

34
Emerging
1818 Fatma-Chaouech/audioverse

Breathe Life Into Your Books! 📚🌱

34
Emerging
1819 medokin/soundpad-text-to-speech

Text-To-Speech for Soundpad

34
Emerging
1820 nhaouari/local11labs

Local11Labs allows generating high-quality text-to-speech and podcast...

34
Emerging
1821 Mobile-Artificial-Intelligence/maise

Maise is an open-source android speech engine designed to provide a powerful...

34
Emerging
1822 akinsella/yt-transcript-rs

🎬️ A Rust library for accessing YouTube Video Infos & Transcripts

34
Emerging
1823 trabdlkarim/voce-browser

Voice Controlled Chromium Web Browser

34
Emerging
1824 Dark2C/Viral-Faceless-Shorts-Generator

Automatically generate faceless YouTube Shorts from trending topics using AI...

34
Emerging
1825 jvandenaardweg/ssml-split

Splits SSML strings into batches AWS Polly ánd Google's Text to Speech API...

34
Emerging
1826 egorsmkv/tts_uk

High-fidelity speech synthesis for Ukrainian using modern neural networks.

34
Emerging
1827 deepkyu/ml-talking-face

Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)

34
Emerging
1828 moeru-ai/ortts

𖣘🔊 Simple and Easy-to-use local TTS inference server, Powered by ONNX Runtime

34
Emerging
1829 jxlarrea/wyoming-voice-match

A Wyoming protocol ASR proxy that verifies speaker identity and isolates...

34
Emerging
1830 GeoHaberC/Story-to-Video

Create a Movie animation plus Audio plus Subtitle from a text file

34
Emerging
1831 Lunarien/Lunariens-Mental-Math-Trainer

Mental math trainer made in C#.

34
Emerging
1832 iotjin/JhPrivacyAuthTool

隐私权限判断 - 封装了几种常用的隐私权限判断(定位服务,通讯录, 日历,提醒事项, 照片, 蓝牙共享,麦克风, 相机)和通知的注册和判断。定位服务,蓝牙共享是单独调用的

34
Emerging
1833 akashmjn/cs224n-gpu-that-talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

34
Emerging
1834 kaituoxu/Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS)...

34
Emerging
1835 FlooferLand/ttvoice-mod

A Minecraft mod that lets you type to speak!

34
Emerging
1836 AndreDalwin/Whisper2Summarize

Whisper2Summarize is an application that uses Whisper for audio processing...

34
Emerging
1837 doveg/whisper-real-time

A real time offline transcriber with gui, based on OpenAI whisper

34
Emerging
1838 TartuNLP/text-to-speech-worker

Estonian multi-speaker neural text-to-speech worker that processes requests...

34
Emerging
1839 tktcorporation/discord-tts-bot

A discord bot to use tts in your voice channel.

34
Emerging
1840 nexmo-community/voice-azure-speechtotext-py

Sample Code for Realtime Transcription using Nexmo, Microsoft Azure Speech...

34
Emerging
1841 seven-io/net-client

Official .NET API Client for seven

34
Emerging
1842 yapit-tts/yapit

Listen to anything. TTS for documents, papers, and web pages.

34
Emerging
1843 N6UDP/SteamDiscordTTSBot

A steam chat to Discord TTS bridge

34
Emerging
1844 NeoKazuya/qwen3-tts-enhanced

Enhanced Qwen3-TTS voice cloning GUI with multi-reference samples, variation...

34
Emerging
1845 ttuleyb/TortoiseTTS-GUI

GradioUI for TortoiseTTS voice generation

34
Emerging
1846 Frida7771/PyVoice

A Python-based speech processing tool that supports both speech-to-text...

34
Emerging
1847 audo-ai/magic-mic

Open Source Noise Cancellation App for Virtual Meetings

34
Emerging
1848 leokwsw/OpenAI-TTS-Gradio

Use OpenAI TTS(Text to Speech) API with Gradio

34
Emerging
1849 bhattbhavesh91/wav2vec2-huggingface-demo

Speech to Text with self-supervised learning based on wav2vec 2.0 framework...

34
Emerging
1850 HectorPulido/chatbot-with-voice

Jarvis like chatbot with voice

34
Emerging
1851 antifield/vmt

Discord App for Transcribing & Translating Voice Messages

34
Emerging
1852 mmpneo/simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

34
Emerging
1853 kssteven418/Q-ASR

[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition

34
Emerging
1854 ayutaz/uCosyVoice

CosyVoice3 text-to-speech for Unity using ONNX inference. Supports zero-shot...

34
Emerging
1855 kaiaai/kaia.js

Kaia.ai platform's JS client library

34
Emerging
1856 Fooftilly/kokoro-extension

Send text from browser to Kokoro-FastAPI for TTS generation

34
Emerging
1857 lepisma/emacs-speech-input

Set of packages for speech and voice inputs in Emacs

34
Emerging
1858 renorari/VoiceJP-Discord

A discord-app can text-to-speech and speech-to-text

34
Emerging
1859 jianchang512/realtime-stt

一个极简的本地离线实时语音转文字工具

34
Emerging
1860 cristofima/AI-Tech-Interview-Preparation

An AI-powered technical interview preparation platform that generates...

34
Emerging
1861 18F/dol-whd-14c

The 14(c) system will become a modern, digital-first service. Applicants...

34
Emerging
1862 neosapience/n8n-nodes-typecast

Integrate Typecast AI TTS into your n8n workflows with this community node.

34
Emerging
1863 cdyangbo/end2endASR

implement end-to-end asr algorithm with tensorflow

34
Emerging
1864 quangvu3/coqui-xtts

Coqui XTTS model with Vietnamese added

34
Emerging
1865 m-nathani/speech_to_text

how to use the Google Cloud Speech API to transcribe audio/video files.

34
Emerging
1866 deepgram-starters/php-transcription

Get started using Deepgram's speech-to-text with this PHP demo app

34
Emerging
1867 keonlee9420/Stepwise_Monotonic_Multihead_Attention

PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to...

34
Emerging
1868 alsrb0607/KoreanSTT

kospeech를 활용한 한국어 음성 인식 모델 개발

34
Emerging
1869 c99koder/AudioClassifier-MQTT

Use the yamnet TensorFlow model to classify live audio from a microphone and...

34
Emerging
1870 nithincvpoyyil/voice-listener

An reusable angular component for voice based input using web speech API

34
Emerging
1871 sudonitin/Audio-book-generator

Convert your ebooks to audiobooks. 📖->🎧

34
Emerging
1872 WeiChiaChang/happy-halloween

🗣 Say "happy halloween" to your browser 🎃

34
Emerging
1873 keonlee9420/Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a...

34
Emerging
1874 Blackwood416/AstraTTS

基于 ONNX Runtime 的跨平台高性能 TTS 合成方案,支持流式输出与低延迟播放,支持自定义音色与中英混合生成。

34
Emerging
1875 alkhimey/esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

34
Emerging
1876 xhuvom/omnilingual-ASR-Web-Dashboard

Meta Omnilingual ASR web based dashboard for testing and API based...

34
Emerging
1877 markokosticdev/cloud_text_to_speech_flutter

Single interface to Google, Microsoft, and Amazon Text-To-Speech.

34
Emerging
1878 priyanujgogoi-28/flowery-tts

Wrapper of Flowery Text to Speech API for Dart

34
Emerging
1879 markmiddo/synthia

AI-powered voice assistant that respects your privacy. Control your desktop,...

34
Emerging
1880 HnDK0/NoveLA

Free Android reader for web novels, light novels, ranobe & EPUB. 25+...

34
Emerging
1881 TartuNLP/text-to-speech-api

REST API for neural text-to-speech synthesis

34
Emerging
1882 nabz0r/mac-local-translator

Local translation app for Mac using speech recognition and offline translation

34
Emerging
1883 aditya-an1l/RILearn

Reinventing Reading with a touch of Interactivity aided Learning

34
Emerging
1884 Harsh-0-7/PDF-Reader

PDF reader with read aloud feature

34
Emerging
1885 C0NZZ/better-teletask

Browser extension that adds useful features like subtitles to HPI Tele-Task.

34
Emerging
1886 notebook-nexus/chatterbox-tts-colab

Transform any text into natural-sounding speech, clone voices from audio...

34
Emerging
1887 book000/audio-transcriber-docker

Automatically transcribe the audio of video / audio files using Speech Recognition.

34
Emerging
1888 rudra00434/SoulPlayer

My own music application build with Django , Tailwind CSS and Spacy...

34
Emerging
1889 ZhuoZhuoCrayon/AcousticKeyBoard-Web

❓声学键盘|脑洞大开:做一个能听懂键盘敲击键位的「玩具」,学习信号处理 / 深度学习 / 安卓 / Django。

34
Emerging
1890 bishop-ai/bishop-ai

Voice and text virtual assistant

34
Emerging
1891 MarkParker5/STARK-PLACE

S.T.A.R.K. Platform Library and Community Extensions

34
Emerging
1892 philsyn/DiffWave-Vocoder

Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and...

34
Emerging
1893 janewu77/ela-extension

English Learner Assistant

34
Emerging
1894 Lastorder-DC/chatreader-kor

채팅 읽어주는 로봇

34
Emerging
1895 leprosus/golang-tts

Text-to-Speach golang package based in Amazon Polly service

34
Emerging
1896 jiwidi/DeepSpeech-pytorch

Pytorch implementation for DeepSpeech 2.0

34
Emerging
1897 T-vK/Termux-DeepSpeech

Open source offline speech recognition for Android using Mozilla's...

34
Emerging
1898 edde746/tiktok-askreddit

A content generation & posting bot for TikTok, scraping posts from r/AskReddit

34
Emerging
1899 speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on...

34
Emerging
1900 msalhab96/SpeeQ

A framework for automatic speech recognition

34
Emerging
« Prev 1 2 3 17 18 19 20 21 68 69 70 Next »