All Voice AI Tools

6,981 tools ranked by quality score · Page 36 of 70

Showing 3501–3600 of 6,981
# Tool Score Tier
3501 seyedsaleh/persian-speech-recognition

Simple word recognition using CNN on Raspberry Pi board 🗣

23
Experimental
3502 trungd/speech-recognition

experimental speech recognition library in tensorflow

23
Experimental
3503 orbxball/DSP

2016 Autumn (105-1) -- Fundamentals of Digital Speech Signal Processing

23
Experimental
3504 Aalwattar/ParrotInk

Professional-grade, real-time voice-to-text for Windows. Stream your voice...

23
Experimental
3505 common-voice/our-voices-model-competition

Our Voices Competition

23
Experimental
3506 eGroupAI/speech-integration-starter

Public-safe starter kit for Whisper integration

23
Experimental
3507 Hayder-IRAQ/SubLab

🎬 Auto-generate & translate video subtitles using Whisper AI — offline,...

23
Experimental
3508 jailuthra/asr

Kaldi ASR wrapper scripts

23
Experimental
3509 mjmammoth/murmur

Real-time speech transcription. Privacy-first.

23
Experimental
3510 R3ner/Barrel-Timer

Advanced voice-controlled cooldown tracker for League of Legends. Tracking...

23
Experimental
3511 mkiol/papago

Papago repeats what you say but in different language

23
Experimental
3512 anshulgupta0803/ASSR

ASSR: Automatic Stuttered Speech Recognition

23
Experimental
3513 AASHISHAG/asr-german

Automatic Speech Recognition (ASR) - German

23
Experimental
3514 Mierdoso87/Step-Audio-R1.1

🎧 Unlock audio insights with Step-Audio-R1.1, the first model that scales...

23
Experimental
3515 cavemansatyn-design/AI-Multilingual-Speech-Chatbot-for-Healthcare-Education-Reception

Developed a multilingual speech-based chatbot capable of handling user...

23
Experimental
3516 brenomfviana/rita

RITA (Rapid Interaction Assistant for Tasks) is a voice-controlled virtual...

23
Experimental
3517 chizkidd/igbo-asr-tonal-evaluation

Systematic evaluation of tonal fidelity in facebook/omniASR-CTC-1B when...

23
Experimental
3518 shessam/DSR

Throughout history, Altough there has been significant research in the field...

23
Experimental
3519 TeaPoly/CE-OptimizedLoss

Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss...

23
Experimental
3520 tushar-prabhu/Multilingual-Voice-Transcriber-and-Translator

A Python-based application that records voice, transcribes spoken text,...

23
Experimental
3521 ServerSideHannes/las

tf 2.0 implementation of Listen, attend and spell

23
Experimental
3522 miaubonito/subsync

🎥 Transcribe and translate YouTube subtitles quickly with SubSync, a Python...

23
Experimental
3523 yxngrbree/text-to-speech

Nano weight TTS

23
Experimental
3524 lohriialo/texttospeech

Google's Speech Synthesis, Text to speech conversion powered by machine learning

23
Experimental
3525 francescodisalvo05/smart-surveillance-raspberrypi

Smart Surveillance System on RaspberryPi

23
Experimental
3526 RW128k/VCIDE

A simple text editor for writing Python using your voice.

23
Experimental
3527 Ryan-M-Smith/Quinton-VoiceAssistant

A simple voice assistant

23
Experimental
3528 brailcom/tts-api-provider

Common interface to speech synthesis

23
Experimental
3529 DevTae/SpeechFeedback

Docker, 음성인식 AI, FastAPI 기반 한국어 발음 교정 시스템

23
Experimental
3530 bjorand/go-speech

Simple speech recognition proof of concept

23
Experimental
3531 dictto-app/dictto

Voice-to-text for Windows — hold a hotkey, speak, release. Clean text...

23
Experimental
3532 Kaljurand/things-k6nelelauncher

An Android Things app that launches Kõnele

23
Experimental
3533 YannJY02/AutoTranscribe

🎙 Automated offline video transcription for macOS — FunASR + speaker...

23
Experimental
3534 RK2521/swift-toml

📄 Parse TOML configuration files easily with this robust Swift...

23
Experimental
3535 Haseeb69420/randora

🎲 Generate random values effortlessly with Randora, a lightweight utility...

23
Experimental
3536 voskii-pug/whisper-php

🎙️ Transform audio to text effortlessly with Whisper for PHP, a fast, local...

23
Experimental
3537 caimari/vtts

Continuous batching for TTS — like vLLM, but for voice. Serve 10+...

23
Experimental
3538 scrappylabsai/scrappy-radio

AI-powered radio station — generates original music, DJ commentary, and...

23
Experimental
3539 Lynxgsm/vibevoice-realtime-0.5b-local

Local runner for Microsoft VibeVoice Realtime TTS demo. Run the Colab...

23
Experimental
3540 vivek-541/vani-tts

Lightweight on-device Hindi TTS for Android & iOS — fine-tuned on AI4Bharat...

23
Experimental
3541 Sim3-14159/Jeremy

Jeremy is an AI-powered robot; it can see its surroundings, talk, process...

23
Experimental
3542 Jayden-X-L/lobster-radio-skill

个性化qwen3本地模型驱动的资讯电台生成服务 - OpenClaw Skill

23
Experimental
3543 VirtualZer0/StreamTalkerServer

AI text-to-speech server powered by Qwen3-TTS with voice cloning, batch...

23
Experimental
3544 niradler/vocal

Generic Speech AI Platform - Ollama for Voice Models

23
Experimental
3545 deepgram-starters/fastapi-text-to-speech

Get started using Deepgram's Text-to-Speech with this FastAPI demo app

23
Experimental
3546 tozalia/pocket-tts-openapi-gpu

🎤 Clone voices locally with Pocket TTS OpenAPI - GPU. Enjoy free,...

23
Experimental
3547 fardin-sabid/NeuTTS-Studio

On-Device Text-to-Speech · Voice Cloning · Real-Time Streaming

23
Experimental
3548 basicScandal/arbiter

Live AI judge agent — watches hackathon demos in real-time via Gemini Live...

23
Experimental
3549 Echoshard/DiscordBotOpenAI_TTS

A simple discord bot that can produces mp3's using Open AI's TTS API.

23
Experimental
3550 tim-hellhake/google-home-adapter

Uses your Google Home device to speak to you

23
Experimental
3551 celanthe/clarion

Your agents have things to say. Now they have a voice to say them with.

23
Experimental
3552 davealaw/kokoro-electron

Kokoro TTS GUI - a user-friendly Electron application for local neural...

23
Experimental
3553 itscooleric/yap

Local-first speech I/O stack — privacy-preserving transcription, synthesis,...

23
Experimental
3554 skystone011/migpt-tts-api

让小爱音箱「按需播报」,openclaw可以说话了——通过简单的 HTTP API 触发播报

23
Experimental
3555 Harsh-GitHup/voice-assistant

Voice Assistant App

23
Experimental
3556 schnorea/post_AWS_transcribe

AWS Transcribe does a reasonable job doing speach to text with speaker id...

23
Experimental
3557 OVOSHatchery/ovos-tts-plugin-voicerss

VoiceRSS TTS plugin for mycroft

23
Experimental
3558 EricBatlle/UnityAndroidNativeToolkit

🧰 Native Android functionalities for Unity in one unique plugin!

23
Experimental
3559 tahababou12/RECOG

The project involves developing an Android app that automatically captures...

23
Experimental
3560 svarlamov/aws-polly-node-typescript-demo

Demo of how to use AWS Polly text-to-speech in a web app using NodeJS,...

23
Experimental
3561 poretsky/freespeech

English text preprocessor for MBROLA speech synthesizer

23
Experimental
3562 seanghay/wav2vec2-khmer-openslr

Wav2Vec2 with OpenSLR 42 (Khmer language)

23
Experimental
3563 gunarakulangunaretnam/hate-speech-analysis-with-happytransformer

An artificial intelligence based tool for analyzing hate speeches in...

23
Experimental
3564 Kowalski1024/Virtual-Assistant

A voice command assistant service, it can recognise human speech, talk to...

23
Experimental
3565 Ali-Assare/auto-recaptcha

A Selenium Bot Written in Python for Automating Google reCAPTCHA Using IBM...

23
Experimental
3566 kaiidams/voice100-runtime

Voice100 runtime. Voice100 includes neural TTS/ASR models. Inference of...

23
Experimental
3567 TheMindhouse/memospeak

Memorize any text with voice recognition

23
Experimental
3568 conqueror62821/VoiceGPT

A Personal Assistant that uses LangChain + ChatGPT Whisper + Google TTS

23
Experimental
3569 yangr0/speakify

[ Speech to Text ]

23
Experimental
3570 dalmoon15/styletts2-dataset-toolkit

🎤 Streamline voice cloning with the StyleTTS2 Dataset Toolkit, a...

23
Experimental
3571 hackzilla/SpeechRecognition

A simple yet powerful SwiftUI app for iOS that demonstrates speech...

23
Experimental
3572 epfluegel/TalkMaths

A Vocola 2 (DNS) extension for creating and editing mathematics (in LaTeX)...

23
Experimental
3573 TheJoin95/pwa-audio-transcript

A speech to text PWA to transcript your whatsapp/telegram audio to text

23
Experimental
3574 huaxiaozhong1/Tensorflow-SparkFunEdge-FullLifeCycel-for-SequenceModel

An "AI on-device" project for sequence model. Based at Tensorflow Lite for...

23
Experimental
3575 bartbilliet/LiveTranslate.App

Generate translated subtitles for any audio source (Xamarin mobile app)

23
Experimental
3576 marcominerva/CustomCommands

A sample that shows how to integrate Custom Commands in a real system, with...

23
Experimental
3577 stitchng/adonis-infobip

An addon/plugin package to provide InfoBip single/bulk SMS/Voice services in...

23
Experimental
3578 xVc323/raspai

A voice assistant powered by Google’s Gemini AI, designed for Raspberry Pi....

23
Experimental
3579 oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation

This repository contains scripts to prune Wav2vec2 using a...

23
Experimental
3580 Soft/epub2audio

Tool for automatically converting EPub ebooks to audiobooks using TTS.

23
Experimental
3581 Taijul007/VieNeu-TTS

🎤 Generate realistic Vietnamese speech with VieNeu-TTS, an advanced...

23
Experimental
3582 BlankOnTheHub/Audiopub

🎧 Transform EPUBs into high-fidelity audiobooks locally with Audiopub, using...

23
Experimental
3583 KevinBonnoron/sirene

Self-hosted text-to-speech platform with multi-backend support, voice...

23
Experimental
3584 7rajatgupta/react-text-to-speech

react library using the speech syntesizer API to convert text to speech in real time

23
Experimental
3585 senigami/audiobook-studio

Professional local-first AI production pipeline for long-form narration....

23
Experimental
3586 jamie-bear/luminaudio

An open-source web frontend for generating and downloading high-quality,...

23
Experimental
3587 sarumaj/bing-wallpaper-changer

Fetch newest bing wallpaper and set it as background. Use NLP and...

23
Experimental
3588 MrWong99/Glyphoxa

AI-Powered Voice NPCs for Tabletop RPGs — real-time voice AI framework written in Go

23
Experimental
3589 x-phone/xbridge

Self-hosted voice gateway — WebSocket audio streaming and REST call control....

23
Experimental
3590 techmo-pl/tts-client

Techmo Text-To-Speech (TTS) gRPC client

23
Experimental
3591 thaispalmer/talkify-tts-api

Library to generate TTS directly from Talkify.net APIs

23
Experimental
3592 licavalentin/reddit-video-creator

✨📼Create Reddit Videos with JavaScript📼✨

23
Experimental
3593 paulfears/vbs

allows python to access visual basic functions including: text to speech,...

23
Experimental
3594 gtsopus/SoftEng-SoftDev2-UoI-Projects

University project for the "Software Engineering" course made in...

23
Experimental
3595 CodingWithEnjoy/Speech-To-Text-HTML-CSS-JS

متن به صدا | Text To Speech 😊🤩

23
Experimental
3596 Nikya/voicify

To generate spoken notification

23
Experimental
3597 mohaimenulislamshawon/text-to-voice-speech-converter

The program is created based on google text to speech or voice converter...

23
Experimental
3598 DevStranger/NoteWriter

NoteWriter - aplikacja do sporządzania notatek ze zdalnych spotkań

23
Experimental
3599 timkrebs/VoiceDetection

Speech Recognition implementation with MFCC and HMM

23
Experimental
3600 PezCoder/ai-chatbot

Bot who can listen & talk.

23
Experimental
« Prev 1 2 3 34 35 36 37 38 68 69 70 Next »