All Voice AI Tools

6,981 tools ranked by quality score · Page 61 of 70

Showing 6001–6100 of 6,981
# Tool Score Tier
6001 msalhab96/Listen-Attend-and-Spell

PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper

12
Experimental
6002 tuanio/conformer-rnnt

Conformer RNN-Transducer

12
Experimental
6003 zyascend/End-to-End-Speech-Recognition-Learning

ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别

12
Experimental
6004 upskyy/RNN-Transducer

PyTorch Implementation of RNN-Transducer

12
Experimental
6005 khaykingleb/automatic-speech-recognition

QuartzNet and DeepSpeech implementation for ASR

12
Experimental
6006 avrtt/MoE-speech-recognition

Mixture of experts architecture for speech-to-text and language...

12
Experimental
6007 yandex-cloud-examples/yc-speechkit-async-recognizer

SpeechKit Asynchronous Batch Recognizer.

12
Experimental
6008 markus-m-u-e-l-l-e-r/CTC.ISL

ISL Speech Recognition Toolkit for training neural networks with the CTC...

12
Experimental
6009 SrujanHR/Happy-AI-Voice-Assistant

Happy is a Python-based personal voice assistant for Windows. It responds to...

12
Experimental
6010 yehuohan/ln-asr

Automatic Speech Recognition

12
Experimental
6011 Omitg24/IIS-ASR

Repositorio para Administración de Sistemas y Redes (ASR), asignatura del...

12
Experimental
6012 subuhana2303/VaaniRakshak_Offline-Emergency-Voice-Assistant

VaaniRakshak is an offline voice assistant built for disaster scenarios,...

12
Experimental
6013 sofiahernandes/speech-sci-calculator

A smart scientific calculator app with speech recognition, built in Python...

12
Experimental
6014 AathifZahir/WhisprSplit

A powerful, local speech-to-text transcription system that combines OpenAI's...

12
Experimental
6015 DanteVela/Python-Voice-Assistant

A repository of a speech-driven virtual assistant powered by Speech...

12
Experimental
6016 Brooklyn-Dev/Ultron-AI

Voice-controlled AI gaming assistant for Marvel Rivals.

12
Experimental
6017 Manan-49/SRT-GENERATOR

Offline desktop application for generating accurate subtitles (SRT) from...

12
Experimental
6018 asiff00/TTS-Training-Blueprint

Intuitive understanding of Autoregressive TTS Models

12
Experimental
6019 brandonviaje/echo

voice assistant discord bot

12
Experimental
6020 Clats97/ClatScribe

ClatScribe is a speech-to-text tool that captures real-time audio,...

12
Experimental
6021 zayedalbloushi/AI-Transcription

Stream audio from the browser, transcribe it in real time, and get live...

12
Experimental
6022 msadeqsirjani/SubtitleGenerator

🎬 AI-powered subtitle generator using OpenAI Whisper. Multi-language...

12
Experimental
6023 tuannho0802/PDFvert-TextToSpeech

A web-based application for seamless PDF/DOCX conversion and natural...

12
Experimental
6024 MrFlapstaart/GameOCRTTS

Speak out text balloons in games without voice acting to use OCR on the...

12
Experimental
6025 taeefnajib/Aximos

Aximos is an innovative AI-powered tool that transforms your content into...

12
Experimental
6026 noAbbreviation/approxima

A command line program to loudly tell time (in chunks of 5 minutes).

12
Experimental
6027 LiZeC123/legado-tts-tencent

Tencent TTS for Legado Reader 基于腾讯语音合成API的Legado(开源阅读)TTS服务.

12
Experimental
6028 Aavache/pdf2speech

Reading PDF files and converting them to audio tracks.

12
Experimental
6029 10809104/taigi-speech-to-text

台語語音轉文字訓練資料集,資料來源:教育部《臺灣閩南語常用詞辭典》。

12
Experimental
6030 benda1989/qwen3-tts

qwen3-tts train multi-speaker emotion control

12
Experimental
6031 Prathuvj/spectrolingua

🎵 Audio Processing Studio - A comprehensive Django API with Streamlit...

12
Experimental
6032 alam025/AI-voice-assistant-with-RAG-powered-customer-support

Enterprise-grade AI voice assistant with RAG-powered customer support,...

12
Experimental
6033 PedritoGMG/GMG-FunMenu

Client-side commands for microphone interactions, sound effects, and more,...

12
Experimental
6034 shaikhsaif72/Jarvis-Voice-Assistant

A voice-activated virtual assistant using Python and OpenAI.

12
Experimental
6035 yigitaliayyildiz/SmartSEE

Android object detection app using YOLOv8 (TFLite) with Turkish TTS feedback.

12
Experimental
6036 AapseMatlb/Pickasso-Speech

Speech Interaction Subsystem for Pickasso Autonomous Robot Enables wake word...

12
Experimental
6037 Swathi-88/JARVIS-AI

A voice-controlled desktop AI assistant for Windows featuring OpenAI...

12
Experimental
6038 AbhaySingh71/Multimodal-Agentic-Assistant-Clara

Clara: An agentic multimodal AI assistant that can see through your webcam,...

12
Experimental
6039 isbendiyarovanezrin/SpeechDetection

Speech Detection 💬

12
Experimental
6040 masonintokyo/voicevox-srt-to-speak

VOICEVOX Engine APIを使ってSubRipファイルから各セリフ時間内に収まるように音声合成します。

12
Experimental
6041 Madh93/whisper

🎙️ My Whisper stuff

12
Experimental
6042 YoungloLee/tf2-speech-recognition-transformer

Tensorflow 2 Speech Recognition Code (Transformer)

12
Experimental
6043 jmrashed/ai-desktop-assistant

A Python-based AI desktop assistant designed to perform various tasks like...

12
Experimental
6044 dannis999/trained_SpeechRecognition

此项目用于备份一个完整的中文语音识别环境,包括环境配置和预训练模型,以方便直接使用

12
Experimental
6045 Masihtabaei/reswhis

A lightweight, WebSocket-based server for real-time, remote audio...

12
Experimental
6046 MSAbhishek22/Veronica_Chatbot

🤖 AI Chatbot with Voice Interface - A Flask web app featuring Groq-powered...

12
Experimental
6047 Hhhpraise/auto-subtitler

a python based app that generates subtitles , and can also be translated ,...

12
Experimental
6048 kevin30205/Media-Transcribe

Media Transcribe: Seamlessly generate transcripts from your video and audio...

12
Experimental
6049 wazeerc/voxie

Voxie, Let Your Notes Speak

12
Experimental
6050 parula-app/assistant

Parula - Digital assistant - Running entirely on your own device

12
Experimental
6051 CSFelix/audio-to-text

🔊 Extract Text from Audios 🔊

12
Experimental
6052 Renamekk/Voice-Assistant

A simple and customizable voice assistant written in Python. Supports adding...

12
Experimental
6053 Akshitha0118/Akshitha-Voice-AI-Voice-Powered-YouTube-Assistant

An AI-powered Voice Assistant built using Python and Streamlit that listens...

12
Experimental
6054 dudarev/speechdown

CLI tool to transcribe your spoken audio notes into timestamped,...

12
Experimental
6055 druellan/ED-AI-Companion

A Python script to monitor the Elite Dangerous journal files and provide...

12
Experimental
6056 GlobussBiogestion/text-to-signals-and-voice

This API works 100% in HTML with Javascipt so it is very light and easy to...

12
Experimental
6057 jetfontanilla/browser-text-to-speech

a demo of what a browser is currently capable of in text-to-speech

12
Experimental
6058 passion-27/openai-whisper-api

A sample speech transcription app implementing OpenAI Text to Speech API...

12
Experimental
6059 13shivam/yt-agent

Offline-friendly backend POC to transcribe YouTube videos and chat with...

12
Experimental
6060 Eng-M-Abdrabbou/Sonix

A high-speed speech processing engine that captures and converts spoken...

12
Experimental
6061 kbhujbal/J.A.R.V.I.S-AI-Assistant

🤖 Voice-controlled AI assistant with speech recognition, Wikipedia search,...

12
Experimental
6062 mavleo96/whisper-accent

Conditioning via Adaptive Layer Norm for accented speech recognition

12
Experimental
6063 5ekastanx/Text-To-Speech

This Django project allows converting text to audio files and saving...

12
Experimental
6064 Aavtic/ena

A video generation program using GIFS.

12
Experimental
6065 sruckh/VibeVoice-finetune-easy

Simplified scripts for fine-tuning VibeVoice speech synthesis models with...

12
Experimental
6066 gas/pronunza-tts-galego-onnx-colab

Caderno de Colab para síntese de voz (TTS) en galego usando o modelo ONNX de Celtia

12
Experimental
6067 nmanikiran/ionic-allinone

This is to give a demo of each feature that are there in ionic and ionic-native

12
Experimental
6068 tb0hdan/voiceplay

Client-side first music centered voice controlled player

12
Experimental
6069 zefie/multi-tts

Docker for multiple TTS Engines with a GRadio interface

12
Experimental
6070 Zuellni/XTTS-Server

XTTS Server for SillyTavern.

12
Experimental
6071 EllangoK/gpt-voice-companion

Small, simple chatbot using GPT and ElevenLabs TTS

12
Experimental
6072 vpakarinen2/text-voice-chatterbox

Text-to-speech and voice cloning using Chatterbox Turbo.

12
Experimental
6073 ExplainableML/ZerAuCap

[NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning...

12
Experimental
6074 jmaczan/asr-dysarthria

Research on Automatic Speech Recognition for dysarthric speech

12
Experimental
6075 Temerold/TobsTTS

Text to speech, Python 3.7. Swedish and English. bye

12
Experimental
6076 SSusantAchary/AI_Resources

Have read and collected few Interesting Papers , Projects

12
Experimental
6077 ryanp3343/LiveScreenTranslator

LiveScreenTranslator utilizes OCR and translation services to provide...

12
Experimental
6078 Vidyut/vidyut-tts

Streamlit frontend for Coqui-tts

12
Experimental
6079 twers1/telegram-bot-audio

Telegram bot text-to-speech and speech-to-text

12
Experimental
6080 khaykingleb/research-playground

Efficient ML/DL implementations across multiple domains with K3s multi-node...

12
Experimental
6081 michaelmior/ha-silero

Text-to-speech for Home Assistant using Silero

12
Experimental
6082 CaydendW/Cashew

A python based virtual assistant

12
Experimental
6083 kuanyshbakytuly/camera-text-speech

Blind Text-Assistance

12
Experimental
6084 kunal2812/Programmophone

It is a tool to program with speech and is intended to be used by sightless...

12
Experimental
6085 Joyeah/videomaker

批量图片生成视频

12
Experimental
6086 lingualogic/speech-react

Speech-React SDK

12
Experimental
6087 TejasQ/react-praise

A React binding for Praise.

12
Experimental
6088 ponchotitlan/google_text-to-speech_prompt_maker

Utility for Google Text-To-Speech batch audio files generator. Ideal for...

12
Experimental
6089 willwade/TTS-Dataset

A workflow to create a dataset of all TTS voices/languages available on...

12
Experimental
6090 kaka-lin/rpi-voice-kit-app

Using app to control Voice Kit(smart speaker)

12
Experimental
6091 Rumeysakeskin/Speech-Datasets-for-ASR

Download speech datasets (English and non-English) for Automatic Speech Recognition

12
Experimental
6092 arjunbazinga/speak

Select any text and have it read out loud

12
Experimental
6093 OVOSHatchery/ovos-tts-plugin-responsivevoice

responsive voice TTS plugin for mycroft

12
Experimental
6094 koth/kokoro.cpp

kokoro tts in cpp

12
Experimental
6095 Erio-Harrison/kokorotts_service

A TTS service that deploys Kokoro model inference

12
Experimental
6096 robauto/bibli3.0

BiBli 3.0 for Raspberry Pi - Swarm Robotics and IoT Operating System - AI -...

12
Experimental
6097 Thukyd/OpenAI-Spechify-Your-Docs

OpenAI-Spechify-Your-Docs is a Python project that converts text from...

12
Experimental
6098 ReadieFur/Stream-Tools

A stream chat tool that features AWS text to speech, voice commands, chat...

12
Experimental
6099 zguesmi/image2speech

Ethereum ready Dapp to speak your images.

12
Experimental
6100 PeterTakahashi/openai-tts

OpenAI Text to Speech

12
Experimental
« Prev 1 2 3 59 60 61 62 63 68 69 70 Next »