All Voice AI Tools
6,981 tools ranked by quality score · Page 27 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 2601 |
jazzqi/openclaw-mimo-tts
OpenClaw TTS Provider for Xiaomi MiMo (mimo-v2-tts) |
|
Experimental |
| 2602 |
manhph2211/ML-Deployment
Pushing Deep Learning models into production using torchserve, kubernetes... |
|
Experimental |
| 2603 |
tuhinpal/text-to-speech
Text to Speech using Google's Library (Made for Fun) |
|
Experimental |
| 2604 |
nhut-ngnn/Voice-Based-Age-and-Gender-Recogniton
[ICTC'24] - "Voice-Based Age and Gender Recognition: A Comparative Study of... |
|
Experimental |
| 2605 |
Kaljurand/Diktofon
An Android app, a dictaphone with Estonian speech-to-text |
|
Experimental |
| 2606 |
ikarago/Talkinator
Talkinator is an easy to use text-to-speech-app for Windows 10-devices |
|
Experimental |
| 2607 |
xuchennlp/S2T
The project for speech translation |
|
Experimental |
| 2608 |
hikari-tadashi/Sapphire
A free and open source replacement for Google Assistant on Android devices,... |
|
Experimental |
| 2609 |
thotnd173389/SpeechCommand
The project aims to use keyword spotting streaming in a real-time offline... |
|
Experimental |
| 2610 |
mascotbot/elevenlabs-avatar
Open-source example for integrating ElevenLabs conversational AI with... |
|
Experimental |
| 2611 |
agentvoiceresponse/avr-asr-vosk
This repository provides a real-time speech-to-text transcription service... |
|
Experimental |
| 2612 |
Shashwat-Akhilesh-Shukla/Cognitive-AI
CognitiveAI is a production-grade conversational AI with persistent memory,... |
|
Experimental |
| 2613 |
koesan/Auto_Dubbing_And_Subtitle
Auto video dubbing and subtitle generation with AI-powered voice synthesis,... |
|
Experimental |
| 2614 |
kostas2370/Video-Creator
This project is to automate the video creation. |
|
Experimental |
| 2615 |
18F/tts-buy-datagov-technical-support-services
Solicitation documents for obtaining professional services to support Data.gov. |
|
Experimental |
| 2616 |
jaoafa/ChatWatcher
🗣 Discord voice-chat speech recognition |
|
Experimental |
| 2617 |
TharanaBope/whisper-v3-diarization
Production-ready audio transcription & speaker diarization CLI & GUI using... |
|
Experimental |
| 2618 |
alexykn/TorchTS
A modern text to speech frontend for Kokoro-82M |
|
Experimental |
| 2619 |
mehnoorsiddiqui/whatsapp-voice-transcriber
WhatsApp voice transcriber is an audio message transcriber app created with... |
|
Experimental |
| 2620 |
atomicoo/Tacotron2-PyTorch
PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。 |
|
Experimental |
| 2621 |
Syduan0921/Muliti-Role_Cosyvoice2
🤖一键部署,利用TTS与LLM将长文本小说转化为多角色音/视频。 |
|
Experimental |
| 2622 |
18F/tts-buy-challengegov-ideation
Market research documents related to the Challenge.gov Ideation Platform. |
|
Experimental |
| 2623 |
GetProjectsIdea/Convert-Text-to-Speech-in-Python
Text to speech is a process to convert any text into voice. Text to speech... |
|
Experimental |
| 2624 |
candlewill/Ossian
Ossian: A simple language-independent Text-to-speech frontend |
|
Experimental |
| 2625 |
JTylerH/unifi-aihorn-dynamic-tts
This project hosts a lightweight Node.js web app that connects to your UniFi... |
|
Experimental |
| 2626 |
Garden-Tree/yomi-KAI
yomi-KAIはDiscordのテキストチャンネルに送られた文章をボイスチャンネルで読み上げるbotです。 |
|
Experimental |
| 2627 |
Issac-Moses/liebea
AI voice-activated girlfriend assistant with wake word detection, speech... |
|
Experimental |
| 2628 |
BullShark/JSpeak
A Text to Speech Reader Front-end that Reads from the Clipboard and with... |
|
Experimental |
| 2629 |
alitahir4024/Text-To-Speach-Javascript
A creative project to give voice to your words. |
|
Experimental |
| 2630 |
SingAvi/SpeechToText
Simple python script to convert live speech or any audio file to text using... |
|
Experimental |
| 2631 |
BenLubar/espeak
Package espeak is a wrapper around espeak-ng that works both natively and in... |
|
Experimental |
| 2632 |
jim11662418/General_Instrument_CTS256_SP0256_Speech_Synthesizer
Vintage General Instrument Speech Synthesizer CTS256 with SP0256 |
|
Experimental |
| 2633 |
JustinGOSSES/spoken-floodplain
Website that verbally tells users when they enter or leave a floodplain in... |
|
Experimental |
| 2634 |
ThetaOne-AI/HiKE
Hierarchical Korean-English Code-Switching Speech Recognition Benchmark... |
|
Experimental |
| 2635 |
zerospeech/benchmarks
A command line tool that helps use the "Zero Ressource Challenge" benchmarks |
|
Experimental |
| 2636 |
Lhx94As/Awesome-Spoken-Language-Identification
An awesome spoken LID repository. (Working in progress |
|
Experimental |
| 2637 |
SupernovifieD/FreeSpeechToText
A python program that extracts text from audio files - .mp3 or .wav - for free! |
|
Experimental |
| 2638 |
Aculeasis/rhvoice-proxy
High-level interface for RHVoice library |
|
Experimental |
| 2639 |
akshatg-721/JanSamvaad-ResolveOS
JanSamvaad ResolveOS — A voice-first AI governance system that converts... |
|
Experimental |
| 2640 |
Mokkapps/parents-soundboard
A soundboard developed for parents to be able to play often needed phrases like "No" |
|
Experimental |
| 2641 |
hwk06023/SONATA
SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced... |
|
Experimental |
| 2642 |
obtic-sorbonne/Toolbox-site
Pandore offers a set of tools that facilitate the most common corpus... |
|
Experimental |
| 2643 |
botbahlul/Live-Subtitle
ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free... |
|
Experimental |
| 2644 |
orianemartin/WhispGrid
A Whisper to TextGrid script that I use to automatize Corpus Annotation on... |
|
Experimental |
| 2645 |
HuuHuy227/XphoneBert_Vits2
VITS2 extended with XPhoneBERT encoder |
|
Experimental |
| 2646 |
mo7amedaliEbaid/run-tracker
A flutter run tracker app - clean architecture |
|
Experimental |
| 2647 |
nay-cat/LiveKit-PiperTTS-Plugin
Quick integration of Piper TTS (super lightweight, high-quality model) with LiveKit |
|
Experimental |
| 2648 |
Zuellni/LLaSA-WebUI
LLaSA WebUI using ExLlamaV2 and FastAPI. |
|
Experimental |
| 2649 |
shreyasnisal/SpeechProgrammer
The Speech Programmer writes code based on voice commands. Right now it only... |
|
Experimental |
| 2650 |
dcavar/ELAN2split
Split ELAN Annotation Files and corresponding speech files into a corpus... |
|
Experimental |
| 2651 |
madushan1000/voxcpm_rs
Rust (using burn) implementation of VoxCPM |
|
Experimental |
| 2652 |
golemfactory/g-flite
g-flite: flite app distributed over Golem Network |
|
Experimental |
| 2653 |
Fractionbeyondseam/soundpad-download-plus-subscription
Get Soundpad Download Plus on GitHub: a complete, high-performance toolkit... |
|
Experimental |
| 2654 |
revdotcom/revai-java-sdk
Rev.ai Java SDK |
|
Experimental |
| 2655 |
miuda-ai/sensevoice-cli
Tool for speech recognition using sensevoice-small |
|
Experimental |
| 2656 |
tabahi/Mel-Spectrum-Analyzer
Online web based mel-spectrum, power spectrum, FFT analyzer for speech and... |
|
Experimental |
| 2657 |
buddyeorl/deep-talk
Deep-speech react app to test trained models,to visualize the speech to text... |
|
Experimental |
| 2658 |
DragonDiffusionbyBoyo/Boyonodes
A set of Comfyui nodes |
|
Experimental |
| 2659 |
upskyy/Paper-Review
Paper Review about Speech Recognition · NLP |
|
Experimental |
| 2660 |
OpenVoiceOS/status
Open Voice OS Server Status Page |
|
Experimental |
| 2661 |
Drakonis96/whispad
WhisPad is a note management tool where you can write or dictate your notes... |
|
Experimental |
| 2662 |
Supremolink81/TTSCeleb
A TTS app where you can clone the voices of any person you wish. |
|
Experimental |
| 2663 |
luongnv89/voice-cast
Your words, any voice. Voice cloning and text-to-speech with multiple TTS... |
|
Experimental |
| 2664 |
ThisModernDay/f5-tts
F5-TTS is a web application that allows users to clone voices and generate... |
|
Experimental |
| 2665 |
savg92/voice-cloning
This project provides a comprehensive testing and comparison platform for... |
|
Experimental |
| 2666 |
techiaith/docker-marytts
Lleisiau synthetig cadwynedig Cymraeg gyda MaryTTS a Docker // Welsh... |
|
Experimental |
| 2667 |
FNBUBBLES420-ORG/Speech-to-Text-Application
🎙️ Welcome to the Speech to Text Application! 📝 This tool converts your... |
|
Experimental |
| 2668 |
yuyq96/pyshengyun
A Python converter for Chinese Pinyin and Shengyun (initials and finals) |
|
Experimental |
| 2669 |
otonomee/streamstem
Implements ML audio separation algorithm on audio from YouTube or Spotify... |
|
Experimental |
| 2670 |
botbahlul/Live-Subtitle-V2
ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free... |
|
Experimental |
| 2671 |
mmahdibarghi/finglish-dataset
Persian to Finglish dataset with all the sentences voice for TTS dataset... |
|
Experimental |
| 2672 |
brailcom/festival-freebsoft-utils
Festival extensions and utilities, focused on interaction with Speech Dispatcher |
|
Experimental |
| 2673 |
Pallas1303/FestPB
FestPB é um projeto com objetivo de oferecer suporte ao Português Brasileiro... |
|
Experimental |
| 2674 |
rezkyatinnov/capetangjs
A JavaScript library for text to speech vice versa using Web Speech API |
|
Experimental |
| 2675 |
De-Technocrats/simple-text-to-speech-javascript
Simple text to speech with javascript. |
|
Experimental |
| 2676 |
ArdaGnsrn/elevenlabs-js
This is an Open Source NodeJS package for ElevenLabs Text to Speech API. |
|
Experimental |
| 2677 |
spandan114/AI-realtime-voice-agent
A Python-based real-time voice-to-voice conversation system that lets you... |
|
Experimental |
| 2678 |
lxpio/omnigram
Omnigram is a Flutter-based file reader and audiobook . It accommodates ... |
|
Experimental |
| 2679 |
wzhd/vosk-rs
Cloud-free speech recognition. See https://fars.ee/F9-b.mp4 |
|
Experimental |
| 2680 |
rsxdalv/bark-speaker-directory
Site for sharing Bark voices |
|
Experimental |
| 2681 |
adeepak7/Speech-To-Code
Speech To Code is Google Chrome Extension to convert Speech into Code. |
|
Experimental |
| 2682 |
Pzc-Neo/vue-web-reader
城墨网页小说朗读 ( Novel read aloud on web. ) |
|
Experimental |
| 2683 |
kaiidams/Voice100AndroidApp
Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime and... |
|
Experimental |
| 2684 |
jzmzhong/Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS). |
|
Experimental |
| 2685 |
KickerMix/Discord-Local-LLM-VoiceChat-Bot
Saya Voice Assistant for Discord AI voice bot: listens, detects keywords,... |
|
Experimental |
| 2686 |
hari-huynh/viVQA-voice-assistant
Voice assistant using Multimodal LLMs - LLaVA-NeXT (Mistral 7B) finetuned &... |
|
Experimental |
| 2687 |
kaiidams/Kokoro-Speech-Dataset
A public domain single speaker Japanese speech dataset |
|
Experimental |
| 2688 |
fengredrum/finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT |
|
Experimental |
| 2689 |
stgloorious/stm32-speech-recognition
Speech Recognition using STM32 and Machine Learning |
|
Experimental |
| 2690 |
grammatek/simaromur
Icelandic TTS (text-to-speech) service for Android |
|
Experimental |
| 2691 |
MichaelGrafnetter/defender-asr-admx
Administrative Template (ADMX) for Microsoft Defender Attack Surface Reduction (ASR) |
|
Experimental |
| 2692 |
Mindinventory/AutoHighlightTTS
AutoHighlightTTS is a simple, powerful solution for Android Text to Speech,... |
|
Experimental |
| 2693 |
msjsc001/Anki-TTS-Edge
A modern text-to-speech tool powered by Microsoft Edge TTS. Creates Anki... |
|
Experimental |
| 2694 |
messiaen/full-lattice-search
Full Text Search Over Probabilistic Lattices with Elasticsearch! |
|
Experimental |
| 2695 |
nvmoyar/aind2-speech-recognition
Some approaches based on deep learning to build the acoustic model for an... |
|
Experimental |
| 2696 |
daveshap/keras_asr
ASR experiment using Google's Universal Sentence Encoder |
|
Experimental |
| 2697 |
KilianB/GoogleTranslatorTTS
Converts a string of text to mp3 files utilizing the google translator text... |
|
Experimental |
| 2698 |
berk76/words
Voice vocabulary :gb: :de: :fr: :es: :ru: :jp: :cn: ... |
|
Experimental |
| 2699 |
LuluW8071/VocalMind
Automatic Speech Recognition using Conformer with Speech Sentiment Analysis... |
|
Experimental |
| 2700 |
tuanio/nextformer
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For... |
|
Experimental |