All Voice AI Tools

6,981 tools ranked by quality score · Page 27 of 70

Showing 2601–2700 of 6,981
# Tool Score Tier
2601 jazzqi/openclaw-mimo-tts

OpenClaw TTS Provider for Xiaomi MiMo (mimo-v2-tts)

28
Experimental
2602 manhph2211/ML-Deployment

Pushing Deep Learning models into production using torchserve, kubernetes...

28
Experimental
2603 tuhinpal/text-to-speech

Text to Speech using Google's Library (Made for Fun)

28
Experimental
2604 nhut-ngnn/Voice-Based-Age-and-Gender-Recogniton

[ICTC'24] - "Voice-Based Age and Gender Recognition: A Comparative Study of...

28
Experimental
2605 Kaljurand/Diktofon

An Android app, a dictaphone with Estonian speech-to-text

28
Experimental
2606 ikarago/Talkinator

Talkinator is an easy to use text-to-speech-app for Windows 10-devices

28
Experimental
2607 xuchennlp/S2T

The project for speech translation

28
Experimental
2608 hikari-tadashi/Sapphire

A free and open source replacement for Google Assistant on Android devices,...

28
Experimental
2609 thotnd173389/SpeechCommand

The project aims to use keyword spotting streaming in a real-time offline...

28
Experimental
2610 mascotbot/elevenlabs-avatar

Open-source example for integrating ElevenLabs conversational AI with...

28
Experimental
2611 agentvoiceresponse/avr-asr-vosk

This repository provides a real-time speech-to-text transcription service...

28
Experimental
2612 Shashwat-Akhilesh-Shukla/Cognitive-AI

CognitiveAI is a production-grade conversational AI with persistent memory,...

28
Experimental
2613 koesan/Auto_Dubbing_And_Subtitle

Auto video dubbing and subtitle generation with AI-powered voice synthesis,...

28
Experimental
2614 kostas2370/Video-Creator

This project is to automate the video creation.

28
Experimental
2615 18F/tts-buy-datagov-technical-support-services

Solicitation documents for obtaining professional services to support Data.gov.

28
Experimental
2616 jaoafa/ChatWatcher

🗣 Discord voice-chat speech recognition

28
Experimental
2617 TharanaBope/whisper-v3-diarization

Production-ready audio transcription & speaker diarization CLI & GUI using...

28
Experimental
2618 alexykn/TorchTS

A modern text to speech frontend for Kokoro-82M

28
Experimental
2619 mehnoorsiddiqui/whatsapp-voice-transcriber

WhatsApp voice transcriber is an audio message transcriber app created with...

28
Experimental
2620 atomicoo/Tacotron2-PyTorch

PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。

28
Experimental
2621 Syduan0921/Muliti-Role_Cosyvoice2

🤖一键部署,利用TTS与LLM将长文本小说转化为多角色音/视频。

28
Experimental
2622 18F/tts-buy-challengegov-ideation

Market research documents related to the Challenge.gov Ideation Platform.

28
Experimental
2623 GetProjectsIdea/Convert-Text-to-Speech-in-Python

Text to speech is a process to convert any text into voice. Text to speech...

28
Experimental
2624 candlewill/Ossian

Ossian: A simple language-independent Text-to-speech frontend

28
Experimental
2625 JTylerH/unifi-aihorn-dynamic-tts

This project hosts a lightweight Node.js web app that connects to your UniFi...

28
Experimental
2626 Garden-Tree/yomi-KAI

yomi-KAIはDiscordのテキストチャンネルに送られた文章をボイスチャンネルで読み上げるbotです。

28
Experimental
2627 Issac-Moses/liebea

AI voice-activated girlfriend assistant with wake word detection, speech...

28
Experimental
2628 BullShark/JSpeak

A Text to Speech Reader Front-end that Reads from the Clipboard and with...

28
Experimental
2629 alitahir4024/Text-To-Speach-Javascript

A creative project to give voice to your words.

28
Experimental
2630 SingAvi/SpeechToText

Simple python script to convert live speech or any audio file to text using...

28
Experimental
2631 BenLubar/espeak

Package espeak is a wrapper around espeak-ng that works both natively and in...

28
Experimental
2632 jim11662418/General_Instrument_CTS256_SP0256_Speech_Synthesizer

Vintage General Instrument Speech Synthesizer CTS256 with SP0256

28
Experimental
2633 JustinGOSSES/spoken-floodplain

Website that verbally tells users when they enter or leave a floodplain in...

28
Experimental
2634 ThetaOne-AI/HiKE

Hierarchical Korean-English Code-Switching Speech Recognition Benchmark...

28
Experimental
2635 zerospeech/benchmarks

A command line tool that helps use the "Zero Ressource Challenge" benchmarks

28
Experimental
2636 Lhx94As/Awesome-Spoken-Language-Identification

An awesome spoken LID repository. (Working in progress

28
Experimental
2637 SupernovifieD/FreeSpeechToText

A python program that extracts text from audio files - .mp3 or .wav - for free!

28
Experimental
2638 Aculeasis/rhvoice-proxy

High-level interface for RHVoice library

28
Experimental
2639 akshatg-721/JanSamvaad-ResolveOS

JanSamvaad ResolveOS — A voice-first AI governance system that converts...

28
Experimental
2640 Mokkapps/parents-soundboard

A soundboard developed for parents to be able to play often needed phrases like "No"

28
Experimental
2641 hwk06023/SONATA

SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced...

28
Experimental
2642 obtic-sorbonne/Toolbox-site

Pandore offers a set of tools that facilitate the most common corpus...

28
Experimental
2643 botbahlul/Live-Subtitle

ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free...

28
Experimental
2644 orianemartin/WhispGrid

A Whisper to TextGrid script that I use to automatize Corpus Annotation on...

28
Experimental
2645 HuuHuy227/XphoneBert_Vits2

VITS2 extended with XPhoneBERT encoder

28
Experimental
2646 mo7amedaliEbaid/run-tracker

A flutter run tracker app - clean architecture

28
Experimental
2647 nay-cat/LiveKit-PiperTTS-Plugin

Quick integration of Piper TTS (super lightweight, high-quality model) with LiveKit

28
Experimental
2648 Zuellni/LLaSA-WebUI

LLaSA WebUI using ExLlamaV2 and FastAPI.

28
Experimental
2649 shreyasnisal/SpeechProgrammer

The Speech Programmer writes code based on voice commands. Right now it only...

28
Experimental
2650 dcavar/ELAN2split

Split ELAN Annotation Files and corresponding speech files into a corpus...

28
Experimental
2651 madushan1000/voxcpm_rs

Rust (using burn) implementation of VoxCPM

28
Experimental
2652 golemfactory/g-flite

g-flite: flite app distributed over Golem Network

28
Experimental
2653 Fractionbeyondseam/soundpad-download-plus-subscription

Get Soundpad Download Plus on GitHub: a complete, high-performance toolkit...

28
Experimental
2654 revdotcom/revai-java-sdk

Rev.ai Java SDK

28
Experimental
2655 miuda-ai/sensevoice-cli

Tool for speech recognition using sensevoice-small

27
Experimental
2656 tabahi/Mel-Spectrum-Analyzer

Online web based mel-spectrum, power spectrum, FFT analyzer for speech and...

27
Experimental
2657 buddyeorl/deep-talk

Deep-speech react app to test trained models,to visualize the speech to text...

27
Experimental
2658 DragonDiffusionbyBoyo/Boyonodes

A set of Comfyui nodes

27
Experimental
2659 upskyy/Paper-Review

Paper Review about Speech Recognition · NLP

27
Experimental
2660 OpenVoiceOS/status

Open Voice OS Server Status Page

27
Experimental
2661 Drakonis96/whispad

WhisPad is a note management tool where you can write or dictate your notes...

27
Experimental
2662 Supremolink81/TTSCeleb

A TTS app where you can clone the voices of any person you wish.

27
Experimental
2663 luongnv89/voice-cast

Your words, any voice. Voice cloning and text-to-speech with multiple TTS...

27
Experimental
2664 ThisModernDay/f5-tts

F5-TTS is a web application that allows users to clone voices and generate...

27
Experimental
2665 savg92/voice-cloning

This project provides a comprehensive testing and comparison platform for...

27
Experimental
2666 techiaith/docker-marytts

Lleisiau synthetig cadwynedig Cymraeg gyda MaryTTS a Docker // Welsh...

27
Experimental
2667 FNBUBBLES420-ORG/Speech-to-Text-Application

🎙️ Welcome to the Speech to Text Application! 📝 This tool converts your...

27
Experimental
2668 yuyq96/pyshengyun

A Python converter for Chinese Pinyin and Shengyun (initials and finals)

27
Experimental
2669 otonomee/streamstem

Implements ML audio separation algorithm on audio from YouTube or Spotify...

27
Experimental
2670 botbahlul/Live-Subtitle-V2

ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free...

27
Experimental
2671 mmahdibarghi/finglish-dataset

Persian to Finglish dataset with all the sentences voice for TTS dataset...

27
Experimental
2672 brailcom/festival-freebsoft-utils

Festival extensions and utilities, focused on interaction with Speech Dispatcher

27
Experimental
2673 Pallas1303/FestPB

FestPB é um projeto com objetivo de oferecer suporte ao Português Brasileiro...

27
Experimental
2674 rezkyatinnov/capetangjs

A JavaScript library for text to speech vice versa using Web Speech API

27
Experimental
2675 De-Technocrats/simple-text-to-speech-javascript

Simple text to speech with javascript.

27
Experimental
2676 ArdaGnsrn/elevenlabs-js

This is an Open Source NodeJS package for ElevenLabs Text to Speech API.

27
Experimental
2677 spandan114/AI-realtime-voice-agent

A Python-based real-time voice-to-voice conversation system that lets you...

27
Experimental
2678 lxpio/omnigram

Omnigram is a Flutter-based file reader and audiobook . It accommodates ...

27
Experimental
2679 wzhd/vosk-rs

Cloud-free speech recognition. See https://fars.ee/F9-b.mp4

27
Experimental
2680 rsxdalv/bark-speaker-directory

Site for sharing Bark voices

27
Experimental
2681 adeepak7/Speech-To-Code

Speech To Code is Google Chrome Extension to convert Speech into Code.

27
Experimental
2682 Pzc-Neo/vue-web-reader

城墨网页小说朗读 ( Novel read aloud on web. )

27
Experimental
2683 kaiidams/Voice100AndroidApp

Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime and...

27
Experimental
2684 jzmzhong/Automatic-Prosody-Annotator-with-SSWP-CLAP

An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).

27
Experimental
2685 KickerMix/Discord-Local-LLM-VoiceChat-Bot

Saya Voice Assistant for Discord AI voice bot: listens, detects keywords,...

27
Experimental
2686 hari-huynh/viVQA-voice-assistant

Voice assistant using Multimodal LLMs - LLaVA-NeXT (Mistral 7B) finetuned &...

27
Experimental
2687 kaiidams/Kokoro-Speech-Dataset

A public domain single speaker Japanese speech dataset

27
Experimental
2688 fengredrum/finetune-whisper-lora

Fine-Tune Whisper with Transformers and PEFT

27
Experimental
2689 stgloorious/stm32-speech-recognition

Speech Recognition using STM32 and Machine Learning

27
Experimental
2690 grammatek/simaromur

Icelandic TTS (text-to-speech) service for Android

27
Experimental
2691 MichaelGrafnetter/defender-asr-admx

Administrative Template (ADMX) for Microsoft Defender Attack Surface Reduction (ASR)

27
Experimental
2692 Mindinventory/AutoHighlightTTS

AutoHighlightTTS is a simple, powerful solution for Android Text to Speech,...

27
Experimental
2693 msjsc001/Anki-TTS-Edge

A modern text-to-speech tool powered by Microsoft Edge TTS. Creates Anki...

27
Experimental
2694 messiaen/full-lattice-search

Full Text Search Over Probabilistic Lattices with Elasticsearch!

27
Experimental
2695 nvmoyar/aind2-speech-recognition

Some approaches based on deep learning to build the acoustic model for an...

27
Experimental
2696 daveshap/keras_asr

ASR experiment using Google's Universal Sentence Encoder

27
Experimental
2697 KilianB/GoogleTranslatorTTS

Converts a string of text to mp3 files utilizing the google translator text...

27
Experimental
2698 berk76/words

Voice vocabulary :gb: :de: :fr: :es: :ru: :jp: :cn: ...

27
Experimental
2699 LuluW8071/VocalMind

Automatic Speech Recognition using Conformer with Speech Sentiment Analysis...

27
Experimental
2700 tuanio/nextformer

PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For...

27
Experimental
« Prev 1 2 3 25 26 27 28 29 68 69 70 Next »