All Voice AI Tools

6,981 tools ranked by quality score · Page 42 of 70

Showing 4101–4200 of 6,981
# Tool Score Tier
4101 prokhororlov/VoiceCraft

Book to MP3 converter. Convert e-books (FB2, EPUB, TXT) to MP3 audiobooks...

21
Experimental
4102 aiola-lab/aiola-js-sdk

The official JavaScript/TypeScript SDK for the aiOla API

21
Experimental
4103 naver/multilingual-distilwhisper

This repository contains all the code necessary for running the multilingual...

21
Experimental
4104 gongouveia/Whisper-Synthetic-ASR-Dataset-Generator

This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI...

21
Experimental
4105 dustland/talk

IELTS Speaking.

21
Experimental
4106 Revocalize/revocalize-python

The official Python API for Revocalize AI voice synthesizer platform.

21
Experimental
4107 sandeepmukku12/vocodine

🎙️ VocoDine: Book your table with your voice! Speak your booking details,...

21
Experimental
4108 SatyamPote/Ai-Video-Interviewer

An AI-powered mock interview platform that simulates a real-time video call...

21
Experimental
4109 dobby-seo/kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

21
Experimental
4110 1ytic/edit-distance-papers

A curated list of papers dedicated to edit-distance as objective function

21
Experimental
4111 Hariswar8018/Star-Wish-AI-Stories

Create Stories with AI, View Stories as well as Scan BarCode to known more...

21
Experimental
4112 Smorodov/kaldi_vosk_win_cmake

cmake based kaldi + vosk + microphone speech recognition example

21
Experimental
4113 abdufelsayed/talkio

Talkio — TypeScript voice AI orchestration: STT + LLM + TTS with streaming,...

21
Experimental
4114 VARCOVoice/VARCOVoice_UNITYSDK

Official Unity SDK for VARCO Voice API. High-quality AI text-to-speech,...

21
Experimental
4115 Geguchh024/VocalizeMD

A VS Code extension that converts Markdown files to natural-sounding speech...

21
Experimental
4116 wq2012/VB_diarization

VB Diarization with Eigenvoice and HMM Priors, refactored

21
Experimental
4117 partrita/tts-kokoro-app

local app for Kokoro TTS.

21
Experimental
4118 BluShooz/text-to-video-generator

SOTA Text-to-Video Generator with MuseTalk 1.5, LivePortrait, and LTX-Video....

21
Experimental
4119 Kaljurand/Grammars

Grammatical Framework based speech recognition grammars for Estonian,...

21
Experimental
4120 FairyDevicesRD/mimi.client.kotlin

mimi(R) API Client for Kotlin

21
Experimental
4121 Yangyangii/Tacotron-pytorch

Tacotron implementation with pytorch 1.0

21
Experimental
4122 mklement0/speak.awf

An Alfred 3 workflow that uses macOS's TTS (text-to-speech) feature to speak...

21
Experimental
4123 bitgineer/Speakeasy

Privacy-first local voice-to-text using Whisper AI. Cross-platform desktop...

21
Experimental
4124 gikonyob/speake3

Speake3 library provides a wrapper around Espeak to easily write efficient...

21
Experimental
4125 funnyzak/xfyun-nls

讯飞云智能语音处理 Node 模块。

21
Experimental
4126 ntddk/transcibe

A script to transcribe audio files with Google Cloud Speech API.

21
Experimental
4127 NassimaOULDOUALI/Prosody-Control-French-TTS

An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control

21
Experimental
4128 WelkinYang/EMPHASIS-pytorch

EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System

21
Experimental
4129 ORI-Muchim/Grad-TTS

'Grad-TTS' with Multilingual Cleaners

21
Experimental
4130 grossstadtmann/elevenbatch

Elevenlabs.io API batch creation of text to speach files.

21
Experimental
4131 cihanselim/python-codebyvoice

talk for programming :loudspeaker: /w google speech recognition

21
Experimental
4132 tltrogl/diaremot2-on

DiaRemot2-ON: CPU-only audio intelligence pipeline (Faster-Whisper, ONNX,...

21
Experimental
4133 m15-ai/TrooperAI

Conversational AI, local, low-latency voice assistant for Raspberry Pi 5...

21
Experimental
4134 babua/TTSDatasetRecorder

A simple app for recording speech datasets.

21
Experimental
4135 QuantiusBenignus/Spoken

Joplin text notes and to-dos via OFFLINE speech recognition. To-do reminders...

21
Experimental
4136 mozilla-ai/speech-to-text

Blueprint by Mozilla.ai on how to transcribe audio files

21
Experimental
4137 FluxCapacitor2/whisper-asr-webapp

A web app for automatic speech recognition using OpenAI's Whisper model...

21
Experimental
4138 jik876/hifi-gan-demo

Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient...

21
Experimental
4139 dimitriStoidis/GenGAN

Repository for the paper: Generating gender-ambiguous voices for...

21
Experimental
4140 AndreaLombax/Speech_emotion_recognition

In this work is proposed a speech emotion recognition model based on the...

21
Experimental
4141 aliyzd95/modified-shemo

A modification on the Sharif Emotional Speech Database

21
Experimental
4142 PrashanthaTP/wav2mov

Speech to Facial Animation using GANs

21
Experimental
4143 Ashmit-Kumar/Assess-AI

End-to-end AI interview platform featuring live voice interaction, coding...

21
Experimental
4144 timf34/Article2Audio

Convert articles to audio using OpenAI's Text to Speech API via a python...

21
Experimental
4145 mvalancy/logitech_bcc950

A talking eyeball on a stick - Logitech BCC950 PTZ camera control scripts

21
Experimental
4146 Sharan-Kumar-R/Talk2Translate

The application uses SpeechRecognition, GoogleTranslator, and gTTS to...

21
Experimental
4147 taresh18/livekit-kokoro

Livekit TTS plugin for kokoro

21
Experimental
4148 transitive-bullshit/unrealspeech-api

TypeScript client for the Unreal Speech TTS API.

21
Experimental
4149 SpringerNLP/Chapter12

Chapter 12: End-to-end Speech Recognition

21
Experimental
4150 danvers/medienpaed-asr

Understanding ASR

21
Experimental
4151 RakeshBabuGajula/real-time-voice-translator

A real-time voice translator web app built with Streamlit that captures live...

21
Experimental
4152 jaju/voissistant

Voiss Aceistant - Apple only, with mlx.

21
Experimental
4153 PanosAntoniadis/slp-ntua

Lab exercises of Speech and Language Processing course in NTUA

21
Experimental
4154 microsoft/MunTTS-A-Text-to-Speech-System-For-Mundari

Official Codebase for "MunTTS: A Text-to-Speech System for Mundari"...

21
Experimental
4155 xulihang/Silhouette

An open source computer-aided translation tool for audios and videos

21
Experimental
4156 Mmesek/mUSh

Ultrastar Songs Creation/Management helper utils.

21
Experimental
4157 IAMJOYBO/index-tts

Docker镜像自动构建并上传到阿里云

21
Experimental
4158 speaking-portal-project-team-a/The-Speaking-Portal-Project

The objective of the Speaking Portal Project is to design, develop, and...

21
Experimental
4159 burntcarrot/quackspeak

Text-to-speech using ducks. 🦆

21
Experimental
4160 lifeCoder123/Speech-to-Text-Converter

Speech-to-text converter tool using Google Speech Cloud API to convert...

21
Experimental
4161 aminul-huq/Speech-Command-Classification

Speech command classification on Speech-Command v0.02 dataset using PyTorch...

21
Experimental
4162 narVidhai/Speech-Transcription-Benchmarking

Example python scripts to evaluate various ASR methods

21
Experimental
4163 malob/article-to-audio-cloud-function

Google Cloud Function that takes a url, converts the article at that url to...

21
Experimental
4164 KuchikiRenji/vall-e

Unofficial PyTorch implementation of VALL-E: zero-shot text-to-speech and...

21
Experimental
4165 popcornell/MicRank

MicRank is a Learning to Rank neural channel selection framework where a DNN...

21
Experimental
4166 anicolson/matlab_feat

Functions for creating speech features in MATLAB.

21
Experimental
4167 mvshyvk/KaldiService

Service for easy access to speech recognition capabilities of Kaldi using...

21
Experimental
4168 George0828Zhang/simulst

PyTorch toolkit for streaming speech recognition, speech translation and...

21
Experimental
4169 FarawaySail/Kaldi_thchs30

媒体与认知语音识别大作业

21
Experimental
4170 meichthys/sword_drill

Displays Bible verses from parsed microphone input.

21
Experimental
4171 JunhoKim94/ASR_project

This repository created for the NHN ASR hackathon competition.

21
Experimental
4172 german-asr/nvidia-jasper-german

Scripts for training NVIDIA Jasper for German Speech Recognition (ASR).

21
Experimental
4173 loryanstrant/HA-ElevenLabs-Custom-TTS

An ElevenLabs TTS integration for Home Assistant that allows for creation of...

21
Experimental
4174 NetherQuartz/TextForSpeechNormalizer

A Python library to accentuate Russian text

21
Experimental
4175 Rajvardhman05/openwhisper-app

Free, open-source voice-to-text for macOS — 100% local, offline...

21
Experimental
4176 davidsuragan/issai-playground

A Python toolkit for accessing ISSAI’s AI services — Oylan (LLM), Soyle...

21
Experimental
4177 zvadaadam/speech-recognition

End to End Speech Recognition with Tensorflow

21
Experimental
4178 TeaPoly/cat_tensorflow

Crf-based Asr Toolkit with TensorFlow implement

21
Experimental
4179 TeaPoly/warp-ctc-crf

An extension of thu-spmi/CAT which contains a full-fledged implementation of...

21
Experimental
4180 upskyy/Automatic-Speech-Recognition-Models

End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

21
Experimental
4181 Kimosabey/vox-agent-neural

Neural Voice Agent core constructs for conversational AI.

21
Experimental
4182 yinruiqing/tiny-transducer

Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices

21
Experimental
4183 sunprinceS/MetaASR-CrossAccent

Meta-Learning for End-to-End ASR

21
Experimental
4184 duc11021102/pyspeech

Python Text To Speech Using gTTS @duc11021102

21
Experimental
4185 ActiveIntelligentSystemsLab/japanese_tts_ros

日本語テキストを音声として出力するROS node

21
Experimental
4186 derpeloper/ostinato

giving a voice to the voiceless.

21
Experimental
4187 lgpearson1771/openwakeword-trainer

Train custom wake word models with openWakeWord. A granular 13-step pipeline...

21
Experimental
4188 vijethph/violet-speech

Violet is a Speech Assistant made using Python

21
Experimental
4189 led-mirage/AivoClip

A.I.VOICEでクリップボードに貼り付けられたテキストを読み上げるアプリです。

21
Experimental
4190 2tocom/F5-TTS-Vietnamese-Google-Colab

Vietnamese TTS, Chuyển văn bản thành giọng nói tiếng Việt, text to speech...

21
Experimental
4191 AssemblyAI/assemblyai-ruby-sdk

The AssemblyAI Ruby SDK provides an easy-to-use interface for interacting...

21
Experimental
4192 emmanuelinfante/SubtitlesEveryone

Transcribe Like a Pro, Without Paying a Penny!

21
Experimental
4193 junhoeKu/Jeju-Translation

제주어, 표준어 양방향 음성 번역 모델 생성 프로젝트 (알고리즘 | 비정형 | NLP | 딥러닝 | 기계번역 | 음성인식 | 멀티모달)

21
Experimental
4194 BitsofJeremy/WeirDing

Audiobook narration engine powered by Qwen3-TTS. Upload documents, pick a...

21
Experimental
4195 Vatis-Tech/asr-client-js

JavaScript SDK client for Vatis Tech ASR services.

21
Experimental
4196 AssemblyAI/assemblyai-semantic-kernel

Transcribe audio using AssemblyAI with Semantic Kernel plugins.

21
Experimental
4197 marcogenna/epub2audiobook

Convert EPUB books to M4B audiobooks with AI-powered TTS (Edge TTS, Kokoro, Piper)

21
Experimental
4198 LucaAngioloni/Micchinetta

HCI project: an application interface using both face and speech recognition...

21
Experimental
4199 JoshuaCarroll/RepeaterProgrammingUtility

N5JLC Repeater Programming Utility

21
Experimental
4200 Listening-Lab/Annotator

Listening Lab audio analysis and annotation tool. Develop audio...

21
Experimental
« Prev 1 2 3 40 41 42 43 44 68 69 70 Next »