All Voice AI Tools

6,981 tools ranked by quality score · Page 31 of 70

Showing 3001–3100 of 6,981
# Tool Score Tier
3001 AndroidWithRossyn/AIVoiceTextTranslator

🌎 If you are looking for a translator, that quickly and accurately translate...

25
Experimental
3002 gunarakulangunaretnam/ai-customer-analyzer

An artificial intelligence system that analysis customers using computer...

25
Experimental
3003 quentinmay/discord-voice-assistant

Discord Voice Assistant is a Discord Bot built using discord.js and python...

25
Experimental
3004 KiLJ4EdeN/Persian_Speech_To_Text

Simple Speech to text prototype using google api

25
Experimental
3005 aiy-voice-assistant/hungry-student-app

Food management home assistant app for Google Assistant or Cloud...

25
Experimental
3006 codejs-kr/stt.js

Speech To Text library for browser 🎤

25
Experimental
3007 headlessripper/NectarSTT

NectarSTT (Nectar Speech To Text) is a Python-based speech recognition...

25
Experimental
3008 yousefhany77/tts-ai

The Text-to-Speech Library provides a simple unified interface for...

25
Experimental
3009 answersolutionsapps/runandread-android

Ultimate Text-to-Speech and Audiobook Player for Android

25
Experimental
3010 korniichuk/google-speech

QuickStart. Google Cloud Speech-to-Text API with Python

25
Experimental
3011 Ashutosh-kv/Karen-Mk-1-

A personal assistant written in python for fun!

25
Experimental
3012 ThaaoBlues/Blue

An open source vocal assistant for windows and Linux. Made to be upgraded...

25
Experimental
3013 Medvedu/Yandex-Speech-API

Text to speech translation. Supports next languages: english, turkey,...

25
Experimental
3014 JN513/Ana

Assistente feita em Python utilizando Speech_recognition, e APIs do Google

25
Experimental
3015 Miihir79/Messaging_app

This is an advanced messaging app which has smart log in options smart...

25
Experimental
3016 karkranikhil/voice-notes

Voice Note taking app using Svelte.

25
Experimental
3017 Forne/ha-yandexcloudtts

Yandex.Cloud SpeechKit for Home Assistant

25
Experimental
3018 mecparts/CTS256-exceptions

An exception-word EPROM generator for the CTS25AL2 text-to-speech IC.

25
Experimental
3019 cyrta/broadcast-news-videos-dataset

Collection of broadcast news video clips

25
Experimental
3020 saikrishnarallabandi/Festival-Speech-Synthesis-System

This repo contains current changes I am making to Festival and Clustergen

25
Experimental
3021 UserBeingOfficial/ai-dictionary-koreader

📖 Enhance your reading experience with AI Dictionary, a KOReader plugin that...

25
Experimental
3022 Vicopem01/srttossml

Using AWS Polly requires SSML files for a better optimised text to speech...

25
Experimental
3023 IIP-Sogang/olkavs-avspeech

The Introduction of the OLKAVS Dataset

25
Experimental
3024 ErnestAroozoo/GPT-Discord-Chatbot

Discord chatbot powered by OpenAI and ElevenLabs that enables natural and...

25
Experimental
3025 moxeeem/ASR-pronunciation-correction

Этот проект представляет систему автоматической коррекции произношения на...

25
Experimental
3026 Chelsea486MHz/debat-politique-ia

Génération automatique de débats politiques par IA. Audio + vidéo.

25
Experimental
3027 thanhtvt/conformer

Tensorflow implementation of Conformer - Transformer-based model for Speech...

25
Experimental
3028 will-rice/conformer-ctc

TensorFlow 2.0 Implementation of Conformer CTC

25
Experimental
3029 aalto-speech/speechbrain-cl

Implementation of different curriculum learning (CL) methods for...

25
Experimental
3030 PigeonDan1/ps-slm

TASU: A New Style of Alignment of Speech LLM with only Text Training Data,...

25
Experimental
3031 csikasote/bigc

This repository contains the data resources for the LacunaFund supported...

25
Experimental
3032 mohdali/Arabic-Phonetic-Dictionary

Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition...

25
Experimental
3033 stefantaubert/tts-mos-test-mturk

Command-line interface (CLI) and Python library to evaluate text-to-speech...

25
Experimental
3034 kemsta/macloop

https://pypi.org/project/macloop/

25
Experimental
3035 riedemannai/parakeet-mlx-server

OpenAI-compatible FastAPI server for German neurology and neuro-oncology...

25
Experimental
3036 ilyaizen/CopySpeak

🔊 CopySpeak – A lightweight tool for quick AI text-to-speech

25
Experimental
3037 fardinsabid/NeuTTS-Studio

On-Device Text-to-Speech · Voice Cloning · Real-Time Streaming

25
Experimental
3038 pranayjoshi/speech_to_text

This is a speech_to_text script by Pranay Joshi

25
Experimental
3039 CoffreLv/ASR_CNN_CTC

从零开始搭建一个基于CNN+CTC的语音识别系统。

25
Experimental
3040 m1n1v1rus/futuristic-calculator

A futuristic, AI-powered advanced calculator with voice control, graph...

25
Experimental
3041 nikkiw/realtime_translator

Python tool for real-time voice recognition and multilingual translation

25
Experimental
3042 zobayerdev/Convertix_QR_PDF_Maker_Scanner

This application provides of QR create, Bar code create, PDF maker & scanner...

25
Experimental
3043 lpkpaco/Bocchi-The-Rock-GPT-SoVITS-Models

Contains voice models based on the GPT-SoVITS architecture of different...

25
Experimental
3044 aiyu-ayaan/tts-engine

The TTS-Engine is a simple and efficient library that provides...

25
Experimental
3045 alfianlosari/flutter_cloud_text_to_speech

Flutter project that uses the Google Cloud Text to Speech API to synthesize...

25
Experimental
3046 KinglittleQ/Tacotron

An implementation of Tacotron with Pytorch0.4

25
Experimental
3047 lifeiteng/NotebookTTS

Text-To-Speech for NotebookLM

25
Experimental
3048 97jamie/public-police-footage

Code for Constructing Datasets From Public Police Body Camera Footage (ICASSP 2025)

25
Experimental
3049 hesamhadadi/realtime-speech-to-text

Realtime speech-to-text library for Persian and English built with TypeScript

25
Experimental
3050 alienx5499/MoodLyft-Mirror-RealTime-Emotion-Analyzer

🎯 An AI-powered tool that analyzes your webcam feed in real-time, delivering...

25
Experimental
3051 idiap/FiniteStateTransducers.jl

Play with Weighted Finite State Transducers (WFST) in the Julia language.

25
Experimental
3052 ynop/spych

Scripts/Tools used for working with automatic speech recognition.

25
Experimental
3053 debelopumento/phaser-test

A voice controlled runner game for Chrome

25
Experimental
3054 vahnxu/doubao-asr

Agent Skill: Transcribe audio files via ByteDance Volcengine Seed-ASR 2.0...

25
Experimental
3055 TheVoxProject/calcvox

Accessible and open-source talking calculator for everyone.

25
Experimental
3056 edholmes2232/Speech2Touch

Voice Input to USB HID Output. Based on STM32WB55

25
Experimental
3057 linagora-labs/asr_benchmark

Toolkit to benchmark various speech recognition APIs (NeMo, Whisper...) and...

25
Experimental
3058 anilkeshwani/speech-text-alignment

Functionality for speech data processing including time alignment, encoding...

25
Experimental
3059 lmk123/cvox

Get spoken alerts when Claude Code needs permission or finishes a task — so...

25
Experimental
3060 rizkyyanuark/PrediksiDepression-DataSpeech

Repositori ini berisi proyek deteksi dini depresi menggunakan MFCC dan CNN...

25
Experimental
3061 symbiont-ai/docent

Docent — Human-AI Symbiotic Loop from Research to Understanding

25
Experimental
3062 wbrisett/linguatrain

A modular recall-based language training engine powered by structured data packs.

25
Experimental
3063 ZoraizQ/urdu-speech-recognition

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs...

25
Experimental
3064 1935417243/GrillMind

智面 · AI 技术面试模拟器 — 上传简历,选择岗位,AI 面试官即刻上线。支持文字对话与实时语音通话,面试结束自动生成评估报告。

25
Experimental
3065 SagarBiswas-MultiHAT/Speech2Speech-AIAssistant

Speech2Speech-AIAssistant; A lightweight, offline-capable voice assistant...

25
Experimental
3066 6Morpheus6/alltalk-tts

[NVIDIA ONLY] AllTalk-TTS is a unified UI for F5-TTS, XTTS, Vite TTS, Piper...

25
Experimental
3067 ShoYamanishi/AndroidMFCC

26-Point MFCC & 512-Point FFT Generator & Visualizer in Java, C++, and NEON...

25
Experimental
3068 royangkr/BabyReady

CNN to predict the reason why a baby is crying

25
Experimental
3069 DavidBradbury/tts-assistant

TTS Assistant: A front-end app utilizing OpenAI's TTS API. Easily input text...

25
Experimental
3070 KernelOverseer/caLLMe

Realtime voice conversation with llm models using an asynchronous Voice to...

25
Experimental
3071 YizheZhang-Ervin/AI_FinTech

Artifical Intelligence (React+Flask RESTful+Sqlite+Antd+Echarts)

25
Experimental
3072 praweshd/speech_emotion_recognition

In this project, the performance of speech emotion recognition is compared...

25
Experimental
3073 kaiidams/voice100

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost...

25
Experimental
3074 super13/tensorflow-speech-recognition-pai

Speech recognition using tensorflow in aliyun pai.

25
Experimental
3075 ace19-dev/tensorflow-speech-recognition-challenge

Kaggle Competitions: TensorFlow Speech Recognition Challenge

25
Experimental
3076 analyticsinmotion/micstream

Cross-platform microphone audio capture for Node.js with pre-built...

25
Experimental
3077 Goblincomet/digitaltwin

Using a single image and just 10 seconds of sample audio, our project...

25
Experimental
3078 diewland/ttsstt-th-demo

Web Speech API Thai demo, Speech-to-text & Text-to-speech

25
Experimental
3079 tomaarsen/TTSTextNormalization

Convert English text from written expressions into spoken forms

25
Experimental
3080 dsrivastavv/Android-Continuous-SpeechRecognition

Code to continuously detect spoken language and convert to text using Google...

25
Experimental
3081 Snesnopic/Morser

SwiftUI recreation of my UIKit Morse Code experiment

25
Experimental
3082 SnappsiSnappes/Jarvis-free-bingGPT-voice-assistant

Голосовой помощник - чат с bingGPT / Bard (на русском) / ChatGPT 3.5 для...

25
Experimental
3083 JuJu2181/Automatic-Nepali-Speech-Recognition-and-Summarizer

A system capable of converting Nepali speech to text and generate summary of text

25
Experimental
3084 yandex-cloud-examples/yc-speechkit-web-ui

SpeechKit Web UI Example

25
Experimental
3085 arthurfortes/speech2text_keras

This repository reports how to build a speech to text model to recognize...

25
Experimental
3086 omatheusribeiro/facial-recognition

The Facial Recognition project is a powerful system designed to detect...

25
Experimental
3087 amanda-emerick/guess-the-animal

:monkey_face: Guess the Animal :frog: is a didactic game developed for...

25
Experimental
3088 pannous/angle

⦠ Angle: new speakable syntax for python 💡

25
Experimental
3089 GeopJr/action-accessibility

Programming is for everyone. No matter what. This action helps achieve that....

25
Experimental
3090 ShihabYasin/Isolated-Bengali-Word-and-Speaker-Recognition.

Isolated Bengali word and speaker recognition.

25
Experimental
3091 bougieL/tts-fluent

Text to speech

25
Experimental
3092 RonanDavalan/vosk-cli-dictation

A real-time, offline, and customizable command-line (CLI) dictation tool for...

25
Experimental
3093 Wookie-VUI/Wokiee

Cross-platform Voice User Interface for your Desktop

25
Experimental
3094 vero-code/gemini-tales

AI storyteller that turns screen time into active adventure. Puck — a live...

25
Experimental
3095 visaoenhance/livekit-debug-playground

LiveKit voice app validation skill. Use when building, debugging, or...

25
Experimental
3096 elizabethfuentes12/ray-ban-ai-agent-sample-for-aws-agentcore

Voice AI agent for Ray-Ban Meta glasses using Amazon Bedrock AgentCore and...

25
Experimental
3097 dalyanalytics/counselor

👑 voice-powered code review tool for R developers

25
Experimental
3098 rohansx/convox

Open-source voice AI orchestration platform for India. Build production...

25
Experimental
3099 olami-developers/olami-api-quickstart-php-samples

OLAMI API Quickstart PHP Samples

25
Experimental
3100 rpsthecoder/js-speech-synthesis

Add Text to Speech feature to webpages using JavaScript's Web Speech API

25
Experimental
« Prev 1 2 3 29 30 31 32 33 68 69 70 Next »