All Voice AI Tools

6,981 tools ranked by quality score · Page 34 of 70

Showing 3301–3400 of 6,981
# Tool Score Tier
3301 tongjinle123/speech-transformer-pytorch_lightning

ASR project with pytorch-lightning

24
Experimental
3302 leaxer-ai/leaxer-qwen3-tts

C++ implementation of Qwen3-TTS running on top of ONNX Runtime.

24
Experimental
3303 AppleHolic/FastSpeech2

Refactored version of https://github.com/ming024/FastSpeech2

24
Experimental
3304 ictnlp/ComSpeech

Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct...

24
Experimental
3305 synesthesiam/pt-synesthesiam

CMU Sphinx acoustic model for Portugese (pt-br)

24
Experimental
3306 angelinekeke/claude-awake-speak

让你的 Claude Code 会说话 — 自动语音朗读中文内容,8种微软官方音色可选,实时切换,免费无需API Key,跨平台支持

24
Experimental
3307 hyperloop-modules/titanium-speech

Use the iOS 10 SFSpeechRecognizer API in JavaScript with Appcelerator Hyperloop.

24
Experimental
3308 umjammer/vavi-speech2

🗣 Java Text to Speech (JSAPI2) engines (google cloud, cocoa, open jtalk,...

24
Experimental
3309 Niger-Volta-LTI/urhobo-asr-spoken-digits

URH-DIGITS is a connected digits speech recognition task

24
Experimental
3310 jonsafari/buckeye_dict

Buckeye Pronunciation Dictionary

24
Experimental
3311 AI-TOOLKIT/VoiceBridgeProjects

Example projects for VoiceBridge - an AI-TOOLKIT Open Source C++ Speech...

24
Experimental
3312 BBC-Esq/Elegant-Audio-Transcriber

Extremely fast and accurate audio transcrbier surpassing Whisper. Optimized...

24
Experimental
3313 vectominist/rspin

Official inference code for NAACL 2024 paper "R-Spin: Efficient Speaker and...

24
Experimental
3314 Alan-6666/chinese_asr

a demo of chinese asr

24
Experimental
3315 speechly/api

Speechly public API definitions and generated code

24
Experimental
3316 dyankov91/a2pod

Convert articles into podcast-quality audio on Apple Silicon. Local TTS, LLM...

24
Experimental
3317 amolgorithm/speech-gpt

What if ChatGPT had its own voice? What if you could speak to it with your...

24
Experimental
3318 deepgram-starters/django-text-to-speech

Get started using Deepgram's Transcription with this Django demo app

24
Experimental
3319 ansh-info/SpeechSense

This powerful toolkit combines real-time speech recognition with NLP to...

24
Experimental
3320 Flux9665/ArticulatoryTextFrontend

This is a text-processing frontend that converts graphemes to phonemes and...

24
Experimental
3321 lifeiteng/Rabbit

Explore Text-To-Speech

24
Experimental
3322 neurlang/whipstr

Whipstr ASR/STT System

24
Experimental
3323 marvinborner/CTC-LSTM

Spoken word recognition using CTC LSTMs for SWR2 Tübingen

24
Experimental
3324 probablyagoodusername/vesper

Therapeutic audio pipeline. Faith meets science. Free, static, open source.

24
Experimental
3325 xDoritox/Voice-Clone-Studio

🔊 Clone and design voices easily with Voice Clone Studio, a web UI powered...

24
Experimental
3326 vishalnagda1/text-to-speech

Python program to convert text to speech.

24
Experimental
3327 cniweb/podcast_generator

Vollautomatisierter Podcast-Generator: Erstellt komplette Episoden (Audio &...

24
Experimental
3328 myned-ai/interactive-website-navigator

An interactive 3D Gaussian Splatting avatar that guides website visitors...

24
Experimental
3329 rust-han/han-speech

汉语发音系统

24
Experimental
3330 smcantab/speak11

Select text, press ⌥⇧/, hear it read aloud. macOS text-to-speech powered by...

24
Experimental
3331 loglux/FlexAudioPrint

FlexAudioPrint is a Python-based app for transcribing audio to text using...

24
Experimental
3332 visu123s/MimicKit

🤖 Learn motion imitation with MimicKit, a framework offering advanced...

24
Experimental
3333 slanglabs-projects/asr-wer-bench

Workbench for benchmarking Word Error Rate (WER) of Automatic Speech...

24
Experimental
3334 baochuquan/ios-vad

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN,...

24
Experimental
3335 mpoyraz/ngram-lm-wiki

Scripts to train a n-gram language models on Wikipedia articles

24
Experimental
3336 awetomate/text-to-speech-streamlit

Text-to-Speech solution using Google's Cloud TTS API and a Streamlit front end

24
Experimental
3337 idiap/TIDIGITSRecipe.jl

A Julia recipe for training an ASR system using the TIDIGITS database

24
Experimental
3338 codassassin/voice-assistant

This is a very simple CLI based voice assistant which does various work on...

24
Experimental
3339 linto-ai/linto-punctuation

LinTO Platform punctuation service.

24
Experimental
3340 Ryan5453/lyricscribe

Automated Lyric Transcription Research

24
Experimental
3341 good-boy01/Quki

A virtual assistant that helps with everyday tasks , Quki is still in the...

24
Experimental
3342 HawksLab/narratify

e-book to audiobook convertor

24
Experimental
3343 andrew-fennell/CogNative

Translated vocal synthesis - Clone a voice and output speech in another language

24
Experimental
3344 findtharun/Railway_bot

Interactive Railway Reservation - BuildIng a ChatBot for a railway...

24
Experimental
3345 guglielmocamporese/learning_invariances_in_speech_recognition

In this work I investigate the speech command task developing and analyzing...

24
Experimental
3346 SoCXin/ASR1606

L4 R2: ASR 624MHz Cortex-R5 Cat.1 SoC (ASR1606/ASR1602)

24
Experimental
3347 SoCXin/ASR1601

L4 R3: ASR Cortex-R5 LTE Cat.1 SoC (ASR1601/ASR1603/ASR3601)

24
Experimental
3348 benank/bento

Your AI cooking companion🍱 Utilizes OpenAI's ChatGPT & Whisper APIs +...

24
Experimental
3349 dacson/Demo-of-Text-to-Speech-based-on-Deep-Learning

text to speech for mandarin,

24
Experimental
3350 WindowsNT/SpeechRec

Continuous Dictation Speech Recognition and Speech Synthesis in Win32

24
Experimental
3351 mklement0/voices

macOS CLI for changing the default TTS (text-to-speech) voice and printing...

24
Experimental
3352 motelian/NutriSmart

NutriSmart is an AI-based calorie and macro tracking app equipped with NLP...

24
Experimental
3353 burrmill/sph2pipe

sph2pipe v2.5. We do not maintain this, and/or accept pull requests; just...

24
Experimental
3354 MItCHeLPL/Discord-AISupBOT

Discord AI Chat Bot with GPT-3

24
Experimental
3355 mc095/luma

Personal Voice Agentic AI powered by Agno

24
Experimental
3356 mayank-kumar-giri/Speech-Recognizer-cum-Voice-Typing-Editor

Speech Recognizer cum text editor that facilitates voice typing using Google...

24
Experimental
3357 baocin/hugging_face_example_STT_api

Demonstration of Hugging Face's (https://huggingface.co/) newly released...

24
Experimental
3358 Scisaga/qwen3-asr-openai

自托管 ASR 推理服务

24
Experimental
3359 aallaguly01/Diplom

Multimodal Python framework for hand gesture and voice control with cursor...

24
Experimental
3360 jxlarrea/homeassistant-voice-recipes

GPU/CUDA-accelerated voice control stack for Home Assistant. Runs on x86/x64...

24
Experimental
3361 Jopex1/real-time-voice-translator

🌍 Capture speech, translate it instantly, and playback audio in a selected...

24
Experimental
3362 HyxiaoGe/ai-audio-assistant-ui

面向音视频内容理解的 AI 助手,支持上传与 YouTube 链接, ASR 转写、结构化摘要与实时进度。

24
Experimental
3363 prakharjadaun/Voice-Assistant

Created a Voice Assistant with the help of pyttsx3 library. Also, I have...

24
Experimental
3364 aidayang/Faster-whisper-OneClick

Faster-whisper一键启动整合包带GUI界面

24
Experimental
3365 persanix-llc/chatrpi-app

Chat for Raspberry Pi (Chatrpi) is a voice assistant for the Raspberry Pi...

24
Experimental
3366 mariangle/taskify

AI powered task manager app with speech recognition, twitter-like input...

24
Experimental
3367 goodmike31/pl-asr-speech-data-survey

Survey of available speech datasets for Polish ASR development

24
Experimental
3368 Michaelrace/awesome-voice-agents

🗣️ Explore a curated list of voice AI agents, frameworks, tools, and best...

24
Experimental
3369 Rishabh1925/voiceforge

AI-powered voice automation platform with text-to-speech and automated...

24
Experimental
3370 cycle-sync-ai/livekit-voice-ai-agent-setup

This is the guide to show the method to build your own AI-Powered voice...

24
Experimental
3371 ayutaz/openjtalk-native

Cross-platform OpenJTalk native shared library — Japanese text-to-phoneme C...

24
Experimental
3372 Userdev1213/h3xassist

🤖 Automate your online meetings with H3xAssist to record, transcribe, and...

24
Experimental
3373 vpdl-sys/vpdl-public

Proprietary AI Voice Script Writer for turning written text into natural,...

24
Experimental
3374 pkprajapati7402/Darvin-Chatbot

Darvin is a Python-based voice-activated chatbot that interacts with users...

24
Experimental
3375 SethiPawandeep/kaldi-for-dummies

This is the repository for my version of Kaldi for Dummies example.

24
Experimental
3376 yuhanwang14/ASR-Pipeline

Local GPU-accelerated speech transcription pipeline with speaker diarization...

24
Experimental
3377 tfm000/diana

Locally hosted Text-to-Speech Document Converter

24
Experimental
3378 charlescao460/SpeechRecognitionByGoogleCloud

A .NET program that captures local audio and recognizes speech

24
Experimental
3379 Sariel2018/audio-srt-aligner

Dual-mode subtitle tool: transcript-aware alignment and audio-only auto...

24
Experimental
3380 kofemann/streetguide

An Android app to discover where you drive

24
Experimental
3381 techiaith/seilwaith

Offer hwyluso creu Adnabod Lleferydd Cymraeg gyda HTK, IRSTLM, Julius a...

24
Experimental
3382 theawless/sr-lib

Automatic Speech Recognition library for my BTech Project.

24
Experimental
3383 coo-quack/talkative-lobster

Desktop voice conversation app — speak into your mic and an AI responds out loud

24
Experimental
3384 deepgram/deepgram-js-captions

This package is the JavaScript implementation of Deepgram's WebVTT and SRT...

24
Experimental
3385 HxnDev/Convert-Text-To-Speech

This project uses Google Text to Speech to convert the written text into any...

24
Experimental
3386 Phe0nix/Speech-Email-Sender

Send email with speech recognition means just start talking and send emails....

24
Experimental
3387 tongplw/ASR-web-based-restaurant

🍔 Foody, a smart voice-assistant web-based restaurant using Kaldi, React, and WebRTC

24
Experimental
3388 ldl805/QuickSpeechPi

Very, very lightweight and simple text to speech (TTS) program that outputs...

24
Experimental
3389 NEURASCOPE/neurascreen

Automate product tour videos with JSON scenarios. Real browser recording, AI...

24
Experimental
3390 aks-devs/mod_whisper_asr

Freeswitch ASR module

24
Experimental
3391 Anthonyiswhy/blind_navigation_aid

Raspberry Pi + ESP32 system for blind assistance using LiDAR, OpenCV, YOLO,...

24
Experimental
3392 pystorage/pyspeechkit

Library for working with a range of technologies for speech recognition and...

24
Experimental
3393 Jithsaavvy/Deploying-an-end-to-end-keyword-spotting-model-into-cloud-server-by-integrating-CI-CD-pipeline

The project is a concoction of research (audio signal processing, keyword...

24
Experimental
3394 Bashvalencia724/xiaomusic

🎶 Stream music effortlessly with XiaoMusic, enhancing your Xiao AI speaker...

24
Experimental
3395 Kaljurand/net-speech-api

Java API for the online speech recognition services provided by phon.ioc.ee

24
Experimental
3396 ParthPipermintwala/Personal-Assistant

🎙 Voice-controlled AI desktop assistant built with Python. Supports voice...

24
Experimental
3397 nikhilkumarsingh/Wit-Speech-API-Wrapper

A python client for interacting with Wit Speech Recognition API

24
Experimental
3398 mahimairaja/vapiserve

A to Z Vapi's custom tools | All your voice agent needs

24
Experimental
3399 abhaymathur21/Aura

A Personal Voice Assistant that performs a multitude of tasks for you on...

24
Experimental
3400 alefiury/SE-R-2022-SER-Track

Code for the winning solution in the SE&R 2022 Challenge - SER track.

24
Experimental
« Prev 1 2 3 32 33 34 35 36 68 69 70 Next »