All Voice AI Tools

6,981 tools ranked by quality score · Page 39 of 70

Showing 3801–3900 of 6,981
# Tool Score Tier
3801 sangramsingnk/Audio-Feature-Extraction

In sound processing, the mel-frequency cepstrum (MFC) is a representation of...

22
Experimental
3802 lucky-bai/wasm-speech-streaming

Offline streaming speech-to-text in the browser

22
Experimental
3803 mhagglun/Speech-Recognition

Tensorflow implementation for Speech Recognition using Convolutional Neural...

22
Experimental
3804 bacharyehya/outloud

Beautiful TUI for text-to-speech. Gemini, OpenAI, or local. One command.

22
Experimental
3805 Pierillo/hallucination-check

Pipeline automatizado que cura, redacta y envía un newsletter diario de IA...

22
Experimental
3806 verrannt/snn_speechrec

Convolutional Spiking Neural Network to recognize speech utterances using...

22
Experimental
3807 rwightman/pytorch-commands

Some PyTorch code for the Kaggle Speech Recognition Challenge

22
Experimental
3808 lukasjakobi/ha-sync-announcement

Broadcast synchronized TTS announcements across multiple media players in...

22
Experimental
3809 aleksandarbos/Sound-Recognition-Convo2D-Neural-Network

Tools: Python (OpenCV 3.0 + Keras lib-Convolution 2D Neural Network). Desc:...

22
Experimental
3810 amityalwar/snoofus

Generative AI based speech analyzer

22
Experimental
3811 chameleon82/avatar-ai

OpenAI Avatar for real-time api

22
Experimental
3812 edwindoremi/Asterisk

🎮 Streamline esports tournaments with Asterisk, a real-time management...

22
Experimental
3813 shr1324/orpheus-tts-docker

🔊 Deploy Orpheus TTS with ease using Docker, featuring GPU management,...

22
Experimental
3814 Ultan-Kearns/GestureBasedUIProject

Gesture Based UI Project 4th Year

22
Experimental
3815 shitian-ni/speech-recognition-transfer-learning

Speech command recognition DenseNet transfer learning from UrbanSound8k in...

22
Experimental
3816 OzoneAnim/employee-api

🏢 Manage employee data efficiently with this RESTful API featuring full CRUD...

22
Experimental
3817 winccoa/winccoa-ae-ts-text2speech

WinCC OA Text-To-Speech Library

22
Experimental
3818 easonlai/ms-speech-services-demo-web-tts

Microsoft Azure Speech Services (Text-to-Speech, TTS) Web Demo with Node.JS...

22
Experimental
3819 QinHsiu/BiCLTTS

Bi-level Cntrastive Learning for Text-to-Speech

22
Experimental
3820 Dragon745/urdu-roman-dictionary

A growing open-source Urdu → Roman Urdu dictionary and lexicon for...

22
Experimental
3821 JeffWang0325/Microsoft-Azure-Cognitive-Services

🖍️ This project combines multiple operations in Microsoft Azure Cognitive...

22
Experimental
3822 huss2342/x_news_station

turn x/twitter feed into audio

22
Experimental
3823 furushchev/ros_gtts

Text-to-Speech service for ROS using python gTTS library for backend.

22
Experimental
3824 Moonbase59/jingle

Quickly generate a Jingle using Text-to-Speech

22
Experimental
3825 ihsacm/ComfyUI-KittenTTS

Integrate KittenTTS into ComfyUI to enable fast, lightweight text-to-speech...

22
Experimental
3826 shotafujie/asrivia

PiP表示でローカル文字起こし結果を表示できます.

22
Experimental
3827 benfordslaw/vowel-sound-generator

Vowel-only speech synthesis of input text using tone.js with formants based...

22
Experimental
3828 mochi-neko/VOICEVOX-API-unity

Binds VOICEVOX text to speech API to pure C# on Unity.

22
Experimental
3829 edisonneza/image-to-text

PWA - Convert Image to Text - A small multi language project built to use...

22
Experimental
3830 SARIT42/image-Annotation-Speech

Explaining the contents of an image in the form of speech through caption...

22
Experimental
3831 kayrugold/andyai

A self-evolving, tri-brain autonomous AI agent featuring local subconscious...

22
Experimental
3832 29sayantanc/Echo

Echo is a privacy-first, offline AI journal and conversational assistant....

22
Experimental
3833 ltphen/martha

Free text to speech synthesizer made with coqui-ai/TTS and flask

22
Experimental
3834 vadimkantorov/discordspeechtotext

Discord Speech-To-Text bot in Python using Google Cloud Speech-To-Text API

22
Experimental
3835 jibon57/nativescript-azure-cognitiveservices

Azure cognitive services implementation for NativeScript.

22
Experimental
3836 Zaid440/cosyvoice-docker

🎙️ Deploy a production-ready Text-to-Speech service with voice cloning and a...

22
Experimental
3837 Cyrostar/ITTS-TR

An end-to-end, highly optimized Text-to-Speech (TTS) framework based on...

22
Experimental
3838 Yacinewhatchandcode/VoiceCloning

🎙️ Real-Time TTS & Voice Cloning Pipeline — F5-TTS · PyTorch · Gradio · Voice Agent

22
Experimental
3839 smswg/callwg

语音呼叫系统-外呼系统,2026年真正可商用CALLWG语音呼叫系统,语音呼叫系统功能:机器人话术外呼系统|呼叫中心|VIP队列|来电记忆|ASR语音识别...

22
Experimental
3840 01-SayantanI/Assistant

This Python Voice Assistant with GUI uses Tkinter to enable users to...

22
Experimental
3841 AlisonGM03/Eva01

Build and interact with an AI that has its own mind, emotions, memory, and a...

22
Experimental
3842 farjadilyas/MUKALMA

MUKALMA is a human-like chatbot which incorporates correct, relevant...

22
Experimental
3843 HsiangNianian/funasr-api

FunASR API is a FastAPI-based inference gateway that wraps multiple FunASR...

22
Experimental
3844 beecave-homelab/parakeet_rocm

ROCm-optimized NVIDIA NeMo Parakeet ASR implementation with CLI, formatting,...

22
Experimental
3845 jm12138/iFLYTEK-MSC-Python-SDK

一个讯飞智能语音平台 MSC 的第三方 Python SDK,支持语音唤醒、语音识别、语音合成、语音评测等功能。A third-party Python...

22
Experimental
3846 bykemalh/S2ST

Speech to Speech Translation Python

22
Experimental
3847 kilogramme/nerdpudding

Provide live AI video commentary with text-to-speech for any video source,...

22
Experimental
3848 itsanuragkumarjha/Voice-chat-enabled-RAG-chatbot-with-real-time-internet-access

An open-source project that uses cutting-edge NLP models and real-time web...

22
Experimental
3849 nmstoker/SimpleSpeechLoop

A very basic demonstration connecting speech recognition and text-to-speech

22
Experimental
3850 Tugaytalha/NarraPhon

NarraPhon: Advanced Text-to-Speech Conversion Pipeline NarraPhon is a...

22
Experimental
3851 TheM1N9/stella

Stella is an intelligent voice assistant built using Python. It leverages...

22
Experimental
3852 Neil-001/audio-to-subtitle-translate

Easily convert speech to timed SRT subtitles and translated captions (Colab-ready)

22
Experimental
3853 0x61space/pu-cit371-helicopter-commander

Control a helicopter in Grand Theft Auto: San Andreas using speech recognition

22
Experimental
3854 ivsergeev/voicer

Голосовой ввод, GigaAM v3 e2e, opencode-plugin, русский язык

22
Experimental
3855 Noor-khalid/Selena

🚀 Accelerate your .NET applications with Selena, a zero-dependency library...

22
Experimental
3856 ShunsukeHayashi/voicebox-tts

VOICEVOX音声生成キューイングシステム (Celery + Redis)

22
Experimental
3857 oasisnoehub/OsisnoeAISpeech

English Text to Speech AI web app: You can better practice your english...

22
Experimental
3858 glloydie/flowtts-byok

🔊 Streamline voice synthesis with FlowTTS BYOK, leveraging Tencent's FlowTTS...

22
Experimental
3859 ORI-Muchim/BERT-MB-iSTFT-VITS

High-quality Multilingual(Korean, Japanese, Chinese, English, French and...

22
Experimental
3860 Nomannazir/f5-tts-fastapi

Open-source FastAPI wrapper for F5-TTS. A powerful Text-to-Speech API with...

22
Experimental
3861 mk-knight23/37-tool-text-to-speech

Production-grade Text-to-Speech utility built with Vue 3 and Web Speech API....

22
Experimental
3862 rajatgoyal715/Awaaz

🎙 An android project with some features like text to speech, speech to text...

22
Experimental
3863 Voine/VITS-MNN

TTS System VITS Android Ver, powered by alibaba-MNN engine.

22
Experimental
3864 AcTePuKc/Chatterbox-TTS-UI

Just an UI for Chatterbox, which uses about 1-2 GB RAM. Double click and...

22
Experimental
3865 cmirnow/Google-Cloud-TTS-Rails

Using the power of Google Cloud Text-to-Speech API and ruby here is a simple...

22
Experimental
3866 KelvinCampelo/open-aiudio-client

This Next.js application provides a user interface for interacting with...

22
Experimental
3867 zemags/golang-yandex-speech-kit

SDK for converting text to audio by Yandex premium voices

22
Experimental
3868 nttcslab-sp/torchain

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

22
Experimental
3869 Mliviu79/cartesia-go

Go SDK for the Cartesia AI API — TTS, STT, voice cloning, agents, WebSocket streaming

22
Experimental
3870 loglux/SpeakItAI

Convert text to speech using Microsoft Azure Neural Text-to-Speech (TTS) and...

22
Experimental
3871 turtlehacks/speechportal

(1st place at HopHacks) A dynamic webVR memory palace for speech training,...

22
Experimental
3872 richardr1126/KittenTTS-FastAPI

High-performance KittenTTS API server with a built-in web UI,...

22
Experimental
3873 yauhenipakala/Yandex.SpeechKit.Xamarin

Yandex SpeechKit Mobile SDK for Xamarin

22
Experimental
3874 neosapience/typecast-python

The official Python SDK for the Typecast API.

22
Experimental
3875 Artavazd2009/yandex-speechkit-php

Provide easy PHP access to Yandex SpeechKit API for audio transcription,...

22
Experimental
3876 Kourva/TextToSpeechBot

Text To Speech Telegram Bot with Brian voice.

22
Experimental
3877 minhsaco99/VoiceCore

Build voice apps fast. Unified API for speech recognition & synthesis with...

22
Experimental
3878 dannycrief/python-voice-assistant

Sarah Voice Assistant (SVA) is a Python voice assistant project on...

22
Experimental
3879 MarceloSalazarV/Multimodal_Med_Ai_with_Deployment

🩺 Enhance patient care with MediBot 2.0, an AI doctor assistant that...

22
Experimental
3880 phith0n/v2srt

v2srt 是一个基于人工智能的视频字幕生成工具,为任意视频生成高质量的字幕文件。

22
Experimental
3881 echocatzh/GTCNN

Personalized AEC

22
Experimental
3882 tiefenauer/ip9

Code for my master thesis at FHNW

22
Experimental
3883 Gopi-Durgaprasad/Speech-To-Text

End-to-End Speech Recognition

22
Experimental
3884 laravieira/reddit-to-tiktok

This project is a Python rendering and publishing pipeline that takes Reddit...

22
Experimental
3885 twn39/edgetts-dart

A pure Dart implementation of the excellent edge-tts library. Access...

22
Experimental
3886 loneicewolf/AI-SNN

AI SNN - or Artificial Intelligence Stuttering Neural Network - a Project I...

22
Experimental
3887 collinsuen/Local-Whisper-STT-Windows11-ZH

Local GPU-Accelerated Chinese Speech-to-Text for Windows 11 (Whisper-based,...

22
Experimental
3888 p337r/Efes

Proof of concept demo for a tool that listens for keywords, and records...

22
Experimental
3889 Bangla-Language-Processing/Katha-Bangla-TTS

The first Bangla Text To Speech System for Bangladeshi Bangla (Katha)

22
Experimental
3890 ninoish/lwc-web-speech-api-input

Implements voice powered input for Lightning Web Component with Web Speech...

22
Experimental
3891 voxia-ai/voxia-open

Lightweight runtime for building real-time Voice AI applications

22
Experimental
3892 bloo-berries/Library-of-the-Blind

The world’s largest catalog of Braille, tactile, audio, and multimodal...

22
Experimental
3893 zolomohan/speech-recognition-in-javascript-starter

Starter Code for Speech Recognition in JavaScript tutorial.

22
Experimental
3894 Chrisisaac948/RealWonder

Generate real-time videos conditioned on physical actions from a single...

22
Experimental
3895 QXIP/RTPEngine-Speech2Text

Simple RTPEngine Speech-to-Text Recording Spooler

22
Experimental
3896 technicianted/msspeech-gbridge

Bridge service to enable using Google Cloud Speech client SDKs with...

22
Experimental
3897 Youhai020616/ai-video-pipeline

Generate AI short dramas and news videos from Python. Text → Images → Video...

22
Experimental
3898 kss2002/edge-TTS

AI Voice TTS Generator to edge-tts

22
Experimental
3899 theawless/Dict-O-nator

A dictation plugin for gedit (the GNOME text editor).

22
Experimental
3900 Axel-NCHO/ReddTok

Generate a TikTok video from a Reddit post

22
Experimental
« Prev 1 2 3 37 38 39 40 41 68 69 70 Next »