All Voice AI Tools

6,981 tools ranked by quality score · Page 30 of 70

Showing 2901–3000 of 6,981
# Tool Score Tier
2901 spacefarers/Transcryb

Fully Local Push-to-Transcribe

26
Experimental
2902 WWWWxp/M3-TTS

Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment &...

26
Experimental
2903 HCID274/JianYan

基于 SenseVoice 的 Windows 本地语音转文字工具,支持 OpenAI 格式 API 润色,低延迟,高精度。

26
Experimental
2904 MysteryPancake/Discord-Lyrebird

[DEPRECATED] Text to speech Discord bot using the Lyrebird API

26
Experimental
2905 igorbezsmertnyi/speech

speech recognition and speech synthesis

26
Experimental
2906 tometoproject/tometo

:zzz: A text to speech social network. [mirror]

26
Experimental
2907 gunarakulangunaretnam/voice-typer

A voice recognition based typing tool for English, Tamil, Sinhala languages.

26
Experimental
2908 fvarrui/PowerPointToVideo

:clapper: PowerPoint to MP4 converter with synthesized interlocutor voice.

26
Experimental
2909 Yashkapure06/TextToSpeech-ChromeExtension

Text To Speech - Chrome Extension

26
Experimental
2910 34j/awesome-vits

List of repositories relevant to VITS.

26
Experimental
2911 fishaudio/fish-audio-n8n

The official n8n node for the Fish Audio API.

26
Experimental
2912 creafz/kaggle-speech-recognition

Solution for TensorFlow Speech Recognition Challenge on Kaggle (125th place, top 10%)

26
Experimental
2913 jiwidi/las-pytorch

Listen, Attend and spell model for E2E ASR. Implementation in Pytorch

26
Experimental
2914 18F/tts-buy-sites-challenge

Solicitation documents related to the purchase of hosting services for...

26
Experimental
2915 Hassi34/NLP-Hub

The NLP Hub consists of multiple NLP services, each providing specific...

26
Experimental
2916 HelloChatterbox/text2speech

Chatterbox TTS engines

26
Experimental
2917 Serkali-sudo/auto-subtitle-generator

An Android app that automatically generates subtitles for videos locally,...

26
Experimental
2918 va-kiet/Voice-Assistant-wake-word-detection-model

Build a Wake Word Detection model for Voice Assistant using PyTorch

26
Experimental
2919 6-robot/xfyun_waterplus

A xfyun ros package for Waterplus Robots

26
Experimental
2920 CypherousSkies/reading-for-listeners

A deep-learning powered accessibility application which turns pdfs into...

26
Experimental
2921 kaiidams/Voice100Sharp

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost...

26
Experimental
2922 ignabelitzky/easy-subber

A Python-based tool that that takes video files and generates .srt subtitle...

26
Experimental
2923 KathyReid/opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in...

26
Experimental
2924 VolgaGerm/PocketTTS.cpp

Single-file C++ TTS runtime for Pocket TTS with ONNX Runtime — voice...

26
Experimental
2925 TranHuuDat2004/tts-flask-app

Text-to-Speech Generator Powered by Python, Flask, and Piper TTS

26
Experimental
2926 gogyzzz/beamformit_matlab

A MATLAB implementation of CHiME4 baseline Beamformit

26
Experimental
2927 LiaTemplates/Speech-Recognition-Quiz

Create quizzes that check spoken text

26
Experimental
2928 Ahmed5attab/Qaf-QuranSearchAndMemorization

iOS Islamic application for the holy Quran, helps the Muslims to have the...

26
Experimental
2929 Unicorn-Commander/Unicorn-Orator

🦄 Text-to-Speech offloaded to iGPU and/or NPU

26
Experimental
2930 winstxnhdw/CapGen

A fast CPU-first video/audio transcriber for generating caption files with...

26
Experimental
2931 HristovB/Speech_Recognition_Macedonian

Speech recognition model for recognising Macedonian spoken language.

26
Experimental
2932 Pooventhiran/VSR

Speaker-Independent Speech Recognition using Visual Features

26
Experimental
2933 speechly/slu-client

Interact with Speechly SLU API from the command line

26
Experimental
2934 m0wer/aibot

Telegram bot powered by Ollama, capable of handling text and voice messages,...

26
Experimental
2935 charslab/Home-Assistant

Home assistant inspired by Amazon Echo, based on wit.ai with speech recognition

26
Experimental
2936 IbrokhimN/IJAI

IJAI is a modular AI assistant that supports text and voice interactions...

26
Experimental
2937 emonosuke/emoASR

End-to-end MOdeling of ASR (Automatic Speech Recognition)

26
Experimental
2938 bhattbhavesh91/speech-python-demos

pyttsx3 is a text-to-speech conversion library in Python. Its a Python-based...

26
Experimental
2939 habitual69/speakify

Speakify is a web application that uses Edge TTS to convert text to speech...

26
Experimental
2940 MikeChongCan/AITK

Artificial Intelligence Toolkit, a powerful tool that makes your life better.

26
Experimental
2941 osteele/speech-provider

A unified TypeScript interface for browser speech synthesis and Eleven Labs...

26
Experimental
2942 SALT-Research/SHALLOW

SHALLOW, the first hallucination benchmark for ASR models

26
Experimental
2943 n0an/VivaDicta

Voice Transcription, Reimagined

26
Experimental
2944 Zuellni/Orpheus-GGUF

Orpheus-TTS inference.

26
Experimental
2945 jakob-stoeck/speechToText

iOS speech recognition app for voice messages and general audio files

26
Experimental
2946 sanwecn/telegram-offline-voice

🎙️ 本地生成 Telegram 语音消息,无需 API Token。Edge-TTS + FFmpeg,零成本,无限制。

26
Experimental
2947 oovz/expo-edge-speech

Microsoft Edge text-to-speech for Expo and React Native

26
Experimental
2948 akukerang/StudySurfer

Subway Surfer TikTok Study Tool

25
Experimental
2949 raomaster/read-me-a-book

Read me a book with python using TTS (local modelas)

25
Experimental
2950 piercecohen1/AI-TTS

Listen to anything with AI voices

25
Experimental
2951 aws-samples/seq2seq-asr-misbehaves

Artifacts for the paper "Attentional Speech Recognition Models Misbehave on...

25
Experimental
2952 csikasote/bembaspeech-exps

Bemba ASR model obtained by fine-tuning a well performing DeepSpeech English...

25
Experimental
2953 kubo/ruby-flite

a small speech synthesis library for ruby using CMU Flite(http://cmuflite.org)

25
Experimental
2954 JingShing-Python/Python-Voice-Order

An project that can transfer your voice order into word command.

25
Experimental
2955 Sgvkamalakar/Azure_AI_Speech_Services

This repository contains a Streamlit-based application that leverages Azure...

25
Experimental
2956 152334H/CTN-webapp

Refactored ControllableTalkNet with Flask/uwsgi

25
Experimental
2957 NullEnt1ty/GCloudSpeech

Transcribe voice data to text using Google Cloud Speech-to-Text

25
Experimental
2958 JesusGautamah/chatgpt_assistant

ChatGPT Virtual Assistant to Telegram and Discord with Voice Recognition

25
Experimental
2959 MERLIN2-ARCH/text_to_speech

Text to speech for ROS 2

25
Experimental
2960 dgnsrekt/Discorgeous

Discord + GTTS = a discord bot that sends google text to speech voice...

25
Experimental
2961 XOREngine/Marvin4000

Real-time audio translation using Whisper + SeamlessM4T / NLLB-200

25
Experimental
2962 Cosmos-Break/asr

沪语(上海话)ASR(语音识别)模型

25
Experimental
2963 SharkyRawr/go-tiktok-tts

Go library for TikToks Text2Speech engine

25
Experimental
2964 xcorpio/FriendlyARM6410

基于FriendlyARM6410平台的嵌入式Qt程序:实时天气信息,远程vnc控制,远程监视摄像头,语音控制,语音输出TTS

25
Experimental
2965 shinchanat/Py

Pyreader is a python project created for reading pdf and text files by applying tts.

25
Experimental
2966 rik079/Speasier

Speak easier in Speasy. A.k.a Sock's Speaking Slave.

25
Experimental
2967 xuliang2024/video_skills

Cursor Skills 合集:一句话生成短视频。包含 Tumblr 风格视频、知识讲解视频、Lottie 动画视频等多种 AI 视频制作技能。

25
Experimental
2968 gabrimatic/local-whisper

On-device voice transcription, grammar correction, and text-to-speech for...

25
Experimental
2969 kanweiwei/speekium

Smart voice assistant with pluggable LLM backends

25
Experimental
2970 florijanqosja/Albanian-ASR

This project is an AI-based transcription tool for the Albanian language....

25
Experimental
2971 whiteSHADOW1234/WhisperTranscriber

🎙️ Effortlessly transcribe YouTube videos, MP4, and MP3 files to text using...

25
Experimental
2972 kaiidams/NeMoOnnxSharp

Text-to-speech and speech recognition, VAD with NVIDIA NeMo and ONNX Runtime...

25
Experimental
2973 NickEinstein1/TUNDA

Empathetic CARE_SOL_AI

25
Experimental
2974 EX3exp/MiriVoice

Open-Free TTS Platform For All

25
Experimental
2975 speechpro/speechpro-cloud-asr-examples

Примеры использования Beta-версии gRPC API потокового распознавания речи в ЦРТ Облаке

25
Experimental
2976 Jor02/DectalkNET

Use the Dectalk voice sythesizer directly in .NET applications

25
Experimental
2977 turinaf/Sagalee

Automatic Speech Recognition Dataset for Oromo Language

25
Experimental
2978 isthistechsupport/tts_for_discord

Using Discord.py and the Azure Cognitive Services Python SDK to bring Azure...

25
Experimental
2979 iChochy/mimo-tts-chat

MiMo TTS Chat

25
Experimental
2980 birros/pico2wave.js

JS port of pico2wave (Emscripten)

25
Experimental
2981 chase-west/VocaSpanish

Python app using tts and speech recognition to memorize spanish vocabulary

25
Experimental
2982 analyticsinmotion/wake-word

Hands-free voice activation for VS Code, Cursor, and compatible editors....

25
Experimental
2983 arpabot/ohno-bot

Discord Japanese text-to-speech bot

25
Experimental
2984 y52en/aquestalk.js

AquesTalkをWebAssembly(v86)環境で動かし、ブラウザやNode.jsで簡単に利用できるようにしたライブラリです DEMO : ...

25
Experimental
2985 Loatchi/Tiktok-TTS-java

A program to transform a text to a vocal message using Tiktok voice template.

25
Experimental
2986 ArthurBabkin/Parimate

A Telegram bot for validating audio and video content using CV models, SR...

25
Experimental
2987 snaraya7/Ok_Eclipse

CSC 510 Software Engineering (Spring 2018) project - Group 'O'

25
Experimental
2988 Aadv1k/reddit-tts-gui

A GUI to auto-generate TTS videos from reddit posts and comments

25
Experimental
2989 mskian/pronounce-and-speech

Pronounce and Speech Text - Enter Word and Get the Pronunciation and Speech Text.

25
Experimental
2990 aminul-huq/Adversarial-Examples-For-Audio-Data

Repo for papers to read on adversarial attack and defense techniques in the...

25
Experimental
2991 Afnanksalal/MediTech

MediTech is an innovative AI-driven Electronic Medical Record (EMR) system...

25
Experimental
2992 paradocx96/Text-to-Speech-Application

Text-to-Speech Application build with Electron JS

25
Experimental
2993 EnjiRouz/Habr-Reader-Extension

Простое расширение-читалка для Chrome/Opera, позволяющее воспроизводить...

25
Experimental
2994 it-beard/podcast-tts

Text-to-speach Python scripts for podcasting

25
Experimental
2995 Kowalski1024/Mi-Go

Mi-Go is an open-source test framework designed to evaluate and compare the...

25
Experimental
2996 KennethanCeyer/awesome-audio-speech

Awesome list of Audio, Speech, and DSP(Digital signal processing)

25
Experimental
2997 kehlawicode/audiblez

🎧 Create high-quality audiobooks from e-books with ease using Audiblez,...

25
Experimental
2998 pajeronda/microsoft

Microsoft Text-to-Speech (TTS) for Home Assitant with streaming support.

25
Experimental
2999 tpc3/Kotone-DiVE

TTS bot for Discord, Re-written with golang

25
Experimental
3000 mush42/leanspeech

Unofficial pytorch implementation of LeanSpeech: The Microsoft Lightweight...

25
Experimental
« Prev 1 2 3 28 29 30 31 32 68 69 70 Next »