All Voice AI Tools
6,981 tools ranked by quality score · Page 30 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 2901 |
spacefarers/Transcryb
Fully Local Push-to-Transcribe |
|
Experimental |
| 2902 |
WWWWxp/M3-TTS
Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment &... |
|
Experimental |
| 2903 |
HCID274/JianYan
基于 SenseVoice 的 Windows 本地语音转文字工具,支持 OpenAI 格式 API 润色,低延迟,高精度。 |
|
Experimental |
| 2904 |
MysteryPancake/Discord-Lyrebird
[DEPRECATED] Text to speech Discord bot using the Lyrebird API |
|
Experimental |
| 2905 |
igorbezsmertnyi/speech
speech recognition and speech synthesis |
|
Experimental |
| 2906 |
tometoproject/tometo
:zzz: A text to speech social network. [mirror] |
|
Experimental |
| 2907 |
gunarakulangunaretnam/voice-typer
A voice recognition based typing tool for English, Tamil, Sinhala languages. |
|
Experimental |
| 2908 |
fvarrui/PowerPointToVideo
:clapper: PowerPoint to MP4 converter with synthesized interlocutor voice. |
|
Experimental |
| 2909 |
Yashkapure06/TextToSpeech-ChromeExtension
Text To Speech - Chrome Extension |
|
Experimental |
| 2910 |
34j/awesome-vits
List of repositories relevant to VITS. |
|
Experimental |
| 2911 |
fishaudio/fish-audio-n8n
The official n8n node for the Fish Audio API. |
|
Experimental |
| 2912 |
creafz/kaggle-speech-recognition
Solution for TensorFlow Speech Recognition Challenge on Kaggle (125th place, top 10%) |
|
Experimental |
| 2913 |
jiwidi/las-pytorch
Listen, Attend and spell model for E2E ASR. Implementation in Pytorch |
|
Experimental |
| 2914 |
18F/tts-buy-sites-challenge
Solicitation documents related to the purchase of hosting services for... |
|
Experimental |
| 2915 |
Hassi34/NLP-Hub
The NLP Hub consists of multiple NLP services, each providing specific... |
|
Experimental |
| 2916 |
HelloChatterbox/text2speech
Chatterbox TTS engines |
|
Experimental |
| 2917 |
Serkali-sudo/auto-subtitle-generator
An Android app that automatically generates subtitles for videos locally,... |
|
Experimental |
| 2918 |
va-kiet/Voice-Assistant-wake-word-detection-model
Build a Wake Word Detection model for Voice Assistant using PyTorch |
|
Experimental |
| 2919 |
6-robot/xfyun_waterplus
A xfyun ros package for Waterplus Robots |
|
Experimental |
| 2920 |
CypherousSkies/reading-for-listeners
A deep-learning powered accessibility application which turns pdfs into... |
|
Experimental |
| 2921 |
kaiidams/Voice100Sharp
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost... |
|
Experimental |
| 2922 |
ignabelitzky/easy-subber
A Python-based tool that that takes video files and generates .srt subtitle... |
|
Experimental |
| 2923 |
KathyReid/opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in... |
|
Experimental |
| 2924 |
VolgaGerm/PocketTTS.cpp
Single-file C++ TTS runtime for Pocket TTS with ONNX Runtime — voice... |
|
Experimental |
| 2925 |
TranHuuDat2004/tts-flask-app
Text-to-Speech Generator Powered by Python, Flask, and Piper TTS |
|
Experimental |
| 2926 |
gogyzzz/beamformit_matlab
A MATLAB implementation of CHiME4 baseline Beamformit |
|
Experimental |
| 2927 |
LiaTemplates/Speech-Recognition-Quiz
Create quizzes that check spoken text |
|
Experimental |
| 2928 |
Ahmed5attab/Qaf-QuranSearchAndMemorization
iOS Islamic application for the holy Quran, helps the Muslims to have the... |
|
Experimental |
| 2929 |
Unicorn-Commander/Unicorn-Orator
🦄 Text-to-Speech offloaded to iGPU and/or NPU |
|
Experimental |
| 2930 |
winstxnhdw/CapGen
A fast CPU-first video/audio transcriber for generating caption files with... |
|
Experimental |
| 2931 |
HristovB/Speech_Recognition_Macedonian
Speech recognition model for recognising Macedonian spoken language. |
|
Experimental |
| 2932 |
Pooventhiran/VSR
Speaker-Independent Speech Recognition using Visual Features |
|
Experimental |
| 2933 |
speechly/slu-client
Interact with Speechly SLU API from the command line |
|
Experimental |
| 2934 |
m0wer/aibot
Telegram bot powered by Ollama, capable of handling text and voice messages,... |
|
Experimental |
| 2935 |
charslab/Home-Assistant
Home assistant inspired by Amazon Echo, based on wit.ai with speech recognition |
|
Experimental |
| 2936 |
IbrokhimN/IJAI
IJAI is a modular AI assistant that supports text and voice interactions... |
|
Experimental |
| 2937 |
emonosuke/emoASR
End-to-end MOdeling of ASR (Automatic Speech Recognition) |
|
Experimental |
| 2938 |
bhattbhavesh91/speech-python-demos
pyttsx3 is a text-to-speech conversion library in Python. Its a Python-based... |
|
Experimental |
| 2939 |
habitual69/speakify
Speakify is a web application that uses Edge TTS to convert text to speech... |
|
Experimental |
| 2940 |
MikeChongCan/AITK
Artificial Intelligence Toolkit, a powerful tool that makes your life better. |
|
Experimental |
| 2941 |
osteele/speech-provider
A unified TypeScript interface for browser speech synthesis and Eleven Labs... |
|
Experimental |
| 2942 |
SALT-Research/SHALLOW
SHALLOW, the first hallucination benchmark for ASR models |
|
Experimental |
| 2943 |
n0an/VivaDicta
Voice Transcription, Reimagined |
|
Experimental |
| 2944 |
Zuellni/Orpheus-GGUF
Orpheus-TTS inference. |
|
Experimental |
| 2945 |
jakob-stoeck/speechToText
iOS speech recognition app for voice messages and general audio files |
|
Experimental |
| 2946 |
sanwecn/telegram-offline-voice
🎙️ 本地生成 Telegram 语音消息,无需 API Token。Edge-TTS + FFmpeg,零成本,无限制。 |
|
Experimental |
| 2947 |
oovz/expo-edge-speech
Microsoft Edge text-to-speech for Expo and React Native |
|
Experimental |
| 2948 |
akukerang/StudySurfer
Subway Surfer TikTok Study Tool |
|
Experimental |
| 2949 |
raomaster/read-me-a-book
Read me a book with python using TTS (local modelas) |
|
Experimental |
| 2950 |
piercecohen1/AI-TTS
Listen to anything with AI voices |
|
Experimental |
| 2951 |
aws-samples/seq2seq-asr-misbehaves
Artifacts for the paper "Attentional Speech Recognition Models Misbehave on... |
|
Experimental |
| 2952 |
csikasote/bembaspeech-exps
Bemba ASR model obtained by fine-tuning a well performing DeepSpeech English... |
|
Experimental |
| 2953 |
kubo/ruby-flite
a small speech synthesis library for ruby using CMU Flite(http://cmuflite.org) |
|
Experimental |
| 2954 |
JingShing-Python/Python-Voice-Order
An project that can transfer your voice order into word command. |
|
Experimental |
| 2955 |
Sgvkamalakar/Azure_AI_Speech_Services
This repository contains a Streamlit-based application that leverages Azure... |
|
Experimental |
| 2956 |
152334H/CTN-webapp
Refactored ControllableTalkNet with Flask/uwsgi |
|
Experimental |
| 2957 |
NullEnt1ty/GCloudSpeech
Transcribe voice data to text using Google Cloud Speech-to-Text |
|
Experimental |
| 2958 |
JesusGautamah/chatgpt_assistant
ChatGPT Virtual Assistant to Telegram and Discord with Voice Recognition |
|
Experimental |
| 2959 |
MERLIN2-ARCH/text_to_speech
Text to speech for ROS 2 |
|
Experimental |
| 2960 |
dgnsrekt/Discorgeous
Discord + GTTS = a discord bot that sends google text to speech voice... |
|
Experimental |
| 2961 |
XOREngine/Marvin4000
Real-time audio translation using Whisper + SeamlessM4T / NLLB-200 |
|
Experimental |
| 2962 |
Cosmos-Break/asr
沪语(上海话)ASR(语音识别)模型 |
|
Experimental |
| 2963 |
SharkyRawr/go-tiktok-tts
Go library for TikToks Text2Speech engine |
|
Experimental |
| 2964 |
xcorpio/FriendlyARM6410
基于FriendlyARM6410平台的嵌入式Qt程序:实时天气信息,远程vnc控制,远程监视摄像头,语音控制,语音输出TTS |
|
Experimental |
| 2965 |
shinchanat/Py
Pyreader is a python project created for reading pdf and text files by applying tts. |
|
Experimental |
| 2966 |
rik079/Speasier
Speak easier in Speasy. A.k.a Sock's Speaking Slave. |
|
Experimental |
| 2967 |
xuliang2024/video_skills
Cursor Skills 合集:一句话生成短视频。包含 Tumblr 风格视频、知识讲解视频、Lottie 动画视频等多种 AI 视频制作技能。 |
|
Experimental |
| 2968 |
gabrimatic/local-whisper
On-device voice transcription, grammar correction, and text-to-speech for... |
|
Experimental |
| 2969 |
kanweiwei/speekium
Smart voice assistant with pluggable LLM backends |
|
Experimental |
| 2970 |
florijanqosja/Albanian-ASR
This project is an AI-based transcription tool for the Albanian language.... |
|
Experimental |
| 2971 |
whiteSHADOW1234/WhisperTranscriber
🎙️ Effortlessly transcribe YouTube videos, MP4, and MP3 files to text using... |
|
Experimental |
| 2972 |
kaiidams/NeMoOnnxSharp
Text-to-speech and speech recognition, VAD with NVIDIA NeMo and ONNX Runtime... |
|
Experimental |
| 2973 |
NickEinstein1/TUNDA
Empathetic CARE_SOL_AI |
|
Experimental |
| 2974 |
EX3exp/MiriVoice
Open-Free TTS Platform For All |
|
Experimental |
| 2975 |
speechpro/speechpro-cloud-asr-examples
Примеры использования Beta-версии gRPC API потокового распознавания речи в ЦРТ Облаке |
|
Experimental |
| 2976 |
Jor02/DectalkNET
Use the Dectalk voice sythesizer directly in .NET applications |
|
Experimental |
| 2977 |
turinaf/Sagalee
Automatic Speech Recognition Dataset for Oromo Language |
|
Experimental |
| 2978 |
isthistechsupport/tts_for_discord
Using Discord.py and the Azure Cognitive Services Python SDK to bring Azure... |
|
Experimental |
| 2979 |
iChochy/mimo-tts-chat
MiMo TTS Chat |
|
Experimental |
| 2980 |
birros/pico2wave.js
JS port of pico2wave (Emscripten) |
|
Experimental |
| 2981 |
chase-west/VocaSpanish
Python app using tts and speech recognition to memorize spanish vocabulary |
|
Experimental |
| 2982 |
analyticsinmotion/wake-word
Hands-free voice activation for VS Code, Cursor, and compatible editors.... |
|
Experimental |
| 2983 |
arpabot/ohno-bot
Discord Japanese text-to-speech bot |
|
Experimental |
| 2984 |
y52en/aquestalk.js
AquesTalkをWebAssembly(v86)環境で動かし、ブラウザやNode.jsで簡単に利用できるようにしたライブラリです DEMO : ... |
|
Experimental |
| 2985 |
Loatchi/Tiktok-TTS-java
A program to transform a text to a vocal message using Tiktok voice template. |
|
Experimental |
| 2986 |
ArthurBabkin/Parimate
A Telegram bot for validating audio and video content using CV models, SR... |
|
Experimental |
| 2987 |
snaraya7/Ok_Eclipse
CSC 510 Software Engineering (Spring 2018) project - Group 'O' |
|
Experimental |
| 2988 |
Aadv1k/reddit-tts-gui
A GUI to auto-generate TTS videos from reddit posts and comments |
|
Experimental |
| 2989 |
mskian/pronounce-and-speech
Pronounce and Speech Text - Enter Word and Get the Pronunciation and Speech Text. |
|
Experimental |
| 2990 |
aminul-huq/Adversarial-Examples-For-Audio-Data
Repo for papers to read on adversarial attack and defense techniques in the... |
|
Experimental |
| 2991 |
Afnanksalal/MediTech
MediTech is an innovative AI-driven Electronic Medical Record (EMR) system... |
|
Experimental |
| 2992 |
paradocx96/Text-to-Speech-Application
Text-to-Speech Application build with Electron JS |
|
Experimental |
| 2993 |
EnjiRouz/Habr-Reader-Extension
Простое расширение-читалка для Chrome/Opera, позволяющее воспроизводить... |
|
Experimental |
| 2994 |
it-beard/podcast-tts
Text-to-speach Python scripts for podcasting |
|
Experimental |
| 2995 |
Kowalski1024/Mi-Go
Mi-Go is an open-source test framework designed to evaluate and compare the... |
|
Experimental |
| 2996 |
KennethanCeyer/awesome-audio-speech
Awesome list of Audio, Speech, and DSP(Digital signal processing) |
|
Experimental |
| 2997 |
kehlawicode/audiblez
🎧 Create high-quality audiobooks from e-books with ease using Audiblez,... |
|
Experimental |
| 2998 |
pajeronda/microsoft
Microsoft Text-to-Speech (TTS) for Home Assitant with streaming support. |
|
Experimental |
| 2999 |
tpc3/Kotone-DiVE
TTS bot for Discord, Re-written with golang |
|
Experimental |
| 3000 |
mush42/leanspeech
Unofficial pytorch implementation of LeanSpeech: The Microsoft Lightweight... |
|
Experimental |