All Voice AI Tools
6,981 tools ranked by quality score · Page 26 of 70
| # | Tool | Score | Tier |
|---|---|---|---|
| 2501 |
mpoyraz/wav2vec2-turkish
Turkish Speech Recognition using Facebook's Wav2vec 2.0 models |
|
Experimental |
| 2502 |
6Morpheus6/Chattered
All in one Gradio interface for chatterbox. Voice cloning from uploaded... |
|
Experimental |
| 2503 |
hay/audio2text
Python command line utility wrappers for Whispercpp and other speech-to-text... |
|
Experimental |
| 2504 |
parzibyte/tts-js
Demostración de speechSynthesis con JavaScript: TTS o Síntesis de habla |
|
Experimental |
| 2505 |
robinhad/voice-recognition-ua
Training scripts for Speech-To-Text models for Ukrainian language |
|
Experimental |
| 2506 |
Hamahmi/kaldi-tut
This is a Kaldi tutorial for beginners |
|
Experimental |
| 2507 |
ARK018/multi-voice-sdk
A universal Text-to-Speech (TTS) SDK . Easily generate and manage audio... |
|
Experimental |
| 2508 |
marcominerva/TranslatorService
A lightweight library that uses Cognitive Translator Service for text... |
|
Experimental |
| 2509 |
DillionLowry/NeuralCodecs
Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia |
|
Experimental |
| 2510 |
javichur/fitness-voice
AI voice-controlled trainer in your web browser, using NLP (wit.ai), body... |
|
Experimental |
| 2511 |
lord-lethris/ComfyUI-lethris-dia2
ComfyUI custom nodes for the Dia2 TTS model — generate speech, timestamps,... |
|
Experimental |
| 2512 |
GuruCharan94/az-podcast-transcriber
A podcast transcription service built on Azure that transcribes any new... |
|
Experimental |
| 2513 |
themaxdit1175/soundpad-download-plus-subscription
Get Soundpad Download Plus on GitHub: a complete, high-performance toolkit... |
|
Experimental |
| 2514 |
aflr-archive/apiaudio-python
api.audio Python SDK |
|
Experimental |
| 2515 |
speechly/browser-client-example
A demo app showcasing Speechly browser-client and detailed api responses. |
|
Experimental |
| 2516 |
ElsebaiyMohamed/Modablag
This project presents a comprehensive study on video dubbing techniques and... |
|
Experimental |
| 2517 |
techiaith/docker-huggingface-stt-cy
Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech... |
|
Experimental |
| 2518 |
morfeusys/porfir
Голосовой ассистент Порфирьевич |
|
Experimental |
| 2519 |
azu/vscode-read-aloud-text
VSCode extension that read aloud text like Markdown and text etc... |
|
Experimental |
| 2520 |
StanGirard/quivr-whisper
Talk to your second brain personal assistant using speech 🧠 |
|
Experimental |
| 2521 |
LianjiaTech/bella-whisper
bella-whisper是一系列基于OpenAI... |
|
Experimental |
| 2522 |
Fraunhofer-AISEC/towards-resistant-audio-adversarial-examples
Generation tool for offset-resistant audio adversarial examples against Deepspeech |
|
Experimental |
| 2523 |
alecokas/BiLatticeRNN-Confidence
Confidence Estimation for Black Box Automatic Speech Recognition Systems... |
|
Experimental |
| 2524 |
mikex86/DeepSpeech-Java-Bindings
Java Bindings for the C++ library DeepSpeech |
|
Experimental |
| 2525 |
prohetamine/tor-speech
🔉 Yandex & Google + Tor |
|
Experimental |
| 2526 |
vietai/ASR
End-to-End Vietnamese Speech Recognition using wav2vec 2.0 |
|
Experimental |
| 2527 |
nidi3/swiss-wowbagger
Let yourself be insulted in swiss german. Schöner fluchen auf Berndeutsch. |
|
Experimental |
| 2528 |
simalexan/speechy
Voice command tool for an easy web speech recognition for your web... |
|
Experimental |
| 2529 |
amitpatil321/VoiceForm
Voice Controlled Form, Which can be filled, cleared, submitted using only... |
|
Experimental |
| 2530 |
nalbion/whisper-server
streaming speech to text server using Whisper |
|
Experimental |
| 2531 |
FS-17/SpeechDataBuilder
Browser-based open-source tool for creating high-quality TTS/STT datasets.... |
|
Experimental |
| 2532 |
AceCentre/pasco
Phrase Auditory Scanning COmmunicator - AAC App for iOS and the Web |
|
Experimental |
| 2533 |
taeyoun811/Whisfusion
Whisfusion: Parallel ASR Decoding via a Diffusion Transformer |
|
Experimental |
| 2534 |
yc9701/pansori-tedxkr-corpus
Korean ASR Corpus generated from TEDx talks |
|
Experimental |
| 2535 |
heyfoz/python-youtube-transcription
This repository contains Python scripts and a local Flask web application... |
|
Experimental |
| 2536 |
jianchang512/parakeet-api
一个基于 NVIDIA Parakeet-tdt-0.6b 模型的本地语音转录服务。它提供了一个与 OpenAI API 兼容的接口和一个简洁的 Web 用户界面 |
|
Experimental |
| 2537 |
Leonard2310/LibrAI
iOS app with AI for an immersive audiobook experience, text-to-speech and... |
|
Experimental |
| 2538 |
jreremy/conformer
Pytorch implementation of conformer with with training script for end-to-end... |
|
Experimental |
| 2539 |
cadia-lvl/WebRICE
WebRICE (Web Reader ICE) is an open source web reader in development at... |
|
Experimental |
| 2540 |
Forced-Alignment-and-Vowel-Extraction/fave-asr
Interface for automated transcription and time alignment of conversational... |
|
Experimental |
| 2541 |
alexogeny/cortana
Your own personal assistant thanks to chat-gpt, whisper, and elevenlabs tts |
|
Experimental |
| 2542 |
maetshju/flux-blstm-implementation
An implementation of the Graves & Schmidhuber (2005) bidirectional LSTM in Flux. |
|
Experimental |
| 2543 |
nearkyh/AWS-Polly
How to use Amazon Polly TTS(Text To Speech) |
|
Experimental |
| 2544 |
theamazing0/global-subtitles-main
Closed Captioning Everywhere, With Assembly AI |
|
Experimental |
| 2545 |
speechly/react-client
An React client library for Speechly API |
|
Experimental |
| 2546 |
noir-neo/UniSpeech
iOS speech framework native plugin for Unity |
|
Experimental |
| 2547 |
BleachDev/tts-grabber
Every Google, Azure & IBM text to speech voice for free. |
|
Experimental |
| 2548 |
cosmoquester/speech-recognition
Develop speech recognition models with Tensorflow 2 |
|
Experimental |
| 2549 |
M86xKC/edge-tts
Simple TTS using MS Edge built-in voices |
|
Experimental |
| 2550 |
18F/tts-buy-cloudgov-vulnerability-scanner
Solicitation and acquisition documents created for the cloud.gov... |
|
Experimental |
| 2551 |
yokawasa/vscode-translator-voice
VS Code extension for multi-language text translation and TTS... |
|
Experimental |
| 2552 |
tasmirz/EyeWear
Eyewear with OCR and live WebRTC based calling for the visually impaired.... |
|
Experimental |
| 2553 |
Ralireza/spoken-digit-recognition
Classifying English spoken digit by Hidden Markov Model |
|
Experimental |
| 2554 |
Tristan296/Universal-MacAssistant
Advanced Personal Assistant created for macOS that utilises AppleScripts,... |
|
Experimental |
| 2555 |
Lqm1/openai-workers-ai
A Cloudflare Workers-based, OpenAI-compatible API project that provides... |
|
Experimental |
| 2556 |
Sukumar9944/Speech-to-Text-with-ChatGPT
This Python application combines speech recognition with the power of... |
|
Experimental |
| 2557 |
matusstas/openai-whisper-microservice
This is an OpenAI Whisper automatic speech recognition microservice |
|
Experimental |
| 2558 |
yanorei32/aitalked
W.I.P. GynoidTalk / VOICEROID2 Low-Level Rust Binding Library based on... |
|
Experimental |
| 2559 |
chienhsiang-hung/voice-and-wav-cloning
通過少量語音與影片樣本生成高質量的語音與影片克隆 ( AI 人像口白生成 ),並提供多種音頻處理技術來提升音質和真實感。 |
|
Experimental |
| 2560 |
hi-paris/wavlm-vocoder-french
WavLM-to-Audio neural vocoder for French speech reconstruction — layer... |
|
Experimental |
| 2561 |
ryanlintott/OEVoice
Old English text-to-speech using AVSpeechSynthesis and IPA pronunciations. |
|
Experimental |
| 2562 |
adhadse/Deepdubpy
A complete end-to-end Deep Learning system to generate high quality human... |
|
Experimental |
| 2563 |
ActiveNick/Unity-SpeechWithLUIS
Sample Unity project used to demonstrate the integration of Speech... |
|
Experimental |
| 2564 |
fikrikarim/volocal
Fully local voice AI for iOS |
|
Experimental |
| 2565 |
SohamRatnaparkhi/Voice-Assistant
Voice Assistant coded in Python! |
|
Experimental |
| 2566 |
huuquyet/PhoWhisper-next
Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js |
|
Experimental |
| 2567 |
codekraft-studio/vue-speech
Vue integration and components for the Web Speech API |
|
Experimental |
| 2568 |
Helther/voice-pick-tbot
Text To Speech Synthesis Telegram Bot with voice customization |
|
Experimental |
| 2569 |
systoolz/dosbtalk
unofficial API implementation for Text-to-Speech Engine by First Byte |
|
Experimental |
| 2570 |
i-bardinov/Godot-Android-Text-to-Speech
Godot Android Text to Speech plugin for Godot Engine 3.4 or higher |
|
Experimental |
| 2571 |
WindQAQ/tensorflow-wavenet
Implementation of WaveNet network based on Tensorflow. |
|
Experimental |
| 2572 |
IBM/iot-mic-sts-ifttt-slack
WARNING: This repository is no longer maintained :warning: This repository... |
|
Experimental |
| 2573 |
void-xtreme/audible-text-editor
An automated Sinhala audio Text Editor for visually impaired and blind students |
|
Experimental |
| 2574 |
jaganadhg/nemoexamples
Experiments with NVIDIA NeMo |
|
Experimental |
| 2575 |
hugobloem/esp-ha-speech
Local speech recognition on an ESP32 for Home Assistant |
|
Experimental |
| 2576 |
18F/tts-buy-code-review
Solicitation documents for the code review procurement being undertaken by TTS. |
|
Experimental |
| 2577 |
opensource-spraakherkenning-nl/asr_nl
Dutch Speech Recognition webservice |
|
Experimental |
| 2578 |
jeantimex/F5-TTS-Server
F5-TTS server APIs for voice cloning and text-to-speech generation with... |
|
Experimental |
| 2579 |
chimechallenge/chime-utils
Scripts for data generation, scoring and data manifest preparation for... |
|
Experimental |
| 2580 |
xiaominfc/aliyun_nls_c_demo
阿里云的实时语音识别服务(ASR)没有提供C的SDK,项目中需要,看了它java sdk的实现,就做了个C版demo |
|
Experimental |
| 2581 |
sap1119/voice-agent-0.01
A self-hosted, AI-powered voice assistant system with real-time voice... |
|
Experimental |
| 2582 |
mathquis/node-picotts
SVOX PicoTTS binding for Node.js |
|
Experimental |
| 2583 |
seven-io/go-client
Official Go API Client for seven.io |
|
Experimental |
| 2584 |
Ranjit2111/AI-Interview-Agent
Multi-agent AI system for interview practice. Features adaptive questioning,... |
|
Experimental |
| 2585 |
veralvx/xtts-finetune
XTTS fine-tuning via CLI |
|
Experimental |
| 2586 |
aria-music/zundacord
Japanese Text-to-speech bot for Discord, powered by VOICEVOX |
|
Experimental |
| 2587 |
jlia0/RealityTalk
RealityTalk: Real-Time Speech-Driven Augmented Presentation for AR Live Storytelling |
|
Experimental |
| 2588 |
sera619/S4M-2.0
German supported VoiceAssist without BigData |
|
Experimental |
| 2589 |
darsh-1010/Jarvis-A-Voice-Based-Assistant-Powered-by-LLaMA
Jarvis is a voice-based assistant built in Python that simplifies daily... |
|
Experimental |
| 2590 |
MelvilQ/stacksrs
A simple Spaced Repetition app for Android. |
|
Experimental |
| 2591 |
Yukaii/gakuon
Review Anki cards using Generative AI voice |
|
Experimental |
| 2592 |
sinProject-Inc/talk
Listening and Speaking |
|
Experimental |
| 2593 |
EtienneAb3d/SRT-Sync
Synchronize SRT timestamps over an existing accurate transcription |
|
Experimental |
| 2594 |
robotology/natural-speech
This repository contains a codebase to build automatic speech recognition... |
|
Experimental |
| 2595 |
dialpad/mucs_2021_dialpad
Dialpad team's submission to the MUCS 2021 workshop |
|
Experimental |
| 2596 |
NICEElevateAI/ElevateAIJavaSDK
Java SDK for ElevateAI |
|
Experimental |
| 2597 |
lelosaiyan/J.A.R.V.I.S.
A voice virtual desktop assistant for Windows 7/10 |
|
Experimental |
| 2598 |
MazueraAlvaro/speech-recognition-asterisk
A script for speech recognition in asterisk |
|
Experimental |
| 2599 |
speechly/react-example-repo-filtering
An example app for filtering data with Speechly and React |
|
Experimental |
| 2600 |
Babakinha/Dectalk
A Simple package for using Dectalk |
|
Experimental |