AliDmrcIo/speech_recognition
AI-Powered Speech Recognition & Diarization: A robust Streamlit application leveraging WhisperX and Faster-Whisper for accurate transcription and speaker separation. Features dual-mode processing (Fast/Pro), automatic speaker identification, color-coded Word (.docx) export, and CPU-optimized Docker deployment on AWS EC2.
Stars
—
Forks
—
Language
Python
License
—
Category
Last pushed
Jan 05, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/AliDmrcIo/speech_recognition"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
zerounintezaragler/whisper_python
Whisper Python Untuk mendapatkan teks dari sebuah audio kini tidak perlu convert manual tidak...
Dicklesworthstone/franken_whisper
Agent-first Rust ASR orchestration stack: Bayesian backend routing across...
EMUNES/Auto-Subtitle-File-Generation
Generate subtitle files with timelines in an automatic way.
sydkwests/kwest-whisper-analysis
Conducted a comprehensive technical analysis of the Whisper model on M-series hardware,...
atahanuz/yt2text
Extract text from a YouTube video in a single command, using OpenAi's Whisper speech recognition model.