fizamusthafa/whisper-app
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
Built with Streamlit, the application wraps OpenAI's Whisper model with automatic language detection across 99+ languages and handles temporary file cleanup to enforce data privacy. The UI provides drag-and-drop audio upload with server-side transcription triggered via sidebar controls, storing no persistent user data beyond the transcription session.
Stars
32
Forks
11
Language
Python
License
—
Category
Last pushed
Feb 12, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/fizamusthafa/whisper-app"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
AbdullahHendy/live-translation
Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client...
Gr122lyBr/voicetag
Speaker identification powered by pyannote and resemblyzer
i4Ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model.
aws-solutions/content-localization-on-aws
Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated...
vdutts7/ai-rapper
Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise TTS, and OpenCV 🎵