fizamusthafa/whisper-app

This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.

/ 100

Emerging

Built with Streamlit, the application wraps OpenAI's Whisper model with automatic language detection across 99+ languages and handles temporary file cleanup to enforce data privacy. The UI provides drag-and-drop audio upload with server-side transcription triggered via sidebar controls, storing no persistent user data beyond the transcription session.

No License No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

AbdullahHendy/live-translation

Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client...

Gr122lyBr/voicetag

Speaker identification powered by pyannote and resemblyzer

i4Ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

aws-solutions/content-localization-on-aws

Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated...

vdutts7/ai-rapper

Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise TTS, and OpenCV 🎵

Explore Voice AI Tools

All categories Trending Voice AI directory Insights