Whisper Speech Transcription Transformer Models

Tools and applications for automatic speech recognition (ASR) and audio transcription using Whisper models. Includes implementations with various interfaces (API, GUI, web), fine-tuning for specific languages/accents, and integration with other AI systems. Does NOT include text-to-speech, voice cloning, audio classification without transcription, or general speech processing unrelated to transcription.

There are 21 whisper speech transcription models tracked. The highest-rated is Arkapravo-Ghosh/speech-to-text at 39/100 with 8 stars.

Get all 21 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=whisper-speech-transcription&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 Arkapravo-Ghosh/speech-to-text

Speech to Text Transcription using OpenAI Whisper v3 and FastAPI

39
Emerging
2 haiodo/oaitt

An OpenAI compatible transcriber using transformers and whisperx.

36
Emerging
3 purvanshjoshi/IndiVoice-DeepASR

Deep Learning framework for Indian-accented Speech-to-Text using Whisper and...

36
Emerging
4 biodatlab/thonburian-whisper

Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo...

35
Emerging
5 boned-fruitwood759/whisperx-asr-with-fastapi

🎤 Enable real-time speech recognition with WhisperX using FastAPI for...

23
Experimental
6 tomdewildt/whisper-experiment

Experiments using the Whisper model from Open AI

22
Experimental
7 scalable-ml-deep-learning/fine_tune_whisper

Fine-Tune Whisper for Italian ASR with transformers

20
Experimental
8 svn05/vietnamese-whisper-asr

Fine-tuned Whisper for Vietnamese ASR with Librosa preprocessing and Gradio demo.

20
Experimental
9 chalotrasahil/AI-Lecture-Studio

AI Lecture Studio is an NLP-driven system that transforms audio and video...

19
Experimental
10 mahiiyh/asr-primer

A complete implementation of an Automatic Speech Recognition (ASR) system...

19
Experimental
11 Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages

This project presents a multilingual pipeline for both speech-to-text and...

18
Experimental
12 EdVince/whisper-trtllm

Whisper in TensorRT-LLM

16
Experimental
13 Nazmul0005/Text2Audio_Audio2Text_Conversion_Using_HuggingFace

A demo project showcasing text-to-speech and speech-to-text conversions...

16
Experimental
14 romanyn36/whisperx-asr-with-fastapi

WhisperX ASR is a FastAPI-based application for automatic speech...

16
Experimental
15 hasanhalacli/whisper-german-finetuning

Fine-tune OpenAI Whisper for German speech recognition using LoRA with audio...

15
Experimental
16 ahmedbesbes/audiolizr

A bentoML-powered API to transcribe audio and make sense of it

14
Experimental
17 xAlpharax/whisper-stt-gradio

Gradio Interface for Transcription and Translation using the Whisper Large...

13
Experimental
18 RAHB-REALTORS-Association/transcriber-describer

Transcribes videos and describes them with OpenAI APIs or local models.

12
Experimental
19 thc1006/MTK-Breeze-ASR-25-colab-transcriptor

Taiwan Mandarin speech-to-text transcriber using MediaTek Breeze-ASR-25....

11
Experimental
20 samratrajsharma/OpenAI-Whisper-Fine-Tuned-ASR-using-LoRA-PEFT

End-to-end Hindi Speech AI project for improving ASR accuracy using...

11
Experimental
21 kulsoom-abdullah/Qwen2-VL-Audio-Adapter

Architecture grafting: Extending Qwen2-VL with Whisper encoder for speech...

11
Experimental