Whisper Speech Transcription Transformer Models

Tools and applications for automatic speech recognition (ASR) and audio transcription using Whisper models. Includes implementations with various interfaces (API, GUI, web), fine-tuning for specific languages/accents, and integration with other AI systems. Does NOT include text-to-speech, voice cloning, audio classification without transcription, or general speech processing unrelated to transcription.

There are 21 whisper speech transcription models tracked. The highest-rated is Arkapravo-Ghosh/speech-to-text at 39/100 with 8 stars.

Get all 21 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=whisper-speech-transcription&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	Arkapravo-Ghosh/speech-to-text Speech to Text Transcription using OpenAI Whisper v3 and FastAPI	39	Emerging	8	Python
2	haiodo/oaitt An OpenAI compatible transcriber using transformers and whisperx.	36	Emerging	6	Python
3	purvanshjoshi/IndiVoice-DeepASR Deep Learning framework for Indian-accented Speech-to-Text using Whisper and...	36	Emerging	2	Python
4	biodatlab/thonburian-whisper Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo...	35	Emerging	186	Jupyter Notebook
5	boned-fruitwood759/whisperx-asr-with-fastapi 🎤 Enable real-time speech recognition with WhisperX using FastAPI for...	23	Experimental	1	HTML
6	tomdewildt/whisper-experiment Experiments using the Whisper model from Open AI	22	Experimental	—	Jupyter Notebook
7	scalable-ml-deep-learning/fine_tune_whisper Fine-Tune Whisper for Italian ASR with transformers	20	Experimental	11	Jupyter Notebook
8	svn05/vietnamese-whisper-asr Fine-tuned Whisper for Vietnamese ASR with Librosa preprocessing and Gradio demo.	20	Experimental	1	Python
9	chalotrasahil/AI-Lecture-Studio AI Lecture Studio is an NLP-driven system that transforms audio and video...	19	Experimental	—	Python
10	mahiiyh/asr-primer A complete implementation of an Automatic Speech Recognition (ASR) system...	19	Experimental	—	Jupyter Notebook
11	Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages This project presents a multilingual pipeline for both speech-to-text and...	18	Experimental	3	Jupyter Notebook
12	EdVince/whisper-trtllm Whisper in TensorRT-LLM	16	Experimental	17	C++
13	Nazmul0005/Text2Audio_Audio2Text_Conversion_Using_HuggingFace A demo project showcasing text-to-speech and speech-to-text conversions...	16	Experimental	1	Jupyter Notebook
14	romanyn36/whisperx-asr-with-fastapi WhisperX ASR is a FastAPI-based application for automatic speech...	16	Experimental	1	HTML
15	hasanhalacli/whisper-german-finetuning Fine-tune OpenAI Whisper for German speech recognition using LoRA with audio...	15	Experimental	—	Python
16	ahmedbesbes/audiolizr A bentoML-powered API to transcribe audio and make sense of it	14	Experimental	39	Python
17	xAlpharax/whisper-stt-gradio Gradio Interface for Transcription and Translation using the Whisper Large...	13	Experimental	5	Python
18	RAHB-REALTORS-Association/transcriber-describer Transcribes videos and describes them with OpenAI APIs or local models.	12	Experimental	3	Python
19	thc1006/MTK-Breeze-ASR-25-colab-transcriptor Taiwan Mandarin speech-to-text transcriber using MediaTek Breeze-ASR-25....	11	Experimental	—	Python
20	samratrajsharma/OpenAI-Whisper-Fine-Tuned-ASR-using-LoRA-PEFT End-to-end Hindi Speech AI project for improving ASR accuracy using...	11	Experimental	—	Jupyter Notebook
21	kulsoom-abdullah/Qwen2-VL-Audio-Adapter Architecture grafting: Extending Qwen2-VL with Whisper encoder for speech...	11	Experimental	—	Jupyter Notebook