Whisper Speech Transcription Transformer Models
Tools and applications for automatic speech recognition (ASR) and audio transcription using Whisper models. Includes implementations with various interfaces (API, GUI, web), fine-tuning for specific languages/accents, and integration with other AI systems. Does NOT include text-to-speech, voice cloning, audio classification without transcription, or general speech processing unrelated to transcription.
There are 21 whisper speech transcription models tracked. The highest-rated is Arkapravo-Ghosh/speech-to-text at 39/100 with 8 stars.
Get all 21 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=whisper-speech-transcription&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
Arkapravo-Ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI |
|
Emerging |
| 2 |
haiodo/oaitt
An OpenAI compatible transcriber using transformers and whisperx. |
|
Emerging |
| 3 |
purvanshjoshi/IndiVoice-DeepASR
Deep Learning framework for Indian-accented Speech-to-Text using Whisper and... |
|
Emerging |
| 4 |
biodatlab/thonburian-whisper
Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo... |
|
Emerging |
| 5 |
boned-fruitwood759/whisperx-asr-with-fastapi
🎤 Enable real-time speech recognition with WhisperX using FastAPI for... |
|
Experimental |
| 6 |
tomdewildt/whisper-experiment
Experiments using the Whisper model from Open AI |
|
Experimental |
| 7 |
scalable-ml-deep-learning/fine_tune_whisper
Fine-Tune Whisper for Italian ASR with transformers |
|
Experimental |
| 8 |
svn05/vietnamese-whisper-asr
Fine-tuned Whisper for Vietnamese ASR with Librosa preprocessing and Gradio demo. |
|
Experimental |
| 9 |
chalotrasahil/AI-Lecture-Studio
AI Lecture Studio is an NLP-driven system that transforms audio and video... |
|
Experimental |
| 10 |
mahiiyh/asr-primer
A complete implementation of an Automatic Speech Recognition (ASR) system... |
|
Experimental |
| 11 |
Arnav-Sharmaa/Multilingual-Speech-to-Text-and-Speech-to-Speech-Content-Summarization-for-Indian-Languages
This project presents a multilingual pipeline for both speech-to-text and... |
|
Experimental |
| 12 |
EdVince/whisper-trtllm
Whisper in TensorRT-LLM |
|
Experimental |
| 13 |
Nazmul0005/Text2Audio_Audio2Text_Conversion_Using_HuggingFace
A demo project showcasing text-to-speech and speech-to-text conversions... |
|
Experimental |
| 14 |
romanyn36/whisperx-asr-with-fastapi
WhisperX ASR is a FastAPI-based application for automatic speech... |
|
Experimental |
| 15 |
hasanhalacli/whisper-german-finetuning
Fine-tune OpenAI Whisper for German speech recognition using LoRA with audio... |
|
Experimental |
| 16 |
ahmedbesbes/audiolizr
A bentoML-powered API to transcribe audio and make sense of it |
|
Experimental |
| 17 |
xAlpharax/whisper-stt-gradio
Gradio Interface for Transcription and Translation using the Whisper Large... |
|
Experimental |
| 18 |
RAHB-REALTORS-Association/transcriber-describer
Transcribes videos and describes them with OpenAI APIs or local models. |
|
Experimental |
| 19 |
thc1006/MTK-Breeze-ASR-25-colab-transcriptor
Taiwan Mandarin speech-to-text transcriber using MediaTek Breeze-ASR-25.... |
|
Experimental |
| 20 |
samratrajsharma/OpenAI-Whisper-Fine-Tuned-ASR-using-LoRA-PEFT
End-to-end Hindi Speech AI project for improving ASR accuracy using... |
|
Experimental |
| 21 |
kulsoom-abdullah/Qwen2-VL-Audio-Adapter
Architecture grafting: Extending Qwen2-VL with Whisper encoder for speech... |
|
Experimental |