Wav2Vec2 Speech Recognition Transformer Models

Fine-tuning and deployment of Wav2Vec2 models for automatic speech recognition (ASR) tasks, including multilingual and language-specific implementations. Does NOT include general speech-to-text pipelines, voice translation systems, or audio classification without ASR components.

There are 13 wav2vec2 speech recognition models tracked. 1 score above 50 (established tier). The highest-rated is MattyB95/Jabberjay at 53/100 with 5 stars and 620 monthly downloads.

Get all 13 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=wav2vec2-speech-recognition&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	MattyB95/Jabberjay 🦜 Synthetic Voice Detection	53	Established	5	Python
2	guxm2021/ALT_SpeechBrain [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription	29	Experimental	49	Python
3	balaragavesh/w2vindia w2vindia is a self-supervised Wav2Vec 2.0 Base model pre-trained from...	19	Experimental	—	Python
4	emilykhidirova/speech-emotion-recognition Speech emotion recognition using fine-tuned Wav2Vec2	19	Experimental	—	Python
5	henilp105/TeluguASR Telugu ASR model trained on IIIT Hyderabad ASR Challenge dataset and...	18	Experimental	3	Jupyter Notebook
6	sebinbenjamin/wav2vec_demo A Python tool for transcribing speech from audio files using the Wav2Vec 2.0...	17	Experimental	3	Python
7	subhasis-ai/Hindi-ASR-Wav2Vec2 This repository demonstrates development of Hindi ASR model using transformers.	16	Experimental	4	Jupyter Notebook
8	guxm2021/MM_ALT [MM 2022] MM-ALT: A Multimodal Automatic Lyric Transcription System (Oral,...	15	Experimental	21	Python
9	jvel07/wav2vec2_patho Fine-tuning wav2vec2 to for Pathological Speech Processing	15	Experimental	6	Jupyter Notebook
10	hammaad2002/ASRAdversarialAttacks An ASR (Automatic Speech Recognition) adversarial attack repository.	14	Experimental	39	Jupyter Notebook
11	TerboucheHacene/speech-keyword-spotting Speech Keyword detection using Wav2Vec Model	13	Experimental	5	Python
12	maximkm/DLA_ASR_HW ASR pytorch project	12	Experimental	1	Python
13	lucasvmigotto/emotion-analysis Audio emotion classifier with fine tuned openai/whisper-large-v3	11	Experimental	—	Python