Wav2Vec2 Speech Recognition Transformer Models
Fine-tuning and deployment of Wav2Vec2 models for automatic speech recognition (ASR) tasks, including multilingual and language-specific implementations. Does NOT include general speech-to-text pipelines, voice translation systems, or audio classification without ASR components.
There are 13 wav2vec2 speech recognition models tracked. 1 score above 50 (established tier). The highest-rated is MattyB95/Jabberjay at 53/100 with 5 stars and 620 monthly downloads.
Get all 13 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=wav2vec2-speech-recognition&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
MattyB95/Jabberjay
🦜 Synthetic Voice Detection |
|
Established |
| 2 |
guxm2021/ALT_SpeechBrain
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription |
|
Experimental |
| 3 |
balaragavesh/w2vindia
w2vindia is a self-supervised Wav2Vec 2.0 Base model pre-trained from... |
|
Experimental |
| 4 |
emilykhidirova/speech-emotion-recognition
Speech emotion recognition using fine-tuned Wav2Vec2 |
|
Experimental |
| 5 |
henilp105/TeluguASR
Telugu ASR model trained on IIIT Hyderabad ASR Challenge dataset and... |
|
Experimental |
| 6 |
sebinbenjamin/wav2vec_demo
A Python tool for transcribing speech from audio files using the Wav2Vec 2.0... |
|
Experimental |
| 7 |
subhasis-ai/Hindi-ASR-Wav2Vec2
This repository demonstrates development of Hindi ASR model using transformers. |
|
Experimental |
| 8 |
guxm2021/MM_ALT
[MM 2022] MM-ALT: A Multimodal Automatic Lyric Transcription System (Oral,... |
|
Experimental |
| 9 |
jvel07/wav2vec2_patho
Fine-tuning wav2vec2 to for Pathological Speech Processing |
|
Experimental |
| 10 |
hammaad2002/ASRAdversarialAttacks
An ASR (Automatic Speech Recognition) adversarial attack repository. |
|
Experimental |
| 11 |
TerboucheHacene/speech-keyword-spotting
Speech Keyword detection using Wav2Vec Model |
|
Experimental |
| 12 |
maximkm/DLA_ASR_HW
ASR pytorch project |
|
Experimental |
| 13 |
lucasvmigotto/emotion-analysis
Audio emotion classifier with fine tuned openai/whisper-large-v3 |
|
Experimental |