Wav2Vec2 ASR Models Voice AI Tools

Fine-tuning frameworks and implementations of Wav2Vec 2.0 for automatic speech recognition across languages. Does NOT include general ASR systems using other architectures (WaveNet, etc.), TTS, or non-ASR applications of Wav2Vec.

There are 46 wav2vec2 asr models tools tracked. The highest-rated is liangstein/Chinese-speech-to-text at 41/100 with 163 stars.

Get all 46 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=wav2vec2-asr-models&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

41
Emerging
2 louiskirsch/speechT

An opensource speech-to-text software written in tensorflow

40
Emerging
3 oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

39
Emerging
4 Open-Speech-EkStep/vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on...

39
Emerging
5 Open-Speech-EkStep/vakyansh-models

Open source speech to text models for Indic Languages

39
Emerging
6 silversparro/wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

38
Emerging
7 juliuskunze/speechless

Speech-to-text based on wav2letter built for transfer learning

38
Emerging
8 m3hrdadfi/soxan

Wav2Vec for speech recognition, classification, and audio classification

37
Emerging
9 mailong25/self-supervised-speech-recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

35
Emerging
10 bhattbhavesh91/wav2vec2-huggingface-demo

Speech to Text with self-supervised learning based on wav2vec 2.0 framework...

34
Emerging
11 loretoparisi/wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

33
Emerging
12 HarunoriKawano/Wav2vec2.0

Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised...

32
Emerging
13 khanld/ASR-Wav2vec-Finetune

:zap: Finetune Wa2vec 2.0 For Speech Recognition

31
Emerging
14 LearnedVector/Wav2Letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

29
Experimental
15 Hamtech-ai/wav2vec2-fa

fine-tune Wav2vec2. an ASR model released by Facebook

28
Experimental
16 phanxuanphucnd/wav2asr

A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.

27
Experimental
17 daanzu/wav2vec2_stt_python

Simple Python library, distributed via binary wheels with few direct...

26
Experimental
18 moxeeem/ASR-pronunciation-correction

Этот проект представляет систему автоматической коррекции произношения на...

25
Experimental
19 ttop32/wav2vec2-live-japanese-translator

real time japanese speech recognition translator using wav2vec2

24
Experimental
20 khanld/Wav2vec2-Pretraining

Wav2vec 2.0 Self-Supervised Pretraining

24
Experimental
21 baocin/hugging_face_example_STT_api

Demonstration of Hugging Face's (https://huggingface.co/) newly released...

24
Experimental
22 oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation

This repository contains scripts to prune Wav2vec2 using a...

23
Experimental
23 seanghay/wav2vec2-khmer-openslr

Wav2Vec2 with OpenSLR 42 (Khmer language)

23
Experimental
24 vietai/ASR

End-to-End Vietnamese Speech Recognition using wav2vec 2.0

22
Experimental
25 HySonLab/EntityKG

wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech

22
Experimental
26 KrishnaDN/BERTphone

Implementation of the paper "BERTphone: Phonetically-aware Encoder...

22
Experimental
27 mpoyraz/wav2vec2-turkish

Turkish Speech Recognition using Facebook's Wav2vec 2.0 models

22
Experimental
28 vadimkantorov/inferspeech

PyTorch speech2text inference script for the NVidia openseq2seq wav2letter...

21
Experimental
29 elerdg/ASR-for-low-resource-languages

Fine-tune wav2vec2-xls-r on data from low-resource-languages

19
Experimental
30 Dhruv16S/Transcribing-Video-to-Text

This repository is an implementation of the Wav2Vec2 model for converting...

18
Experimental
31 imvladikon/wav2vec2-hebrew

Speech Recognition for Hebrew (using wav2vec2 models)

17
Experimental
32 EN10/Speech-to-Text-WaveNet

Speech to Text

17
Experimental
33 ranchlai/wav2vec-2.0

Wav2vec2 English speech recognition in PaddlePaddle

16
Experimental
34 Ronnie-Leon76/Swahili-ASR

This repository contains the code for fine-tuning the XLS-R Wav2Vec2 model...

16
Experimental
35 egorsmkv/w2v2-bert-aligner

Aligner for wav2vec2-bert models

16
Experimental
36 nicolas-dufour/self-supervised-low-res-speech

This project transfert the self supervised Wav2vec2 representation to low...

16
Experimental
37 Narasimha1997/wavenet-stt

An end-to-end speech recognition system with Wavenet. Built using C++ and python.

15
Experimental
38 Bushramjad/XLSR-Wav2Vec2-Speech-Recognition-Urdu

Speech Recognition in Urdu language by fine-tuning the pretrained...

15
Experimental
39 navalnica/wav2vec2-belarusian

Speech to Text model for Belarusian language

15
Experimental
40 RaggioAI/dondza-xitsonga-asr-wav2vec2

Dondza-Xitsonga Wav2Vec2 é um modelo de Reconhecimento Automático de Fala em...

15
Experimental
41 theolepage/wavlm_ssl_sv

SOTA method for self-supervised speaker verification leveraging a...

13
Experimental
42 erfanashams/w2v2viz

A domain-informed probe visualiser trained on wav2vec 2.0 representations.

13
Experimental
43 rodrigues-aline/wav2vec2_interpretation

Investigating wav2vec2 context representations and the effects of fine-tuning

13
Experimental
44 dsalnikov/wav2vec

pure numpy implementation of wav2vec 2.0

12
Experimental
45 ahammedrohit/Speech-Recognition-using-wav2vec2-with-minimum-GPU

Python Colab for speech recognition with wav2vec2. Since wav2vec2 requires...

11
Experimental
46 mead-ml/audio8

Deep audio modeling

10
Experimental