Wav2Vec2 ASR Models Voice AI Tools
Fine-tuning frameworks and implementations of Wav2Vec 2.0 for automatic speech recognition across languages. Does NOT include general ASR systems using other architectures (WaveNet, etc.), TTS, or non-ASR applications of Wav2Vec.
There are 46 wav2vec2 asr models tools tracked. The highest-rated is liangstein/Chinese-speech-to-text at 41/100 with 163 stars.
Get all 46 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=wav2vec2-asr-models&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
liangstein/Chinese-speech-to-text
Chinese Speech To Text Using Wavenet |
|
Emerging |
| 2 |
louiskirsch/speechT
An opensource speech-to-text software written in tensorflow |
|
Emerging |
| 3 |
oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model. |
|
Emerging |
| 4 |
Open-Speech-EkStep/vakyansh-wav2vec2-experimentation
Repository containing experimentation platform on how to train, infer on... |
|
Emerging |
| 5 |
Open-Speech-EkStep/vakyansh-models
Open source speech to text models for Indic Languages |
|
Emerging |
| 6 |
silversparro/wav2letter.pytorch
A fully convolution-network for speech-to-text, built on pytorch. |
|
Emerging |
| 7 |
juliuskunze/speechless
Speech-to-text based on wav2letter built for transfer learning |
|
Emerging |
| 8 |
m3hrdadfi/soxan
Wav2Vec for speech recognition, classification, and audio classification |
|
Emerging |
| 9 |
mailong25/self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework |
|
Emerging |
| 10 |
bhattbhavesh91/wav2vec2-huggingface-demo
Speech to Text with self-supervised learning based on wav2vec 2.0 framework... |
|
Emerging |
| 11 |
loretoparisi/wave2vec-recognize-docker
Wave2vec 2.0 Recognize pipeline |
|
Emerging |
| 12 |
HarunoriKawano/Wav2vec2.0
Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised... |
|
Emerging |
| 13 |
khanld/ASR-Wav2vec-Finetune
:zap: Finetune Wa2vec 2.0 For Speech Recognition |
|
Emerging |
| 14 |
LearnedVector/Wav2Letter
Speech Recognition model based off of FAIR research paper built using Pytorch. |
|
Experimental |
| 15 |
Hamtech-ai/wav2vec2-fa
fine-tune Wav2vec2. an ASR model released by Facebook |
|
Experimental |
| 16 |
phanxuanphucnd/wav2asr
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task. |
|
Experimental |
| 17 |
daanzu/wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct... |
|
Experimental |
| 18 |
moxeeem/ASR-pronunciation-correction
Этот проект представляет систему автоматической коррекции произношения на... |
|
Experimental |
| 19 |
ttop32/wav2vec2-live-japanese-translator
real time japanese speech recognition translator using wav2vec2 |
|
Experimental |
| 20 |
khanld/Wav2vec2-Pretraining
Wav2vec 2.0 Self-Supervised Pretraining |
|
Experimental |
| 21 |
baocin/hugging_face_example_STT_api
Demonstration of Hugging Face's (https://huggingface.co/) newly released... |
|
Experimental |
| 22 |
oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation
This repository contains scripts to prune Wav2vec2 using a... |
|
Experimental |
| 23 |
seanghay/wav2vec2-khmer-openslr
Wav2Vec2 with OpenSLR 42 (Khmer language) |
|
Experimental |
| 24 |
vietai/ASR
End-to-End Vietnamese Speech Recognition using wav2vec 2.0 |
|
Experimental |
| 25 |
HySonLab/EntityKG
wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech |
|
Experimental |
| 26 |
KrishnaDN/BERTphone
Implementation of the paper "BERTphone: Phonetically-aware Encoder... |
|
Experimental |
| 27 |
mpoyraz/wav2vec2-turkish
Turkish Speech Recognition using Facebook's Wav2vec 2.0 models |
|
Experimental |
| 28 |
vadimkantorov/inferspeech
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter... |
|
Experimental |
| 29 |
elerdg/ASR-for-low-resource-languages
Fine-tune wav2vec2-xls-r on data from low-resource-languages |
|
Experimental |
| 30 |
Dhruv16S/Transcribing-Video-to-Text
This repository is an implementation of the Wav2Vec2 model for converting... |
|
Experimental |
| 31 |
imvladikon/wav2vec2-hebrew
Speech Recognition for Hebrew (using wav2vec2 models) |
|
Experimental |
| 32 |
EN10/Speech-to-Text-WaveNet
Speech to Text |
|
Experimental |
| 33 |
ranchlai/wav2vec-2.0
Wav2vec2 English speech recognition in PaddlePaddle |
|
Experimental |
| 34 |
Ronnie-Leon76/Swahili-ASR
This repository contains the code for fine-tuning the XLS-R Wav2Vec2 model... |
|
Experimental |
| 35 |
egorsmkv/w2v2-bert-aligner
Aligner for wav2vec2-bert models |
|
Experimental |
| 36 |
nicolas-dufour/self-supervised-low-res-speech
This project transfert the self supervised Wav2vec2 representation to low... |
|
Experimental |
| 37 |
Narasimha1997/wavenet-stt
An end-to-end speech recognition system with Wavenet. Built using C++ and python. |
|
Experimental |
| 38 |
Bushramjad/XLSR-Wav2Vec2-Speech-Recognition-Urdu
Speech Recognition in Urdu language by fine-tuning the pretrained... |
|
Experimental |
| 39 |
navalnica/wav2vec2-belarusian
Speech to Text model for Belarusian language |
|
Experimental |
| 40 |
RaggioAI/dondza-xitsonga-asr-wav2vec2
Dondza-Xitsonga Wav2Vec2 é um modelo de Reconhecimento Automático de Fala em... |
|
Experimental |
| 41 |
theolepage/wavlm_ssl_sv
SOTA method for self-supervised speaker verification leveraging a... |
|
Experimental |
| 42 |
erfanashams/w2v2viz
A domain-informed probe visualiser trained on wav2vec 2.0 representations. |
|
Experimental |
| 43 |
rodrigues-aline/wav2vec2_interpretation
Investigating wav2vec2 context representations and the effects of fine-tuning |
|
Experimental |
| 44 |
dsalnikov/wav2vec
pure numpy implementation of wav2vec 2.0 |
|
Experimental |
| 45 |
ahammedrohit/Speech-Recognition-using-wav2vec2-with-minimum-GPU
Python Colab for speech recognition with wav2vec2. Since wav2vec2 requires... |
|
Experimental |
| 46 |
mead-ml/audio8
Deep audio modeling |
|
Experimental |