Open-Speech-EkStep/vakyansh-models
Open source speech to text models for Indic Languages
Built on wav2vec2 and Conformer architectures, the models employ self-supervised pretraining on 34,000+ hours of unlabeled Indic audio followed by language-specific fine-tuning, with cross-lingual representations (CLSRIL-23) enabling transfer across 23 languages. Beyond ASR, the toolkit includes complementary models for punctuation restoration, text-to-speech synthesis, language identification, and gender classification. TorchScript export formats enable deployment across edge and production environments without framework dependencies.
325 stars. No commits in the last 6 months.
Stars
325
Forks
49
Language
—
License
MIT
Category
Last pushed
Sep 16, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Open-Speech-EkStep/vakyansh-models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
liangstein/Chinese-speech-to-text
Chinese Speech To Text Using Wavenet
louiskirsch/speechT
An opensource speech-to-text software written in tensorflow
Open-Speech-EkStep/vakyansh-wav2vec2-experimentation
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
silversparro/wav2letter.pytorch
A fully convolution-network for speech-to-text, built on pytorch.