Open-Speech-EkStep/vakyansh-models

Open source speech to text models for Indic Languages

/ 100

Emerging

Built on wav2vec2 and Conformer architectures, the models employ self-supervised pretraining on 34,000+ hours of unlabeled Indic audio followed by language-specific fine-tuning, with cross-lingual representations (CLSRIL-23) enabling transfer across 23 languages. Beyond ASR, the toolkit includes complementary models for punctuation restoration, text-to-speech synthesis, language identification, and gender classification. TorchScript export formats enable deployment across edge and production environments without framework dependencies.

325 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

325

Forks

Language

—

License

MIT

Higher-rated alternatives

liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

louiskirsch/speechT

An opensource speech-to-text software written in tensorflow

Open-Speech-EkStep/vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

silversparro/wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights