dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

/ 100

Established

Implements QuartzNet architecture via NVIDIA NeMo with language model decoding using KenLM for improved accuracy. Trained on ~100 hours of Vietnamese speech (YouTube, radio, call center, TTS, and public datasets), the compact 13M-parameter model achieves fast inference. Offers both batch audio transcription and a Flask web application, with an active PyTorch-based v2.0 branch in development.

165 stars.

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

165

Forks

Language

Python

License

Apache-2.0

Related tools

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

srvk/eesen

The official repository of the Eesen project

sooftware/kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights