dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
Implements QuartzNet architecture via NVIDIA NeMo with language model decoding using KenLM for improved accuracy. Trained on ~100 hours of Vietnamese speech (YouTube, radio, call center, TTS, and public datasets), the compact 13M-parameter model achieves fast inference. Offers both batch audio transcription and a Flask web application, with an active PyTorch-based v2.0 branch in development.
165 stars.
Stars
165
Forks
56
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/dangvansam/viet-asr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
srvk/eesen
The official repository of the Eesen project
sooftware/kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.