LAS_Mandarin_PyTorch and Listen-Attend-Spell
These tools are competitors, as `jackaduma/LAS_Mandarin_PyTorch` appears to be a specialized adaptation of the foundational Listen, Attend and Spell (LAS) framework implemented in `kaituoxu/Listen-Attend-Spell`, specifically targeting Chinese Mandarin with a pretrained model.
About LAS_Mandarin_PyTorch
jackaduma/LAS_Mandarin_PyTorch
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)
About Listen-Attend-Spell
kaituoxu/Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
The implementation uses a encoder-decoder architecture with attention mechanisms, where the listener (encoder) processes acoustic features via bidirectional LSTMs and the speller (decoder) generates character sequences autoregressively. It integrates with Kaldi for acoustic feature extraction and includes built-in support for Visdom visualization of training metrics. The framework includes a complete pipeline for data preparation, training, and decoding, with example scripts targeting the AISHELL Mandarin speech corpus.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work