kaituoxu/Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
The implementation uses a encoder-decoder architecture with attention mechanisms, where the listener (encoder) processes acoustic features via bidirectional LSTMs and the speller (decoder) generates character sequences autoregressively. It integrates with Kaldi for acoustic feature extraction and includes built-in support for Visdom visualization of training metrics. The framework includes a complete pipeline for data preparation, training, and decoding, with example scripts targeting the AISHELL Mandarin speech corpus.
207 stars. No commits in the last 6 months.
Stars
207
Forks
56
Language
Python
License
—
Category
Last pushed
Jan 08, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kaituoxu/Listen-Attend-Spell"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
khanld/chunkformer
ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech...
upskyy/Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech...
WindQAQ/listen-attend-and-spell
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project...
jackaduma/LAS_Mandarin_PyTorch
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)