kaituoxu/Listen-Attend-Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

/ 100

Emerging

The implementation uses a encoder-decoder architecture with attention mechanisms, where the listener (encoder) processes acoustic features via bidirectional LSTMs and the speller (decoder) generates character sequences autoregressively. It integrates with Kaldi for acoustic feature extraction and includes built-in support for Visdom visualization of training metrics. The framework includes a complete pipeline for data preparation, training, and decoding, with example scripts targeting the AISHELL Mandarin speech corpus.

207 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 22 / 25

How are scores calculated?

Stars

207

Forks

Language

Python

License

—

Compare

Listen-Attend-Spell and LAS_Mandarin_PyTorch Listen-Attend-Spell and Listen-Attend-Spell-v2

Higher-rated alternatives

khanld/chunkformer

ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription

sooftware/conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech...

upskyy/Squeezeformer

PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech...

WindQAQ/listen-attend-and-spell

Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project...

jackaduma/LAS_Mandarin_PyTorch

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights