kaituoxu/Listen-Attend-Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

40
/ 100
Emerging

The implementation uses a encoder-decoder architecture with attention mechanisms, where the listener (encoder) processes acoustic features via bidirectional LSTMs and the speller (decoder) generates character sequences autoregressively. It integrates with Kaldi for acoustic feature extraction and includes built-in support for Visdom visualization of training metrics. The framework includes a complete pipeline for data preparation, training, and decoding, with example scripts targeting the AISHELL Mandarin speech corpus.

207 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 22 / 25

How are scores calculated?

Stars

207

Forks

56

Language

Python

License

Last pushed

Jan 08, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kaituoxu/Listen-Attend-Spell"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.