zzw922cn/Automatic_Speech_Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

50
/ 100
Established

Implements multiple RNN architectures (BiRNN, DBiRNN, ResNet) with configurable cell types (LSTM, GRU) and includes DeepSpeech2 and Capsule Network models for character and phoneme-level recognition. Provides preprocessing pipelines for TIMIT, LibriSpeech, and WSJ datasets with Layer Normalization RNN variants for improved training efficiency, plus an integrated language modeling module for decoding.

2,839 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 24 / 25

How are scores calculated?

Stars

2,839

Forks

534

Language

Python

License

MIT

Last pushed

Mar 24, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/zzw922cn/Automatic_Speech_Recognition"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.