scionoftech/DeepAsr

Keras(Tensorflow) implementations of Automatic Speech Recognition

48
/ 100
Emerging

Implements end-to-end ASR using CTC loss with support for Baidu's DeepSpeech2 and custom architectures, featuring CuDNN-accelerated LSTMs, multi-GPU distributed training, and mixed precision capabilities. Includes modular pipeline components for audio feature extraction (MFCC/spectrogram), greedy and beam-search decoders with language model integration, and on-the-fly data generation for large datasets. Targets 16kHz mono WAV/FLAC inputs and provides pre-trained models for immediate inference or fine-tuning.

No commits in the last 6 months. Available on PyPI.

Stale 6m No Dependents
Maintenance 0 / 25
Adoption 13 / 25
Maturity 18 / 25
Community 17 / 25

How are scores calculated?

Stars

24

Forks

11

Language

Jupyter Notebook

License

AGPL-3.0

Last pushed

Jan 13, 2022

Monthly downloads

683

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/scionoftech/DeepAsr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.