rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
ArchivedImplements modular ASR pipelines with CTC loss and greedy decoding, achieving 6.71% WER on LibriSpeech dev-clean without external language models. Built on TensorFlow 2.0 eager execution, it decouples feature extraction, neural network models, and decoders as independent components—allowing swapping of sophisticated feature extraction, data augmentation strategies, or language model decoders. Supports distributed training via TensorFlow Strategy and mixed-precision policies for optimized inference.
223 stars. No commits in the last 6 months.
Stars
223
Forks
63
Language
Python
License
AGPL-3.0
Category
Last pushed
Jun 15, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rolczynski/Automatic-Speech-Recognition"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models
shenasa-ai/speech2text
A Deep-Learning-Based Persian Speech Recognition System
subho406/TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge...