rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Archived

/ 100

Emerging

Implements modular ASR pipelines with CTC loss and greedy decoding, achieving 6.71% WER on LibriSpeech dev-clean without external language models. Built on TensorFlow 2.0 eager execution, it decouples feature extraction, neural network models, and decoders as independent components—allowing swapping of sophisticated feature extraction, data augmentation strategies, or language model decoders. Supports distributed training via TensorFlow Strategy and mixed-precision policies for optimized inference.

223 stars. No commits in the last 6 months.

Archived Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

223

Forks

Language

Python

License

AGPL-3.0

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

shenasa-ai/speech2text

A Deep-Learning-Based Persian Speech Recognition System

subho406/TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights