louiskirsch/speechT

An opensource speech-to-text software written in tensorflow

47
/ 100
Emerging

Implements the Wav2Letter architecture with CTC loss for end-to-end acoustic modeling, achieving 8% letter error rate on LibriSpeech. Supports optional KenLM language model integration for improved decoding beyond greedy decoding. Provides CLI tools for preprocessing LibriSpeech data, training with TensorBoard monitoring, live microphone transcription, and model evaluation with pre-trained weights available.

160 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 21 / 25

How are scores calculated?

Stars

160

Forks

34

Language

Python

License

Apache-2.0

Last pushed

Oct 15, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/louiskirsch/speechT"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.