louiskirsch/speechT

An opensource speech-to-text software written in tensorflow

/ 100

Emerging

Implements the Wav2Letter architecture with CTC loss for end-to-end acoustic modeling, achieving 8% letter error rate on LibriSpeech. Supports optional KenLM language model integration for improved decoding beyond greedy decoding. Provides CLI tools for preprocessing LibriSpeech data, training with TensorBoard monitoring, live microphone transcription, and model evaluation with pre-trained weights available.

160 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

160

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

Open-Speech-EkStep/vakyansh-models

Open source speech to text models for Indic Languages

Open-Speech-EkStep/vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

silversparro/wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights