liangstein/Chinese-speech-to-text

Chinese Speech To Text Using Wavenet

/ 100

Emerging

Implements WaveNet architecture for character-level Chinese speech recognition trained on the THCHS30 dataset, eliminating the need for word vectorization by operating directly on phonetic units. Built with Keras/TensorFlow backend and achieves 0.2768 CTC loss on 10,000 training samples, though performance degrades significantly in noisy conditions and would benefit from larger, more diverse datasets.

163 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

163

Forks

Language

Python

License

Apache-2.0

Related tools

louiskirsch/speechT

An opensource speech-to-text software written in tensorflow

Open-Speech-EkStep/vakyansh-models

Open source speech to text models for Indic Languages

Open-Speech-EkStep/vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

silversparro/wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights