nobody132/masr

中文语音识别; Mandarin Automatic Speech Recognition;

/ 100

Emerging

Built on an end-to-end gated convolutional neural network (inspired by Facebook's Wav2letter but using GLU activation instead of ReLU/HardTanh for faster convergence), MASR trains on the AISHELL-1 dataset (150 hours, 4000+ Chinese characters) and achieves 14% character error rate on test sets. The architecture supports external language models to further reduce error rates to 8%, though performance remains limited compared to industrial systems trained on significantly larger datasets with domain-specific language models.

1,964 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 25 / 25

How are scores calculated?

Stars

1,964

Forks

483

Language

Python

License

—

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights