pannous/tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

/ 100

Established

Implements end-to-end speech-to-text conversion by converting audio spectrograms through sequence-to-sequence models with LSTM and dense architectures, supporting real-time microphone input via PyAudio. The project includes modular training pipelines and toy classifiers for numbers and speakers, designed as a standalone Linux solution leveraging public datasets like OpenSLR. **Note: This is archived/educational code—the maintainers recommend using Mozilla DeepSpeech or OpenAI Whisper for production use.**

2,176 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

2,176

Forks

633

Language

Python

License

—

Related tools

espnet/espnet

End-to-End Speech Processing Toolkit

yeyupiaoling/PPASR

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

flashlight/wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

yeyupiaoling/PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。

philipperemy/deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights