mozilla/DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Archived

/ 100

Emerging

Built on Baidu's Deep Speech research and TensorFlow, DeepSpeech implements a recurrent neural network architecture optimized for low-latency inference across heterogeneous hardware. The engine provides pre-trained models, supports custom model training, and exposes APIs for Python, Node.js, and C/C++ integration. Multi-platform support includes Docker containerization and native builds for Linux, macOS, and Windows.

26,741 stars. No commits in the last 6 months.

Archived Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

26,741

Forks

4,103

Language

C++

License

MPL-2.0

Featured in

Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Higher-rated alternatives

Picovoice/porcupine

On-device wake word detection powered by deep learning

MycroftAI/mycroft-precise

A lightweight, simple-to-use, RNN wake word listener

arcosoph/nanowakeword

A lightweight, open-source, and intelligent wake word detection engine. Train custom,...

OAID/cortex-m-kws

Cortex M KWS example with Tengine Lite.

vineeths96/Spoken-Keyword-Spotting

In this repository, we explore using a hybrid system consisting of a Convolutional Neural...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights