mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
ArchivedBuilt on Baidu's Deep Speech research and TensorFlow, DeepSpeech implements a recurrent neural network architecture optimized for low-latency inference across heterogeneous hardware. The engine provides pre-trained models, supports custom model training, and exposes APIs for Python, Node.js, and C/C++ integration. Multi-platform support includes Docker containerization and native builds for Linux, macOS, and Windows.
26,741 stars. No commits in the last 6 months.
Stars
26,741
Forks
4,103
Language
C++
License
MPL-2.0
Category
Last pushed
Jun 19, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mozilla/DeepSpeech"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Picovoice/porcupine
On-device wake word detection powered by deep learning
MycroftAI/mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
arcosoph/nanowakeword
A lightweight, open-source, and intelligent wake word detection engine. Train custom,...
OAID/cortex-m-kws
Cortex M KWS example with Tengine Lite.
vineeths96/Spoken-Keyword-Spotting
In this repository, we explore using a hybrid system consisting of a Convolutional Neural...