mgonzs13/whisper_ros
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
Integrates whisper.cpp (GGML format) with ROS 2's audio_common for unified audio I/O, while SileroVAD preprocesses streams to reduce transcription overhead by detecting voice activity. Supports hardware acceleration via CUDA for both Whisper inference and SileroVAD ONNX runtime, with configurable model paths, sampling parameters, and OpenVINO device selection for cross-platform deployment.
Stars
91
Forks
21
Language
C++
License
MIT
Category
Last pushed
Mar 06, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mgonzs13/whisper_ros"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
ggml-org/whisper.cpp
Port of OpenAI's Whisper model in C/C++
ChetanXpro/nodejs-whisper
NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++...
vilassn/whisper_android
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
sandrohanea/whisper.net
Whisper.net. Speech to text made simple using Whisper Models
mybigday/whisper.rn
React Native binding of whisper.cpp.