mgonzs13/whisper_ros

Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2

57
/ 100
Established

Integrates whisper.cpp (GGML format) with ROS 2's audio_common for unified audio I/O, while SileroVAD preprocesses streams to reduce transcription overhead by detecting voice activity. Supports hardware acceleration via CUDA for both Whisper inference and SileroVAD ONNX runtime, with configurable model paths, sampling parameters, and OpenVINO device selection for cross-platform deployment.

No Package No Dependents
Maintenance 13 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

91

Forks

21

Language

C++

License

MIT

Last pushed

Mar 06, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mgonzs13/whisper_ros"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.