HolmesGPT/holmesgpt
SRE Agent - CNCF Sandbox Project
AI-driven agentic loop that autonomously queries live observability data across Kubernetes, cloud providers, databases, and SaaS platforms to diagnose production incidents and identify root causes. Handles petabyte-scale datasets through server-side filtering and memory-safe execution with per-tool limits and streaming output to disk. Integrates with Prometheus, Grafana, Datadog, PagerDuty, Slack, and 20+ platforms via built-in toolsets or custom REST APIs, with optional operator mode for continuous background monitoring and automated remediation.
1,967 stars. Actively maintained with 114 commits in the last 30 days.
Stars
1,967
Forks
258
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 12, 2026
Commits (30d)
114
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/HolmesGPT/holmesgpt"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
kubewall/kubewall
kubewall - Single-Binary Kubernetes Dashboard with Multi-Cluster Management & AI Integration....
gofireflyio/aiac
Artificial Intelligence Infrastructure-as-Code Generator.
radareorg/r2ai
LLM-based reversing for radare2
mr-tbot/mesh-api
MESH-API (previously MESH-AI) — Off-Grid AI & API Router with over 30 API extensions for...
volcengine/veaiops
Volcano Engine AIOps Suite, provides cloud customers with out-of-the-box intelligent operations...