SRE Incident Automation AI Agents
AI agents for autonomous incident detection, root cause analysis, and remediation in production environments. Focuses on SRE-specific tools that integrate with observability platforms and cloud infrastructure. Does NOT include general monitoring dashboards, anomaly detection platforms without remediation, or incident classification frameworks.
There are 45 sre incident automation agents tracked. 2 score above 50 (established tier). The highest-rated is scitix/siclaw at 53/100 with 69 stars and 550 monthly downloads.
Get all 45 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=agents&subcategory=sre-incident-automation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Agent | Score | Tier |
|---|---|---|---|
| 1 |
scitix/siclaw
AI-powered SRE platform — read-only infrastructure diagnostics with deep... |
|
Established |
| 2 |
Arvo-AI/aurora
Aurora — Open source AI-powered agentic incident management & root cause... |
|
Established |
| 3 |
pavangudiwada/awesome-ai-sre
AI SRE tools for RCA, Incident Response, Cost-Saving, Infra management,... |
|
Emerging |
| 4 |
a2wio/lucas
A2W's SRE agent for Kubernetes |
|
Emerging |
| 5 |
chatwoot/faultline
An open-source AI agent for infrastructure debugging. |
|
Emerging |
| 6 |
datolabs-io/opsy
Opsy - Your AI-Powered SRE Colleague |
|
Emerging |
| 7 |
whitepaper27/Sentri
AI-powered autonomous DBA agent — detects, diagnoses, and fixes Oracle... |
|
Emerging |
| 8 |
avivl/cloud-sre-agent
An autonomous SRE agent that monitors cloud logs across multiple platforms,... |
|
Emerging |
| 9 |
codeready-toolchain/tarsy
Intelligent Site Reliability Engineering agent for automatic alert processing |
|
Emerging |
| 10 |
ismailperim/oncallmate
🚨 Autonomous AI SRE agent that investigates Docker incidents while you... |
|
Emerging |
| 11 |
qicesun/SRE-Agent-App
An Autonomous AI SRE Agent for Kubernetes, built with Java Spring Boot &... |
|
Emerging |
| 12 |
qingwave/kubewizard
✨Kubewizard is An AI-Agent for automated Kubernetes troubleshooting, and... |
|
Experimental |
| 13 |
Joeen-AI-Labs/Netiarius
CLI agent for Linux server network troubleshooting and repair, with built-in... |
|
Experimental |
| 14 |
vitas/evidra
Flight recorder for Infrastructure Automation. Behavioral Reliability for... |
|
Experimental |
| 15 |
hanu-tayal/ai-oncall-agent
AI agents that replace human on-call engineers — automated error analysis,... |
|
Experimental |
| 16 |
codenamev/ruby_llm-ups
ups.dev status page integration for RubyLLM — automatic agent heartbeats,... |
|
Experimental |
| 17 |
AxonLabsDev/nervmap
Infrastructure cartography CLI — discover services, map dependencies, trace... |
|
Experimental |
| 18 |
haoranc/agent-estimate
The first open-source effort estimation tool built for AI coding agents.... |
|
Experimental |
| 19 |
kiloloop/agent-estimate
The first open-source effort estimation tool built for AI coding agents.... |
|
Experimental |
| 20 |
javakishore-veleti/Claims-Processor-With-SRE
A multi-tenant healthcare claims processing platform with AI-powered... |
|
Experimental |
| 21 |
imIbAd404/sre-agent
🚀 Automate self-healing and root cause analysis for financial services with... |
|
Experimental |
| 22 |
dbwls99706/deadends.dev
Structured failure knowledge infrastructure for AI agents — dead ends,... |
|
Experimental |
| 23 |
jayta1314/awesome-ai-sre
Curate and explore a comprehensive list of AI-driven tools and resources... |
|
Experimental |
| 24 |
GagauzSergii/anomaly_detection_platform
Distributed real-time AIOps platform for metric ingestion and anomaly... |
|
Experimental |
| 25 |
obtFusi/network-agent
CLI Agent für Netzwerk-Analyse via natürliche Sprache (Venice.ai) |
|
Experimental |
| 26 |
anonymousgirl123/ai-incident-analyzer
Build a production-style AI system that ingests logs and metrics, detects... |
|
Experimental |
| 27 |
koustubh-v/AutoDevOps-AI
Autonomous SRE agent that recursively audits, traces, and self-heals... |
|
Experimental |
| 28 |
csa7mdm/AutoMender
Autonomous AI Agent that detects, analyzes, and self-heals .NET runtime... |
|
Experimental |
| 29 |
iemafzalhassan/OutagePilot
OutagePilot uses a multi-agent system to autonomously detect, diagnose, and... |
|
Experimental |
| 30 |
agamm/awesome-ai-sre
A curated list of 100+ AI-powered tools, platforms, and resources for Site... |
|
Experimental |
| 31 |
sinzin91/awesome-sre-skills
A curated list of AI agent skills for Site Reliability Engineering —... |
|
Experimental |
| 32 |
agentincident/agentincident
The open incident format for autonomous AI agents. Record, classify, and... |
|
Experimental |
| 33 |
sydasif/network-automation-agent
Run commands on network device with LLM using netmiko |
|
Experimental |
| 34 |
Suraj-kumar00/DataIncidentManager
AI-Powered Autonomous Incident Management for Data Teams |
|
Experimental |
| 35 |
bblackheart013/semantic-devops-bot
AI-powered DevOps Assistant that reads error logs, suggests fixes, and... |
|
Experimental |
| 36 |
charles-adedotun/kubepulse
Intelligent Kubernetes health monitoring with AI-powered diagnostics,... |
|
Experimental |
| 37 |
kyisaiah47/cloudwatch-genius
AI-powered DevOps agent using Amazon Bedrock & Claude 3 Sonnet for... |
|
Experimental |
| 38 |
ghantakiran/ShieldOps
AI-Powered Autonomous SRE Platform — Autonomous agents for investigation,... |
|
Experimental |
| 39 |
AdityaIndoori/Sentry
Autonomous AI service monitor multi-agent pipeline (Triage, Detective,... |
|
Experimental |
| 40 |
rubsj/ai-devops-assistant
Multi-agent DevOps AI assistant for pipeline monitoring, log analysis, root... |
|
Experimental |
| 41 |
kaiojoceli51/ShieldOps
Automate incident investigation, remediation, and security enforcement... |
|
Experimental |
| 42 |
brngg/herald
AI agent that detects, diagnoses, and remediates Kubernetes incidents with... |
|
Experimental |
| 43 |
tareksyria/SREAgents
🤖 Build and manage AI-driven SRE agents to automate operations tasks with... |
|
Experimental |
| 44 |
kamaleshanantha/-metr-time-horizon-feb-2026
Interactive visualization of METR AI agent time horizon benchmark with... |
|
Experimental |
| 45 |
DilshanPGN/IncidentIQ
AI-driven observability & incident-analysis agent that plugs into Java... |
|
Experimental |