microsoft/OpenRCA
[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?
Comprises a benchmark dataset across three production systems (Telecom, Bank, Market) with multi-modal telemetry including KPI time series, dependency traces, and semi-structured logs. Introduces RCA-agent, a Python-based agentic baseline that retrieves and analyzes telemetry programmatically rather than processing full contexts, reducing token overhead while maintaining reasoning capability. Supports custom task generation and evaluation with standardized metrics for assessing LLM performance on root cause localization in complex distributed systems.
292 stars.
Stars
292
Forks
36
Language
Python
License
MIT
Category
Last pushed
Feb 24, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/microsoft/OpenRCA"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
PacificAI/langtest
Deliver safe & effective language models
TrustGen/TrustEval-toolkit
[ICLR'26, NAACL'25 Demo] Toolkit & Benchmark for evaluating the trustworthiness of generative...
Babelscape/ALERT
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language...
ChenWu98/agent-attack
[ICLR 2025] Dissecting adversarial robustness of multimodal language model agents
ast-fortiss-tum/STELLAR
STELLAR: A Search-Based Testing Framework for Large Language Model Applications" (SANER 2026) 🏆