Agent Observability Debugging AI Agents

Tools for tracing, visualizing, and debugging AI agent execution—including root-cause analysis, log monitoring, decision attribution, and hallucination detection. Does NOT include general application monitoring, infrastructure observability, or agent frameworks themselves.

There are 117 agent observability debugging agents tracked. 1 score above 70 (verified tier). The highest-rated is truera/trulens at 74/100 with 3,160 stars. 1 of the top 10 are actively maintained.

Get all 117 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=agents&subcategory=agent-observability-debugging&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Agent Score Tier
1 truera/trulens

Evaluation and Tracking for LLM Experiments and AI Agents

74
Verified
2 traceroot-ai/traceroot

Find the Root Cause in Your Code's Trace

61
Established
3 future-agi/traceAI

Open Source AI Tracing Framework built on Opentelemetry for AI Applications...

52
Established
4 VishApp/multiagent-debugger

Multi-Agent Debugger: An AI-powered debugging system using CrewAI to...

49
Emerging
5 evilmartians/agent-prism

React components for visualizing traces from AI agents

46
Emerging
6 InftyAI/alphatrion

⚒️ AlphaTrion is an open-source observability platform for AI agents,...

42
Emerging
7 Mandark-droid/genai_otel_instrument

GenAI OpenTelemetry Auto-Instrumentation Library A comprehensive wrapper for...

40
Emerging
8 fwdai/reticle

AI Engineering DevTools - design, simulate, and debug LLM interactions with...

39
Emerging
9 triggerdotdev/agentcrumbs

Debug mode for any AI agent. Structured tracing agents add inline, stripped...

39
Emerging
10 prateekdevisingh/kakveda

Open-source failure intelligence platform for LLM & agent systems. Adds...

39
Emerging
11 dkondo/agent-tackle-box

A toolkit for developing AI agents, including agent-debugger: Terminal...

38
Emerging
12 vstorm-co/logfire-assistant

AI-powered tool that helps you debug, analyze, and understand your...

38
Emerging
13 ai-debugger-inc/aidb

A language-agnostic debugging interface for AI agents.

38
Emerging
14 IIfaitdoux/agent-devtools

Debug AI agents in real time with Chrome DevTools to pause, inspect, and...

35
Emerging
15 Alex188dot/agensic

Forensic Observability for AI Agents

35
Emerging
16 Siddhant-K-code/agent-trace

strace for AI agents. Capture and replay every tool call, prompt, and...

34
Emerging
17 QuesmaOrg/otel-bench

OpenTelemetry Benchmark - can AI trace your failed login?

30
Emerging
18 getdeeptracer/deeptracer-js

AI agent for log monitoring

26
Experimental
19 Rxflex/agenttrace

AgentTrace is an open-source, local-first step debugger for AI agents. It...

26
Experimental
20 abczsl520/debug-methodology

Systematic debugging methodology for AI agents and developers. Prevents...

25
Experimental
21 fukami/minitrace

A session trace format for capturing human-AI coding interactions across frameworks.

25
Experimental
22 ClemenceChee/AgentFlow

Monitor any AI agent system. Auto-detects failures, sends alerts. Zero...

25
Experimental
23 acailic/agent_debugger

Local-first agent debugger with replay, failure memory, smart highlights,...

24
Experimental
24 rylinjames/litmus

Record and deterministically replay AI agent executions. Flight recorder for...

24
Experimental
25 phantom5125/omnismi

Cross-vendor GPU observability for AI agents and Python apps

24
Experimental
26 samlanda12/agentgauge

Lightweight Prometheus exporter for AI agent pipelines.

23
Experimental
27 olliekod/agent-tracer

Crash dumps for AI agents. Record and replay LLM interactions locally.

23
Experimental
28 sauravbhattacharya001/agentlens

AgentLens — Observability and Explainability for AI Agents

23
Experimental
29 enkronos/traceforge

Portable trace envelopes for governed agent execution.

23
Experimental
30 tn-pisama/pisama

Multi-agent failure detection — 17 detectors for LLM orchestration systems

23
Experimental
31 umairb0/agenttrace

Trace and debug AI agent behavior locally with a step-by-step visual tool...

23
Experimental
32 Exploreunive/agentlens

Explain why your agent failed — root-cause debugging, memory attribution,...

23
Experimental
33 thekateproject/kate-sdk

Open source observability and auto-evals for AI agents

23
Experimental
34 clouatre-labs/sre-shadow-replay

Supplementary materials for SRE shadow-mode PR replay experiment

23
Experimental
35 LuciferForge/ai-trace

Zero-dependency AI agent decision tracer. Records every step — what it saw,...

23
Experimental
36 tranhoangtu-it/agentlens

Self-hosted AI agent observability with tool-call tracing and decision tree...

22
Experimental
37 sumankalyan123/langsight

Complete observability for everything an AI agent calls — traces, costs,...

22
Experimental
38 JSLEEKR/agent-test-recorder

Record and replay LLM API calls for deterministic testing. The VCR for AI...

22
Experimental
39 akramo660/agentdoctor-oss

Analyze AI agent logs locally to detect failure patterns and measure...

22
Experimental
40 dunetrace/dunetrace

Behavioral runtime observability for AI agents

22
Experimental
41 ManasVardhan/agent-replay

🔄 Record, replay, and debug AI agent execution traces

22
Experimental
42 tripledoublev/v100

Experimental harness for studying long-horizon LLM agents through...

22
Experimental
43 ian-flores/securetrace

Observability, tracing, and cost accounting for R LLM agent workflows

22
Experimental
44 speed785/agentlens

DevTools for AI agents: drop-in observability layer that measures latency,...

22
Experimental
45 AnupamaCVenugopal/AutosarDebuggingAutomation

AI-assisted AUTOSAR debugging pipeline for baseline-vs-failure path...

22
Experimental
46 Microck/indagine

A meta-agent that investigates broken AI agents. Feed it a failure trace and...

22
Experimental
47 wharfe/agent-trust-telemetry

Trust telemetry middleware for multi-agent systems — makes instruction...

22
Experimental
48 Adriano886/agente-admin-observabilidad

🚀 Automate alert analysis with Agno Framework and Grafana Stack, correlating...

22
Experimental
49 eosho/agent-trace-opentelemetry

Tracing for AI assisted development.

22
Experimental
50 Oldcircle/trace-viewer

Visual diagnostic tool for OpenClaw agent execution traces — see every LLM...

22
Experimental
51 Angelopvtac/agenttrace-sdk

Cross-agent observability for AI workflows — the call stack for AI

22
Experimental
52 sajidurdev/calltrace

Fast CLI tool for tracing symbol relationships in a codebase.

22
Experimental
53 ELVOR1236/agentlens

Track AI agent actions and tool use in Chrome DevTools to debug decisions...

22
Experimental
54 nexus66666/Augur-Runtime-Debugging-Agent

🔍 Enhance your coding with Augur, the AI-native runtime debugger that...

22
Experimental
55 idreesaziz/agent-trace

Universal local debugger and visualizer for multi-agent workflows.

22
Experimental
56 StanislavBG/agent-trace

CLI-first local observability for AI agents — OTel GenAI semantics stored in...

22
Experimental
57 TECHKNOWMAD-LABS/trace-agent

Agent observability. pip + MCP server + Claude skill.

22
Experimental
58 mailtocsprasad/ai-kd

AI-augmented WinDbg extension for automated crash dump triage using Claude

22
Experimental
59 crithstudio-hash/vcr-llm

Record and replay LLM API conversations for deterministic testing. Zero...

22
Experimental
60 advitrocks9/openflux

Open standard for AI agent telemetry. 9 frameworks, one schema, zero deps.

22
Experimental
61 LangSight/langsight

Complete observability for everything an AI agent calls — traces, costs,...

22
Experimental
62 zzhiyuann/agentlens

Chrome DevTools for AI agents — record, replay, inspect, and test agent...

22
Experimental
63 azharmateen/agent-brew

Record, replay, and diff WebSocket sessions for debugging.

22
Experimental
64 Zijian-Ni/agent-replay

🔄 Record, replay, and debug AI agent execution traces — the DevTools for AI agents

22
Experimental
65 rifft-dev/rifft

Cross-framework debugger for multi-agent AI systems

22
Experimental
66 RapidBotStudio888/Autodebug_pro01

Stop wasting hours on stack traces. Autodebug_pro identifies, explains, and...

22
Experimental
67 nedbpowell/agenttrace-react

Headless React primitives for AI agent execution traces, approval gates, and...

22
Experimental
68 JSLEEKR/agent-trace-debugger

Chrome DevTools for AI agent pipelines - debug, replay, and diff agent executions

22
Experimental
69 darshankparmar/agent-observatory

Lightweight observability layer for AI agents with tracing, spans and...

21
Experimental
70 vtqveant/symbolic-mlir-debugger

Dynamic Symbolic (Concolic) Debugger for MLIR

21
Experimental
71 guialfredo/columbo-root-cause-explorer

🕵️ AI-powered root cause analysis for containerized environments. ...

21
Experimental
72 getagentd/agentd-py

Observability SDK for AI agents. Drop-in replacement for Claude Agent SDK.

21
Experimental
73 mttetc/AgentReplay

DevTools for replaying AI coding agent sessions. Reads Claude Code JSONL...

20
Experimental
74 ade-engine/ade

The autonomous debugging engine for modern codebases.

20
Experimental
75 tracemem/tracemem-vercel-ai

TraceMem integration for Vercel AI SDK

20
Experimental
76 AI8-Algorithm-Intelligence-Section-8/Dissect

Dissect helps you understand what's happening inside your AI agent systems....

20
Experimental
77 ishu86/Agent-Debugger

Chrome DevTools for AI agents — time-travel debugging with fork, patch, and...

20
Experimental
78 Sabyasachig/ai-cost-observatory

Open-source observability layer for AI agents - Track, analyze, and optimize...

20
Experimental
79 polsebas/agente-admin-observabilidad

Sistema de análisis automático de alertas con Agno Framework + Grafana...

20
Experimental
80 moondef/llm-trace

Structured execution traces for LLM debugging – lets AI coding tools see...

20
Experimental
81 vexorlabs/beacon

Chrome DevTools for AI Agents. Open-source, local-first debugging with...

19
Experimental
82 milyas2001/meridian-agent-observability

MERIDIAN - Distributed AI Agent Observability with Causal Tracing

19
Experimental
83 prajitdatta/AI-Agent-Autopsy

Cut open broken LLM agents. Find what killed them. Fix it before you ship.

19
Experimental
84 yuan-cloud/agent-cassette

Record once → replay forever → deterministic tests for AI agents.

19
Experimental
85 stefanoamorelli/opentelemetry-instrumentation-dust

OpenTelemetry instrumentation package for the Dust SDK. Automatically...

19
Experimental
86 Bluefactordev/InnerTrace

Deterministic, causal tracing for complex execution flows — designed for LLM...

19
Experimental
87 marcosgabbardo/wiretaps

See what your AI agents are sending to LLMs.

19
Experimental
88 thomasahle/trace-taxi

Trace Taxi Trace Viewer

19
Experimental
89 certainly-param/tracelens

Tracelens - Visual Debugger and Replay Engine for LangGraph Agentic...

19
Experimental
90 songyang-dev/agent-motive

Intercept LLM agent calls for debugging

19
Experimental
91 UPwith-me/Augur-Runtime-Debugging-Agent

An autonomous AI debugging agent powered by the Debug Adapter Protocol...

18
Experimental
92 trylynxai/reagent

Replay Debugger for AI Agents

17
Experimental
93 breezy89757/WinDbgAssist

AI-powered automated debugging terminal for WinDbg & .NET Dump analysis.

16
Experimental
94 originaonxi/tdad-replication

Live proof of arXiv:2603.17973 — 100% regression reduction, 30 API calls

15
Experimental
95 MoneyCat-inc/otel-agent-coordination

Telemetry-based AI agent coordination framework using OpenTelemetry for...

15
Experimental
96 amitmishrg/agenticlens

Visual debugging, tracing, and replay for agent workflows.

14
Experimental
97 JohnODowdAI/replaykit

Turn failed agent traces into replayable regression cases.

14
Experimental
98 AndriGitDev/synapse

🧠 Watch AI Agents Think - Visualize AI decision-making in real-time

14
Experimental
99 tomsik21/edge-watchdog

AI-assisted infrastructure monitoring service built with Node.js,...

14
Experimental
100 GeoffreyWang1117/AgentTrace

AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed...

14
Experimental
101 a2a-settlement/otel-agent-provenance

OpenTelemetry semantic conventions and instrumentation for agent provenance,...

14
Experimental
102 bearwash/agent-lens

The Git for AI Agents: Rewind, Branch, and Debug Multi-step Reasoning in Real-time.

14
Experimental
103 dev-k99/pulsetrace

Open-source AI agent monitoring dashboard — OTel traces, eval metrics, agent...

14
Experimental
104 diwushennian4955/crewai-observability-nexaapi

CrewAI Agent Observability with LoongSuite + NexaAPI: Build monitored,...

14
Experimental
105 AhmedAllam0/ghosttrace

👻 See what your AI agent almost did. Record agent decisions including...

13
Experimental
106 amirkiarafiei/repo-learn

Visualizing Deep Agents in Long-Horizon Tasks: Towards Explainable and...

12
Experimental
107 CaoDuyThanh/drtrace

AI-Powered Log Analysis for Instant Root-Cause Explanations through natural...

12
Experimental
108 rodrigoguedes09/AI-decision-timeline-system

A visual-first platform to trace, replay, and explain AI decisions with full...

12
Experimental
109 sru4ka/agentpulse

Real-time observability for AI agents. Track costs, monitor errors, replay prompts

12
Experimental
110 logicoflife/crewai-decision-trace

Semantic decision telemetry integration for CrewAI.

11
Experimental
111 raggedymoon/predictagent

Patent-pending AI agent failure prediction - Prevent silent failures before...

11
Experimental
112 Makonmm/LoggerAI

Logger AI is an offensive tool for red team that uses AI agents.

11
Experimental
113 JRF-2018/simplest-pal

The Simplest PDB Automation Layer for AI-Driven Debugging

11
Experimental
114 CoRAL-ASU/TraceBack

TRACEBACK: multi-agent attribution framework, traces table-based answers...

11
Experimental
115 om-gupta-30/GCP-Log-Monitor-Agent

AI-powered log monitoring and alerting agent built on Google Cloud Platform

11
Experimental
116 JRF-2018/jrf_pdb_agent_lib

A library for AI-driven debugging and human-AI collaboration using PDB.

11
Experimental
117 pedroalexleite/Learning-LangFuse

LLM observability with LangFuse: tracing, evaluation, and monitoring, from a...

11
Experimental

Comparisons in this category