Agent Observability Debugging AI Agents
Tools for tracing, visualizing, and debugging AI agent execution—including root-cause analysis, log monitoring, decision attribution, and hallucination detection. Does NOT include general application monitoring, infrastructure observability, or agent frameworks themselves.
There are 117 agent observability debugging agents tracked. 1 score above 70 (verified tier). The highest-rated is truera/trulens at 74/100 with 3,160 stars. 1 of the top 10 are actively maintained.
Get all 117 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=agents&subcategory=agent-observability-debugging&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Agent | Score | Tier |
|---|---|---|---|
| 1 |
truera/trulens
Evaluation and Tracking for LLM Experiments and AI Agents |
|
Verified |
| 2 |
traceroot-ai/traceroot
Find the Root Cause in Your Code's Trace |
|
Established |
| 3 |
future-agi/traceAI
Open Source AI Tracing Framework built on Opentelemetry for AI Applications... |
|
Established |
| 4 |
VishApp/multiagent-debugger
Multi-Agent Debugger: An AI-powered debugging system using CrewAI to... |
|
Emerging |
| 5 |
evilmartians/agent-prism
React components for visualizing traces from AI agents |
|
Emerging |
| 6 |
InftyAI/alphatrion
⚒️ AlphaTrion is an open-source observability platform for AI agents,... |
|
Emerging |
| 7 |
Mandark-droid/genai_otel_instrument
GenAI OpenTelemetry Auto-Instrumentation Library A comprehensive wrapper for... |
|
Emerging |
| 8 |
fwdai/reticle
AI Engineering DevTools - design, simulate, and debug LLM interactions with... |
|
Emerging |
| 9 |
triggerdotdev/agentcrumbs
Debug mode for any AI agent. Structured tracing agents add inline, stripped... |
|
Emerging |
| 10 |
prateekdevisingh/kakveda
Open-source failure intelligence platform for LLM & agent systems. Adds... |
|
Emerging |
| 11 |
dkondo/agent-tackle-box
A toolkit for developing AI agents, including agent-debugger: Terminal... |
|
Emerging |
| 12 |
vstorm-co/logfire-assistant
AI-powered tool that helps you debug, analyze, and understand your... |
|
Emerging |
| 13 |
ai-debugger-inc/aidb
A language-agnostic debugging interface for AI agents. |
|
Emerging |
| 14 |
IIfaitdoux/agent-devtools
Debug AI agents in real time with Chrome DevTools to pause, inspect, and... |
|
Emerging |
| 15 |
Alex188dot/agensic
Forensic Observability for AI Agents |
|
Emerging |
| 16 |
Siddhant-K-code/agent-trace
strace for AI agents. Capture and replay every tool call, prompt, and... |
|
Emerging |
| 17 |
QuesmaOrg/otel-bench
OpenTelemetry Benchmark - can AI trace your failed login? |
|
Emerging |
| 18 |
getdeeptracer/deeptracer-js
AI agent for log monitoring |
|
Experimental |
| 19 |
Rxflex/agenttrace
AgentTrace is an open-source, local-first step debugger for AI agents. It... |
|
Experimental |
| 20 |
abczsl520/debug-methodology
Systematic debugging methodology for AI agents and developers. Prevents... |
|
Experimental |
| 21 |
fukami/minitrace
A session trace format for capturing human-AI coding interactions across frameworks. |
|
Experimental |
| 22 |
ClemenceChee/AgentFlow
Monitor any AI agent system. Auto-detects failures, sends alerts. Zero... |
|
Experimental |
| 23 |
acailic/agent_debugger
Local-first agent debugger with replay, failure memory, smart highlights,... |
|
Experimental |
| 24 |
rylinjames/litmus
Record and deterministically replay AI agent executions. Flight recorder for... |
|
Experimental |
| 25 |
phantom5125/omnismi
Cross-vendor GPU observability for AI agents and Python apps |
|
Experimental |
| 26 |
samlanda12/agentgauge
Lightweight Prometheus exporter for AI agent pipelines. |
|
Experimental |
| 27 |
olliekod/agent-tracer
Crash dumps for AI agents. Record and replay LLM interactions locally. |
|
Experimental |
| 28 |
sauravbhattacharya001/agentlens
AgentLens — Observability and Explainability for AI Agents |
|
Experimental |
| 29 |
enkronos/traceforge
Portable trace envelopes for governed agent execution. |
|
Experimental |
| 30 |
tn-pisama/pisama
Multi-agent failure detection — 17 detectors for LLM orchestration systems |
|
Experimental |
| 31 |
umairb0/agenttrace
Trace and debug AI agent behavior locally with a step-by-step visual tool... |
|
Experimental |
| 32 |
Exploreunive/agentlens
Explain why your agent failed — root-cause debugging, memory attribution,... |
|
Experimental |
| 33 |
thekateproject/kate-sdk
Open source observability and auto-evals for AI agents |
|
Experimental |
| 34 |
clouatre-labs/sre-shadow-replay
Supplementary materials for SRE shadow-mode PR replay experiment |
|
Experimental |
| 35 |
LuciferForge/ai-trace
Zero-dependency AI agent decision tracer. Records every step — what it saw,... |
|
Experimental |
| 36 |
tranhoangtu-it/agentlens
Self-hosted AI agent observability with tool-call tracing and decision tree... |
|
Experimental |
| 37 |
sumankalyan123/langsight
Complete observability for everything an AI agent calls — traces, costs,... |
|
Experimental |
| 38 |
JSLEEKR/agent-test-recorder
Record and replay LLM API calls for deterministic testing. The VCR for AI... |
|
Experimental |
| 39 |
akramo660/agentdoctor-oss
Analyze AI agent logs locally to detect failure patterns and measure... |
|
Experimental |
| 40 |
dunetrace/dunetrace
Behavioral runtime observability for AI agents |
|
Experimental |
| 41 |
ManasVardhan/agent-replay
🔄 Record, replay, and debug AI agent execution traces |
|
Experimental |
| 42 |
tripledoublev/v100
Experimental harness for studying long-horizon LLM agents through... |
|
Experimental |
| 43 |
ian-flores/securetrace
Observability, tracing, and cost accounting for R LLM agent workflows |
|
Experimental |
| 44 |
speed785/agentlens
DevTools for AI agents: drop-in observability layer that measures latency,... |
|
Experimental |
| 45 |
AnupamaCVenugopal/AutosarDebuggingAutomation
AI-assisted AUTOSAR debugging pipeline for baseline-vs-failure path... |
|
Experimental |
| 46 |
Microck/indagine
A meta-agent that investigates broken AI agents. Feed it a failure trace and... |
|
Experimental |
| 47 |
wharfe/agent-trust-telemetry
Trust telemetry middleware for multi-agent systems — makes instruction... |
|
Experimental |
| 48 |
Adriano886/agente-admin-observabilidad
🚀 Automate alert analysis with Agno Framework and Grafana Stack, correlating... |
|
Experimental |
| 49 |
eosho/agent-trace-opentelemetry
Tracing for AI assisted development. |
|
Experimental |
| 50 |
Oldcircle/trace-viewer
Visual diagnostic tool for OpenClaw agent execution traces — see every LLM... |
|
Experimental |
| 51 |
Angelopvtac/agenttrace-sdk
Cross-agent observability for AI workflows — the call stack for AI |
|
Experimental |
| 52 |
sajidurdev/calltrace
Fast CLI tool for tracing symbol relationships in a codebase. |
|
Experimental |
| 53 |
ELVOR1236/agentlens
Track AI agent actions and tool use in Chrome DevTools to debug decisions... |
|
Experimental |
| 54 |
nexus66666/Augur-Runtime-Debugging-Agent
🔍 Enhance your coding with Augur, the AI-native runtime debugger that... |
|
Experimental |
| 55 |
idreesaziz/agent-trace
Universal local debugger and visualizer for multi-agent workflows. |
|
Experimental |
| 56 |
StanislavBG/agent-trace
CLI-first local observability for AI agents — OTel GenAI semantics stored in... |
|
Experimental |
| 57 |
TECHKNOWMAD-LABS/trace-agent
Agent observability. pip + MCP server + Claude skill. |
|
Experimental |
| 58 |
mailtocsprasad/ai-kd
AI-augmented WinDbg extension for automated crash dump triage using Claude |
|
Experimental |
| 59 |
crithstudio-hash/vcr-llm
Record and replay LLM API conversations for deterministic testing. Zero... |
|
Experimental |
| 60 |
advitrocks9/openflux
Open standard for AI agent telemetry. 9 frameworks, one schema, zero deps. |
|
Experimental |
| 61 |
LangSight/langsight
Complete observability for everything an AI agent calls — traces, costs,... |
|
Experimental |
| 62 |
zzhiyuann/agentlens
Chrome DevTools for AI agents — record, replay, inspect, and test agent... |
|
Experimental |
| 63 |
azharmateen/agent-brew
Record, replay, and diff WebSocket sessions for debugging. |
|
Experimental |
| 64 |
Zijian-Ni/agent-replay
🔄 Record, replay, and debug AI agent execution traces — the DevTools for AI agents |
|
Experimental |
| 65 |
rifft-dev/rifft
Cross-framework debugger for multi-agent AI systems |
|
Experimental |
| 66 |
RapidBotStudio888/Autodebug_pro01
Stop wasting hours on stack traces. Autodebug_pro identifies, explains, and... |
|
Experimental |
| 67 |
nedbpowell/agenttrace-react
Headless React primitives for AI agent execution traces, approval gates, and... |
|
Experimental |
| 68 |
JSLEEKR/agent-trace-debugger
Chrome DevTools for AI agent pipelines - debug, replay, and diff agent executions |
|
Experimental |
| 69 |
darshankparmar/agent-observatory
Lightweight observability layer for AI agents with tracing, spans and... |
|
Experimental |
| 70 |
vtqveant/symbolic-mlir-debugger
Dynamic Symbolic (Concolic) Debugger for MLIR |
|
Experimental |
| 71 |
guialfredo/columbo-root-cause-explorer
🕵️ AI-powered root cause analysis for containerized environments. ... |
|
Experimental |
| 72 |
getagentd/agentd-py
Observability SDK for AI agents. Drop-in replacement for Claude Agent SDK. |
|
Experimental |
| 73 |
mttetc/AgentReplay
DevTools for replaying AI coding agent sessions. Reads Claude Code JSONL... |
|
Experimental |
| 74 |
ade-engine/ade
The autonomous debugging engine for modern codebases. |
|
Experimental |
| 75 |
tracemem/tracemem-vercel-ai
TraceMem integration for Vercel AI SDK |
|
Experimental |
| 76 |
AI8-Algorithm-Intelligence-Section-8/Dissect
Dissect helps you understand what's happening inside your AI agent systems.... |
|
Experimental |
| 77 |
ishu86/Agent-Debugger
Chrome DevTools for AI agents — time-travel debugging with fork, patch, and... |
|
Experimental |
| 78 |
Sabyasachig/ai-cost-observatory
Open-source observability layer for AI agents - Track, analyze, and optimize... |
|
Experimental |
| 79 |
polsebas/agente-admin-observabilidad
Sistema de análisis automático de alertas con Agno Framework + Grafana... |
|
Experimental |
| 80 |
moondef/llm-trace
Structured execution traces for LLM debugging – lets AI coding tools see... |
|
Experimental |
| 81 |
vexorlabs/beacon
Chrome DevTools for AI Agents. Open-source, local-first debugging with... |
|
Experimental |
| 82 |
milyas2001/meridian-agent-observability
MERIDIAN - Distributed AI Agent Observability with Causal Tracing |
|
Experimental |
| 83 |
prajitdatta/AI-Agent-Autopsy
Cut open broken LLM agents. Find what killed them. Fix it before you ship. |
|
Experimental |
| 84 |
yuan-cloud/agent-cassette
Record once → replay forever → deterministic tests for AI agents. |
|
Experimental |
| 85 |
stefanoamorelli/opentelemetry-instrumentation-dust
OpenTelemetry instrumentation package for the Dust SDK. Automatically... |
|
Experimental |
| 86 |
Bluefactordev/InnerTrace
Deterministic, causal tracing for complex execution flows — designed for LLM... |
|
Experimental |
| 87 |
marcosgabbardo/wiretaps
See what your AI agents are sending to LLMs. |
|
Experimental |
| 88 |
thomasahle/trace-taxi
Trace Taxi Trace Viewer |
|
Experimental |
| 89 |
certainly-param/tracelens
Tracelens - Visual Debugger and Replay Engine for LangGraph Agentic... |
|
Experimental |
| 90 |
songyang-dev/agent-motive
Intercept LLM agent calls for debugging |
|
Experimental |
| 91 |
UPwith-me/Augur-Runtime-Debugging-Agent
An autonomous AI debugging agent powered by the Debug Adapter Protocol... |
|
Experimental |
| 92 |
trylynxai/reagent
Replay Debugger for AI Agents |
|
Experimental |
| 93 |
breezy89757/WinDbgAssist
AI-powered automated debugging terminal for WinDbg & .NET Dump analysis. |
|
Experimental |
| 94 |
originaonxi/tdad-replication
Live proof of arXiv:2603.17973 — 100% regression reduction, 30 API calls |
|
Experimental |
| 95 |
MoneyCat-inc/otel-agent-coordination
Telemetry-based AI agent coordination framework using OpenTelemetry for... |
|
Experimental |
| 96 |
amitmishrg/agenticlens
Visual debugging, tracing, and replay for agent workflows. |
|
Experimental |
| 97 |
JohnODowdAI/replaykit
Turn failed agent traces into replayable regression cases. |
|
Experimental |
| 98 |
AndriGitDev/synapse
🧠 Watch AI Agents Think - Visualize AI decision-making in real-time |
|
Experimental |
| 99 |
tomsik21/edge-watchdog
AI-assisted infrastructure monitoring service built with Node.js,... |
|
Experimental |
| 100 |
GeoffreyWang1117/AgentTrace
AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed... |
|
Experimental |
| 101 |
a2a-settlement/otel-agent-provenance
OpenTelemetry semantic conventions and instrumentation for agent provenance,... |
|
Experimental |
| 102 |
bearwash/agent-lens
The Git for AI Agents: Rewind, Branch, and Debug Multi-step Reasoning in Real-time. |
|
Experimental |
| 103 |
dev-k99/pulsetrace
Open-source AI agent monitoring dashboard — OTel traces, eval metrics, agent... |
|
Experimental |
| 104 |
diwushennian4955/crewai-observability-nexaapi
CrewAI Agent Observability with LoongSuite + NexaAPI: Build monitored,... |
|
Experimental |
| 105 |
AhmedAllam0/ghosttrace
👻 See what your AI agent almost did. Record agent decisions including... |
|
Experimental |
| 106 |
amirkiarafiei/repo-learn
Visualizing Deep Agents in Long-Horizon Tasks: Towards Explainable and... |
|
Experimental |
| 107 |
CaoDuyThanh/drtrace
AI-Powered Log Analysis for Instant Root-Cause Explanations through natural... |
|
Experimental |
| 108 |
rodrigoguedes09/AI-decision-timeline-system
A visual-first platform to trace, replay, and explain AI decisions with full... |
|
Experimental |
| 109 |
sru4ka/agentpulse
Real-time observability for AI agents. Track costs, monitor errors, replay prompts |
|
Experimental |
| 110 |
logicoflife/crewai-decision-trace
Semantic decision telemetry integration for CrewAI. |
|
Experimental |
| 111 |
raggedymoon/predictagent
Patent-pending AI agent failure prediction - Prevent silent failures before... |
|
Experimental |
| 112 |
Makonmm/LoggerAI
Logger AI is an offensive tool for red team that uses AI agents. |
|
Experimental |
| 113 |
JRF-2018/simplest-pal
The Simplest PDB Automation Layer for AI-Driven Debugging |
|
Experimental |
| 114 |
CoRAL-ASU/TraceBack
TRACEBACK: multi-agent attribution framework, traces table-based answers... |
|
Experimental |
| 115 |
om-gupta-30/GCP-Log-Monitor-Agent
AI-powered log monitoring and alerting agent built on Google Cloud Platform |
|
Experimental |
| 116 |
JRF-2018/jrf_pdb_agent_lib
A library for AI-driven debugging and human-AI collaboration using PDB. |
|
Experimental |
| 117 |
pedroalexleite/Learning-LangFuse
LLM observability with LangFuse: tracing, evaluation, and monitoring, from a... |
|
Experimental |