Agent Observability Debugging AI Agents

Tools for tracing, visualizing, and debugging AI agent execution—including root-cause analysis, log monitoring, decision attribution, and hallucination detection. Does NOT include general application monitoring, infrastructure observability, or agent frameworks themselves.

There are 117 agent observability debugging agents tracked. 1 score above 70 (verified tier). The highest-rated is truera/trulens at 74/100 with 3,160 stars. 1 of the top 10 are actively maintained.

Get all 117 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=agents&subcategory=agent-observability-debugging&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Agent	Score	Tier	Stars	Language
1	truera/trulens Evaluation and Tracking for LLM Experiments and AI Agents	74	Verified	3,160	Python
2	traceroot-ai/traceroot Find the Root Cause in Your Code's Trace	61	Established	407	TypeScript
3	future-agi/traceAI Open Source AI Tracing Framework built on Opentelemetry for AI Applications...	52	Established	57	Python
4	VishApp/multiagent-debugger Multi-Agent Debugger: An AI-powered debugging system using CrewAI to...	49	Emerging	35	Python
5	evilmartians/agent-prism React components for visualizing traces from AI agents	46	Emerging	303	TypeScript
6	InftyAI/alphatrion ⚒️ AlphaTrion is an open-source observability platform for AI agents,...	42	Emerging	13	Python
7	Mandark-droid/genai_otel_instrument GenAI OpenTelemetry Auto-Instrumentation Library A comprehensive wrapper for...	40	Emerging	1	Python
8	fwdai/reticle AI Engineering DevTools - design, simulate, and debug LLM interactions with...	39	Emerging	6	TypeScript
9	triggerdotdev/agentcrumbs Debug mode for any AI agent. Structured tracing agents add inline, stripped...	39	Emerging	5	TypeScript
10	prateekdevisingh/kakveda Open-source failure intelligence platform for LLM & agent systems. Adds...	39	Emerging	9	Python
11	dkondo/agent-tackle-box A toolkit for developing AI agents, including agent-debugger: Terminal...	38	Emerging	49	Python
12	vstorm-co/logfire-assistant AI-powered tool that helps you debug, analyze, and understand your...	38	Emerging	10	Python
13	ai-debugger-inc/aidb A language-agnostic debugging interface for AI agents.	38	Emerging	13	Python
14	IIfaitdoux/agent-devtools Debug AI agents in real time with Chrome DevTools to pause, inspect, and...	35	Emerging	1	Python
15	Alex188dot/agensic Forensic Observability for AI Agents	35	Emerging	9	Python
16	Siddhant-K-code/agent-trace strace for AI agents. Capture and replay every tool call, prompt, and...	34	Emerging	10	Python
17	QuesmaOrg/otel-bench OpenTelemetry Benchmark - can AI trace your failed login?	30	Emerging	16	Shell
18	getdeeptracer/deeptracer-js AI agent for log monitoring	26	Experimental	5	TypeScript
19	Rxflex/agenttrace AgentTrace is an open-source, local-first step debugger for AI agents. It...	26	Experimental	6	Python
20	abczsl520/debug-methodology Systematic debugging methodology for AI agents and developers. Prevents...	25	Experimental	18	—
21	fukami/minitrace A session trace format for capturing human-AI coding interactions across frameworks.	25	Experimental	4	Python
22	ClemenceChee/AgentFlow Monitor any AI agent system. Auto-detects failures, sends alerts. Zero...	25	Experimental	4	TypeScript
23	acailic/agent_debugger Local-first agent debugger with replay, failure memory, smart highlights,...	24	Experimental	2	Python
24	rylinjames/litmus Record and deterministically replay AI agent executions. Flight recorder for...	24	Experimental	2	Python
25	phantom5125/omnismi Cross-vendor GPU observability for AI agents and Python apps	24	Experimental	2	Python
26	samlanda12/agentgauge Lightweight Prometheus exporter for AI agent pipelines.	23	Experimental	1	Python
27	olliekod/agent-tracer Crash dumps for AI agents. Record and replay LLM interactions locally.	23	Experimental	1	Python
28	sauravbhattacharya001/agentlens AgentLens — Observability and Explainability for AI Agents	23	Experimental	1	Python
29	enkronos/traceforge Portable trace envelopes for governed agent execution.	23	Experimental	1	TypeScript
30	tn-pisama/pisama Multi-agent failure detection — 17 detectors for LLM orchestration systems	23	Experimental	1	—
31	umairb0/agenttrace Trace and debug AI agent behavior locally with a step-by-step visual tool...	23	Experimental	1	Python
32	Exploreunive/agentlens Explain why your agent failed — root-cause debugging, memory attribution,...	23	Experimental	1	Python
33	thekateproject/kate-sdk Open source observability and auto-evals for AI agents	23	Experimental	1	Python
34	clouatre-labs/sre-shadow-replay Supplementary materials for SRE shadow-mode PR replay experiment	23	Experimental	1	Python
35	LuciferForge/ai-trace Zero-dependency AI agent decision tracer. Records every step — what it saw,...	23	Experimental	1	Python
36	tranhoangtu-it/agentlens Self-hosted AI agent observability with tool-call tracing and decision tree...	22	Experimental	—	Python
37	sumankalyan123/langsight Complete observability for everything an AI agent calls — traces, costs,...	22	Experimental	—	Python
38	JSLEEKR/agent-test-recorder Record and replay LLM API calls for deterministic testing. The VCR for AI...	22	Experimental	—	TypeScript
39	akramo660/agentdoctor-oss Analyze AI agent logs locally to detect failure patterns and measure...	22	Experimental	—	TypeScript
40	dunetrace/dunetrace Behavioral runtime observability for AI agents	22	Experimental	—	Python
41	ManasVardhan/agent-replay 🔄 Record, replay, and debug AI agent execution traces	22	Experimental	—	Python
42	tripledoublev/v100 Experimental harness for studying long-horizon LLM agents through...	22	Experimental	—	Go
43	ian-flores/securetrace Observability, tracing, and cost accounting for R LLM agent workflows	22	Experimental	—	R
44	speed785/agentlens DevTools for AI agents: drop-in observability layer that measures latency,...	22	Experimental	—	Python
45	AnupamaCVenugopal/AutosarDebuggingAutomation AI-assisted AUTOSAR debugging pipeline for baseline-vs-failure path...	22	Experimental	—	HTML
46	Microck/indagine A meta-agent that investigates broken AI agents. Feed it a failure trace and...	22	Experimental	—	Python
47	wharfe/agent-trust-telemetry Trust telemetry middleware for multi-agent systems — makes instruction...	22	Experimental	—	Python
48	Adriano886/agente-admin-observabilidad 🚀 Automate alert analysis with Agno Framework and Grafana Stack, correlating...	22	Experimental	—	TypeScript
49	eosho/agent-trace-opentelemetry Tracing for AI assisted development.	22	Experimental	—	Python
50	Oldcircle/trace-viewer Visual diagnostic tool for OpenClaw agent execution traces — see every LLM...	22	Experimental	—	TypeScript
51	Angelopvtac/agenttrace-sdk Cross-agent observability for AI workflows — the call stack for AI	22	Experimental	—	TypeScript
52	sajidurdev/calltrace Fast CLI tool for tracing symbol relationships in a codebase.	22	Experimental	—	Rust
53	ELVOR1236/agentlens Track AI agent actions and tool use in Chrome DevTools to debug decisions...	22	Experimental	—	Python
54	nexus66666/Augur-Runtime-Debugging-Agent 🔍 Enhance your coding with Augur, the AI-native runtime debugger that...	22	Experimental	—	TypeScript
55	idreesaziz/agent-trace Universal local debugger and visualizer for multi-agent workflows.	22	Experimental	—	Python
56	StanislavBG/agent-trace CLI-first local observability for AI agents — OTel GenAI semantics stored in...	22	Experimental	—	TypeScript
57	TECHKNOWMAD-LABS/trace-agent Agent observability. pip + MCP server + Claude skill.	22	Experimental	—	Python
58	mailtocsprasad/ai-kd AI-augmented WinDbg extension for automated crash dump triage using Claude	22	Experimental	—	Python
59	crithstudio-hash/vcr-llm Record and replay LLM API conversations for deterministic testing. Zero...	22	Experimental	—	Python
60	advitrocks9/openflux Open standard for AI agent telemetry. 9 frameworks, one schema, zero deps.	22	Experimental	—	Python
61	LangSight/langsight Complete observability for everything an AI agent calls — traces, costs,...	22	Experimental	—	Python
62	zzhiyuann/agentlens Chrome DevTools for AI agents — record, replay, inspect, and test agent...	22	Experimental	—	TypeScript
63	azharmateen/agent-brew Record, replay, and diff WebSocket sessions for debugging.	22	Experimental	—	JavaScript
64	Zijian-Ni/agent-replay 🔄 Record, replay, and debug AI agent execution traces — the DevTools for AI agents	22	Experimental	—	TypeScript
65	rifft-dev/rifft Cross-framework debugger for multi-agent AI systems	22	Experimental	—	TypeScript
66	RapidBotStudio888/Autodebug_pro01 Stop wasting hours on stack traces. Autodebug_pro identifies, explains, and...	22	Experimental	—	—
67	nedbpowell/agenttrace-react Headless React primitives for AI agent execution traces, approval gates, and...	22	Experimental	—	TypeScript
68	JSLEEKR/agent-trace-debugger Chrome DevTools for AI agent pipelines - debug, replay, and diff agent executions	22	Experimental	—	TypeScript
69	darshankparmar/agent-observatory Lightweight observability layer for AI agents with tracing, spans and...	21	Experimental	—	Python
70	vtqveant/symbolic-mlir-debugger Dynamic Symbolic (Concolic) Debugger for MLIR	21	Experimental	2	Python
71	guialfredo/columbo-root-cause-explorer 🕵️ AI-powered root cause analysis for containerized environments. ...	21	Experimental	2	Python
72	getagentd/agentd-py Observability SDK for AI agents. Drop-in replacement for Claude Agent SDK.	21	Experimental	2	Python
73	mttetc/AgentReplay DevTools for replaying AI coding agent sessions. Reads Claude Code JSONL...	20	Experimental	1	Svelte
74	ade-engine/ade The autonomous debugging engine for modern codebases.	20	Experimental	1	—
75	tracemem/tracemem-vercel-ai TraceMem integration for Vercel AI SDK	20	Experimental	1	TypeScript
76	AI8-Algorithm-Intelligence-Section-8/Dissect Dissect helps you understand what's happening inside your AI agent systems....	20	Experimental	1	Python
77	ishu86/Agent-Debugger Chrome DevTools for AI agents — time-travel debugging with fork, patch, and...	20	Experimental	1	Python
78	Sabyasachig/ai-cost-observatory Open-source observability layer for AI agents - Track, analyze, and optimize...	20	Experimental	1	Python
79	polsebas/agente-admin-observabilidad Sistema de análisis automático de alertas con Agno Framework + Grafana...	20	Experimental	1	TypeScript
80	moondef/llm-trace Structured execution traces for LLM debugging – lets AI coding tools see...	20	Experimental	1	TypeScript
81	vexorlabs/beacon Chrome DevTools for AI Agents. Open-source, local-first debugging with...	19	Experimental	—	Python
82	milyas2001/meridian-agent-observability MERIDIAN - Distributed AI Agent Observability with Causal Tracing	19	Experimental	—	Python
83	prajitdatta/AI-Agent-Autopsy Cut open broken LLM agents. Find what killed them. Fix it before you ship.	19	Experimental	—	Python
84	yuan-cloud/agent-cassette Record once → replay forever → deterministic tests for AI agents.	19	Experimental	—	TypeScript
85	stefanoamorelli/opentelemetry-instrumentation-dust OpenTelemetry instrumentation package for the Dust SDK. Automatically...	19	Experimental	—	TypeScript
86	Bluefactordev/InnerTrace Deterministic, causal tracing for complex execution flows — designed for LLM...	19	Experimental	—	Python
87	marcosgabbardo/wiretaps See what your AI agents are sending to LLMs.	19	Experimental	—	Python
88	thomasahle/trace-taxi Trace Taxi Trace Viewer	19	Experimental	10	Svelte
89	certainly-param/tracelens Tracelens - Visual Debugger and Replay Engine for LangGraph Agentic...	19	Experimental	—	Python
90	songyang-dev/agent-motive Intercept LLM agent calls for debugging	19	Experimental	—	Python
91	UPwith-me/Augur-Runtime-Debugging-Agent An autonomous AI debugging agent powered by the Debug Adapter Protocol...	18	Experimental	4	TypeScript
92	trylynxai/reagent Replay Debugger for AI Agents	17	Experimental	3	Python
93	breezy89757/WinDbgAssist AI-powered automated debugging terminal for WinDbg & .NET Dump analysis.	16	Experimental	1	C#
94	originaonxi/tdad-replication Live proof of arXiv:2603.17973 — 100% regression reduction, 30 API calls	15	Experimental	1	Python
95	MoneyCat-inc/otel-agent-coordination Telemetry-based AI agent coordination framework using OpenTelemetry for...	15	Experimental	—	—
96	amitmishrg/agenticlens Visual debugging, tracing, and replay for agent workflows.	14	Experimental	—	JavaScript
97	JohnODowdAI/replaykit Turn failed agent traces into replayable regression cases.	14	Experimental	—	Python
98	AndriGitDev/synapse 🧠 Watch AI Agents Think - Visualize AI decision-making in real-time	14	Experimental	—	TypeScript
99	tomsik21/edge-watchdog AI-assisted infrastructure monitoring service built with Node.js,...	14	Experimental	—	TypeScript
100	GeoffreyWang1117/AgentTrace AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed...	14	Experimental	—	Python
101	a2a-settlement/otel-agent-provenance OpenTelemetry semantic conventions and instrumentation for agent provenance,...	14	Experimental	—	Python
102	bearwash/agent-lens The Git for AI Agents: Rewind, Branch, and Debug Multi-step Reasoning in Real-time.	14	Experimental	—	TypeScript
103	dev-k99/pulsetrace Open-source AI agent monitoring dashboard — OTel traces, eval metrics, agent...	14	Experimental	—	Python
104	diwushennian4955/crewai-observability-nexaapi CrewAI Agent Observability with LoongSuite + NexaAPI: Build monitored,...	14	Experimental	—	—
105	AhmedAllam0/ghosttrace 👻 See what your AI agent almost did. Record agent decisions including...	13	Experimental	2	Python
106	amirkiarafiei/repo-learn Visualizing Deep Agents in Long-Horizon Tasks: Towards Explainable and...	12	Experimental	1	TypeScript
107	CaoDuyThanh/drtrace AI-Powered Log Analysis for Instant Root-Cause Explanations through natural...	12	Experimental	1	Python
108	rodrigoguedes09/AI-decision-timeline-system A visual-first platform to trace, replay, and explain AI decisions with full...	12	Experimental	1	Python
109	sru4ka/agentpulse Real-time observability for AI agents. Track costs, monitor errors, replay prompts	12	Experimental	1	TypeScript
110	logicoflife/crewai-decision-trace Semantic decision telemetry integration for CrewAI.	11	Experimental	—	Python
111	raggedymoon/predictagent Patent-pending AI agent failure prediction - Prevent silent failures before...	11	Experimental	—	—
112	Makonmm/LoggerAI Logger AI is an offensive tool for red team that uses AI agents.	11	Experimental	—	Python
113	JRF-2018/simplest-pal The Simplest PDB Automation Layer for AI-Driven Debugging	11	Experimental	—	Python
114	CoRAL-ASU/TraceBack TRACEBACK: multi-agent attribution framework, traces table-based answers...	11	Experimental	—	HTML
115	om-gupta-30/GCP-Log-Monitor-Agent AI-powered log monitoring and alerting agent built on Google Cloud Platform	11	Experimental	—	Python
116	JRF-2018/jrf_pdb_agent_lib A library for AI-driven debugging and human-AI collaboration using PDB.	11	Experimental	—	Python
117	pedroalexleite/Learning-LangFuse LLM observability with LangFuse: tracing, evaluation, and monitoring, from a...	11	Experimental	—	Jupyter Notebook

Comparisons in this category

agent-trace and agenttrace (34 vs 26)