Harshit-J004/toolguard
Pytest-style reliability testing for AI agent tool chains. Catches hallucinated payloads, schema errors, and cascading failures before production.
Provides automated fuzzing and DAG-based execution tracing that identifies tool-chain vulnerabilities without live LLM calls—instead using type hints to generate deterministic failure scenarios (null propagation, type mismatches, cascading errors). Integrates natively with LangChain, CrewAI, Swarm, and AutoGen through context vars instrumentation, while a 6-layer security firewall adds human-in-the-loop approval gates for high-risk tool execution and recursive prompt-injection detection.
Stars
5
Forks
1
Language
Python
License
MIT
Category
Last pushed
Mar 17, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/Harshit-J004/toolguard"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
aptible/unpage
Unpage is the open source framework for building SRE agents with infrastructure context and...
valmi-io/value
⚡ "Value" - https://value.valmi.io . Valmi Value is Outcome-based billing and payments...
driftbase-labs/driftbase-python
Local-first behavioral drift monitoring for AI agents. One decorator, SQLite, no cloud.
2001Haru/TokenWaster
The Most Useless EVER agent assistant in the Human History. Always trying to read everything in...
govynAI/govyn
Governance proxy for AI agents — per-agent budgets, cost tracking, loop detection, and policy...