Harshit-J004/toolguard

Pytest-style reliability testing for AI agent tool chains. Catches hallucinated payloads, schema errors, and cascading failures before production.

/ 100

Emerging

Provides automated fuzzing and DAG-based execution tracing that identifies tool-chain vulnerabilities without live LLM calls—instead using type hints to generate deterministic failure scenarios (null propagation, type mismatches, cascading errors). Integrates natively with LangChain, CrewAI, Swarm, and AutoGen through context vars instrumentation, while a 6-layer security firewall adds human-in-the-loop approval gates for high-risk tool execution and recursive prompt-injection detection.

No Package No Dependents

Maintenance 13 / 25

Adoption 4 / 25

Maturity 9 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

aptible/unpage

Unpage is the open source framework for building SRE agents with infrastructure context and...

valmi-io/value

⚡ "Value" - https://value.valmi.io . Valmi Value is Outcome-based billing and payments...

driftbase-labs/driftbase-python

Local-first behavioral drift monitoring for AI agents. One decorator, SQLite, no cloud.

2001Haru/TokenWaster

The Most Useless EVER agent assistant in the Human History. Always trying to read everything in...

govynAI/govyn

Governance proxy for AI agents — per-agent budgets, cost tracking, loop detection, and policy...

Explore AI Agents

All categories Trending AI Agent directory Insights