Pro-GenAI/Agent-Action-Guard

🛡️ Safe AI Agents through Action Classifier

/ 100

Established

Intercepts tool calls from AI agents in real-time using a lightweight neural classifier trained on the HarmActions dataset to block unsafe actions before execution. Addresses a critical gap: testing revealed 95%+ of LLMs execute harmful actions when given access to dangerous tools, often while claiming refusal. Integrates seamlessly into agent loops as a middleware layer between agents and their tool implementations.

Available on PyPI.

Maintenance 13 / 25

Adoption 11 / 25

Maturity 18 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Featured in

Agent Governance in 2026: Who's Building the Guardrails? Agent Platforms Are Four Problems, Not One Your Agent Doesn't Have an Email Address (Yet) Your Agent is Hitting its Ceiling — Who's Actually Fixing It

Related agents

microsoft/agent-governance-toolkit

AI Agent Governance Toolkit — Policy enforcement, zero-trust identity, execution sandboxing, and...

ucsandman/DashClaw

🛡️Decision infrastructure for AI agents. Intercept actions, enforce guard policies, require...

vstorm-co/pydantic-ai-middleware

Middleware layer for Pydantic AI — intercept, transform & guard agent calls with 7 lifecycle...

mattijsmoens/sovereign-shield

AI security framework: tamper-proof action auditing, prompt injection firewall, ethical...

vstorm-co/pydantic-ai-shields

Guardrail capabilities for Pydantic AI — cost tracking, prompt injection detection, PII...

Explore AI Agents

All categories Trending AI Agent directory Insights