toxy4ny/kevlar-benchmark

Kevlar Benchmark: OWASP Top 10 for Agentic Apps (AI-Agents) 2026 a Red Team Benchmark

/ 100

Emerging

Automated red-team framework that tests all 10 OWASP Agent-Specific Injection (ASI) vulnerabilities through modular exploit simulators (goal hijacking, RCE chains, memory poisoning) ordered by real-world attack criticality. Includes a prioritized threat orchestrator, detection engines for data exfiltration and goal drift, and AIVSS scoring integration. Supports both mock and real LangChain agent modes via CLI with individual ASI test scripts, CI/CD integration, and structured JSON reporting.

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 9 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

Ed1s0nZ/CyberStrikeAI

CyberStrikeAI is an AI-native security testing platform built in Go. It integrates 100+ security...

vxcontrol/pentagi

✨ Fully autonomous AI Agents system capable of performing complex penetration testing tasks

GH05TCREW/pentestagent

PentestAgent is an AI agent framework for black-box security testing, supporting bug bounty,...

SanMuzZzZz/LuaN1aoAgent

LuaN1aoAgent is a cognitive-driven AI hacker. It is a fully autonomous AI penetration testing...

asaotomo/FofaMap

FofaMap v2.0 是一款基于 Python3 开发的全网首个 AI 驱动红队资产测绘智能体。在延续原有 FOFA 数据采集、存活检测、统计聚合、图标 Hash...

Explore AI Agents

All categories Trending AI Agent directory Insights