qualifire-dev/rogue

AI Agent Evaluator & Red Team Platform

/ 100

Established

Provides dual-mode testing for AI agents: automatic evaluation against business policies and red team attacks spanning 75+ vulnerabilities across 12 security categories with CVSS-based risk scoring. Supports multiple agent protocols (A2A, MCP, direct Python functions) with a client-server architecture offering TUI, CLI, and programmatic interfaces. Includes 8 compliance frameworks (OWASP, MITRE, NIST, EU AI Act, GDPR) and reproducible scans via seeding for regression testing.

1,012 stars. Actively maintained with 2 commits in the last 30 days.

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 23 / 25

How are scores calculated?

Stars

1,012

Forks

160

Language

Python

License

—

Featured in

You're Shipping AI You Can't Measure

Related agents

StonyBrookNLP/appworld

🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and...

future-agi/ai-evaluation

Evaluation Framework for all your AI related Workflows

microsoft/WindowsAgentArena

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of...

agentscope-ai/OpenJudge

OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards

SparkBeyond/agentune

Tune your AI Agent to best meet its KPI with a cyclic process of analyze, improve and simulate

Explore AI Agents

All categories Trending AI Agent directory Insights