qualifire-dev/rogue
AI Agent Evaluator & Red Team Platform
Provides dual-mode testing for AI agents: automatic evaluation against business policies and red team attacks spanning 75+ vulnerabilities across 12 security categories with CVSS-based risk scoring. Supports multiple agent protocols (A2A, MCP, direct Python functions) with a client-server architecture offering TUI, CLI, and programmatic interfaces. Includes 8 compliance frameworks (OWASP, MITRE, NIST, EU AI Act, GDPR) and reproducible scans via seeding for regression testing.
1,012 stars. Actively maintained with 2 commits in the last 30 days.
Stars
1,012
Forks
160
Language
Python
License
—
Category
Last pushed
Mar 04, 2026
Commits (30d)
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/qualifire-dev/rogue"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Related agents
StonyBrookNLP/appworld
🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and...
future-agi/ai-evaluation
Evaluation Framework for all your AI related Workflows
microsoft/WindowsAgentArena
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of...
agentscope-ai/OpenJudge
OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards
SparkBeyond/agentune
Tune your AI Agent to best meet its KPI with a cyclic process of analyze, improve and simulate