trailofbits/pajaMAS
Multi-agent system (MAS) hijacking demos
Demonstrates control-flow exploitation attacks across six scenarios—from basic orchestrator hijacking and malicious tools to persistent memory poisoning and unintended agent cycles—using LLM-based multi-agent systems built with Anthropic's API. Attacks manipulate inter-agent communication and workflows by injecting malicious prompts into web content, tool responses, and agent memory, exposing how MAS architectures amplify existing agentic AI vulnerabilities. Includes a naive defense example to illustrate ineffective guardrails against these novel attack vectors.
Stars
42
Forks
3
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 08, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/trailofbits/pajaMAS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
4thfever/cultivation-world-simulator
基于 AI Agent 工作流的修仙世界模拟器,旨在还原智能、开放的仙侠世界。| An open-source Cultivation World Simulator using...
nikmcfly/MiroFish-Offline
Offline multi-agent simulation & prediction engine. English fork of MiroFish with Neo4j + Ollama...
oil-oil/wolfcha
AI-powered Werewolf (Mafia) social deduction game where every player is controlled by top LLMs...
yasserfarouk/negmas
Negotiation Multi-Agent System (A negotiation library designed for situated negotiations within...
cormas/cormas
CORMAS (COmmon pool Ressources and Multi-Agent Simulations)