trailofbits/pajaMAS

Multi-agent system (MAS) hijacking demos

/ 100

Emerging

Demonstrates control-flow exploitation attacks across six scenarios—from basic orchestrator hijacking and malicious tools to persistent memory poisoning and unintended agent cycles—using LLM-based multi-agent systems built with Anthropic's API. Attacks manipulate inter-agent communication and workflows by injecting malicious prompts into web content, tool responses, and agent memory, exposing how MAS architectures amplify existing agentic AI vulnerabilities. Includes a naive defense example to illustrate ineffective guardrails against these novel attack vectors.

No Package No Dependents

Maintenance 13 / 25

Adoption 8 / 25

Maturity 15 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

4thfever/cultivation-world-simulator

基于 AI Agent 工作流的修仙世界模拟器，旨在还原智能、开放的仙侠世界。| An open-source Cultivation World Simulator using...

nikmcfly/MiroFish-Offline

Offline multi-agent simulation & prediction engine. English fork of MiroFish with Neo4j + Ollama...

oil-oil/wolfcha

AI-powered Werewolf (Mafia) social deduction game where every player is controlled by top LLMs...

yasserfarouk/negmas

Negotiation Multi-Agent System (A negotiation library designed for situated negotiations within...

cormas/cormas

CORMAS (COmmon pool Ressources and Multi-Agent Simulations)

Explore AI Agents

All categories Trending AI Agent directory Insights