marlbenchmark/on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

51
/ 100
Established

Implements centralized training with decentralized execution (CTDE) using shared policy networks across agents, with support for diverse benchmarks including SMAC v2, Google Research Football, Hanabi, and MPE environments. The codebase provides environment wrappers, rollout runners, and pre-tuned hyperparameter scripts for each scenario, emphasizing reproducibility through detailed configuration management and documented training curves from the original paper.

1,914 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

1,914

Forks

371

Language

Python

License

MIT

Last pushed

Jul 18, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/marlbenchmark/on-policy"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.