marlbenchmark/on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

/ 100

Established

Implements centralized training with decentralized execution (CTDE) using shared policy networks across agents, with support for diverse benchmarks including SMAC v2, Google Research Football, Hanabi, and MPE environments. The codebase provides environment wrappers, rollout runners, and pre-tuned hyperparameter scripts for each scenario, emphasizing reproducibility through detailed configuration management and documented training curves from the original paper.

1,914 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

1,914

Forks

371

Language

Python

License

MIT

Related agents

facebookresearch/BenchMARL

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL...

datamllab/rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Toni-SM/skrl

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with...

utiasDSL/gym-pybullet-drones

PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control

koulanurag/ma-gym

A collection of multi agent environments based on OpenAI gym.

Explore AI Agents

All categories Trending AI Agent directory Insights