Game-Playing Agents

AI agents that learn to play and solve games through search algorithms, reinforcement learning, and game-tree exploration. Includes implementations for board games (Connect Four, Pac-Man, Mancala), puzzle games (8-Puzzle), and classic game-playing techniques (Minimax, MCTS, Alpha-Beta pruning, DQN). Does NOT include general reinforcement learning frameworks, non-game simulations, or agent orchestration platforms.

There are 168 game-playing agents tracked. 3 score above 70 (verified tier). The highest-rated is facebookresearch/BenchMARL at 76/100 with 580 stars and 785 monthly downloads. 3 of the top 10 are actively maintained.

Get all 168 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=agents&subcategory=game-playing-agents&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Agent Score Tier
1 facebookresearch/BenchMARL

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning...

76
Verified
2 datamllab/rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc,...

70
Verified
3 Toni-SM/skrl

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX,...

70
Verified
4 utiasDSL/gym-pybullet-drones

PyBullet Gymnasium environments for single and multi-agent reinforcement...

64
Established
5 koulanurag/ma-gym

A collection of multi agent environments based on OpenAI gym.

63
Established
6 AgileRL/AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL...

63
Established
7 gtri/scrimmage

Multi-Agent Robotics Simulator

60
Established
8 proroklab/VectorizedMultiAgentSimulator

VMAS is a vectorized differentiable simulator designed for efficient...

60
Established
9 idsc-frazzoli/dg-commons

Driving games common tools

60
Established
10 APLA-Toolbox/pymapf

📍🗺️ A Python library for Multi-Agents Planning and Pathfinding (Centralized...

58
Established
11 microsoft/maro

Multi-Agent Resource Optimization (MARO) platform is an instance of...

52
Established
12 marlbenchmark/on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

51
Established
13 geek-ai/MAgent

A Platform for Many-Agent Reinforcement Learning

50
Established
14 PathPlanning/Continuous-CBS

Continuous CBS - a modification of conflict based search algorithm, that...

50
Established
15 bark-simulator/bark

Open-Source Framework for Development, Simulation and Benchmarking of...

49
Emerging
16 semitable/robotic-warehouse

Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment

49
Emerging
17 mit-acl/mader

Trajectory Planner in Multi-Agent and Dynamic Environments

48
Emerging
18 ArnaudFickinger/gym-multigrid

Lightweight multi-agent gridworld Gym environment

47
Emerging
19 PathPlanning/AA-SIPP-m

Algorithm for prioritized multi-agent path finding (MAPF) in grid-worlds....

47
Emerging
20 multi-commander/Multi-Commander

Multi & Single Agent Reinforcement Learning for Traffic Signal Control Problem

46
Emerging
21 garlicdevs/Fruit-API

A Universal Deep Reinforcement Learning Framework

44
Emerging
22 woctezuma/puissance4

AI for the game "Connect Four". Available on PyPI.

44
Emerging
23 SMARTlab-Purdue/robotarium-rendezvous-RSSDOA

This repository contains the Matlab source codes (to use in Robotarium...

43
Emerging
24 Pieter-Cawood/M-TA-Prioritized-MAPD

Multi-Agent Pickup and Delivery implementation

42
Emerging
25 asieradzk/RL_Matrix

Deep Reinforcement Learning in C#

42
Emerging
26 crowddynamics/crowddynamics

Continuous-time multi-agent crowd simulation engine implemented in Python...

41
Emerging
27 sair-lab/formation

[T-Cyber 2020] Cooperative Pursuit with Multi-pursuer and One Faster...

40
Emerging
28 DamianoBrunori/DAMIAN-Delay-Aware-MultI-Aerial-Navigation-DRL-based-environment-

An OpenAIGym-based framework allowing to test Delay-Aware Deep Reinforcement...

40
Emerging
29 opendilab/GoBigger

[ICLR 2023] Come & try Decision-Intelligence version of "Agar"! Gobigger...

40
Emerging
30 abhisheknaik96/MultiAgentTORCS

The multi-agent version of TORCS for developing control algorithms for fully...

39
Emerging
31 gdalle/MultiAgentPathFinding.jl

Structures and algorithms for Multi-Agent PathFinding in Julia

38
Emerging
32 opendilab/ACE

[AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative...

37
Emerging
33 santifiorino/dino-reinforcement-learning

Evolutionary Reinforcement Learning for Dino Game: Train an AI agent to...

37
Emerging
34 ProValarous/Predator-Prey-Archetype-Gridworld-Environment

A minimalist, discrete multi-agent predator-prey archytype environment.

36
Emerging
35 kyegomez/OpioidRL

OpioidRL is a cutting-edge reinforcement learning (RL) library that...

35
Emerging
36 zhixiangli/gomoku-battle

Gomoku Battle is a cross-language cross-system battle platform.

35
Emerging
37 JacopoPan/gym-marl-reconnaissance

Gym environment for cooperative multi-agent reinforcement learning in...

34
Emerging
38 KaleabTessera/HyperMARL

Adaptive Hypernetworks for Multi-Agent RL. NeurIPS 2025.

34
Emerging
39 trajectoryRL/trajectoryRL

Bittensor Subnet 11 - Decentralized Reinforcement Learning for optimizing...

34
Emerging
40 DamianoBrunori/MultiUAV-OpenAIGym

An OpenaAIGym-based framework allowing to test hybrid approaches (RL + path...

33
Emerging
41 puyuan1996/MARL

Implementation for mSAC methods in PyTorch

32
Emerging
42 KornbergFresnel/ModelRepo

reproduce some RL or Multi-Agent models

32
Emerging
43 samshipengs/Coordinated-Multi-Agent-Imitation-Learning

This is an implementation of the paper "Coordinated Multi Agent Imitation...

32
Emerging
44 orlov-ai/beer-game-env

Beer Game implemented as an OpenAI gym environment.

31
Emerging
45 TARTRL/RankingCost

The Ranking Cost algorithm for multi-path routing of gridworld.(多智能体路径规划,电路规划)

31
Emerging
46 kooktaelee/D2OC

Python and MATLAB codes for Density-Driven Optimal Control (D2OC) using...

30
Emerging
47 hanzheteng/pioneer_mrs

A flexible development platform for Pioneer Multi-Robot Systems (MRS)...

30
Emerging
48 James4Ever0/vimgolf-gym

OpenAI gym style Vimgolf environment and benchmark for AI

30
Emerging
49 DeepGym/deepgym

RL training environments with verifiable rewards for coding agents. Works...

29
Experimental
50 zombie-einstein/esquilax

JAX Multi-Agent RL, Neuro-Evolution, and A-Life Library

29
Experimental
51 IanRDavies/LeMOL

Experimenting with meta-learning approaches to opponent modelling in MARL....

29
Experimental
52 T3AS/MAD-ARL

Python project for the paper "Adversarial Deep Reinforcement Learning for...

28
Experimental
53 DiligentPanda/MAPF-LRR2023

This is the repo for the team Pikachu's solution in the League of Robot...

28
Experimental
54 VideojogosLusofona/color-shape-links-ai-competition

AI competition for IEEE CoG 2021

28
Experimental
55 kaiyoo/AI-agent-Azul-Game-Competition

AI agent game competition - Reinforcement learning (Monte Carlo Tree Search,...

28
Experimental
56 EttoreCaputo/hadron-game

This is a game AI project for the course "Artificial Intelligence" at the...

27
Experimental
57 opendilab/Gobigger-Explore

Still struggling with the high threshold or looking for the appropriate...

27
Experimental
58 zzbuzzard/boxjump

Box Jump is a co-operative multi-agent reinforcement learning environment!

27
Experimental
59 zagoli/MultiAgentPathFinding

Implementation of Sven Koenig's class project about MAPF

27
Experimental
60 T3AS/Benchmarking-QRS-2022

Implementation of "Evaluating the Robustness of Deep Reinforcement Learning...

26
Experimental
61 LijunSun90/pursuitMatrixWorld

Multi-agent pursuit in matrix world (pursuitMW)

25
Experimental
62 NcJie/multiagent-ddpg

Multi-agent DDPG on ml-agents environment

25
Experimental
63 ajheshbasnet/reinforcement-learning-agents

a collection of advanced reinforcement learning (rl) agents and...

25
Experimental
64 ethanmclark1/blocksworld3d

3D version of the classic Blocksworld environment for reinforcement learning

25
Experimental
65 T3AS/ReMAV

Implementation of "ReMAV: Reward Modeling of Autonomous Vehicles for Finding...

24
Experimental
66 SafeRoboticsLab/opinion_game

Multi-agent coordination using game theory and nonlinear opinion dynamics - CDC 2023

24
Experimental
67 zhouker94/Multi-Agent-DRL

Multiagent deep reinforcement learning research project

24
Experimental
68 AdrianDiepeveen/Deep-Reinforcement-Learning-Multi-Agent-Autonomous-Drone-Emergency-Response-System

Deep reinforcement learning system for coordinating autonomous drones in...

24
Experimental
69 praveen-palanisamy/macad-agents

Agents code for Multi-Agent Connected Autonomous Driving (MACAD) described...

24
Experimental
70 bjmwang/minority_game

少数派博弈的一个简明Python3实现

24
Experimental
71 collisonda/fyp-matlab-code

MEng Final Year Project - Reinforcement Learned Collision Avoidance

23
Experimental
72 omron-sinicx/ncf2

Official implementation of "Counterfactual Fairness Filter for Fair-Delay...

23
Experimental
73 sukhitashvili/pong

A reimplementation of Andrej Karpathy's repository for an RL self-learning...

22
Experimental
74 laperdida23/go-playing-agent

🤖 Master Go on a 5x5 board with this AI agent using advanced algorithms for...

22
Experimental
75 Eation5/Reinforcement-Learning-Environments

Collection of custom reinforcement learning environments and agents...

22
Experimental
76 oft2026/mspm

Unofficial PyTorch reproduction of MSPM: A Modularized and Scalable...

22
Experimental
77 zhy0/dmarket_rl

Fast single unit, double auction market for reinforcement learning

22
Experimental
78 jeffasante/RepoGym

A reinforcement learning environment for AI coding agents built from real...

22
Experimental
79 legalaspro/unity_multiagent_rl

Multi-agent reinforcement learning framework for Unity environments....

22
Experimental
80 infinitycloud-ch/roboticprogramai

Cognitive robotics: Unitree Go2 + Isaac Sim 5.1.0 + ROS2 on DGX Spark...

22
Experimental
81 FareedKhan-dev/ai-gaming-agent

A step by step implementation of building an AI agent that plays 3d shooting game

22
Experimental
82 Hanny658/MAVSPOI

Multi-Agent Voting Scheme for real-time POI Recommendation. With API-based...

22
Experimental
83 martinchapman/hands

Run AI Search Game (Hide-and-Seek) simulations as a decision-support tool

21
Experimental
84 biological-alignment-benchmarks/zoo_to_gym_multiagent_adapter

Enables you to convert a PettingZoo environment to a Gym environment while...

21
Experimental
85 LukasSchaefer/MSc_Curiosity_MARL

MSc Informatics dissertation project - University of Edinburgh: Curiosity in...

21
Experimental
86 CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

We use policy gradient to help agents learn optimal policies in a...

21
Experimental
87 william-dan/rl-elevator

Event‑driven elevator dispatch Gymnasium environment with FIFO/LOOK...

21
Experimental
88 aldoeliacim/sumo-mappo

MAPPO experiments for unsignalized intersections using SUMO and Gym.

21
Experimental
89 davide97l/Pacman

Implementation of many popular AI algorithms to play the game of Pacman such...

21
Experimental
90 omarathon/rl-multi-agent-car-parking

simulation/RL - multi-agent car parking using reinforcement learning

20
Experimental
91 BinLee26/InterAgent

[CVPR2026] InterAgent: Physics-based Multi-agent Command Execution via...

20
Experimental
92 SafeRoboticsLab/Who_Plays_First

Repository for "Who Plays First? Optimizing the Order of Play in Stackelberg...

20
Experimental
93 uzumstanley/Multi-Agent-AI-Researcher

Multi-Agent-AI-Researcher-Powered-by-DeepSeek-R1-main

19
Experimental
94 studiofarzulla/friction-marl

Multi-Agent Reinforcement Learning with Friction Dynamics — code companion...

19
Experimental
95 Lucien-MG/gym-codingame

A high-performance Gymnasium wrapper for CodinGame engines, enabling...

19
Experimental
96 SinyZXJ/COMPASS

COMPASS: Cooperative Multi-Agent Persistent Surveillance using...

19
Experimental
97 NaimurRahman-Niaz02/Ultimate-Tic-Tac-Toe-with-AI-Agent

Ultimate Tic-Tac-Toe is a strategic web game featuring an AI opponent where...

19
Experimental
98 DongChen06/PVRL

Photovoltaic control using RL methods.

18
Experimental
99 Wadaboa/flatland-challenge

Multi-agent reinforcement learning on trains, for Deep Learning class at UNIBO

18
Experimental
100 ZhuohuiZhang/TGCNet

This is the official implementation of [AAAI'25 Oral] accepted paper:...

18
Experimental
101 Cognitive-AI-Systems/mats-lp

[AAAI-2024] MATS-LP addresses the challenging problem of decentralized...

18
Experimental
102 ElectroCubic/Multi-Agent-Pathfinding-Sim

An interactive multi-agent pathfinding simulator in Python using Pygame, and...

17
Experimental
103 osamahmada2024/3D-Maze-Space-Arena

A high-performance 3D maze simulation featuring 9 AI pathfinding algorithms,...

17
Experimental
104 nicoleorzan/marl-mo

Multi-Objective Multi-Agent RL with non-linear utility functions

17
Experimental
105 jellyheadandrew/autoresearch-robotics

Autonomous robotics research with simulation feedback

17
Experimental
106 CognitiveAISystems/mats-lp

[AAAI-2024] MATS-LP addresses the challenging problem of decentralized...

17
Experimental
107 Piyushi-0/Fair-MAMAB

Code for our AAMAS '25 oral paper, 'Multi-agent Multi-armed Bandits with...

16
Experimental
108 sahajrajmalla/dhumbal-ai

Code and simulations for "Optimizing AI Agents for Dhumbal," the first AI...

16
Experimental
109 aiaaee/Stochastic-Policy-Iteration-in-Markov-Environments

This project implements policy iteration in a stochastic environment using...

16
Experimental
110 pspanoudakis/Berkeley-Pacman-Projects

Berkeley Pac-Man 🤤◽◽◽👻 projects 0, 1 & 2 solutions

16
Experimental
111 iamvigneshwars/ai-walkers-ppo-pytorch

AI agent learns to walk, run, hop and crawl with out any given data using...

16
Experimental
112 stillonearth/bevy_rl_shooter

Multi-Agent FPS Gym Environment with bevy_rl

16
Experimental
113 ormai/hypersonic

Bomberman-like, turn-based game played by two competing AI Agents

15
Experimental
114 azizi-zahra/xoshift-ai-agent

AI Project - An AI agent for playing XOShift with Python (Spring 2025)

15
Experimental
115 metazoic/hierlearning

HierLearning is a C++11 implementation of a multi-agent, hierarchical...

15
Experimental
116 imbulana/multi-agent-perceiver

Multi-Agent Perceiver Critic for Robotic Warehouse (RWARE) Coordination.

15
Experimental
117 SvetLuna-Lab/Highrise-fire-uav-response-demo-

Simulation demo: coordinating a small fleet of UAVs to suppress fires on...

15
Experimental
118 iam-weijie/alphataxx

AI agent that plays Ataxx

15
Experimental
119 Nikhil-Singla/go-playing-agent

Go Playing AI Agent: A sophisticated artificial intelligence system that...

15
Experimental
120 jianzhnie/RLZero

A clean and easy implementation of MuZero, AlphaZero and Self-Play...

15
Experimental
121 rap-lab-org/public_pymcpf-d

Multi-Agent Combinatorial Path Finding with Heterogeneous Task Duration (MCPF-D)

15
Experimental
122 reubenwong97/NFSP-PEG-GridWorld

Implementation of Neural Fictitious Self-Play for a GridWorld based...

15
Experimental
123 damn8daniel/multi-agent-rl

Multi-Agent RL: MAPPO, cooperative environments, centralized training...

14
Experimental
124 Chris-airobot/adversarial-rl-project

Research project on adversarial reinforcement learning, PPO training, and...

14
Experimental
125 marojeff123/Snake-double-deep-Q-learning

🐍 Implement deep Q-learning to enhance the Snake game experience, enabling...

14
Experimental
126 alextousss/wargames

Two agents shooting at each other, controlled by a neural network optimized...

14
Experimental
127 minhtoan-tran/nckh2026-uet-warehouse-robots

Student research: Multi-robot warehouse optimization (Simulation & Physical...

14
Experimental
128 kvr06-ai/trust-based-public-goods-game

Multi-agent simulation of a public goods game with trust dynamics and...

14
Experimental
129 pulakk/ConflictAvoidantCBS-MAPF

Conflict Avoidant CBS (CA-CBS)

14
Experimental
130 LuddeWessen/assembly-robot-manager-minizinc

A scheduler to manage a multi tool dual arm robot while avoiding arm-to-arm...

14
Experimental
131 Dophinjet/lunarlander-dqn-comparison

🚀 Analyze and compare value-based deep reinforcement learning algorithms on...

14
Experimental
132 MAYANK12-WQ/multi-agent-robotics-lab

5 AI agents collaborate in real-time to research, implement, test, review,...

14
Experimental
133 camlischke1/marl-anomaly-detect

This project tests multiple different machine learning algorithms that can...

14
Experimental
134 jaychampaneri14/multi-agent-sim

Competitive and cooperative multi-agent RL environment

14
Experimental
135 gagan0116/Multi_Agent_Traffic_Control_RL

Decentralized multi-agent RL traffic signal control using Dueling DQN + GAT...

14
Experimental
136 DavidMANZI-093/RexAI

A Chrome Dinosaur game clone powered by NEAT (NeuroEvolution of Augmenting...

14
Experimental
137 kennycornellius-collab/SnakeGameRL

Training a MLP and CNN based policy to compare the performance in a game of snake

14
Experimental
138 alpc91/SMERL

[ICML 2024] Official environments and JAX-implementations for...

13
Experimental
139 GUT-AI/multi-robot-path-planning

Multi-Robot Path Planning

13
Experimental
140 Sebastian-Griesbach/Minimax-Multi-Agent-Deep-Deterministic-Policy-Gradient

A general pytorch implementation of the Minimax Multi-Agent Deep...

13
Experimental
141 alessandrositta/Flatland_challenge

Repository containing the code and explanation of a solution to the Flatland...

13
Experimental
142 jyotishp/multiagent-collision-avoidance

Decentralized multi-agent collision avoidance

12
Experimental
143 ghayda-njaafreh/tetris-ml-agent

Tetris game in Python (Pygame) with an ML-based agent + training/testing curves.

12
Experimental
144 tuan-nv0505/Snake-Q-learning

Q-learning for playing Snake game

12
Experimental
145 wilrop/ramo

Algorithms for computing or learning equilibria in multi-objective games

12
Experimental
146 YohannTPN/CrossyRoadAI

AI agent that learns to play Crossy Road using genetic algorithms and neural...

12
Experimental
147 muhammadwaheedairi/hackathon_textbook_ai_robotics

Personal AI-Robotics portfolio: ROS 2, Gazebo, NVIDIA Isaac, VLA systems —...

12
Experimental
148 ben-ogden/rllib-trading-arena

🏟️ A competitive multi-agent trading arena using RLlib + Ray

12
Experimental
149 damat-le/mage

Multi-Agent Grid Environment (MAGE)

12
Experimental
150 hansman/multi-agent-reinforcement-learning

Multi-Agent Reinforcement Learning with Deep Sarsa Agents

12
Experimental
151 gkc741/Snake-AI

Snake AI-agent using Neuroevolution

12
Experimental
152 tuan-nv0505/Snake-Deep-Q-Learning

Deep Q-learning (DQL) for playing Snake game

11
Experimental
153 gdalle/Flatland.jl

A barebones Julia version of the Flatland railway simulator

11
Experimental
154 Farid-Karimi/Shover

Shover-World is a grid-based reinforcement learning environment built on the...

11
Experimental
155 Rana-inan/WizardOfWor-AiRemake

A Python remake of the classic Wizard of Wor game with AI-controlled players.

11
Experimental
156 khanbilal-devop/intelligent-agent-frameworks

Implementations of AI search algorithms for goal-based agents. Includes...

11
Experimental
157 SorerBOT/Watering-Problem

Using Planning, Markov Decision Processes, Reinforcement Learning and other...

11
Experimental
158 Jeffawe/Space-Shooter

Space Shooter Retro Game built using Amazon Q

11
Experimental
159 lu-m-dev/python-games

A collection of classic games with human and AI agents

11
Experimental
160 h24abdal/tic-tac-toe-reinforcement-learning

Reinforcement learning solution for tic tac toe implemented in python.

11
Experimental
161 sakshampandey1901/MountainCar

A Deep Q-Network on Gymnasium’s MountainCar-v0 with reward shaping and...

11
Experimental
162 irgidev/kaggle-connect-x-agent

An AI agent designed to play the 'Connect X' game from the Kaggle...

11
Experimental
163 tene04/Neuroevolution_vs_DeepQ-Learning

Comparison of Neuroevolution and DQL to train agents on gym environment (Flappy bird)

11
Experimental
164 plss12/Connect-X-AlphaZero

Reinforcement learning agents for Connect4, featuring a robust AlphaZero...

11
Experimental
165 alexisjapas/mystic-square

Multi-agent mystic square game solver

11
Experimental
166 Dodo2k01/HeartsGame

University School Project that required us to implement a game engine, ui,...

11
Experimental
167 ItsOrv/Ai-Society

Complete RL implementation from scratch: neural networks, PPO, 3D...

11
Experimental
168 superboySB/matrix-game-baselines

learning value-based method, started by one-step matrix games

10
Experimental