Game-Playing Agents

AI agents that learn to play and solve games through search algorithms, reinforcement learning, and game-tree exploration. Includes implementations for board games (Connect Four, Pac-Man, Mancala), puzzle games (8-Puzzle), and classic game-playing techniques (Minimax, MCTS, Alpha-Beta pruning, DQN). Does NOT include general reinforcement learning frameworks, non-game simulations, or agent orchestration platforms.

There are 168 game-playing agents tracked. 3 score above 70 (verified tier). The highest-rated is facebookresearch/BenchMARL at 76/100 with 580 stars and 785 monthly downloads. 3 of the top 10 are actively maintained.

Get all 168 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=agents&subcategory=game-playing-agents&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Agent	Score	Tier	Stars	Language
1	facebookresearch/BenchMARL BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning...	76	Verified	580	Python
2	datamllab/rlcard Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc,...	70	Verified	3,416	Python
3	Toni-SM/skrl Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX,...	70	Verified	1,011	Python
4	utiasDSL/gym-pybullet-drones PyBullet Gymnasium environments for single and multi-agent reinforcement...	64	Established	1,885	Python
5	koulanurag/ma-gym A collection of multi agent environments based on OpenAI gym.	63	Established	629	Python
6	AgileRL/AgileRL Streamlining reinforcement learning with RLOps. State-of-the-art RL...	63	Established	896	Python
7	gtri/scrimmage Multi-Agent Robotics Simulator	60	Established	176	C++
8	proroklab/VectorizedMultiAgentSimulator VMAS is a vectorized differentiable simulator designed for efficient...	60	Established	531	Python
9	idsc-frazzoli/dg-commons Driving games common tools	60	Established	30	Python
10	APLA-Toolbox/pymapf 📍🗺️ A Python library for Multi-Agents Planning and Pathfinding (Centralized...	58	Established	76	Python
11	microsoft/maro Multi-Agent Resource Optimization (MARO) platform is an instance of...	52	Established	910	Python
12	marlbenchmark/on-policy This is the official implementation of Multi-Agent PPO (MAPPO).	51	Established	1,914	Python
13	geek-ai/MAgent A Platform for Many-Agent Reinforcement Learning	50	Established	1,758	Python
14	PathPlanning/Continuous-CBS Continuous CBS - a modification of conflict based search algorithm, that...	50	Established	259	C++
15	bark-simulator/bark Open-Source Framework for Development, Simulation and Benchmarking of...	49	Emerging	304	C++
16	semitable/robotic-warehouse Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment	49	Emerging	416	Python
17	mit-acl/mader Trajectory Planner in Multi-Agent and Dynamic Environments	48	Emerging	598	C++
18	ArnaudFickinger/gym-multigrid Lightweight multi-agent gridworld Gym environment	47	Emerging	213	Python
19	PathPlanning/AA-SIPP-m Algorithm for prioritized multi-agent path finding (MAPF) in grid-worlds....	47	Emerging	124	C++
20	multi-commander/Multi-Commander Multi & Single Agent Reinforcement Learning for Traffic Signal Control Problem	46	Emerging	130	Python
21	garlicdevs/Fruit-API A Universal Deep Reinforcement Learning Framework	44	Emerging	71	Python
22	woctezuma/puissance4 AI for the game "Connect Four". Available on PyPI.	44	Emerging	5	Python
23	SMARTlab-Purdue/robotarium-rendezvous-RSSDOA This repository contains the Matlab source codes (to use in Robotarium...	43	Emerging	48	MATLAB
24	Pieter-Cawood/M-TA-Prioritized-MAPD Multi-Agent Pickup and Delivery implementation	42	Emerging	38	Python
25	asieradzk/RL_Matrix Deep Reinforcement Learning in C#	42	Emerging	302	C#
26	crowddynamics/crowddynamics Continuous-time multi-agent crowd simulation engine implemented in Python...	41	Emerging	45	Python
27	sair-lab/formation [T-Cyber 2020] Cooperative Pursuit with Multi-pursuer and One Faster...	40	Emerging	46	Python
28	DamianoBrunori/DAMIAN-Delay-Aware-MultI-Aerial-Navigation-DRL-based-environment- An OpenAIGym-based framework allowing to test Delay-Aware Deep Reinforcement...	40	Emerging	57	Python
29	opendilab/GoBigger [ICLR 2023] Come & try Decision-Intelligence version of "Agar"! Gobigger...	40	Emerging	481	Python
30	abhisheknaik96/MultiAgentTORCS The multi-agent version of TORCS for developing control algorithms for fully...	39	Emerging	146	Python
31	gdalle/MultiAgentPathFinding.jl Structures and algorithms for Multi-Agent PathFinding in Julia	38	Emerging	15	Julia
32	opendilab/ACE [AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative...	37	Emerging	238	Python
33	santifiorino/dino-reinforcement-learning Evolutionary Reinforcement Learning for Dino Game: Train an AI agent to...	37	Emerging	90	Processing
34	ProValarous/Predator-Prey-Archetype-Gridworld-Environment A minimalist, discrete multi-agent predator-prey archytype environment.	36	Emerging	6	Python
35	kyegomez/OpioidRL OpioidRL is a cutting-edge reinforcement learning (RL) library that...	35	Emerging	8	Python
36	zhixiangli/gomoku-battle Gomoku Battle is a cross-language cross-system battle platform.	35	Emerging	19	Java
37	JacopoPan/gym-marl-reconnaissance Gym environment for cooperative multi-agent reinforcement learning in...	34	Emerging	51	Python
38	KaleabTessera/HyperMARL Adaptive Hypernetworks for Multi-Agent RL. NeurIPS 2025.	34	Emerging	14	Python
39	trajectoryRL/trajectoryRL Bittensor Subnet 11 - Decentralized Reinforcement Learning for optimizing...	34	Emerging	7	Python
40	DamianoBrunori/MultiUAV-OpenAIGym An OpenaAIGym-based framework allowing to test hybrid approaches (RL + path...	33	Emerging	142	Python
41	puyuan1996/MARL Implementation for mSAC methods in PyTorch	32	Emerging	41	Python
42	KornbergFresnel/ModelRepo reproduce some RL or Multi-Agent models	32	Emerging	35	Python
43	samshipengs/Coordinated-Multi-Agent-Imitation-Learning This is an implementation of the paper "Coordinated Multi Agent Imitation...	32	Emerging	41	Jupyter Notebook
44	orlov-ai/beer-game-env Beer Game implemented as an OpenAI gym environment.	31	Emerging	17	Python
45	TARTRL/RankingCost The Ranking Cost algorithm for multi-path routing of gridworld.(多智能体路径规划，电路规划)	31	Emerging	19	Python
46	kooktaelee/D2OC Python and MATLAB codes for Density-Driven Optimal Control (D2OC) using...	30	Emerging	12	MATLAB
47	hanzheteng/pioneer_mrs A flexible development platform for Pioneer Multi-Robot Systems (MRS)...	30	Emerging	13	Python
48	James4Ever0/vimgolf-gym OpenAI gym style Vimgolf environment and benchmark for AI	30	Emerging	11	Python
49	DeepGym/deepgym RL training environments with verifiable rewards for coding agents. Works...	29	Experimental	3	Python
50	zombie-einstein/esquilax JAX Multi-Agent RL, Neuro-Evolution, and A-Life Library	29	Experimental	12	Python
51	IanRDavies/LeMOL Experimenting with meta-learning approaches to opponent modelling in MARL....	29	Experimental	14	Python
52	T3AS/MAD-ARL Python project for the paper "Adversarial Deep Reinforcement Learning for...	28	Experimental	14	Python
53	DiligentPanda/MAPF-LRR2023 This is the repo for the team Pikachu's solution in the League of Robot...	28	Experimental	27	C++
54	VideojogosLusofona/color-shape-links-ai-competition AI competition for IEEE CoG 2021	28	Experimental	3	C#
55	kaiyoo/AI-agent-Azul-Game-Competition AI agent game competition - Reinforcement learning (Monte Carlo Tree Search,...	28	Experimental	11	Python
56	EttoreCaputo/hadron-game This is a game AI project for the course "Artificial Intelligence" at the...	27	Experimental	5	Python
57	opendilab/Gobigger-Explore Still struggling with the high threshold or looking for the appropriate...	27	Experimental	182	Python
58	zzbuzzard/boxjump Box Jump is a co-operative multi-agent reinforcement learning environment!	27	Experimental	13	Python
59	zagoli/MultiAgentPathFinding Implementation of Sven Koenig's class project about MAPF	27	Experimental	6	Python
60	T3AS/Benchmarking-QRS-2022 Implementation of "Evaluating the Robustness of Deep Reinforcement Learning...	26	Experimental	7	Python
61	LijunSun90/pursuitMatrixWorld Multi-agent pursuit in matrix world (pursuitMW)	25	Experimental	5	Python
62	NcJie/multiagent-ddpg Multi-agent DDPG on ml-agents environment	25	Experimental	4	Jupyter Notebook
63	ajheshbasnet/reinforcement-learning-agents a collection of advanced reinforcement learning (rl) agents and...	25	Experimental	19	Jupyter Notebook
64	ethanmclark1/blocksworld3d 3D version of the classic Blocksworld environment for reinforcement learning	25	Experimental	3	Python
65	T3AS/ReMAV Implementation of "ReMAV: Reward Modeling of Autonomous Vehicles for Finding...	24	Experimental	4	Jupyter Notebook
66	SafeRoboticsLab/opinion_game Multi-agent coordination using game theory and nonlinear opinion dynamics - CDC 2023	24	Experimental	14	Python
67	zhouker94/Multi-Agent-DRL Multiagent deep reinforcement learning research project	24	Experimental	29	Jupyter Notebook
68	AdrianDiepeveen/Deep-Reinforcement-Learning-Multi-Agent-Autonomous-Drone-Emergency-Response-System Deep reinforcement learning system for coordinating autonomous drones in...	24	Experimental	1	Python
69	praveen-palanisamy/macad-agents Agents code for Multi-Agent Connected Autonomous Driving (MACAD) described...	24	Experimental	24	Python
70	bjmwang/minority_game 少数派博弈的一个简明Python3实现	24	Experimental	3	Python
71	collisonda/fyp-matlab-code MEng Final Year Project - Reinforcement Learned Collision Avoidance	23	Experimental	2	MATLAB
72	omron-sinicx/ncf2 Official implementation of "Counterfactual Fairness Filter for Fair-Delay...	23	Experimental	2	Jupyter Notebook
73	sukhitashvili/pong A reimplementation of Andrej Karpathy's repository for an RL self-learning...	22	Experimental	17	Python
74	laperdida23/go-playing-agent 🤖 Master Go on a 5x5 board with this AI agent using advanced algorithms for...	22	Experimental	—	Python
75	Eation5/Reinforcement-Learning-Environments Collection of custom reinforcement learning environments and agents...	22	Experimental	—	Scala
76	oft2026/mspm Unofficial PyTorch reproduction of MSPM: A Modularized and Scalable...	22	Experimental	—	Python
77	zhy0/dmarket_rl Fast single unit, double auction market for reinforcement learning	22	Experimental	7	Python
78	jeffasante/RepoGym A reinforcement learning environment for AI coding agents built from real...	22	Experimental	—	Rust
79	legalaspro/unity_multiagent_rl Multi-agent reinforcement learning framework for Unity environments....	22	Experimental	12	Python
80	infinitycloud-ch/roboticprogramai Cognitive robotics: Unitree Go2 + Isaac Sim 5.1.0 + ROS2 on DGX Spark...	22	Experimental	—	Python
81	FareedKhan-dev/ai-gaming-agent A step by step implementation of building an AI agent that plays 3d shooting game	22	Experimental	20	Python
82	Hanny658/MAVSPOI Multi-Agent Voting Scheme for real-time POI Recommendation. With API-based...	22	Experimental	—	—
83	martinchapman/hands Run AI Search Game (Hide-and-Seek) simulations as a decision-support tool	21	Experimental	2	Java
84	biological-alignment-benchmarks/zoo_to_gym_multiagent_adapter Enables you to convert a PettingZoo environment to a Gym environment while...	21	Experimental	2	Python
85	LukasSchaefer/MSc_Curiosity_MARL MSc Informatics dissertation project - University of Edinburgh: Curiosity in...	21	Experimental	13	Python
86	CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting We use policy gradient to help agents learn optimal policies in a...	21	Experimental	12	Jupyter Notebook
87	william-dan/rl-elevator Event‑driven elevator dispatch Gymnasium environment with FIFO/LOOK...	21	Experimental	—	Jupyter Notebook
88	aldoeliacim/sumo-mappo MAPPO experiments for unsignalized intersections using SUMO and Gym.	21	Experimental	2	Jupyter Notebook
89	davide97l/Pacman Implementation of many popular AI algorithms to play the game of Pacman such...	21	Experimental	13	Python
90	omarathon/rl-multi-agent-car-parking simulation/RL - multi-agent car parking using reinforcement learning	20	Experimental	12	C#
91	BinLee26/InterAgent [CVPR2026] InterAgent: Physics-based Multi-agent Command Execution via...	20	Experimental	1	Python
92	SafeRoboticsLab/Who_Plays_First Repository for "Who Plays First? Optimizing the Order of Play in Stackelberg...	20	Experimental	18	Python
93	uzumstanley/Multi-Agent-AI-Researcher Multi-Agent-AI-Researcher-Powered-by-DeepSeek-R1-main	19	Experimental	12	Python
94	studiofarzulla/friction-marl Multi-Agent Reinforcement Learning with Friction Dynamics — code companion...	19	Experimental	—	Python
95	Lucien-MG/gym-codingame A high-performance Gymnasium wrapper for CodinGame engines, enabling...	19	Experimental	—	Python
96	SinyZXJ/COMPASS COMPASS: Cooperative Multi-Agent Persistent Surveillance using...	19	Experimental	6	Python
97	NaimurRahman-Niaz02/Ultimate-Tic-Tac-Toe-with-AI-Agent Ultimate Tic-Tac-Toe is a strategic web game featuring an AI opponent where...	19	Experimental	—	JavaScript
98	DongChen06/PVRL Photovoltaic control using RL methods.	18	Experimental	7	MATLAB
99	Wadaboa/flatland-challenge Multi-agent reinforcement learning on trains, for Deep Learning class at UNIBO	18	Experimental	21	TeX
100	ZhuohuiZhang/TGCNet This is the official implementation of [AAAI'25 Oral] accepted paper:...	18	Experimental	11	Python
101	Cognitive-AI-Systems/mats-lp [AAAI-2024] MATS-LP addresses the challenging problem of decentralized...	18	Experimental	29	C++
102	ElectroCubic/Multi-Agent-Pathfinding-Sim An interactive multi-agent pathfinding simulator in Python using Pygame, and...	17	Experimental	—	Python
103	osamahmada2024/3D-Maze-Space-Arena A high-performance 3D maze simulation featuring 9 AI pathfinding algorithms,...	17	Experimental	2	Python
104	nicoleorzan/marl-mo Multi-Objective Multi-Agent RL with non-linear utility functions	17	Experimental	2	Python
105	jellyheadandrew/autoresearch-robotics Autonomous robotics research with simulation feedback	17	Experimental	3	Python
106	CognitiveAISystems/mats-lp [AAAI-2024] MATS-LP addresses the challenging problem of decentralized...	17	Experimental	2	—
107	Piyushi-0/Fair-MAMAB Code for our AAMAS '25 oral paper, 'Multi-agent Multi-armed Bandits with...	16	Experimental	1	Jupyter Notebook
108	sahajrajmalla/dhumbal-ai Code and simulations for "Optimizing AI Agents for Dhumbal," the first AI...	16	Experimental	1	Jupyter Notebook
109	aiaaee/Stochastic-Policy-Iteration-in-Markov-Environments This project implements policy iteration in a stochastic environment using...	16	Experimental	1	Jupyter Notebook
110	pspanoudakis/Berkeley-Pacman-Projects Berkeley Pac-Man 🤤◽◽◽👻 projects 0, 1 & 2 solutions	16	Experimental	4	Python
111	iamvigneshwars/ai-walkers-ppo-pytorch AI agent learns to walk, run, hop and crawl with out any given data using...	16	Experimental	3	Python
112	stillonearth/bevy_rl_shooter Multi-Agent FPS Gym Environment with bevy_rl	16	Experimental	24	Rust
113	ormai/hypersonic Bomberman-like, turn-based game played by two competing AI Agents	15	Experimental	—	Python
114	azizi-zahra/xoshift-ai-agent AI Project - An AI agent for playing XOShift with Python (Spring 2025)	15	Experimental	7	Python
115	metazoic/hierlearning HierLearning is a C++11 implementation of a multi-agent, hierarchical...	15	Experimental	2	C++
116	imbulana/multi-agent-perceiver Multi-Agent Perceiver Critic for Robotic Warehouse (RWARE) Coordination.	15	Experimental	—	Python
117	SvetLuna-Lab/Highrise-fire-uav-response-demo- Simulation demo: coordinating a small fleet of UAVs to suppress fires on...	15	Experimental	—	Python
118	iam-weijie/alphataxx AI agent that plays Ataxx	15	Experimental	—	Python
119	Nikhil-Singla/go-playing-agent Go Playing AI Agent: A sophisticated artificial intelligence system that...	15	Experimental	—	Python
120	jianzhnie/RLZero A clean and easy implementation of MuZero, AlphaZero and Self-Play...	15	Experimental	17	Python
121	rap-lab-org/public_pymcpf-d Multi-Agent Combinatorial Path Finding with Heterogeneous Task Duration (MCPF-D)	15	Experimental	6	Python
122	reubenwong97/NFSP-PEG-GridWorld Implementation of Neural Fictitious Self-Play for a GridWorld based...	15	Experimental	1	Jupyter Notebook
123	damn8daniel/multi-agent-rl Multi-Agent RL: MAPPO, cooperative environments, centralized training...	14	Experimental	—	Python
124	Chris-airobot/adversarial-rl-project Research project on adversarial reinforcement learning, PPO training, and...	14	Experimental	—	Python
125	marojeff123/Snake-double-deep-Q-learning 🐍 Implement deep Q-learning to enhance the Snake game experience, enabling...	14	Experimental	—	Python
126	alextousss/wargames Two agents shooting at each other, controlled by a neural network optimized...	14	Experimental	24	Python
127	minhtoan-tran/nckh2026-uet-warehouse-robots Student research: Multi-robot warehouse optimization (Simulation & Physical...	14	Experimental	—	Python
128	kvr06-ai/trust-based-public-goods-game Multi-agent simulation of a public goods game with trust dynamics and...	14	Experimental	3	Python
129	pulakk/ConflictAvoidantCBS-MAPF Conflict Avoidant CBS (CA-CBS)	14	Experimental	9	C#
130	LuddeWessen/assembly-robot-manager-minizinc A scheduler to manage a multi tool dual arm robot while avoiding arm-to-arm...	14	Experimental	11	Python
131	Dophinjet/lunarlander-dqn-comparison 🚀 Analyze and compare value-based deep reinforcement learning algorithms on...	14	Experimental	—	Jupyter Notebook
132	MAYANK12-WQ/multi-agent-robotics-lab 5 AI agents collaborate in real-time to research, implement, test, review,...	14	Experimental	—	Python
133	camlischke1/marl-anomaly-detect This project tests multiple different machine learning algorithms that can...	14	Experimental	1	Python
134	jaychampaneri14/multi-agent-sim Competitive and cooperative multi-agent RL environment	14	Experimental	—	Python
135	gagan0116/Multi_Agent_Traffic_Control_RL Decentralized multi-agent RL traffic signal control using Dueling DQN + GAT...	14	Experimental	—	Jupyter Notebook
136	DavidMANZI-093/RexAI A Chrome Dinosaur game clone powered by NEAT (NeuroEvolution of Augmenting...	14	Experimental	4	Python
137	kennycornellius-collab/SnakeGameRL Training a MLP and CNN based policy to compare the performance in a game of snake	14	Experimental	—	Python
138	alpc91/SMERL [ICML 2024] Official environments and JAX-implementations for...	13	Experimental	6	Python
139	GUT-AI/multi-robot-path-planning Multi-Robot Path Planning	13	Experimental	5	—
140	Sebastian-Griesbach/Minimax-Multi-Agent-Deep-Deterministic-Policy-Gradient A general pytorch implementation of the Minimax Multi-Agent Deep...	13	Experimental	5	Python
141	alessandrositta/Flatland_challenge Repository containing the code and explanation of a solution to the Flatland...	13	Experimental	8	Python
142	jyotishp/multiagent-collision-avoidance Decentralized multi-agent collision avoidance	12	Experimental	12	MATLAB
143	ghayda-njaafreh/tetris-ml-agent Tetris game in Python (Pygame) with an ML-based agent + training/testing curves.	12	Experimental	1	Python
144	tuan-nv0505/Snake-Q-learning Q-learning for playing Snake game	12	Experimental	9	Python
145	wilrop/ramo Algorithms for computing or learning equilibria in multi-objective games	12	Experimental	4	Python
146	YohannTPN/CrossyRoadAI AI agent that learns to play Crossy Road using genetic algorithms and neural...	12	Experimental	1	Java
147	muhammadwaheedairi/hackathon_textbook_ai_robotics Personal AI-Robotics portfolio: ROS 2, Gazebo, NVIDIA Isaac, VLA systems —...	12	Experimental	1	Shell
148	ben-ogden/rllib-trading-arena 🏟️ A competitive multi-agent trading arena using RLlib + Ray	12	Experimental	1	Python
149	damat-le/mage Multi-Agent Grid Environment (MAGE)	12	Experimental	3	Python
150	hansman/multi-agent-reinforcement-learning Multi-Agent Reinforcement Learning with Deep Sarsa Agents	12	Experimental	3	JavaScript
151	gkc741/Snake-AI Snake AI-agent using Neuroevolution	12	Experimental	1	C
152	tuan-nv0505/Snake-Deep-Q-Learning Deep Q-learning (DQL) for playing Snake game	11	Experimental	8	Python
153	gdalle/Flatland.jl A barebones Julia version of the Flatland railway simulator	11	Experimental	2	Julia
154	Farid-Karimi/Shover Shover-World is a grid-based reinforcement learning environment built on the...	11	Experimental	—	Python
155	Rana-inan/WizardOfWor-AiRemake A Python remake of the classic Wizard of Wor game with AI-controlled players.	11	Experimental	—	Python
156	khanbilal-devop/intelligent-agent-frameworks Implementations of AI search algorithms for goal-based agents. Includes...	11	Experimental	—	Python
157	SorerBOT/Watering-Problem Using Planning, Markov Decision Processes, Reinforcement Learning and other...	11	Experimental	—	Python
158	Jeffawe/Space-Shooter Space Shooter Retro Game built using Amazon Q	11	Experimental	—	Python
159	lu-m-dev/python-games A collection of classic games with human and AI agents	11	Experimental	—	Python
160	h24abdal/tic-tac-toe-reinforcement-learning Reinforcement learning solution for tic tac toe implemented in python.	11	Experimental	—	Jupyter Notebook
161	sakshampandey1901/MountainCar A Deep Q-Network on Gymnasium’s MountainCar-v0 with reward shaping and...	11	Experimental	—	Python
162	irgidev/kaggle-connect-x-agent An AI agent designed to play the 'Connect X' game from the Kaggle...	11	Experimental	—	Jupyter Notebook
163	tene04/Neuroevolution_vs_DeepQ-Learning Comparison of Neuroevolution and DQL to train agents on gym environment (Flappy bird)	11	Experimental	—	Jupyter Notebook
164	plss12/Connect-X-AlphaZero Reinforcement learning agents for Connect4, featuring a robust AlphaZero...	11	Experimental	—	Python
165	alexisjapas/mystic-square Multi-agent mystic square game solver	11	Experimental	2	Python
166	Dodo2k01/HeartsGame University School Project that required us to implement a game engine, ui,...	11	Experimental	—	Java
167	ItsOrv/Ai-Society Complete RL implementation from scratch: neural networks, PPO, 3D...	11	Experimental	—	Python
168	superboySB/matrix-game-baselines learning value-based method, started by one-step matrix games	10	Experimental	1	Python