Game-Playing Agents
AI agents that learn to play and solve games through search algorithms, reinforcement learning, and game-tree exploration. Includes implementations for board games (Connect Four, Pac-Man, Mancala), puzzle games (8-Puzzle), and classic game-playing techniques (Minimax, MCTS, Alpha-Beta pruning, DQN). Does NOT include general reinforcement learning frameworks, non-game simulations, or agent orchestration platforms.
There are 168 game-playing agents tracked. 3 score above 70 (verified tier). The highest-rated is facebookresearch/BenchMARL at 76/100 with 580 stars and 785 monthly downloads. 3 of the top 10 are actively maintained.
Get all 168 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=agents&subcategory=game-playing-agents&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Agent | Score | Tier |
|---|---|---|---|
| 1 |
facebookresearch/BenchMARL
BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning... |
|
Verified |
| 2 |
datamllab/rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc,... |
|
Verified |
| 3 |
Toni-SM/skrl
Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX,... |
|
Verified |
| 4 |
utiasDSL/gym-pybullet-drones
PyBullet Gymnasium environments for single and multi-agent reinforcement... |
|
Established |
| 5 |
koulanurag/ma-gym
A collection of multi agent environments based on OpenAI gym. |
|
Established |
| 6 |
AgileRL/AgileRL
Streamlining reinforcement learning with RLOps. State-of-the-art RL... |
|
Established |
| 7 |
gtri/scrimmage
Multi-Agent Robotics Simulator |
|
Established |
| 8 |
proroklab/VectorizedMultiAgentSimulator
VMAS is a vectorized differentiable simulator designed for efficient... |
|
Established |
| 9 |
idsc-frazzoli/dg-commons
Driving games common tools |
|
Established |
| 10 |
APLA-Toolbox/pymapf
📍🗺️ A Python library for Multi-Agents Planning and Pathfinding (Centralized... |
|
Established |
| 11 |
microsoft/maro
Multi-Agent Resource Optimization (MARO) platform is an instance of... |
|
Established |
| 12 |
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO). |
|
Established |
| 13 |
geek-ai/MAgent
A Platform for Many-Agent Reinforcement Learning |
|
Established |
| 14 |
PathPlanning/Continuous-CBS
Continuous CBS - a modification of conflict based search algorithm, that... |
|
Established |
| 15 |
bark-simulator/bark
Open-Source Framework for Development, Simulation and Benchmarking of... |
|
Emerging |
| 16 |
semitable/robotic-warehouse
Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment |
|
Emerging |
| 17 |
mit-acl/mader
Trajectory Planner in Multi-Agent and Dynamic Environments |
|
Emerging |
| 18 |
ArnaudFickinger/gym-multigrid
Lightweight multi-agent gridworld Gym environment |
|
Emerging |
| 19 |
PathPlanning/AA-SIPP-m
Algorithm for prioritized multi-agent path finding (MAPF) in grid-worlds.... |
|
Emerging |
| 20 |
multi-commander/Multi-Commander
Multi & Single Agent Reinforcement Learning for Traffic Signal Control Problem |
|
Emerging |
| 21 |
garlicdevs/Fruit-API
A Universal Deep Reinforcement Learning Framework |
|
Emerging |
| 22 |
woctezuma/puissance4
AI for the game "Connect Four". Available on PyPI. |
|
Emerging |
| 23 |
SMARTlab-Purdue/robotarium-rendezvous-RSSDOA
This repository contains the Matlab source codes (to use in Robotarium... |
|
Emerging |
| 24 |
Pieter-Cawood/M-TA-Prioritized-MAPD
Multi-Agent Pickup and Delivery implementation |
|
Emerging |
| 25 |
asieradzk/RL_Matrix
Deep Reinforcement Learning in C# |
|
Emerging |
| 26 |
crowddynamics/crowddynamics
Continuous-time multi-agent crowd simulation engine implemented in Python... |
|
Emerging |
| 27 |
sair-lab/formation
[T-Cyber 2020] Cooperative Pursuit with Multi-pursuer and One Faster... |
|
Emerging |
| 28 |
DamianoBrunori/DAMIAN-Delay-Aware-MultI-Aerial-Navigation-DRL-based-environment-
An OpenAIGym-based framework allowing to test Delay-Aware Deep Reinforcement... |
|
Emerging |
| 29 |
opendilab/GoBigger
[ICLR 2023] Come & try Decision-Intelligence version of "Agar"! Gobigger... |
|
Emerging |
| 30 |
abhisheknaik96/MultiAgentTORCS
The multi-agent version of TORCS for developing control algorithms for fully... |
|
Emerging |
| 31 |
gdalle/MultiAgentPathFinding.jl
Structures and algorithms for Multi-Agent PathFinding in Julia |
|
Emerging |
| 32 |
opendilab/ACE
[AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative... |
|
Emerging |
| 33 |
santifiorino/dino-reinforcement-learning
Evolutionary Reinforcement Learning for Dino Game: Train an AI agent to... |
|
Emerging |
| 34 |
ProValarous/Predator-Prey-Archetype-Gridworld-Environment
A minimalist, discrete multi-agent predator-prey archytype environment. |
|
Emerging |
| 35 |
kyegomez/OpioidRL
OpioidRL is a cutting-edge reinforcement learning (RL) library that... |
|
Emerging |
| 36 |
zhixiangli/gomoku-battle
Gomoku Battle is a cross-language cross-system battle platform. |
|
Emerging |
| 37 |
JacopoPan/gym-marl-reconnaissance
Gym environment for cooperative multi-agent reinforcement learning in... |
|
Emerging |
| 38 |
KaleabTessera/HyperMARL
Adaptive Hypernetworks for Multi-Agent RL. NeurIPS 2025. |
|
Emerging |
| 39 |
trajectoryRL/trajectoryRL
Bittensor Subnet 11 - Decentralized Reinforcement Learning for optimizing... |
|
Emerging |
| 40 |
DamianoBrunori/MultiUAV-OpenAIGym
An OpenaAIGym-based framework allowing to test hybrid approaches (RL + path... |
|
Emerging |
| 41 |
puyuan1996/MARL
Implementation for mSAC methods in PyTorch |
|
Emerging |
| 42 |
KornbergFresnel/ModelRepo
reproduce some RL or Multi-Agent models |
|
Emerging |
| 43 |
samshipengs/Coordinated-Multi-Agent-Imitation-Learning
This is an implementation of the paper "Coordinated Multi Agent Imitation... |
|
Emerging |
| 44 |
orlov-ai/beer-game-env
Beer Game implemented as an OpenAI gym environment. |
|
Emerging |
| 45 |
TARTRL/RankingCost
The Ranking Cost algorithm for multi-path routing of gridworld.(多智能体路径规划,电路规划) |
|
Emerging |
| 46 |
kooktaelee/D2OC
Python and MATLAB codes for Density-Driven Optimal Control (D2OC) using... |
|
Emerging |
| 47 |
hanzheteng/pioneer_mrs
A flexible development platform for Pioneer Multi-Robot Systems (MRS)... |
|
Emerging |
| 48 |
James4Ever0/vimgolf-gym
OpenAI gym style Vimgolf environment and benchmark for AI |
|
Emerging |
| 49 |
DeepGym/deepgym
RL training environments with verifiable rewards for coding agents. Works... |
|
Experimental |
| 50 |
zombie-einstein/esquilax
JAX Multi-Agent RL, Neuro-Evolution, and A-Life Library |
|
Experimental |
| 51 |
IanRDavies/LeMOL
Experimenting with meta-learning approaches to opponent modelling in MARL.... |
|
Experimental |
| 52 |
T3AS/MAD-ARL
Python project for the paper "Adversarial Deep Reinforcement Learning for... |
|
Experimental |
| 53 |
DiligentPanda/MAPF-LRR2023
This is the repo for the team Pikachu's solution in the League of Robot... |
|
Experimental |
| 54 |
VideojogosLusofona/color-shape-links-ai-competition
AI competition for IEEE CoG 2021 |
|
Experimental |
| 55 |
kaiyoo/AI-agent-Azul-Game-Competition
AI agent game competition - Reinforcement learning (Monte Carlo Tree Search,... |
|
Experimental |
| 56 |
EttoreCaputo/hadron-game
This is a game AI project for the course "Artificial Intelligence" at the... |
|
Experimental |
| 57 |
opendilab/Gobigger-Explore
Still struggling with the high threshold or looking for the appropriate... |
|
Experimental |
| 58 |
zzbuzzard/boxjump
Box Jump is a co-operative multi-agent reinforcement learning environment! |
|
Experimental |
| 59 |
zagoli/MultiAgentPathFinding
Implementation of Sven Koenig's class project about MAPF |
|
Experimental |
| 60 |
T3AS/Benchmarking-QRS-2022
Implementation of "Evaluating the Robustness of Deep Reinforcement Learning... |
|
Experimental |
| 61 |
LijunSun90/pursuitMatrixWorld
Multi-agent pursuit in matrix world (pursuitMW) |
|
Experimental |
| 62 |
NcJie/multiagent-ddpg
Multi-agent DDPG on ml-agents environment |
|
Experimental |
| 63 |
ajheshbasnet/reinforcement-learning-agents
a collection of advanced reinforcement learning (rl) agents and... |
|
Experimental |
| 64 |
ethanmclark1/blocksworld3d
3D version of the classic Blocksworld environment for reinforcement learning |
|
Experimental |
| 65 |
T3AS/ReMAV
Implementation of "ReMAV: Reward Modeling of Autonomous Vehicles for Finding... |
|
Experimental |
| 66 |
SafeRoboticsLab/opinion_game
Multi-agent coordination using game theory and nonlinear opinion dynamics - CDC 2023 |
|
Experimental |
| 67 |
zhouker94/Multi-Agent-DRL
Multiagent deep reinforcement learning research project |
|
Experimental |
| 68 |
AdrianDiepeveen/Deep-Reinforcement-Learning-Multi-Agent-Autonomous-Drone-Emergency-Response-System
Deep reinforcement learning system for coordinating autonomous drones in... |
|
Experimental |
| 69 |
praveen-palanisamy/macad-agents
Agents code for Multi-Agent Connected Autonomous Driving (MACAD) described... |
|
Experimental |
| 70 |
bjmwang/minority_game
少数派博弈的一个简明Python3实现 |
|
Experimental |
| 71 |
collisonda/fyp-matlab-code
MEng Final Year Project - Reinforcement Learned Collision Avoidance |
|
Experimental |
| 72 |
omron-sinicx/ncf2
Official implementation of "Counterfactual Fairness Filter for Fair-Delay... |
|
Experimental |
| 73 |
sukhitashvili/pong
A reimplementation of Andrej Karpathy's repository for an RL self-learning... |
|
Experimental |
| 74 |
laperdida23/go-playing-agent
🤖 Master Go on a 5x5 board with this AI agent using advanced algorithms for... |
|
Experimental |
| 75 |
Eation5/Reinforcement-Learning-Environments
Collection of custom reinforcement learning environments and agents... |
|
Experimental |
| 76 |
oft2026/mspm
Unofficial PyTorch reproduction of MSPM: A Modularized and Scalable... |
|
Experimental |
| 77 |
zhy0/dmarket_rl
Fast single unit, double auction market for reinforcement learning |
|
Experimental |
| 78 |
jeffasante/RepoGym
A reinforcement learning environment for AI coding agents built from real... |
|
Experimental |
| 79 |
legalaspro/unity_multiagent_rl
Multi-agent reinforcement learning framework for Unity environments.... |
|
Experimental |
| 80 |
infinitycloud-ch/roboticprogramai
Cognitive robotics: Unitree Go2 + Isaac Sim 5.1.0 + ROS2 on DGX Spark... |
|
Experimental |
| 81 |
FareedKhan-dev/ai-gaming-agent
A step by step implementation of building an AI agent that plays 3d shooting game |
|
Experimental |
| 82 |
Hanny658/MAVSPOI
Multi-Agent Voting Scheme for real-time POI Recommendation. With API-based... |
|
Experimental |
| 83 |
martinchapman/hands
Run AI Search Game (Hide-and-Seek) simulations as a decision-support tool |
|
Experimental |
| 84 |
biological-alignment-benchmarks/zoo_to_gym_multiagent_adapter
Enables you to convert a PettingZoo environment to a Gym environment while... |
|
Experimental |
| 85 |
LukasSchaefer/MSc_Curiosity_MARL
MSc Informatics dissertation project - University of Edinburgh: Curiosity in... |
|
Experimental |
| 86 |
CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
We use policy gradient to help agents learn optimal policies in a... |
|
Experimental |
| 87 |
william-dan/rl-elevator
Event‑driven elevator dispatch Gymnasium environment with FIFO/LOOK... |
|
Experimental |
| 88 |
aldoeliacim/sumo-mappo
MAPPO experiments for unsignalized intersections using SUMO and Gym. |
|
Experimental |
| 89 |
davide97l/Pacman
Implementation of many popular AI algorithms to play the game of Pacman such... |
|
Experimental |
| 90 |
omarathon/rl-multi-agent-car-parking
simulation/RL - multi-agent car parking using reinforcement learning |
|
Experimental |
| 91 |
BinLee26/InterAgent
[CVPR2026] InterAgent: Physics-based Multi-agent Command Execution via... |
|
Experimental |
| 92 |
SafeRoboticsLab/Who_Plays_First
Repository for "Who Plays First? Optimizing the Order of Play in Stackelberg... |
|
Experimental |
| 93 |
uzumstanley/Multi-Agent-AI-Researcher
Multi-Agent-AI-Researcher-Powered-by-DeepSeek-R1-main |
|
Experimental |
| 94 |
studiofarzulla/friction-marl
Multi-Agent Reinforcement Learning with Friction Dynamics — code companion... |
|
Experimental |
| 95 |
Lucien-MG/gym-codingame
A high-performance Gymnasium wrapper for CodinGame engines, enabling... |
|
Experimental |
| 96 |
SinyZXJ/COMPASS
COMPASS: Cooperative Multi-Agent Persistent Surveillance using... |
|
Experimental |
| 97 |
NaimurRahman-Niaz02/Ultimate-Tic-Tac-Toe-with-AI-Agent
Ultimate Tic-Tac-Toe is a strategic web game featuring an AI opponent where... |
|
Experimental |
| 98 |
DongChen06/PVRL
Photovoltaic control using RL methods. |
|
Experimental |
| 99 |
Wadaboa/flatland-challenge
Multi-agent reinforcement learning on trains, for Deep Learning class at UNIBO |
|
Experimental |
| 100 |
ZhuohuiZhang/TGCNet
This is the official implementation of [AAAI'25 Oral] accepted paper:... |
|
Experimental |
| 101 |
Cognitive-AI-Systems/mats-lp
[AAAI-2024] MATS-LP addresses the challenging problem of decentralized... |
|
Experimental |
| 102 |
ElectroCubic/Multi-Agent-Pathfinding-Sim
An interactive multi-agent pathfinding simulator in Python using Pygame, and... |
|
Experimental |
| 103 |
osamahmada2024/3D-Maze-Space-Arena
A high-performance 3D maze simulation featuring 9 AI pathfinding algorithms,... |
|
Experimental |
| 104 |
nicoleorzan/marl-mo
Multi-Objective Multi-Agent RL with non-linear utility functions |
|
Experimental |
| 105 |
jellyheadandrew/autoresearch-robotics
Autonomous robotics research with simulation feedback |
|
Experimental |
| 106 |
CognitiveAISystems/mats-lp
[AAAI-2024] MATS-LP addresses the challenging problem of decentralized... |
|
Experimental |
| 107 |
Piyushi-0/Fair-MAMAB
Code for our AAMAS '25 oral paper, 'Multi-agent Multi-armed Bandits with... |
|
Experimental |
| 108 |
sahajrajmalla/dhumbal-ai
Code and simulations for "Optimizing AI Agents for Dhumbal," the first AI... |
|
Experimental |
| 109 |
aiaaee/Stochastic-Policy-Iteration-in-Markov-Environments
This project implements policy iteration in a stochastic environment using... |
|
Experimental |
| 110 |
pspanoudakis/Berkeley-Pacman-Projects
Berkeley Pac-Man 🤤◽◽◽👻 projects 0, 1 & 2 solutions |
|
Experimental |
| 111 |
iamvigneshwars/ai-walkers-ppo-pytorch
AI agent learns to walk, run, hop and crawl with out any given data using... |
|
Experimental |
| 112 |
stillonearth/bevy_rl_shooter
Multi-Agent FPS Gym Environment with bevy_rl |
|
Experimental |
| 113 |
ormai/hypersonic
Bomberman-like, turn-based game played by two competing AI Agents |
|
Experimental |
| 114 |
azizi-zahra/xoshift-ai-agent
AI Project - An AI agent for playing XOShift with Python (Spring 2025) |
|
Experimental |
| 115 |
metazoic/hierlearning
HierLearning is a C++11 implementation of a multi-agent, hierarchical... |
|
Experimental |
| 116 |
imbulana/multi-agent-perceiver
Multi-Agent Perceiver Critic for Robotic Warehouse (RWARE) Coordination. |
|
Experimental |
| 117 |
SvetLuna-Lab/Highrise-fire-uav-response-demo-
Simulation demo: coordinating a small fleet of UAVs to suppress fires on... |
|
Experimental |
| 118 |
iam-weijie/alphataxx
AI agent that plays Ataxx |
|
Experimental |
| 119 |
Nikhil-Singla/go-playing-agent
Go Playing AI Agent: A sophisticated artificial intelligence system that... |
|
Experimental |
| 120 |
jianzhnie/RLZero
A clean and easy implementation of MuZero, AlphaZero and Self-Play... |
|
Experimental |
| 121 |
rap-lab-org/public_pymcpf-d
Multi-Agent Combinatorial Path Finding with Heterogeneous Task Duration (MCPF-D) |
|
Experimental |
| 122 |
reubenwong97/NFSP-PEG-GridWorld
Implementation of Neural Fictitious Self-Play for a GridWorld based... |
|
Experimental |
| 123 |
damn8daniel/multi-agent-rl
Multi-Agent RL: MAPPO, cooperative environments, centralized training... |
|
Experimental |
| 124 |
Chris-airobot/adversarial-rl-project
Research project on adversarial reinforcement learning, PPO training, and... |
|
Experimental |
| 125 |
marojeff123/Snake-double-deep-Q-learning
🐍 Implement deep Q-learning to enhance the Snake game experience, enabling... |
|
Experimental |
| 126 |
alextousss/wargames
Two agents shooting at each other, controlled by a neural network optimized... |
|
Experimental |
| 127 |
minhtoan-tran/nckh2026-uet-warehouse-robots
Student research: Multi-robot warehouse optimization (Simulation & Physical... |
|
Experimental |
| 128 |
kvr06-ai/trust-based-public-goods-game
Multi-agent simulation of a public goods game with trust dynamics and... |
|
Experimental |
| 129 |
pulakk/ConflictAvoidantCBS-MAPF
Conflict Avoidant CBS (CA-CBS) |
|
Experimental |
| 130 |
LuddeWessen/assembly-robot-manager-minizinc
A scheduler to manage a multi tool dual arm robot while avoiding arm-to-arm... |
|
Experimental |
| 131 |
Dophinjet/lunarlander-dqn-comparison
🚀 Analyze and compare value-based deep reinforcement learning algorithms on... |
|
Experimental |
| 132 |
MAYANK12-WQ/multi-agent-robotics-lab
5 AI agents collaborate in real-time to research, implement, test, review,... |
|
Experimental |
| 133 |
camlischke1/marl-anomaly-detect
This project tests multiple different machine learning algorithms that can... |
|
Experimental |
| 134 |
jaychampaneri14/multi-agent-sim
Competitive and cooperative multi-agent RL environment |
|
Experimental |
| 135 |
gagan0116/Multi_Agent_Traffic_Control_RL
Decentralized multi-agent RL traffic signal control using Dueling DQN + GAT... |
|
Experimental |
| 136 |
DavidMANZI-093/RexAI
A Chrome Dinosaur game clone powered by NEAT (NeuroEvolution of Augmenting... |
|
Experimental |
| 137 |
kennycornellius-collab/SnakeGameRL
Training a MLP and CNN based policy to compare the performance in a game of snake |
|
Experimental |
| 138 |
alpc91/SMERL
[ICML 2024] Official environments and JAX-implementations for... |
|
Experimental |
| 139 |
GUT-AI/multi-robot-path-planning
Multi-Robot Path Planning |
|
Experimental |
| 140 |
Sebastian-Griesbach/Minimax-Multi-Agent-Deep-Deterministic-Policy-Gradient
A general pytorch implementation of the Minimax Multi-Agent Deep... |
|
Experimental |
| 141 |
alessandrositta/Flatland_challenge
Repository containing the code and explanation of a solution to the Flatland... |
|
Experimental |
| 142 |
jyotishp/multiagent-collision-avoidance
Decentralized multi-agent collision avoidance |
|
Experimental |
| 143 |
ghayda-njaafreh/tetris-ml-agent
Tetris game in Python (Pygame) with an ML-based agent + training/testing curves. |
|
Experimental |
| 144 |
tuan-nv0505/Snake-Q-learning
Q-learning for playing Snake game |
|
Experimental |
| 145 |
wilrop/ramo
Algorithms for computing or learning equilibria in multi-objective games |
|
Experimental |
| 146 |
YohannTPN/CrossyRoadAI
AI agent that learns to play Crossy Road using genetic algorithms and neural... |
|
Experimental |
| 147 |
muhammadwaheedairi/hackathon_textbook_ai_robotics
Personal AI-Robotics portfolio: ROS 2, Gazebo, NVIDIA Isaac, VLA systems —... |
|
Experimental |
| 148 |
ben-ogden/rllib-trading-arena
🏟️ A competitive multi-agent trading arena using RLlib + Ray |
|
Experimental |
| 149 |
damat-le/mage
Multi-Agent Grid Environment (MAGE) |
|
Experimental |
| 150 |
hansman/multi-agent-reinforcement-learning
Multi-Agent Reinforcement Learning with Deep Sarsa Agents |
|
Experimental |
| 151 |
gkc741/Snake-AI
Snake AI-agent using Neuroevolution |
|
Experimental |
| 152 |
tuan-nv0505/Snake-Deep-Q-Learning
Deep Q-learning (DQL) for playing Snake game |
|
Experimental |
| 153 |
gdalle/Flatland.jl
A barebones Julia version of the Flatland railway simulator |
|
Experimental |
| 154 |
Farid-Karimi/Shover
Shover-World is a grid-based reinforcement learning environment built on the... |
|
Experimental |
| 155 |
Rana-inan/WizardOfWor-AiRemake
A Python remake of the classic Wizard of Wor game with AI-controlled players. |
|
Experimental |
| 156 |
khanbilal-devop/intelligent-agent-frameworks
Implementations of AI search algorithms for goal-based agents. Includes... |
|
Experimental |
| 157 |
SorerBOT/Watering-Problem
Using Planning, Markov Decision Processes, Reinforcement Learning and other... |
|
Experimental |
| 158 |
Jeffawe/Space-Shooter
Space Shooter Retro Game built using Amazon Q |
|
Experimental |
| 159 |
lu-m-dev/python-games
A collection of classic games with human and AI agents |
|
Experimental |
| 160 |
h24abdal/tic-tac-toe-reinforcement-learning
Reinforcement learning solution for tic tac toe implemented in python. |
|
Experimental |
| 161 |
sakshampandey1901/MountainCar
A Deep Q-Network on Gymnasium’s MountainCar-v0 with reward shaping and... |
|
Experimental |
| 162 |
irgidev/kaggle-connect-x-agent
An AI agent designed to play the 'Connect X' game from the Kaggle... |
|
Experimental |
| 163 |
tene04/Neuroevolution_vs_DeepQ-Learning
Comparison of Neuroevolution and DQL to train agents on gym environment (Flappy bird) |
|
Experimental |
| 164 |
plss12/Connect-X-AlphaZero
Reinforcement learning agents for Connect4, featuring a robust AlphaZero... |
|
Experimental |
| 165 |
alexisjapas/mystic-square
Multi-agent mystic square game solver |
|
Experimental |
| 166 |
Dodo2k01/HeartsGame
University School Project that required us to implement a game engine, ui,... |
|
Experimental |
| 167 |
ItsOrv/Ai-Society
Complete RL implementation from scratch: neural networks, PPO, 3D... |
|
Experimental |
| 168 |
superboySB/matrix-game-baselines
learning value-based method, started by one-step matrix games |
|
Experimental |