Grid World Q-Learning ML Frameworks

Educational implementations of Q-learning and reinforcement learning algorithms applied to grid-based navigation environments. Includes projects demonstrating value iteration, SARSA, and related RL techniques in controlled simulations. Does NOT include general reinforcement learning frameworks, complex game AI, or non-grid-based environments.

There are 32 grid world q-learning frameworks tracked. 1 score above 50 (established tier). The highest-rated is Farama-Foundation/stable-retro at 54/100 with 354 stars.

Get all 32 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=grid-world-q-learning&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 Farama-Foundation/stable-retro

Retro games for Reinforcement Learning Research

54
Established
2 MatPoliquin/stable-retro-scripts

Train models on retro games. AI vs AI contest. Pytorch C++ plugin for...

43
Emerging
3 nadavbh12/Retro-Learning-Environment

The Retro Learning Environment (RLE) -- a learning framework for AI

40
Emerging
4 svpino/cs7641-assignment4

CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes

34
Emerging
5 Avalon-Benchmark/avalon

A 3D video game environment and benchmark designed from scratch for...

33
Emerging
6 abhinavcreed13/ai-reinforcement-learning

This project will implement value iteration and Q-learning. It will first...

28
Experimental
7 i01000101/Q-Learning-Visualizer

An AI that learns to solve mazes with Q-Learning algorithm 🧠

24
Experimental
8 PhilippOesch/QuickRLGym.js

A small TypeScript/ JavaScript Framework for implementing, testing, and...

23
Experimental
9 F-a-b-r-i-z-i-o/maze-game

Q-learning project where an agent learns by himself to find the exit inside...

23
Experimental
10 LukaszMalucha/Pathfinder-AI

Flask RESTful App with Q-learning AI algorithm

22
Experimental
11 pcluteijn/MazeLearning

Q-Learing algorithm solves simple mazes.

22
Experimental
12 tristansokol/Bobcats

Code used for the OpenAI Retro Contest for team Bobcats, spring 2018

22
Experimental
13 vlfom/StepLearn

Applying Markov Decision Processes and Q-Learning to a robot movement model

18
Experimental
14 RodneyShag/GridWorldMDP

Uses Markov decision processes (MDPs) and Temporal Difference (TD)...

17
Experimental
15 htglvl/Final_prj

Auto CS2 bot using reinforcement learning

16
Experimental
16 bzznrc/rl-toybox

Tiny collection of minimal RL arcade-style games with shared training code

16
Experimental
17 EricChen0104/ppo-icm-maze-exploration

A curiosity-driven PPO + ICM reinforcement learning agent for autonomous...

15
Experimental
18 tctibbs/matchbox-rl

Matchbox-RL: A tangible reinforcement learning library for Python. Based on...

15
Experimental
19 cabe9/NLBProject

Reproducible NLB'21 mc_maze benchmark with lagged PCA latent regression and...

14
Experimental
20 thanhld94/Reinforcement-Learning

Reinforcement Learning and Adaptive Resonance Theory

14
Experimental
21 gaut2172/TreasureHuntGame

AI pathfinding project using deep reinforcement learning. Deep Q-learning...

14
Experimental
22 Hysperr/gridworld

Given infinite possibilities, an AI agent learns the optimal path towards...

14
Experimental
23 mbeps/custom-q-learning-agent

Custom Reinforcement Learning agent built using Q-Learning. This is a...

13
Experimental
24 LeoMartinezTAMUK/Markov_Decision_Process

This project implements a Markov Decision Process (MDP) using Reinforcement...

12
Experimental
25 ms-daniel/Reinforcement-Learning

Program created in java with swing interface.

12
Experimental
26 KriChau95/rl-cnn-maze-localization

This project implements and compares strategies for self-localization within...

11
Experimental
27 yaacov/qlearner

Q learning algorithm example in python

11
Experimental
28 gtoubassi/SimpleReinforcementLearning

A demonstration of table based, SARSA reinforcement learning for a simple...

11
Experimental
29 Styleflo/Gridword-DQN

Gridword pour l'entraînement d'un DQN, afin de comprendre son fonctionnement

11
Experimental
30 MariaEduardaSampaio/TP3_Fundamentos-IA

Reinforcement Learning project implementing Q-Learning and optimal policy...

11
Experimental
31 Panithecracker/Generic-Markov-decision-process-solver

Creation of a generic program capable of computing the optimal policy for an...

10
Experimental
32 HarunaHaju/RL-STG

Play STG with Reinforcement Learning

10
Experimental