Grid World Q-Learning ML Frameworks
Educational implementations of Q-learning and reinforcement learning algorithms applied to grid-based navigation environments. Includes projects demonstrating value iteration, SARSA, and related RL techniques in controlled simulations. Does NOT include general reinforcement learning frameworks, complex game AI, or non-grid-based environments.
There are 32 grid world q-learning frameworks tracked. 1 score above 50 (established tier). The highest-rated is Farama-Foundation/stable-retro at 54/100 with 354 stars.
Get all 32 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=grid-world-q-learning&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
Farama-Foundation/stable-retro
Retro games for Reinforcement Learning Research |
|
Established |
| 2 |
MatPoliquin/stable-retro-scripts
Train models on retro games. AI vs AI contest. Pytorch C++ plugin for... |
|
Emerging |
| 3 |
nadavbh12/Retro-Learning-Environment
The Retro Learning Environment (RLE) -- a learning framework for AI |
|
Emerging |
| 4 |
svpino/cs7641-assignment4
CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes |
|
Emerging |
| 5 |
Avalon-Benchmark/avalon
A 3D video game environment and benchmark designed from scratch for... |
|
Emerging |
| 6 |
abhinavcreed13/ai-reinforcement-learning
This project will implement value iteration and Q-learning. It will first... |
|
Experimental |
| 7 |
i01000101/Q-Learning-Visualizer
An AI that learns to solve mazes with Q-Learning algorithm 🧠 |
|
Experimental |
| 8 |
PhilippOesch/QuickRLGym.js
A small TypeScript/ JavaScript Framework for implementing, testing, and... |
|
Experimental |
| 9 |
F-a-b-r-i-z-i-o/maze-game
Q-learning project where an agent learns by himself to find the exit inside... |
|
Experimental |
| 10 |
LukaszMalucha/Pathfinder-AI
Flask RESTful App with Q-learning AI algorithm |
|
Experimental |
| 11 |
pcluteijn/MazeLearning
Q-Learing algorithm solves simple mazes. |
|
Experimental |
| 12 |
tristansokol/Bobcats
Code used for the OpenAI Retro Contest for team Bobcats, spring 2018 |
|
Experimental |
| 13 |
vlfom/StepLearn
Applying Markov Decision Processes and Q-Learning to a robot movement model |
|
Experimental |
| 14 |
RodneyShag/GridWorldMDP
Uses Markov decision processes (MDPs) and Temporal Difference (TD)... |
|
Experimental |
| 15 |
htglvl/Final_prj
Auto CS2 bot using reinforcement learning |
|
Experimental |
| 16 |
bzznrc/rl-toybox
Tiny collection of minimal RL arcade-style games with shared training code |
|
Experimental |
| 17 |
EricChen0104/ppo-icm-maze-exploration
A curiosity-driven PPO + ICM reinforcement learning agent for autonomous... |
|
Experimental |
| 18 |
tctibbs/matchbox-rl
Matchbox-RL: A tangible reinforcement learning library for Python. Based on... |
|
Experimental |
| 19 |
cabe9/NLBProject
Reproducible NLB'21 mc_maze benchmark with lagged PCA latent regression and... |
|
Experimental |
| 20 |
thanhld94/Reinforcement-Learning
Reinforcement Learning and Adaptive Resonance Theory |
|
Experimental |
| 21 |
gaut2172/TreasureHuntGame
AI pathfinding project using deep reinforcement learning. Deep Q-learning... |
|
Experimental |
| 22 |
Hysperr/gridworld
Given infinite possibilities, an AI agent learns the optimal path towards... |
|
Experimental |
| 23 |
mbeps/custom-q-learning-agent
Custom Reinforcement Learning agent built using Q-Learning. This is a... |
|
Experimental |
| 24 |
LeoMartinezTAMUK/Markov_Decision_Process
This project implements a Markov Decision Process (MDP) using Reinforcement... |
|
Experimental |
| 25 |
ms-daniel/Reinforcement-Learning
Program created in java with swing interface. |
|
Experimental |
| 26 |
KriChau95/rl-cnn-maze-localization
This project implements and compares strategies for self-localization within... |
|
Experimental |
| 27 |
yaacov/qlearner
Q learning algorithm example in python |
|
Experimental |
| 28 |
gtoubassi/SimpleReinforcementLearning
A demonstration of table based, SARSA reinforcement learning for a simple... |
|
Experimental |
| 29 |
Styleflo/Gridword-DQN
Gridword pour l'entraînement d'un DQN, afin de comprendre son fonctionnement |
|
Experimental |
| 30 |
MariaEduardaSampaio/TP3_Fundamentos-IA
Reinforcement Learning project implementing Q-Learning and optimal policy... |
|
Experimental |
| 31 |
Panithecracker/Generic-Markov-decision-process-solver
Creation of a generic program capable of computing the optimal policy for an... |
|
Experimental |
| 32 |
HarunaHaju/RL-STG
Play STG with Reinforcement Learning |
|
Experimental |