Grid World Q-Learning ML Frameworks

Educational implementations of Q-learning and reinforcement learning algorithms applied to grid-based navigation environments. Includes projects demonstrating value iteration, SARSA, and related RL techniques in controlled simulations. Does NOT include general reinforcement learning frameworks, complex game AI, or non-grid-based environments.

There are 32 grid world q-learning frameworks tracked. 1 score above 50 (established tier). The highest-rated is Farama-Foundation/stable-retro at 54/100 with 354 stars.

Get all 32 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=grid-world-q-learning&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Framework	Score	Tier	Stars	Language
1	Farama-Foundation/stable-retro Retro games for Reinforcement Learning Research	54	Established	354	C++
2	MatPoliquin/stable-retro-scripts Train models on retro games. AI vs AI contest. Pytorch C++ plugin for...	43	Emerging	39	C++
3	nadavbh12/Retro-Learning-Environment The Retro Learning Environment (RLE) -- a learning framework for AI	40	Emerging	186	C++
4	svpino/cs7641-assignment4 CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes	34	Emerging	14	Java
5	Avalon-Benchmark/avalon A 3D video game environment and benchmark designed from scratch for...	33	Emerging	190	Jupyter Notebook
6	abhinavcreed13/ai-reinforcement-learning This project will implement value iteration and Q-learning. It will first...	28	Experimental	4	Python
7	i01000101/Q-Learning-Visualizer An AI that learns to solve mazes with Q-Learning algorithm 🧠	24	Experimental	3	JavaScript
8	PhilippOesch/QuickRLGym.js A small TypeScript/ JavaScript Framework for implementing, testing, and...	23	Experimental	2	TypeScript
9	F-a-b-r-i-z-i-o/maze-game Q-learning project where an agent learns by himself to find the exit inside...	23	Experimental	5	Python
10	LukaszMalucha/Pathfinder-AI Flask RESTful App with Q-learning AI algorithm	22	Experimental	9	Python
11	pcluteijn/MazeLearning Q-Learing algorithm solves simple mazes.	22	Experimental	7	MATLAB
12	tristansokol/Bobcats Code used for the OpenAI Retro Contest for team Bobcats, spring 2018	22	Experimental	1	Jupyter Notebook
13	vlfom/StepLearn Applying Markov Decision Processes and Q-Learning to a robot movement model	18	Experimental	4	Java
14	RodneyShag/GridWorldMDP Uses Markov decision processes (MDPs) and Temporal Difference (TD)...	17	Experimental	3	Java
15	htglvl/Final_prj Auto CS2 bot using reinforcement learning	16	Experimental	2	Jupyter Notebook
16	bzznrc/rl-toybox Tiny collection of minimal RL arcade-style games with shared training code	16	Experimental	2	Python
17	EricChen0104/ppo-icm-maze-exploration A curiosity-driven PPO + ICM reinforcement learning agent for autonomous...	15	Experimental	8	Python
18	tctibbs/matchbox-rl Matchbox-RL: A tangible reinforcement learning library for Python. Based on...	15	Experimental	—	Python
19	cabe9/NLBProject Reproducible NLB'21 mc_maze benchmark with lagged PCA latent regression and...	14	Experimental	—	Python
20	thanhld94/Reinforcement-Learning Reinforcement Learning and Adaptive Resonance Theory	14	Experimental	1	Java
21	gaut2172/TreasureHuntGame AI pathfinding project using deep reinforcement learning. Deep Q-learning...	14	Experimental	7	Python
22	Hysperr/gridworld Given infinite possibilities, an AI agent learns the optimal path towards...	14	Experimental	1	Java
23	mbeps/custom-q-learning-agent Custom Reinforcement Learning agent built using Q-Learning. This is a...	13	Experimental	2	Python
24	LeoMartinezTAMUK/Markov_Decision_Process This project implements a Markov Decision Process (MDP) using Reinforcement...	12	Experimental	3	Python
25	ms-daniel/Reinforcement-Learning Program created in java with swing interface.	12	Experimental	3	Java
26	KriChau95/rl-cnn-maze-localization This project implements and compares strategies for self-localization within...	11	Experimental	—	Jupyter Notebook
27	yaacov/qlearner Q learning algorithm example in python	11	Experimental	—	Python
28	gtoubassi/SimpleReinforcementLearning A demonstration of table based, SARSA reinforcement learning for a simple...	11	Experimental	2	Java
29	Styleflo/Gridword-DQN Gridword pour l'entraînement d'un DQN, afin de comprendre son fonctionnement	11	Experimental	—	Jupyter Notebook
30	MariaEduardaSampaio/TP3_Fundamentos-IA Reinforcement Learning project implementing Q-Learning and optimal policy...	11	Experimental	—	Jupyter Notebook
31	Panithecracker/Generic-Markov-decision-process-solver Creation of a generic program capable of computing the optimal policy for an...	10	Experimental	1	Python
32	HarunaHaju/RL-STG Play STG with Reinforcement Learning	10	Experimental	1	Java