Reinforcement Learning Frameworks

Complete RL algorithm implementations and educational resources for training agents using policy gradient, Q-learning, actor-critic, and other methods. Does NOT include game-playing agents, robotics simulators, or domain-specific RL applications—only the core algorithmic frameworks and tutorials.

There are 290 reinforcement learning frameworks tracked. 3 score above 70 (verified tier). The highest-rated is google-deepmind/dm_control at 86/100 with 4,494 stars and 309,287 monthly downloads. 5 of the top 10 are actively maintained.

Get all 290 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=reinforcement-learning-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Framework	Score	Tier	Stars	Language
1	google-deepmind/dm_control Google DeepMind's software stack for physics-based simulation and...	86	Verified	4,494	Python
2	DLR-RM/stable-baselines3 PyTorch version of Stable Baselines, reliable implementations of...	76	Verified	12,878	Python
3	Denys88/rl_games RL implementations	73	Verified	1,310	Jupyter Notebook
4	pytorch/rl A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.	68	Established	3,335	Python
5	flatland-association/flatland-rl The Flatland Framework is a multi-purpose environment to tackle problems...	64	Established	58	Jupyter Notebook
6	yandexdataschool/Practical_RL A course in reinforcement learning in the wild	64	Established	6,460	Jupyter Notebook
7	takuseno/d3rlpy An offline deep reinforcement learning library	62	Established	1,644	Python
8	keras-rl/keras-rl Deep Reinforcement Learning for Keras.	60	Established	5,554	Python
9	MushroomRL/mushroom-rl Python library for Reinforcement Learning.	59	Established	921	Python
10	qzed/irl-maxent Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning...	58	Established	312	Jupyter Notebook
11	Stable-Baselines-Team/stable-baselines3-contrib Contrib package for Stable-Baselines3 - Experimental reinforcement learning...	57	Established	693	Python
12	PKU-Alignment/omnisafe JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.	56	Established	1,077	Python
13	huggingface/deep-rl-class This repo contains the Hugging Face Deep Reinforcement Learning Course.	55	Established	4,803	MDX
14	MyoHub/myosuite MyoSuite is a collection of environments/tasks to be solved by...	54	Established	1,116	Python
15	google-research/batch-ppo Efficient Batched Reinforcement Learning in TensorFlow	54	Established	975	Python
16	upb-lea/reinforcement_learning_course_materials Lecture notes, tutorial tasks including solutions as well as online videos...	54	Established	1,017	Jupyter Notebook
17	inoryy/reaver Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft...	54	Established	562	Python
18	tensorlayer/RLzoo A Comprehensive Reinforcement Learning Zoo for Simple Usage 🚀	53	Established	644	Python
19	lucidrains/streaming-deep-rl Explorations into the proposed Streaming Deep Reinforcement Learning, from...	52	Established	24	Python
20	rlcode/reinforcement-learning Minimal and Clean Reinforcement Learning Examples	51	Established	3,621	Python
21	MorvanZhou/Reinforcement-learning-with-tensorflow Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学	51	Established	9,435	Python
22	sweetice/Deep-reinforcement-learning-with-pytorch PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO,...	51	Established	4,589	Python
23	MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning This is the homepage of a new book entitled "Mathematical Foundations of...	51	Established	14,922	MATLAB
24	ikostrikov/pytorch-a2c-ppo-acktr-gail PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy...	51	Established	3,879	Python
25	ShangtongZhang/reinforcement-learning-an-introduction Python Implementation of Reinforcement Learning: An Introduction	51	Established	14,587	Python
26	SforAiDl/genrl A PyTorch reinforcement learning library for generalizable and reproducible...	50	Established	412	Python
27	iffiX/machin Reinforcement learning library(framework) designed for PyTorch, implements...	50	Established	419	Python
28	seungeunrho/minimalRL Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)	49	Emerging	3,164	Python
29	danaugrs/huskarl Deep Reinforcement Learning Framework + Algorithms	49	Emerging	415	Python
30	AdamStelmaszczyk/learning2run Our NIPS 2017: Learning to Run source code	48	Emerging	55	Python
31	vwxyzjn/cleanrl High-quality single file implementation of Deep Reinforcement Learning...	48	Emerging	9,286	Python
32	andri27-ts/Reinforcement-Learning Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python....	48	Emerging	4,696	Jupyter Notebook
33	Fraunhofer-IIS/fmugym Interface to connect Reinforcement Learning libraries to Functional Mock-up...	47	Emerging	29	Python
34	danijar/mindpark Testbed for deep reinforcement learning	47	Emerging	162	Python
35	fracapuano/robot-learning-tutorial All the source code for "Robot Learning: A Tutorial". Get involved to be...	47	Emerging	477	TeX
36	TuragaLab/flybody MuJoCo fruit fly body model and locomotion RL tasks	47	Emerging	503	Python
37	rl-tools/rl-tools The Fastest Deep Reinforcement Learning Library	47	Emerging	933	C++
38	CarperAI/trlx A repo for distributed training of language models with Reinforcement...	46	Emerging	4,738	Python
39	ikostrikov/pytorch-a3c PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from...	44	Emerging	1,317	Python
40	keon/deep-q-learning Minimal Deep Q Learning (DQN & DDQN) implementations in Keras	44	Emerging	1,316	Python
41	miyosuda/async_deep_reinforce Asynchronous Methods for Deep Reinforcement Learning	44	Emerging	591	Python
42	danijar/embodied Fast reinforcement learning research	44	Emerging	61	Python
43	RLE-Foundation/rllte Long-Term Evolution Project of Reinforcement Learning	44	Emerging	475	Python
44	stanfordnmbl/osim-rl Reinforcement learning environments with musculoskeletal models	44	Emerging	944	Python
45	archsyscall/DeepRL-TensorFlow2 🐋 Simple implementations of various popular Deep Reinforcement Learning...	44	Emerging	606	Python
46	ankonzoid/LearningX Deep & Classical Reinforcement Learning + Machine Learning Examples in Python	44	Emerging	370	Python
47	heronsystems/adeptRL Reinforcement learning framework to accelerate research	44	Emerging	206	Python
48	pathak22/noreward-rl [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep...	44	Emerging	1,471	Python
49	vmayoral/basic_reinforcement_learning An introductory series to Reinforcement Learning (RL) with comprehensive...	44	Emerging	1,213	Jupyter Notebook
50	dalmia/David-Silver-Reinforcement-learning Notes for the Reinforcement Learning course by David Silver along with...	44	Emerging	849	Jupyter Notebook
51	icoxfog417/baby-steps-of-rl-ja Pythonで学ぶ強化学習 -入門から実践まで- サンプルコード	44	Emerging	452	Jupyter Notebook
52	lucidrains/metacontroller Implementation of the MetaController proposed in "Emergent temporal...	44	Emerging	93	Jupyter Notebook
53	mimoralea/gdrl Grokking Deep Reinforcement Learning	44	Emerging	1,005	Jupyter Notebook
54	nikhilbarhate99/PPO-PyTorch Minimal implementation of clipped objective Proximal Policy Optimization...	43	Emerging	2,320	Python
55	Kaixhin/Rainbow Rainbow: Combining Improvements in Deep Reinforcement Learning	43	Emerging	1,661	Python
56	zyxue/sutton-barto-rl-exercises 📖Learning reinforcement learning by implementing the algorithms from...	43	Emerging	84	Jupyter Notebook
57	simoninithomas/Deep_reinforcement_learning_Course Implementations from the free course Deep Reinforcement Learning with...	43	Emerging	3,904	Jupyter Notebook
58	mimoralea/applied-reinforcement-learning Reinforcement Learning and Decision Making tutorials explained at an...	43	Emerging	331	Jupyter Notebook
59	pat-coady/trpo Trust Region Policy Optimization with TensorFlow and OpenAI Gym	43	Emerging	361	Jupyter Notebook
60	udacity/reinforcement-learning Reinforcement learning material, code and exercises for Udacity Nanodegree programs.	43	Emerging	89	Jupyter Notebook
61	rail-berkeley/softlearning Softlearning is a reinforcement learning framework for training maximum...	43	Emerging	1,413	Python
62	jingweiz/pytorch-rl Deep Reinforcement Learning with pytorch & visdom	43	Emerging	804	Python
63	ikostrikov/pytorch-trpo PyTorch implementation of Trust Region Policy Optimization	43	Emerging	450	Python
64	nrontsis/PILCO Bayesian Reinforcement Learning in Tensorflow	42	Emerging	335	Python
65	denisyarats/pytorch_sac PyTorch implementation of Soft Actor-Critic (SAC)	42	Emerging	591	Jupyter Notebook
66	opendilab/DI-engine-docs DI-engine docs (Chinese and English)	42	Emerging	321	Python
67	rmst/ddpg TensorFlow implementation of the DDPG algorithm from the paper Continuous...	42	Emerging	215	Jupyter Notebook
68	binary-husky/hmp2g Multiagent Reinforcement Learning Research Project	41	Emerging	228	Python
69	XinJingHao/DRL-Pytorch Clean, Robust, and Unified PyTorch implementation of popular Deep...	41	Emerging	3,306	Python
70	rl-language/rlc Bringing reinforcement learning to every day programmers	41	Emerging	62	C++
71	Stable-Baselines-Team/stable-baselines Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of...	41	Emerging	307	Python
72	alessiodm/drl-zh Deep Reinforcement Learning: Zero to Hero!	41	Emerging	2,265	Jupyter Notebook
73	Cloudslab/DLSF [TMC'20] Deep Learning based Scheduler for Stochastic Fog-Cloud computing...	40	Emerging	126	Java
74	HewlettPackard/dc-rl SustainDC is a set of Python environments for Data Center simulation and...	40	Emerging	95	HTML
75	ericyangyu/PPO-for-Beginners A simple and well styled PPO implementation. Based on my Medium series:...	40	Emerging	1,219	Python
76	adrianwix/pybasin pyBasin is a Python library for estimating basin stability in dynamical...	40	Emerging	4	Python
77	gordicaleksa/pytorch-learn-reinforcement-learning A collection of various RL algorithms like policy gradients, DQN and PPO....	40	Emerging	161	Python
78	godka/Pensieve-PPO The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art...	39	Emerging	87	DIGITAL Command Language
79	ItoMasaki/PixyzRL A Bayesian RL Framework with Probabilistic Generative Models	39	Emerging	10	Python
80	SuhailSama/MR_RL Gym Simulator for Magnetic Micro Robots	39	Emerging	6	Python
81	TianhongDai/reinforcement-learning-algorithms This repository contains most of pytorch implementation based classic deep...	39	Emerging	693	Python
82	sebastianbrzustowicz/Robot-Sumo-RL Python + PyTorch. Advanced Reinforcement Learning (SAC/PPO/A2C) for...	38	Emerging	14	Python
83	google-deepmind/dm_env A Python interface for reinforcement learning environments	38	Emerging	394	Python
84	medipixel/rl_algorithms Structural implementation of RL key algorithms	38	Emerging	516	Python
85	Anjum48/rl-examples Examples of published reinforcement learning algorithms in recent literature...	38	Emerging	103	Python
86	IBM/LOA Neuro-Symbolic Reinforcement Learning: Logical Optimal Action (LOA), a novel...	38	Emerging	56	Python
87	gabrielhuang/reptile-pytorch A PyTorch implementation of OpenAI's REPTILE algorithm	38	Emerging	220	Jupyter Notebook
88	UoA-CARES/cares_reinforcement_learning CARES Reinforcement Learning Package	38	Emerging	39	Python
89	yihaosun1124/OfflineRL-Kit An elegant PyTorch offline reinforcement learning library for researchers.	38	Emerging	384	Python
90	mahyaret/kuka_rl Reinforcement Learning Experiments using PyBullet	37	Emerging	136	Jupyter Notebook
91	huangwl18/modular-rl [ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular...	37	Emerging	232	Jupyter Notebook
92	denisyarats/drq DrQ: Data regularized Q	37	Emerging	419	Jupyter Notebook
93	khushhallchandra/pytorch-rl Pytorch Implementation of RL algorithms	37	Emerging	15	Python
94	DeNA/HandyRL HandyRL is a handy and simple framework based on Python and PyTorch for...	37	Emerging	304	Python
95	tayalmanan28/Safe_Reinforcement_Learning Repository containing the code for safe reinforcement learning in two custom...	36	Emerging	46	Python
96	yrlu/irl-imitation Implementation of Inverse Reinforcement Learning (IRL) algorithms in...	36	Emerging	667	Python
97	rlgraph/rlgraph RLgraph: Modular computation graphs for deep reinforcement learning	36	Emerging	323	Python
98	mohmdelsayed/streaming-drl Deep reinforcement learning without experience replay, target networks, or...	36	Emerging	279	Python
99	omerbsezer/Reinforcement_learning_tutorial_with_demo Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration),...	36	Emerging	790	Jupyter Notebook
100	asystemoffields/disco-torch A PyTorch port of DeepMind's Disco103 — the meta-learned reinforcement...	36	Emerging	9	Python
101	sudharsan13296/Deep-Reinforcement-Learning-With-Python Master classic RL, deep RL, distributional RL, inverse RL, and more using...	36	Emerging	464	Jupyter Notebook
102	Shaswat2001/maple-robotics MAPLE (Model and Policy Learning Evaluation) - A unified CLI daemon for...	36	Emerging	7	Python
103	dvalenciar/ReinforceUI-Studio ReinforceUI-Studio. A Python-based application designed to simplify the...	35	Emerging	76	Python
104	Bellman-devs/bellman Model-based reinforcement learning in TensorFlow	35	Emerging	56	Python
105	andrewliao11/Deep-Reinforcement-Learning-Survey My Exploration on Deep Reinforcement Learning Survey	35	Emerging	435	—
106	Learning4Optimization-HUST/H-TSP Official implementation of H-TSP (AAAI2023)	35	Emerging	56	Python
107	mitre/ilpyt ilpyt: imitation learning library with modular, baseline implementations in Pytorch	35	Emerging	18	Python
108	MaartenGr/ReinLife Creating Artificial Life with Reinforcement Learning	34	Emerging	84	Python
109	MarcoMeter/recurrent-ppo-truncated-bptt Baseline implementation of recurrent PPO using truncated BPTT	34	Emerging	160	Jupyter Notebook
110	Kaixhin/imitation-learning Imitation learning algorithms	34	Emerging	562	Python
111	NatLabRockies/graph-env Reinforcement learning for combinatorial optimization over directed graphs	34	Emerging	43	Python
112	airboxlab/rllib-energyplus Simple EnergyPlus environments for control optimization using reinforcement learning	34	Emerging	55	Python
113	denisyarats/proto Proto-RL: Reinforcement Learning with Prototypical Representations	34	Emerging	86	Python
114	rmst/rlrd PyTorch implementation of our paper Reinforcement Learning with Random...	34	Emerging	42	Python
115	thanhkaist/CCFDM1 CCFDM reinforcement learning	34	Emerging	40	Python
116	tirthajyoti/RL_basics Basic Reinforcement Learning algorithms	33	Emerging	19	Jupyter Notebook
117	rllab-snu/Deep-Reinforcement-Learning Introduction to Deep Reinforcement Learning	33	Emerging	88	Jupyter Notebook
118	nsidn98/NICE Combining Reinforcement Learning with Integer Programming for Robust Scheduling	33	Emerging	30	Python
119	whoiszyc/IntelliHealer IntelliHealer: An imitation and reinforcement learning platform for...	33	Emerging	32	Python
120	zuoxingdong/lagom lagom: A PyTorch infrastructure for rapid prototyping of reinforcement...	33	Emerging	378	Jupyter Notebook
121	TheoLvs/reinforcement-learning Personal experiments on Reinforcement Learning	33	Emerging	119	Jupyter Notebook
122	araffin/rl-handson-rlvs21 Stable-Baselines3 (SB3) reinforcement learning tutorial for the...	33	Emerging	58	Jupyter Notebook
123	MishaLaskin/rad RAD: Reinforcement Learning with Augmented Data	33	Emerging	416	Jupyter Notebook
124	antonpuz/DeROL Deep Reinforcement One-Shot Learning Framework for Artificially Intelligent...	33	Emerging	36	Python
125	RLE-Foundation/RLeXplore RLeXplore provides stable baselines of exploration methods in reinforcement...	32	Emerging	459	Jupyter Notebook
126	luisgarciar/3D-bin-packing Solving the 3D bin packing problem with reinforcement learning	32	Emerging	61	Jupyter Notebook
127	EsratMaria/Reinforcement-Learning_for_Energy_Minimization_Using_CLoudsim Implementation of RL in the cloud for energy minimization due to migration...	32	Emerging	30	HTML
128	UlisseMini/procgen-tools Tools for running experiments on RL agents in procgen environments	32	Emerging	20	Jupyter Notebook
129	921kiyo/symbolic-rl Symbolic Reinforcement Learning using Inductive Logic Programming	32	Emerging	63	Lasso
130	Zhenye-Na/advanced-deep-learning-and-reinforcement-learning-deepmind 🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind \|...	32	Emerging	158	Jupyter Notebook
131	sdpkjc/abcdrl Modular Single-file Reinfocement Learning Algorithms Library	32	Emerging	38	Python
132	YuvrajSingh-mist/NeatRL Repository of implementations of classic and sota rl algorithms from scratch...	31	Emerging	221	Python
133	AdamStelmaszczyk/rl-tutorial Source code for "A deep dive into reinforcement learning"	30	Emerging	13	Python
134	astier/model-free-episodic-control Model-Free-Episodic-Control implementation.	30	Emerging	17	Python
135	takuseno/minerva An out-of-the-box GUI tool for offline deep reinforcement learning	30	Emerging	102	JavaScript
136	affaan-m/Behavioral_RL Reinforcement Learning with human behavioral biases integration	30	Emerging	12	HTML
137	LAMDA-RL/ODIS The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent...	30	Emerging	46	Python
138	reward-scope-ai/reward-scope Real-time reward debugging and hacking detection for reinforcement learning	30	Emerging	18	Python
139	appgym/appgym Mobile Apps (Android) as Environment for Reinforcement Learning Agents	30	Emerging	10	Jupyter Notebook
140	anassinator/pddp WIP implementation of Probabilistic Differential Dynamic Programming in PyTorch	30	Emerging	16	Jupyter Notebook
141	dalmia/udacity-deep-reinforcement-learning My solutions to the projects (and mini-projects) of the Deep Reinforcement...	30	Emerging	63	Jupyter Notebook
142	VachanVY/Reinforcement-Learning PyTorch implementations of algorithms from "Reinforcement Learning: An...	30	Emerging	204	Python
143	chengxi600/RLStuff A collection of reinforcement learning algorithm implementations	30	Emerging	64	Jupyter Notebook
144	BY571/CQL PyTorch implementation of the Offline Reinforcement Learning algorithm CQL....	29	Experimental	148	Python
145	kochlisGit/Shadow-Hand-Controller Construction of controllers for Shadow-Hand in Mujoco environment, using...	29	Experimental	22	Python
146	saqib1707/RL-PPO-PyTorch Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch	29	Experimental	13	Python
147	denisyarats/exorl ExORL: Exploratory Data for Offline Reinforcement Learning	29	Experimental	129	Python
148	shehio/rl Implementing RL agents, one algorithm at a time	29	Experimental	9	Python
149	navneet-nmk/pytorch-rl This repository contains model-free deep reinforcement learning algorithms...	29	Experimental	452	Python
150	BNN-UPC/ENERO Code used in the paper "ENERO: Efficient real-time WAN routing optimization...	29	Experimental	33	Python
151	Alee08/multiagent-rl-rm The Multi-Agent RLRM (Reinforcement Learning with Reward Machines) Framework...	28	Experimental	7	Python
152	jayLEE0301/dhrl_official Official code for "DHRL: A Graph-Based Approach for Long-Horizon and Sparse...	28	Experimental	34	Python
153	lucadellalib/actorch Deep reinforcement learning framework for fast prototyping based on PyTorch	28	Experimental	14	Python
154	navneet-nmk/Pytorch-RL-CPP A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)	27	Experimental	101	C++
155	VladGavra98/SERL Safety-informed Evolutionary Reinforcement Learning applied to...	27	Experimental	10	Python
156	teepanis/nonlinear-pendulum Data and Code Availability -- Universal spectral structure in pendulum-like systems	27	Experimental	1	Jupyter Notebook
157	schmidtdominik/Rainbow Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient...	27	Experimental	44	Python
158	opium-sh/prl Open-source library for a reinforcement learning research.	27	Experimental	54	Python
159	harshaljanjani/taskschedulingdqn Designing energy-aware scheduling and task allocation algorithms for online...	27	Experimental	11	Jupyter Notebook
160	goktug97/PEPG-ES Python Implementation of Parameter-exploring Policy Gradients Evolution Strategy	27	Experimental	17	Python
161	amr-khaled164/GRLMSL 🚀 Optimize microservice instance selection and load balancing in edge...	27	Experimental	1	Python
162	Brownwang0426/Reversal-Generative-Reinforcement-Learning A simple model-free and value-function-free reinforcement learning model	26	Experimental	6	Python
163	nnaisense/pgpelib A mini library for Policy Gradients with Parameter-based Exploration, with...	26	Experimental	73	Python
164	AdamStelmaszczyk/dqn TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)	26	Experimental	40	Python
165	jimimvp/torch_rl Reinforcement learning library for PyTorch.	26	Experimental	11	Python
166	NYU-MLDA/ABC-RL This is work-in-progress (WIP) refactored implementation of...	26	Experimental	8	Verilog
167	matthieu637/ddrl Deep Developmental Reinforcement Learning	25	Experimental	29	C++
168	haron1100/Upside-Down-Reinforcement-Learning Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch	25	Experimental	27	Jupyter Notebook
169	lucaslingle/pytorch_rl2 Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'	25	Experimental	72	Python
170	ialexmp/DRL-Generalization Exploring Generalization in Deep Reinforcement Learning algorithms for...	25	Experimental	5	Python
171	cgel/DRL A collection of Deep Reinforcement Learning algorithms implemented in...	25	Experimental	29	Python
172	FabioMiguel2000/LOA-feat.Reinforcement-Learning Assigment 2 for Course L.EIC029 Artificial Intelligence, FEUP LEIC 3rd Year...	25	Experimental	3	Python
173	Asap7772/PTR This repository contains the implementation of the PTR algorithm described...	25	Experimental	32	Python
174	CLAIRE-Labo/no-representation-no-trust Codebase to fully reproduce the results of "No Representation, No Trust:...	25	Experimental	31	Python
175	mbchang/decentralized-rl Decentralized Reinforcment Learning: Global Decision-Making via Local...	24	Experimental	43	Python
176	0xnu/deep-reinforcement-learning Deep Reinforcement Learning (DRL)	24	Experimental	1	Jupyter Notebook
177	Skw3mdy/Reinforcement-Learning-Projects 🤖 Explore reinforcement learning techniques with projects including a taxi...	24	Experimental	2	Jupyter Notebook
178	linker81/Reinforcement-Learning-CheatSheet Cheatsheet of Reinforcement Learning (Based on Sutton-Barto Book - 2nd Edition)	24	Experimental	59	TeX
179	Jaehyun-Jeong/100LinesRL Clean RL algorithm implementations in under 100 lines each.	23	Experimental	1	Python
180	Defenser1337/Reinforcement-learning-for-Gradient-descent Application of reinforcement learning to train hyperparameters of gradient...	23	Experimental	1	Jupyter Notebook
181	voaneves/colab-rl Keras implementation of the latest Reinforcement Learning algorithms, ready...	23	Experimental	6	Jupyter Notebook
182	Naighten/track-simulator Код магистрантской дипломной работы студента НГТУ им Р.Е. Алексеева Жукова...	23	Experimental	1	Python
183	kyegomez/HindsightReplay My implementation of Hindsight replay in PyTorch: "Hindsight Experience Replay"	23	Experimental	6	Python
184	ProfessorNova/PPO-Humanoid PPO implementation for controlling a humanoid in Gymnasium's Mujoco...	23	Experimental	31	Python
185	WinDerek/reinforce-py Reinforcement learning agents in Python (dynamic programming,...	23	Experimental	2	Jupyter Notebook
186	Daraan/ray_utilities ray & RLlib tools for unified code across different repositories....	23	Experimental	6	Python
187	Now-Join-Us/V0 The code repository for "$V_0$: A Generalist Value Model for Any Policy at...	23	Experimental	5	Python
188	JeepWay/DeepPack Unofficial implementation of DeepPack in PyTorch. DeepPack is a deep...	23	Experimental	6	Python
189	enjeeneer/zero-shot-rl VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low...	23	Experimental	26	Python
190	mindspore-courses/Rainbow-MindSpore About Rainbow-MindSpore! A step-by-step tutorial from DQN to Rainbow	23	Experimental	6	Jupyter Notebook
191	davirenner88-rgb/LR-S 🚀 Emulate Arknights: Endfield servers with LR-S for seamless game...	23	Experimental	1	Zig
192	Space-Robotics-Laboratory/rlstar RL STaR is a platform for creating AI for robotic applications. Researchers...	22	Experimental	32	Python
193	motokiomura/Q-DOT [RLC 2025] Official code repository for "Offline Reinforcement Learning with...	22	Experimental	3	Python
194	KeepALifeUS/ml-dqn Rainbow DQN: Double, Dueling, PER, Noisy Nets. Atari benchmarks. PyTorch.	22	Experimental	3	Python
195	nunesma/reinforcement_learning Deep reinforcement learning techniques for artificial intelligence project	22	Experimental	1	Jupyter Notebook
196	32olaa/reward-scope 🔍 Detect reward hacking in RL training with RewardScope. Track reward...	22	Experimental	—	Python
197	HGVAbyte/rlhf-data-agent-full 🔍 Generate synthetic preference-ranked datasets for RLHF and DPO training,...	22	Experimental	—	Python
198	AlirezaShamsoshoara/RL-from-zero Comprehensive collection of reinforcement learning algorithms implemented...	22	Experimental	—	Python
199	mlnjsh/Reinforcement_Learning_Projects 20 RL basics notebooks + 10 advanced projects with Streamlit apps covering...	22	Experimental	—	Jupyter Notebook
200	Rudge0/DynaMO-RL Optimize policy learning by dynamically allocating rollouts and modulating...	22	Experimental	—	Python
201	mercurycontaminated-sandarac557/KnapsackRL 🎯 Optimize exploration budgets in Reinforcement Learning with KnapsackRL for...	22	Experimental	—	Python
202	rickstaa/stable-learning-control A framework for training theoretically stable (and robust) Reinforcement...	22	Experimental	7	Python
203	tartavull/alfredo Relentlessly learning, persistently failing, but never surrendering.	22	Experimental	9	Python
204	ErickRosete/tacorl TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning	22	Experimental	30	Python
205	rbosh/ml-adp Approximate dynamic programming for stochastic optimal control in Pytorch	22	Experimental	24	Python
206	zhuzhipeng-123/reinforce-study-for-mmm Reinforcement Learning Research - Exploring RL algorithms in practical scenarios	22	Experimental	—	—
207	Yuxing-Wang-THU/Surrogate-assisted-ERL A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning	21	Experimental	15	Python
208	dlb-rl/pulse-rl Code for PulseRL: Enabling Offline Reinforcement Learning for Digital...	21	Experimental	8	Python
209	NVlabs/RL-TNCO RL-TNCO: A reinforcement learning algorithm for solving the tensor network...	21	Experimental	10	Python
210	cubrink/mujoco-2.1-rl-project Implementing Deep Reinforcement Learning Algorithms in Python for use in the...	20	Experimental	17	TeX
211	MiscellaneousStuff/tlol-rl TLoL (Reinforcement Learning Python Module) - League of Legends RL Module...	20	Experimental	19	Python
212	JaydenTeoh/MORL-Generalization Benchmark for evaluating the generalization capabilities of Multi-Objective...	20	Experimental	26	Python
213	enginBozkurt/Deep-Reinforcement-Learning-for-Enterprise-Nanodegree Udacity Deep Reinforcement Learning for Enterprise Nanodegree Projects	20	Experimental	8	Jupyter Notebook
214	mohmdelsayed/TinyRL Real-Time Deep RL That Fits in Small Devices	20	Experimental	1	C++
215	shivakanthsujit/reducible-loss Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss	20	Experimental	12	Python
216	natetsang/open-rl Implementations of a large collection of reinforcement learning algorithms.	20	Experimental	28	Python
217	Uzi-gpu/reinforcement-learning Reinforcement Learning projects with Q-Learning, Actor-Critic, and REINFORCE...	19	Experimental	—	Jupyter Notebook
218	Axel-Bravo/19_udacity_drlnd Deep Reinforcement Learning Nanodregree from Udacity	19	Experimental	3	Jupyter Notebook
219	ugr-sail/paper-drl_building Supplementary material to the paper "An experimental evaluation of Deep...	19	Experimental	20	HTML
220	HzcIrving/DLRL-PlayGround The code repo contains multiple code reproduction processes of various SOTA...	19	Experimental	37	Jupyter Notebook
221	declanoller/cat_mouse_continuous_RL Using DDPG and A2C reinforcement learning algorithms to solve a math puzzle	19	Experimental	10	Python
222	sapanz/Udacity-deep-reinforcement-learning-solution This repo will cover most of machine learning algorithms with coding examples.	19	Experimental	4	Jupyter Notebook
223	prototwin/RLExamples PotoTwin Reinforcement Learning Examples	18	Experimental	40	Python
224	dalmia/P2_Continuous_Control My solution code for the second project of Udacity's Deep Reinforcement...	18	Experimental	5	ASP
225	trunghng/reinforcement_learning_an_introduction Python Implementation for problems in Reinforcement Learning - An Introduction book	17	Experimental	5	Python
226	iliasoroka1/GRU_Lyapunov_Spectrum Lyapunov Spectrum for Double Pendulum using GRU	17	Experimental	2	Jupyter Notebook
227	bmazoure/ppo_jax Jax implementation of Proximal Policy Optimization (PPO) specifically tuned...	17	Experimental	59	Python
228	rafelps/learning-recursive-goal-proposal Learning Recursive Goal Proposal: A hierarchical Reinforcement Learning Approach	17	Experimental	4	Python
229	dayyass/rllib Reinforcement Learning Library.	16	Experimental	29	Python
230	adaptive-intelligent-robotics/HTE This is the repository for the paper Hierarchical Quality-Diversity for...	16	Experimental	4	C++
231	soovittt/RL-Studio A full-stack platform for designing reinforcement learning environments,...	16	Experimental	1	TypeScript
232	ankitsharma-tech/Deep-Reinforcement-Learning-With-Pytorch PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3.	16	Experimental	12	Python
233	rorofaiz/awesome-RLVR-boundary 🔍 Explore curated resources on Reinforcement Learning with Verifiable...	16	Experimental	2	—
234	andranik-sahakyan/team-tron-rl Multi-Agent Reinforcement Learning project exploring the emergence and...	16	Experimental	3	Jupyter Notebook
235	mindspore-courses/Deep-Reinforcement-Learning-Algorithms-with-MindSpore MindSpore implementations of deep reinforcement learning algorithms and environments	16	Experimental	16	Python
236	rStar-RL/LoongRL LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts...	16	Experimental	13	Python
237	ARgruny/Deep-Reinforcement-Learning Build and test DRL algorithms in different environments	15	Experimental	2	Jupyter Notebook
238	ashworks1706/kaelum LATS-based inference with a reward model and online policy router across...	15	Experimental	6	Python
239	mlnjsh/rl-book-labs 🎮 Interactive browser-based labs for "Complete Reinforcement Learning...	15	Experimental	1	Jupyter Notebook
240	GTR-GAMES/Deep-Hierarchical-Planning 🔍 Implement efficient long-horizon task planning with this PyTorch...	15	Experimental	1	Python
241	victor369basu/MyosuiteDDQN In this repository, we try to solve musculoskeletal tasks with `Double DQN...	15	Experimental	17	Python
242	icaros-usc/dqd-rl Official implementation of "Approximating Gradients for Differentiable...	15	Experimental	22	Python
243	motokiomura/annealed-q-learning [ICML 2025] Official code repository for "Gradual Transition from Bellman...	15	Experimental	8	Python
244	liyan2015/SUMO-RL-MobiCharger OpenAI-gym-like Reinforcement Learning environment for Dispatching of Mobile...	15	Experimental	15	Python
245	xValentim/ReinforcementLearning_Zero_to_Hero_Course In this repository you will learn all the basic math about Reinforcement...	15	Experimental	6	Jupyter Notebook
246	fareskhlifi/Intelligent-Scheduling-using-Reinforcement-learning-and-Deep-Q-Networks Implementing a new environment in Gymnasium for intelligent schduling	15	Experimental	6	Jupyter Notebook
247	aminkhani/Deep-RL You can see a reference for Books, Articles, Courses and Educational...	15	Experimental	20	Jupyter Notebook
248	snthomps/rlhf-ppo-pipeline RLHF/PPO Training Pipeline with Performance Profiling and Optimization Demonstrations	15	Experimental	—	JavaScript
249	hmomin/PPO-Winter-Run Trains an agent with Proximal Policy Optimization (PPO) to beat Winter Run	15	Experimental	23	TypeScript
250	bay3s/ppo-parallel Parallelized implementation of Proximal Policy Optimization (PPO).	15	Experimental	1	Python
251	undextrois/reinforcement-learning RL Experiments and what not	15	Experimental	—	Python
252	silviomori/udacity-deep-reinforcement-learning-p2-continuous-control Create and train a double-jointed arm agent that is able to maintain its...	15	Experimental	1	Jupyter Notebook
253	mhahsler/Introduction_to_Reinforcement_Learning Material for an introduction course to reinforcement learning for compute scientists	15	Experimental	1	Jupyter Notebook
254	Lare1998/rl-for-robotics Reinforcement Learning applications for robotic control and task automation.	14	Experimental	—	Python
255	yelurebajrang/HeteroRL_GEPO ⚡ Optimize heterogeneous reinforcement learning with GEPO for decentralized...	14	Experimental	—	Python
256	Madid1976/reinforcement-learning-agents Implementations of various reinforcement learning algorithms and agents for...	14	Experimental	—	Python
257	kodok13/Label-Free-RLVR 📚 Explore a curated collection of research on Label-Free Reinforcement...	14	Experimental	—	—
258	a7med3laa/DRL-Books-resources Deep Reinforcement Learning Books and links for studying	14	Experimental	1	—
259	julia-bel/MAPF_G2RL Implementation of the G2RL approach in the POGEMA environment	14	Experimental	13	Jupyter Notebook
260	uzumstanley/DEEP-LEARNING UNIVERSITY OF ROEHAMPTON LONDON	14	Experimental	12	Jupyter Notebook
261	Jcorrieri/multiagent-gridworld Deep Reinforcement Learning for Multi-Robot Path Planning using PyTorch, Ray...	14	Experimental	4	Python
262	BackpropTools/BackpropTools A Fast, Portable Deep Reinforcement Learning Library for Continuous Control	14	Experimental	13	C++
263	TroddenSpade/Exhaustive-Reinforcement-Learning Exhaustive Implementation of Algorithms, Key Papers, and Well-Known Problems...	14	Experimental	12	Jupyter Notebook
264	PatrickSinger99/ReinforcementLearningInventoryManagement Repository for my bachelor thesis on inventory management in a logistics...	14	Experimental	9	Jupyter Notebook
265	Aryia-Behroziuan/Robot-learning In developmental robotics, robot learning algorithms generate their own...	14	Experimental	9	—
266	AndersonPeng/ppo_tutorial PPO pytorch tutorial for continuous control (BipedalWalker-v3)	13	Experimental	11	Jupyter Notebook
267	manjavacas/rl-temario Temario sobre aprendizaje por refuerzo en español.	13	Experimental	5	Typst
268	mbar0075/Advanced-Reinforcement-Learning Deliverables relating to the Advanced Reinforcement Learning University Unit	13	Experimental	6	Jupyter Notebook
269	gabotechs/lazaro Reinforcement learning framework for implementing custom models on custom...	12	Experimental	4	Python
270	PathumDilhara/RL-agent-for-CNN-hyper-parameter-optimization A reinforcement learning (RL) based agent that automatically tunes...	12	Experimental	1	Jupyter Notebook
271	brianspiering/rl-course Applied Reinforcement Learning course	12	Experimental	12	Jupyter Notebook
272	CarsonScott/Dual-Process-Reinforcement An intelligent agent that adaptively changes its thought processes to...	12	Experimental	12	—
273	openpsi-projects/srl SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores	12	Experimental	15	Python
274	TroddenSpade/Maximum-Entropy-Deep-IRL Implementations of Maximum Entropy Algorithms for solving Inverse...	12	Experimental	29	Jupyter Notebook
275	MatTheTab/GHOST_RL_materials Materials for Reinforcement Learning and Machine Learning in games for GHOST.	11	Experimental	—	Jupyter Notebook
276	ArdavanKhalij/RL-Seminar-Project This project is the project of RL course at Vrije Universiteit Brussels and...	11	Experimental	2	Python
277	Develop-Packt/Building-an-Artificial-Intelligence-Algorithm Learn how to build a machine learning mode and get started on the popular...	11	Experimental	—	Jupyter Notebook
278	TroddenSpade/Meta-Reinforcement-Learning Code snippets of Meta Reinforcement Learning algorithms	11	Experimental	39	Jupyter Notebook
279	baekbyte/NormLayer A Python SDK that enforces behavioral policies between agents at runtime in...	11	Experimental	—	Python
280	Tahernezhad/Continuous-Control-Workbench A clean PyTorch implementation of PPO, SAC, and TD3 made from scratch. It is...	11	Experimental	—	Python
281	zchuning/repo Resilient Model-Based RL by Regularizing Posterior Predictability	11	Experimental	22	Python
282	thevilledev/elements-of-ai-idea Project pitch on using reinforcement learning for resource scheduling	11	Experimental	—	—
283	yamatokataoka/learning-from-human-preferences Replication of Deep Reinforcement Learning from Human Preferences...	11	Experimental	2	TypeScript
284	Bonifatius94/rl-algos SOTA Reinforcement Learning Algorithms	11	Experimental	2	Python
285	ArdavanKhalij/MDP machine-learning reinforcement-learning artificial-intelligence...	11	Experimental	2	Jupyter Notebook
286	mohamedrxo/ppo A comprehensive repository for training OpenAI Gym environments using...	10	Experimental	3	Jupyter Notebook
287	Saifahmadkhan/PlugNPlay This library is a PlugNPlay version of our novel pipeline VacSIM. We have...	10	Experimental	1	Python
288	Talendar/pyderl Simple Deep Reinforcement Learning framework for Python.	10	Experimental	1	Python
289	micdestefano/micppo An implementation of Proximal Policy Optimization (PPO)	10	Experimental	1	Python
290	PieroMacaluso/collaboration-n-competition Implementation of Multi-Agent Deep Deterministic Policy Gradient (MADDPG)...	10	Experimental	1	TeX

Comparisons in this category

stable-baselines3 and stable-baselines3-contrib (76 vs 57) rl_games and Practical_RL (73 vs 64) rl_games and Deep-reinforcement-learning-with-pytorch (73 vs 51)