Reinforcement Learning Frameworks

Complete RL algorithm implementations and educational resources for training agents using policy gradient, Q-learning, actor-critic, and other methods. Does NOT include game-playing agents, robotics simulators, or domain-specific RL applications—only the core algorithmic frameworks and tutorials.

There are 290 reinforcement learning frameworks tracked. 3 score above 70 (verified tier). The highest-rated is google-deepmind/dm_control at 86/100 with 4,494 stars and 309,287 monthly downloads. 5 of the top 10 are actively maintained.

Get all 290 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=reinforcement-learning-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 google-deepmind/dm_control

Google DeepMind's software stack for physics-based simulation and...

86
Verified
2 DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of...

76
Verified
3 Denys88/rl_games

RL implementations

73
Verified
4 pytorch/rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

68
Established
5 flatland-association/flatland-rl

The Flatland Framework is a multi-purpose environment to tackle problems...

64
Established
6 yandexdataschool/Practical_RL

A course in reinforcement learning in the wild

64
Established
7 takuseno/d3rlpy

An offline deep reinforcement learning library

62
Established
8 keras-rl/keras-rl

Deep Reinforcement Learning for Keras.

60
Established
9 MushroomRL/mushroom-rl

Python library for Reinforcement Learning.

59
Established
10 qzed/irl-maxent

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning...

58
Established
11 Stable-Baselines-Team/stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning...

57
Established
12 PKU-Alignment/omnisafe

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

56
Established
13 huggingface/deep-rl-class

This repo contains the Hugging Face Deep Reinforcement Learning Course.

55
Established
14 MyoHub/myosuite

MyoSuite is a collection of environments/tasks to be solved by...

54
Established
15 google-research/batch-ppo

Efficient Batched Reinforcement Learning in TensorFlow

54
Established
16 upb-lea/reinforcement_learning_course_materials

Lecture notes, tutorial tasks including solutions as well as online videos...

54
Established
17 inoryy/reaver

Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft...

54
Established
18 tensorlayer/RLzoo

A Comprehensive Reinforcement Learning Zoo for Simple Usage 🚀

53
Established
19 lucidrains/streaming-deep-rl

Explorations into the proposed Streaming Deep Reinforcement Learning, from...

52
Established
20 rlcode/reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

51
Established
21 MorvanZhou/Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

51
Established
22 sweetice/Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO,...

51
Established
23 MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of...

51
Established
24 ikostrikov/pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy...

51
Established
25 ShangtongZhang/reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

51
Established
26 SforAiDl/genrl

A PyTorch reinforcement learning library for generalizable and reproducible...

50
Established
27 iffiX/machin

Reinforcement learning library(framework) designed for PyTorch, implements...

50
Established
28 seungeunrho/minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

49
Emerging
29 danaugrs/huskarl

Deep Reinforcement Learning Framework + Algorithms

49
Emerging
30 AdamStelmaszczyk/learning2run

Our NIPS 2017: Learning to Run source code

48
Emerging
31 vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning...

48
Emerging
32 andri27-ts/Reinforcement-Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python....

48
Emerging
33 Fraunhofer-IIS/fmugym

Interface to connect Reinforcement Learning libraries to Functional Mock-up...

47
Emerging
34 danijar/mindpark

Testbed for deep reinforcement learning

47
Emerging
35 fracapuano/robot-learning-tutorial

All the source code for "Robot Learning: A Tutorial". Get involved to be...

47
Emerging
36 TuragaLab/flybody

MuJoCo fruit fly body model and locomotion RL tasks

47
Emerging
37 rl-tools/rl-tools

The Fastest Deep Reinforcement Learning Library

47
Emerging
38 CarperAI/trlx

A repo for distributed training of language models with Reinforcement...

46
Emerging
39 ikostrikov/pytorch-a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from...

44
Emerging
40 keon/deep-q-learning

Minimal Deep Q Learning (DQN & DDQN) implementations in Keras

44
Emerging
41 miyosuda/async_deep_reinforce

Asynchronous Methods for Deep Reinforcement Learning

44
Emerging
42 danijar/embodied

Fast reinforcement learning research

44
Emerging
43 RLE-Foundation/rllte

Long-Term Evolution Project of Reinforcement Learning

44
Emerging
44 stanfordnmbl/osim-rl

Reinforcement learning environments with musculoskeletal models

44
Emerging
45 archsyscall/DeepRL-TensorFlow2

🐋 Simple implementations of various popular Deep Reinforcement Learning...

44
Emerging
46 ankonzoid/LearningX

Deep & Classical Reinforcement Learning + Machine Learning Examples in Python

44
Emerging
47 heronsystems/adeptRL

Reinforcement learning framework to accelerate research

44
Emerging
48 pathak22/noreward-rl

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep...

44
Emerging
49 vmayoral/basic_reinforcement_learning

An introductory series to Reinforcement Learning (RL) with comprehensive...

44
Emerging
50 dalmia/David-Silver-Reinforcement-learning

Notes for the Reinforcement Learning course by David Silver along with...

44
Emerging
51 icoxfog417/baby-steps-of-rl-ja

Pythonで学ぶ強化学習 -入門から実践まで- サンプルコード

44
Emerging
52 lucidrains/metacontroller

Implementation of the MetaController proposed in "Emergent temporal...

44
Emerging
53 mimoralea/gdrl

Grokking Deep Reinforcement Learning

44
Emerging
54 nikhilbarhate99/PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization...

43
Emerging
55 Kaixhin/Rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning

43
Emerging
56 zyxue/sutton-barto-rl-exercises

📖Learning reinforcement learning by implementing the algorithms from...

43
Emerging
57 simoninithomas/Deep_reinforcement_learning_Course

Implementations from the free course Deep Reinforcement Learning with...

43
Emerging
58 mimoralea/applied-reinforcement-learning

Reinforcement Learning and Decision Making tutorials explained at an...

43
Emerging
59 pat-coady/trpo

Trust Region Policy Optimization with TensorFlow and OpenAI Gym

43
Emerging
60 udacity/reinforcement-learning

Reinforcement learning material, code and exercises for Udacity Nanodegree programs.

43
Emerging
61 rail-berkeley/softlearning

Softlearning is a reinforcement learning framework for training maximum...

43
Emerging
62 jingweiz/pytorch-rl

Deep Reinforcement Learning with pytorch & visdom

43
Emerging
63 ikostrikov/pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

43
Emerging
64 nrontsis/PILCO

Bayesian Reinforcement Learning in Tensorflow

42
Emerging
65 denisyarats/pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

42
Emerging
66 opendilab/DI-engine-docs

DI-engine docs (Chinese and English)

42
Emerging
67 rmst/ddpg

TensorFlow implementation of the DDPG algorithm from the paper Continuous...

42
Emerging
68 binary-husky/hmp2g

Multiagent Reinforcement Learning Research Project

41
Emerging
69 XinJingHao/DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep...

41
Emerging
70 rl-language/rlc

Bringing reinforcement learning to every day programmers

41
Emerging
71 Stable-Baselines-Team/stable-baselines

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of...

41
Emerging
72 alessiodm/drl-zh

Deep Reinforcement Learning: Zero to Hero!

41
Emerging
73 Cloudslab/DLSF

[TMC'20] Deep Learning based Scheduler for Stochastic Fog-Cloud computing...

40
Emerging
74 HewlettPackard/dc-rl

SustainDC is a set of Python environments for Data Center simulation and...

40
Emerging
75 ericyangyu/PPO-for-Beginners

A simple and well styled PPO implementation. Based on my Medium series:...

40
Emerging
76 adrianwix/pybasin

pyBasin is a Python library for estimating basin stability in dynamical...

40
Emerging
77 gordicaleksa/pytorch-learn-reinforcement-learning

A collection of various RL algorithms like policy gradients, DQN and PPO....

40
Emerging
78 godka/Pensieve-PPO

The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art...

39
Emerging
79 ItoMasaki/PixyzRL

A Bayesian RL Framework with Probabilistic Generative Models

39
Emerging
80 SuhailSama/MR_RL

Gym Simulator for Magnetic Micro Robots

39
Emerging
81 TianhongDai/reinforcement-learning-algorithms

This repository contains most of pytorch implementation based classic deep...

39
Emerging
82 sebastianbrzustowicz/Robot-Sumo-RL

Python + PyTorch. Advanced Reinforcement Learning (SAC/PPO/A2C) for...

38
Emerging
83 google-deepmind/dm_env

A Python interface for reinforcement learning environments

38
Emerging
84 medipixel/rl_algorithms

Structural implementation of RL key algorithms

38
Emerging
85 Anjum48/rl-examples

Examples of published reinforcement learning algorithms in recent literature...

38
Emerging
86 IBM/LOA

Neuro-Symbolic Reinforcement Learning: Logical Optimal Action (LOA), a novel...

38
Emerging
87 gabrielhuang/reptile-pytorch

A PyTorch implementation of OpenAI's REPTILE algorithm

38
Emerging
88 UoA-CARES/cares_reinforcement_learning

CARES Reinforcement Learning Package

38
Emerging
89 yihaosun1124/OfflineRL-Kit

An elegant PyTorch offline reinforcement learning library for researchers.

38
Emerging
90 mahyaret/kuka_rl

Reinforcement Learning Experiments using PyBullet

37
Emerging
91 huangwl18/modular-rl

[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular...

37
Emerging
92 denisyarats/drq

DrQ: Data regularized Q

37
Emerging
93 khushhallchandra/pytorch-rl

Pytorch Implementation of RL algorithms

37
Emerging
94 DeNA/HandyRL

HandyRL is a handy and simple framework based on Python and PyTorch for...

37
Emerging
95 tayalmanan28/Safe_Reinforcement_Learning

Repository containing the code for safe reinforcement learning in two custom...

36
Emerging
96 yrlu/irl-imitation

Implementation of Inverse Reinforcement Learning (IRL) algorithms in...

36
Emerging
97 rlgraph/rlgraph

RLgraph: Modular computation graphs for deep reinforcement learning

36
Emerging
98 mohmdelsayed/streaming-drl

Deep reinforcement learning without experience replay, target networks, or...

36
Emerging
99 omerbsezer/Reinforcement_learning_tutorial_with_demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration),...

36
Emerging
100 asystemoffields/disco-torch

A PyTorch port of DeepMind's Disco103 — the meta-learned reinforcement...

36
Emerging
101 sudharsan13296/Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using...

36
Emerging
102 Shaswat2001/maple-robotics

MAPLE (Model and Policy Learning Evaluation) - A unified CLI daemon for...

36
Emerging
103 dvalenciar/ReinforceUI-Studio

ReinforceUI-Studio. A Python-based application designed to simplify the...

35
Emerging
104 Bellman-devs/bellman

Model-based reinforcement learning in TensorFlow

35
Emerging
105 andrewliao11/Deep-Reinforcement-Learning-Survey

My Exploration on Deep Reinforcement Learning Survey

35
Emerging
106 Learning4Optimization-HUST/H-TSP

Official implementation of H-TSP (AAAI2023)

35
Emerging
107 mitre/ilpyt

ilpyt: imitation learning library with modular, baseline implementations in Pytorch

35
Emerging
108 MaartenGr/ReinLife

Creating Artificial Life with Reinforcement Learning

34
Emerging
109 MarcoMeter/recurrent-ppo-truncated-bptt

Baseline implementation of recurrent PPO using truncated BPTT

34
Emerging
110 Kaixhin/imitation-learning

Imitation learning algorithms

34
Emerging
111 NatLabRockies/graph-env

Reinforcement learning for combinatorial optimization over directed graphs

34
Emerging
112 airboxlab/rllib-energyplus

Simple EnergyPlus environments for control optimization using reinforcement learning

34
Emerging
113 denisyarats/proto

Proto-RL: Reinforcement Learning with Prototypical Representations

34
Emerging
114 rmst/rlrd

PyTorch implementation of our paper Reinforcement Learning with Random...

34
Emerging
115 thanhkaist/CCFDM1

CCFDM reinforcement learning

34
Emerging
116 tirthajyoti/RL_basics

Basic Reinforcement Learning algorithms

33
Emerging
117 rllab-snu/Deep-Reinforcement-Learning

Introduction to Deep Reinforcement Learning

33
Emerging
118 nsidn98/NICE

Combining Reinforcement Learning with Integer Programming for Robust Scheduling

33
Emerging
119 whoiszyc/IntelliHealer

IntelliHealer: An imitation and reinforcement learning platform for...

33
Emerging
120 zuoxingdong/lagom

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement...

33
Emerging
121 TheoLvs/reinforcement-learning

Personal experiments on Reinforcement Learning

33
Emerging
122 araffin/rl-handson-rlvs21

Stable-Baselines3 (SB3) reinforcement learning tutorial for the...

33
Emerging
123 MishaLaskin/rad

RAD: Reinforcement Learning with Augmented Data

33
Emerging
124 antonpuz/DeROL

Deep Reinforcement One-Shot Learning Framework for Artificially Intelligent...

33
Emerging
125 RLE-Foundation/RLeXplore

RLeXplore provides stable baselines of exploration methods in reinforcement...

32
Emerging
126 luisgarciar/3D-bin-packing

Solving the 3D bin packing problem with reinforcement learning

32
Emerging
127 EsratMaria/Reinforcement-Learning_for_Energy_Minimization_Using_CLoudsim

Implementation of RL in the cloud for energy minimization due to migration...

32
Emerging
128 UlisseMini/procgen-tools

Tools for running experiments on RL agents in procgen environments

32
Emerging
129 921kiyo/symbolic-rl

Symbolic Reinforcement Learning using Inductive Logic Programming

32
Emerging
130 Zhenye-Na/advanced-deep-learning-and-reinforcement-learning-deepmind

🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind |...

32
Emerging
131 sdpkjc/abcdrl

Modular Single-file Reinfocement Learning Algorithms Library

32
Emerging
132 YuvrajSingh-mist/NeatRL

Repository of implementations of classic and sota rl algorithms from scratch...

31
Emerging
133 AdamStelmaszczyk/rl-tutorial

Source code for "A deep dive into reinforcement learning"

30
Emerging
134 astier/model-free-episodic-control

Model-Free-Episodic-Control implementation.

30
Emerging
135 takuseno/minerva

An out-of-the-box GUI tool for offline deep reinforcement learning

30
Emerging
136 affaan-m/Behavioral_RL

Reinforcement Learning with human behavioral biases integration

30
Emerging
137 LAMDA-RL/ODIS

The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent...

30
Emerging
138 reward-scope-ai/reward-scope

Real-time reward debugging and hacking detection for reinforcement learning

30
Emerging
139 appgym/appgym

Mobile Apps (Android) as Environment for Reinforcement Learning Agents

30
Emerging
140 anassinator/pddp

WIP implementation of Probabilistic Differential Dynamic Programming in PyTorch

30
Emerging
141 dalmia/udacity-deep-reinforcement-learning

My solutions to the projects (and mini-projects) of the Deep Reinforcement...

30
Emerging
142 VachanVY/Reinforcement-Learning

PyTorch implementations of algorithms from "Reinforcement Learning: An...

30
Emerging
143 chengxi600/RLStuff

A collection of reinforcement learning algorithm implementations

30
Emerging
144 BY571/CQL

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL....

29
Experimental
145 kochlisGit/Shadow-Hand-Controller

Construction of controllers for Shadow-Hand in Mujoco environment, using...

29
Experimental
146 saqib1707/RL-PPO-PyTorch

Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch

29
Experimental
147 denisyarats/exorl

ExORL: Exploratory Data for Offline Reinforcement Learning

29
Experimental
148 shehio/rl

Implementing RL agents, one algorithm at a time

29
Experimental
149 navneet-nmk/pytorch-rl

This repository contains model-free deep reinforcement learning algorithms...

29
Experimental
150 BNN-UPC/ENERO

Code used in the paper "ENERO: Efficient real-time WAN routing optimization...

29
Experimental
151 Alee08/multiagent-rl-rm

The Multi-Agent RLRM (Reinforcement Learning with Reward Machines) Framework...

28
Experimental
152 jayLEE0301/dhrl_official

Official code for "DHRL: A Graph-Based Approach for Long-Horizon and Sparse...

28
Experimental
153 lucadellalib/actorch

Deep reinforcement learning framework for fast prototyping based on PyTorch

28
Experimental
154 navneet-nmk/Pytorch-RL-CPP

A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)

27
Experimental
155 VladGavra98/SERL

Safety-informed Evolutionary Reinforcement Learning applied to...

27
Experimental
156 teepanis/nonlinear-pendulum

Data and Code Availability -- Universal spectral structure in pendulum-like systems

27
Experimental
157 schmidtdominik/Rainbow

Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient...

27
Experimental
158 opium-sh/prl

Open-source library for a reinforcement learning research.

27
Experimental
159 harshaljanjani/taskschedulingdqn

Designing energy-aware scheduling and task allocation algorithms for online...

27
Experimental
160 goktug97/PEPG-ES

Python Implementation of Parameter-exploring Policy Gradients Evolution Strategy

27
Experimental
161 amr-khaled164/GRLMSL

🚀 Optimize microservice instance selection and load balancing in edge...

27
Experimental
162 Brownwang0426/Reversal-Generative-Reinforcement-Learning

A simple model-free and value-function-free reinforcement learning model

26
Experimental
163 nnaisense/pgpelib

A mini library for Policy Gradients with Parameter-based Exploration, with...

26
Experimental
164 AdamStelmaszczyk/dqn

TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)

26
Experimental
165 jimimvp/torch_rl

Reinforcement learning library for PyTorch.

26
Experimental
166 NYU-MLDA/ABC-RL

This is work-in-progress (WIP) refactored implementation of...

26
Experimental
167 matthieu637/ddrl

Deep Developmental Reinforcement Learning

25
Experimental
168 haron1100/Upside-Down-Reinforcement-Learning

Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch

25
Experimental
169 lucaslingle/pytorch_rl2

Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'

25
Experimental
170 ialexmp/DRL-Generalization

Exploring Generalization in Deep Reinforcement Learning algorithms for...

25
Experimental
171 cgel/DRL

A collection of Deep Reinforcement Learning algorithms implemented in...

25
Experimental
172 FabioMiguel2000/LOA-feat.Reinforcement-Learning

Assigment 2 for Course L.EIC029 Artificial Intelligence, FEUP LEIC 3rd Year...

25
Experimental
173 Asap7772/PTR

This repository contains the implementation of the PTR algorithm described...

25
Experimental
174 CLAIRE-Labo/no-representation-no-trust

Codebase to fully reproduce the results of "No Representation, No Trust:...

25
Experimental
175 mbchang/decentralized-rl

Decentralized Reinforcment Learning: Global Decision-Making via Local...

24
Experimental
176 0xnu/deep-reinforcement-learning

Deep Reinforcement Learning (DRL)

24
Experimental
177 Skw3mdy/Reinforcement-Learning-Projects

🤖 Explore reinforcement learning techniques with projects including a taxi...

24
Experimental
178 linker81/Reinforcement-Learning-CheatSheet

Cheatsheet of Reinforcement Learning (Based on Sutton-Barto Book - 2nd Edition)

24
Experimental
179 Jaehyun-Jeong/100LinesRL

Clean RL algorithm implementations in under 100 lines each.

23
Experimental
180 Defenser1337/Reinforcement-learning-for-Gradient-descent

Application of reinforcement learning to train hyperparameters of gradient...

23
Experimental
181 voaneves/colab-rl

Keras implementation of the latest Reinforcement Learning algorithms, ready...

23
Experimental
182 Naighten/track-simulator

Код магистрантской дипломной работы студента НГТУ им Р.Е. Алексеева Жукова...

23
Experimental
183 kyegomez/HindsightReplay

My implementation of Hindsight replay in PyTorch: "Hindsight Experience Replay"

23
Experimental
184 ProfessorNova/PPO-Humanoid

PPO implementation for controlling a humanoid in Gymnasium's Mujoco...

23
Experimental
185 WinDerek/reinforce-py

Reinforcement learning agents in Python (dynamic programming,...

23
Experimental
186 Daraan/ray_utilities

ray & RLlib tools for unified code across different repositories....

23
Experimental
187 Now-Join-Us/V0

The code repository for "$V_0$: A Generalist Value Model for Any Policy at...

23
Experimental
188 JeepWay/DeepPack

Unofficial implementation of DeepPack in PyTorch. DeepPack is a deep...

23
Experimental
189 enjeeneer/zero-shot-rl

VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low...

23
Experimental
190 mindspore-courses/Rainbow-MindSpore

About Rainbow-MindSpore! A step-by-step tutorial from DQN to Rainbow

23
Experimental
191 davirenner88-rgb/LR-S

🚀 Emulate Arknights: Endfield servers with LR-S for seamless game...

23
Experimental
192 Space-Robotics-Laboratory/rlstar

RL STaR is a platform for creating AI for robotic applications. Researchers...

22
Experimental
193 motokiomura/Q-DOT

[RLC 2025] Official code repository for "Offline Reinforcement Learning with...

22
Experimental
194 KeepALifeUS/ml-dqn

Rainbow DQN: Double, Dueling, PER, Noisy Nets. Atari benchmarks. PyTorch.

22
Experimental
195 nunesma/reinforcement_learning

Deep reinforcement learning techniques for artificial intelligence project

22
Experimental
196 32olaa/reward-scope

🔍 Detect reward hacking in RL training with RewardScope. Track reward...

22
Experimental
197 HGVAbyte/rlhf-data-agent-full

🔍 Generate synthetic preference-ranked datasets for RLHF and DPO training,...

22
Experimental
198 AlirezaShamsoshoara/RL-from-zero

Comprehensive collection of reinforcement learning algorithms implemented...

22
Experimental
199 mlnjsh/Reinforcement_Learning_Projects

20 RL basics notebooks + 10 advanced projects with Streamlit apps covering...

22
Experimental
200 Rudge0/DynaMO-RL

Optimize policy learning by dynamically allocating rollouts and modulating...

22
Experimental
201 mercurycontaminated-sandarac557/KnapsackRL

🎯 Optimize exploration budgets in Reinforcement Learning with KnapsackRL for...

22
Experimental
202 rickstaa/stable-learning-control

A framework for training theoretically stable (and robust) Reinforcement...

22
Experimental
203 tartavull/alfredo

Relentlessly learning, persistently failing, but never surrendering.

22
Experimental
204 ErickRosete/tacorl

TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning

22
Experimental
205 rbosh/ml-adp

Approximate dynamic programming for stochastic optimal control in Pytorch

22
Experimental
206 zhuzhipeng-123/reinforce-study-for-mmm

Reinforcement Learning Research - Exploring RL algorithms in practical scenarios

22
Experimental
207 Yuxing-Wang-THU/Surrogate-assisted-ERL

A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning

21
Experimental
208 dlb-rl/pulse-rl

Code for PulseRL: Enabling Offline Reinforcement Learning for Digital...

21
Experimental
209 NVlabs/RL-TNCO

RL-TNCO: A reinforcement learning algorithm for solving the tensor network...

21
Experimental
210 cubrink/mujoco-2.1-rl-project

Implementing Deep Reinforcement Learning Algorithms in Python for use in the...

20
Experimental
211 MiscellaneousStuff/tlol-rl

TLoL (Reinforcement Learning Python Module) - League of Legends RL Module...

20
Experimental
212 JaydenTeoh/MORL-Generalization

Benchmark for evaluating the generalization capabilities of Multi-Objective...

20
Experimental
213 enginBozkurt/Deep-Reinforcement-Learning-for-Enterprise-Nanodegree

Udacity Deep Reinforcement Learning for Enterprise Nanodegree Projects

20
Experimental
214 mohmdelsayed/TinyRL

Real-Time Deep RL That Fits in Small Devices

20
Experimental
215 shivakanthsujit/reducible-loss

Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss

20
Experimental
216 natetsang/open-rl

Implementations of a large collection of reinforcement learning algorithms.

20
Experimental
217 Uzi-gpu/reinforcement-learning

Reinforcement Learning projects with Q-Learning, Actor-Critic, and REINFORCE...

19
Experimental
218 Axel-Bravo/19_udacity_drlnd

Deep Reinforcement Learning Nanodregree from Udacity

19
Experimental
219 ugr-sail/paper-drl_building

Supplementary material to the paper "An experimental evaluation of Deep...

19
Experimental
220 HzcIrving/DLRL-PlayGround

The code repo contains multiple code reproduction processes of various SOTA...

19
Experimental
221 declanoller/cat_mouse_continuous_RL

Using DDPG and A2C reinforcement learning algorithms to solve a math puzzle

19
Experimental
222 sapanz/Udacity-deep-reinforcement-learning-solution

This repo will cover most of machine learning algorithms with coding examples.

19
Experimental
223 prototwin/RLExamples

PotoTwin Reinforcement Learning Examples

18
Experimental
224 dalmia/P2_Continuous_Control

My solution code for the second project of Udacity's Deep Reinforcement...

18
Experimental
225 trunghng/reinforcement_learning_an_introduction

Python Implementation for problems in Reinforcement Learning - An Introduction book

17
Experimental
226 iliasoroka1/GRU_Lyapunov_Spectrum

Lyapunov Spectrum for Double Pendulum using GRU

17
Experimental
227 bmazoure/ppo_jax

Jax implementation of Proximal Policy Optimization (PPO) specifically tuned...

17
Experimental
228 rafelps/learning-recursive-goal-proposal

Learning Recursive Goal Proposal: A hierarchical Reinforcement Learning Approach

17
Experimental
229 dayyass/rllib

Reinforcement Learning Library.

16
Experimental
230 adaptive-intelligent-robotics/HTE

This is the repository for the paper Hierarchical Quality-Diversity for...

16
Experimental
231 soovittt/RL-Studio

A full-stack platform for designing reinforcement learning environments,...

16
Experimental
232 ankitsharma-tech/Deep-Reinforcement-Learning-With-Pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3.

16
Experimental
233 rorofaiz/awesome-RLVR-boundary

🔍 Explore curated resources on Reinforcement Learning with Verifiable...

16
Experimental
234 andranik-sahakyan/team-tron-rl

Multi-Agent Reinforcement Learning project exploring the emergence and...

16
Experimental
235 mindspore-courses/Deep-Reinforcement-Learning-Algorithms-with-MindSpore

MindSpore implementations of deep reinforcement learning algorithms and environments

16
Experimental
236 rStar-RL/LoongRL

LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts...

16
Experimental
237 ARgruny/Deep-Reinforcement-Learning

Build and test DRL algorithms in different environments

15
Experimental
238 ashworks1706/kaelum

LATS-based inference with a reward model and online policy router across...

15
Experimental
239 mlnjsh/rl-book-labs

🎮 Interactive browser-based labs for "Complete Reinforcement Learning...

15
Experimental
240 GTR-GAMES/Deep-Hierarchical-Planning

🔍 Implement efficient long-horizon task planning with this PyTorch...

15
Experimental
241 victor369basu/MyosuiteDDQN

In this repository, we try to solve musculoskeletal tasks with `Double DQN...

15
Experimental
242 icaros-usc/dqd-rl

Official implementation of "Approximating Gradients for Differentiable...

15
Experimental
243 motokiomura/annealed-q-learning

[ICML 2025] Official code repository for "Gradual Transition from Bellman...

15
Experimental
244 liyan2015/SUMO-RL-MobiCharger

OpenAI-gym-like Reinforcement Learning environment for Dispatching of Mobile...

15
Experimental
245 xValentim/ReinforcementLearning_Zero_to_Hero_Course

In this repository you will learn all the basic math about Reinforcement...

15
Experimental
246 fareskhlifi/Intelligent-Scheduling-using-Reinforcement-learning-and-Deep-Q-Networks

Implementing a new environment in Gymnasium for intelligent schduling

15
Experimental
247 aminkhani/Deep-RL

You can see a reference for Books, Articles, Courses and Educational...

15
Experimental
248 snthomps/rlhf-ppo-pipeline

RLHF/PPO Training Pipeline with Performance Profiling and Optimization Demonstrations

15
Experimental
249 hmomin/PPO-Winter-Run

Trains an agent with Proximal Policy Optimization (PPO) to beat Winter Run

15
Experimental
250 bay3s/ppo-parallel

Parallelized implementation of Proximal Policy Optimization (PPO).

15
Experimental
251 undextrois/reinforcement-learning

RL Experiments and what not

15
Experimental
252 silviomori/udacity-deep-reinforcement-learning-p2-continuous-control

Create and train a double-jointed arm agent that is able to maintain its...

15
Experimental
253 mhahsler/Introduction_to_Reinforcement_Learning

Material for an introduction course to reinforcement learning for compute scientists

15
Experimental
254 Lare1998/rl-for-robotics

Reinforcement Learning applications for robotic control and task automation.

14
Experimental
255 yelurebajrang/HeteroRL_GEPO

⚡ Optimize heterogeneous reinforcement learning with GEPO for decentralized...

14
Experimental
256 Madid1976/reinforcement-learning-agents

Implementations of various reinforcement learning algorithms and agents for...

14
Experimental
257 kodok13/Label-Free-RLVR

📚 Explore a curated collection of research on Label-Free Reinforcement...

14
Experimental
258 a7med3laa/DRL-Books-resources

Deep Reinforcement Learning Books and links for studying

14
Experimental
259 julia-bel/MAPF_G2RL

Implementation of the G2RL approach in the POGEMA environment

14
Experimental
260 uzumstanley/DEEP-LEARNING

UNIVERSITY OF ROEHAMPTON LONDON

14
Experimental
261 Jcorrieri/multiagent-gridworld

Deep Reinforcement Learning for Multi-Robot Path Planning using PyTorch, Ray...

14
Experimental
262 BackpropTools/BackpropTools

A Fast, Portable Deep Reinforcement Learning Library for Continuous Control

14
Experimental
263 TroddenSpade/Exhaustive-Reinforcement-Learning

Exhaustive Implementation of Algorithms, Key Papers, and Well-Known Problems...

14
Experimental
264 PatrickSinger99/ReinforcementLearningInventoryManagement

Repository for my bachelor thesis on inventory management in a logistics...

14
Experimental
265 Aryia-Behroziuan/Robot-learning

In developmental robotics, robot learning algorithms generate their own...

14
Experimental
266 AndersonPeng/ppo_tutorial

PPO pytorch tutorial for continuous control (BipedalWalker-v3)

13
Experimental
267 manjavacas/rl-temario

Temario sobre aprendizaje por refuerzo en español.

13
Experimental
268 mbar0075/Advanced-Reinforcement-Learning

Deliverables relating to the Advanced Reinforcement Learning University Unit

13
Experimental
269 gabotechs/lazaro

Reinforcement learning framework for implementing custom models on custom...

12
Experimental
270 PathumDilhara/RL-agent-for-CNN-hyper-parameter-optimization

A reinforcement learning (RL) based agent that automatically tunes...

12
Experimental
271 brianspiering/rl-course

Applied Reinforcement Learning course

12
Experimental
272 CarsonScott/Dual-Process-Reinforcement

An intelligent agent that adaptively changes its thought processes to...

12
Experimental
273 openpsi-projects/srl

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

12
Experimental
274 TroddenSpade/Maximum-Entropy-Deep-IRL

Implementations of Maximum Entropy Algorithms for solving Inverse...

12
Experimental
275 MatTheTab/GHOST_RL_materials

Materials for Reinforcement Learning and Machine Learning in games for GHOST.

11
Experimental
276 ArdavanKhalij/RL-Seminar-Project

This project is the project of RL course at Vrije Universiteit Brussels and...

11
Experimental
277 Develop-Packt/Building-an-Artificial-Intelligence-Algorithm

Learn how to build a machine learning mode and get started on the popular...

11
Experimental
278 TroddenSpade/Meta-Reinforcement-Learning

Code snippets of Meta Reinforcement Learning algorithms

11
Experimental
279 baekbyte/NormLayer

A Python SDK that enforces behavioral policies between agents at runtime in...

11
Experimental
280 Tahernezhad/Continuous-Control-Workbench

A clean PyTorch implementation of PPO, SAC, and TD3 made from scratch. It is...

11
Experimental
281 zchuning/repo

Resilient Model-Based RL by Regularizing Posterior Predictability

11
Experimental
282 thevilledev/elements-of-ai-idea

Project pitch on using reinforcement learning for resource scheduling

11
Experimental
283 yamatokataoka/learning-from-human-preferences

Replication of Deep Reinforcement Learning from Human Preferences...

11
Experimental
284 Bonifatius94/rl-algos

SOTA Reinforcement Learning Algorithms

11
Experimental
285 ArdavanKhalij/MDP

machine-learning reinforcement-learning artificial-intelligence...

11
Experimental
286 mohamedrxo/ppo

A comprehensive repository for training OpenAI Gym environments using...

10
Experimental
287 Saifahmadkhan/PlugNPlay

This library is a PlugNPlay version of our novel pipeline VacSIM. We have...

10
Experimental
288 Talendar/pyderl

Simple Deep Reinforcement Learning framework for Python.

10
Experimental
289 micdestefano/micppo

An implementation of Proximal Policy Optimization (PPO)

10
Experimental
290 PieroMacaluso/collaboration-n-competition

Implementation of Multi-Agent Deep Deterministic Policy Gradient (MADDPG)...

10
Experimental