Reinforcement Learning Frameworks
Complete RL algorithm implementations and educational resources for training agents using policy gradient, Q-learning, actor-critic, and other methods. Does NOT include game-playing agents, robotics simulators, or domain-specific RL applications—only the core algorithmic frameworks and tutorials.
There are 290 reinforcement learning frameworks tracked. 3 score above 70 (verified tier). The highest-rated is google-deepmind/dm_control at 86/100 with 4,494 stars and 309,287 monthly downloads. 5 of the top 10 are actively maintained.
Get all 290 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=reinforcement-learning-frameworks&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and... |
|
Verified |
| 2 |
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of... |
|
Verified |
| 3 |
Denys88/rl_games
RL implementations |
|
Verified |
| 4 |
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning. |
|
Established |
| 5 |
flatland-association/flatland-rl
The Flatland Framework is a multi-purpose environment to tackle problems... |
|
Established |
| 6 |
yandexdataschool/Practical_RL
A course in reinforcement learning in the wild |
|
Established |
| 7 |
takuseno/d3rlpy
An offline deep reinforcement learning library |
|
Established |
| 8 |
keras-rl/keras-rl
Deep Reinforcement Learning for Keras. |
|
Established |
| 9 |
MushroomRL/mushroom-rl
Python library for Reinforcement Learning. |
|
Established |
| 10 |
qzed/irl-maxent
Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning... |
|
Established |
| 11 |
Stable-Baselines-Team/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning... |
|
Established |
| 12 |
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research. |
|
Established |
| 13 |
huggingface/deep-rl-class
This repo contains the Hugging Face Deep Reinforcement Learning Course. |
|
Established |
| 14 |
MyoHub/myosuite
MyoSuite is a collection of environments/tasks to be solved by... |
|
Established |
| 15 |
google-research/batch-ppo
Efficient Batched Reinforcement Learning in TensorFlow |
|
Established |
| 16 |
upb-lea/reinforcement_learning_course_materials
Lecture notes, tutorial tasks including solutions as well as online videos... |
|
Established |
| 17 |
inoryy/reaver
Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft... |
|
Established |
| 18 |
tensorlayer/RLzoo
A Comprehensive Reinforcement Learning Zoo for Simple Usage 🚀 |
|
Established |
| 19 |
lucidrains/streaming-deep-rl
Explorations into the proposed Streaming Deep Reinforcement Learning, from... |
|
Established |
| 20 |
rlcode/reinforcement-learning
Minimal and Clean Reinforcement Learning Examples |
|
Established |
| 21 |
MorvanZhou/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学 |
|
Established |
| 22 |
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO,... |
|
Established |
| 23 |
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of... |
|
Established |
| 24 |
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy... |
|
Established |
| 25 |
ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction |
|
Established |
| 26 |
SforAiDl/genrl
A PyTorch reinforcement learning library for generalizable and reproducible... |
|
Established |
| 27 |
iffiX/machin
Reinforcement learning library(framework) designed for PyTorch, implements... |
|
Established |
| 28 |
seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based) |
|
Emerging |
| 29 |
danaugrs/huskarl
Deep Reinforcement Learning Framework + Algorithms |
|
Emerging |
| 30 |
AdamStelmaszczyk/learning2run
Our NIPS 2017: Learning to Run source code |
|
Emerging |
| 31 |
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning... |
|
Emerging |
| 32 |
andri27-ts/Reinforcement-Learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python.... |
|
Emerging |
| 33 |
Fraunhofer-IIS/fmugym
Interface to connect Reinforcement Learning libraries to Functional Mock-up... |
|
Emerging |
| 34 |
danijar/mindpark
Testbed for deep reinforcement learning |
|
Emerging |
| 35 |
fracapuano/robot-learning-tutorial
All the source code for "Robot Learning: A Tutorial". Get involved to be... |
|
Emerging |
| 36 |
TuragaLab/flybody
MuJoCo fruit fly body model and locomotion RL tasks |
|
Emerging |
| 37 |
rl-tools/rl-tools
The Fastest Deep Reinforcement Learning Library |
|
Emerging |
| 38 |
CarperAI/trlx
A repo for distributed training of language models with Reinforcement... |
|
Emerging |
| 39 |
ikostrikov/pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from... |
|
Emerging |
| 40 |
keon/deep-q-learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras |
|
Emerging |
| 41 |
miyosuda/async_deep_reinforce
Asynchronous Methods for Deep Reinforcement Learning |
|
Emerging |
| 42 |
danijar/embodied
Fast reinforcement learning research |
|
Emerging |
| 43 |
RLE-Foundation/rllte
Long-Term Evolution Project of Reinforcement Learning |
|
Emerging |
| 44 |
stanfordnmbl/osim-rl
Reinforcement learning environments with musculoskeletal models |
|
Emerging |
| 45 |
archsyscall/DeepRL-TensorFlow2
🐋 Simple implementations of various popular Deep Reinforcement Learning... |
|
Emerging |
| 46 |
ankonzoid/LearningX
Deep & Classical Reinforcement Learning + Machine Learning Examples in Python |
|
Emerging |
| 47 |
heronsystems/adeptRL
Reinforcement learning framework to accelerate research |
|
Emerging |
| 48 |
pathak22/noreward-rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep... |
|
Emerging |
| 49 |
vmayoral/basic_reinforcement_learning
An introductory series to Reinforcement Learning (RL) with comprehensive... |
|
Emerging |
| 50 |
dalmia/David-Silver-Reinforcement-learning
Notes for the Reinforcement Learning course by David Silver along with... |
|
Emerging |
| 51 |
icoxfog417/baby-steps-of-rl-ja
Pythonで学ぶ強化学習 -入門から実践まで- サンプルコード |
|
Emerging |
| 52 |
lucidrains/metacontroller
Implementation of the MetaController proposed in "Emergent temporal... |
|
Emerging |
| 53 |
mimoralea/gdrl
Grokking Deep Reinforcement Learning |
|
Emerging |
| 54 |
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization... |
|
Emerging |
| 55 |
Kaixhin/Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning |
|
Emerging |
| 56 |
zyxue/sutton-barto-rl-exercises
📖Learning reinforcement learning by implementing the algorithms from... |
|
Emerging |
| 57 |
simoninithomas/Deep_reinforcement_learning_Course
Implementations from the free course Deep Reinforcement Learning with... |
|
Emerging |
| 58 |
mimoralea/applied-reinforcement-learning
Reinforcement Learning and Decision Making tutorials explained at an... |
|
Emerging |
| 59 |
pat-coady/trpo
Trust Region Policy Optimization with TensorFlow and OpenAI Gym |
|
Emerging |
| 60 |
udacity/reinforcement-learning
Reinforcement learning material, code and exercises for Udacity Nanodegree programs. |
|
Emerging |
| 61 |
rail-berkeley/softlearning
Softlearning is a reinforcement learning framework for training maximum... |
|
Emerging |
| 62 |
jingweiz/pytorch-rl
Deep Reinforcement Learning with pytorch & visdom |
|
Emerging |
| 63 |
ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization |
|
Emerging |
| 64 |
nrontsis/PILCO
Bayesian Reinforcement Learning in Tensorflow |
|
Emerging |
| 65 |
denisyarats/pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC) |
|
Emerging |
| 66 |
opendilab/DI-engine-docs
DI-engine docs (Chinese and English) |
|
Emerging |
| 67 |
rmst/ddpg
TensorFlow implementation of the DDPG algorithm from the paper Continuous... |
|
Emerging |
| 68 |
binary-husky/hmp2g
Multiagent Reinforcement Learning Research Project |
|
Emerging |
| 69 |
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep... |
|
Emerging |
| 70 |
rl-language/rlc
Bringing reinforcement learning to every day programmers |
|
Emerging |
| 71 |
Stable-Baselines-Team/stable-baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of... |
|
Emerging |
| 72 |
alessiodm/drl-zh
Deep Reinforcement Learning: Zero to Hero! |
|
Emerging |
| 73 |
Cloudslab/DLSF
[TMC'20] Deep Learning based Scheduler for Stochastic Fog-Cloud computing... |
|
Emerging |
| 74 |
HewlettPackard/dc-rl
SustainDC is a set of Python environments for Data Center simulation and... |
|
Emerging |
| 75 |
ericyangyu/PPO-for-Beginners
A simple and well styled PPO implementation. Based on my Medium series:... |
|
Emerging |
| 76 |
adrianwix/pybasin
pyBasin is a Python library for estimating basin stability in dynamical... |
|
Emerging |
| 77 |
gordicaleksa/pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO.... |
|
Emerging |
| 78 |
godka/Pensieve-PPO
The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art... |
|
Emerging |
| 79 |
ItoMasaki/PixyzRL
A Bayesian RL Framework with Probabilistic Generative Models |
|
Emerging |
| 80 |
SuhailSama/MR_RL
Gym Simulator for Magnetic Micro Robots |
|
Emerging |
| 81 |
TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep... |
|
Emerging |
| 82 |
sebastianbrzustowicz/Robot-Sumo-RL
Python + PyTorch. Advanced Reinforcement Learning (SAC/PPO/A2C) for... |
|
Emerging |
| 83 |
google-deepmind/dm_env
A Python interface for reinforcement learning environments |
|
Emerging |
| 84 |
medipixel/rl_algorithms
Structural implementation of RL key algorithms |
|
Emerging |
| 85 |
Anjum48/rl-examples
Examples of published reinforcement learning algorithms in recent literature... |
|
Emerging |
| 86 |
IBM/LOA
Neuro-Symbolic Reinforcement Learning: Logical Optimal Action (LOA), a novel... |
|
Emerging |
| 87 |
gabrielhuang/reptile-pytorch
A PyTorch implementation of OpenAI's REPTILE algorithm |
|
Emerging |
| 88 |
UoA-CARES/cares_reinforcement_learning
CARES Reinforcement Learning Package |
|
Emerging |
| 89 |
yihaosun1124/OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers. |
|
Emerging |
| 90 |
mahyaret/kuka_rl
Reinforcement Learning Experiments using PyBullet |
|
Emerging |
| 91 |
huangwl18/modular-rl
[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular... |
|
Emerging |
| 92 |
denisyarats/drq
DrQ: Data regularized Q |
|
Emerging |
| 93 |
khushhallchandra/pytorch-rl
Pytorch Implementation of RL algorithms |
|
Emerging |
| 94 |
DeNA/HandyRL
HandyRL is a handy and simple framework based on Python and PyTorch for... |
|
Emerging |
| 95 |
tayalmanan28/Safe_Reinforcement_Learning
Repository containing the code for safe reinforcement learning in two custom... |
|
Emerging |
| 96 |
yrlu/irl-imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in... |
|
Emerging |
| 97 |
rlgraph/rlgraph
RLgraph: Modular computation graphs for deep reinforcement learning |
|
Emerging |
| 98 |
mohmdelsayed/streaming-drl
Deep reinforcement learning without experience replay, target networks, or... |
|
Emerging |
| 99 |
omerbsezer/Reinforcement_learning_tutorial_with_demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration),... |
|
Emerging |
| 100 |
asystemoffields/disco-torch
A PyTorch port of DeepMind's Disco103 — the meta-learned reinforcement... |
|
Emerging |
| 101 |
sudharsan13296/Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using... |
|
Emerging |
| 102 |
Shaswat2001/maple-robotics
MAPLE (Model and Policy Learning Evaluation) - A unified CLI daemon for... |
|
Emerging |
| 103 |
dvalenciar/ReinforceUI-Studio
ReinforceUI-Studio. A Python-based application designed to simplify the... |
|
Emerging |
| 104 |
Bellman-devs/bellman
Model-based reinforcement learning in TensorFlow |
|
Emerging |
| 105 |
andrewliao11/Deep-Reinforcement-Learning-Survey
My Exploration on Deep Reinforcement Learning Survey |
|
Emerging |
| 106 |
Learning4Optimization-HUST/H-TSP
Official implementation of H-TSP (AAAI2023) |
|
Emerging |
| 107 |
mitre/ilpyt
ilpyt: imitation learning library with modular, baseline implementations in Pytorch |
|
Emerging |
| 108 |
MaartenGr/ReinLife
Creating Artificial Life with Reinforcement Learning |
|
Emerging |
| 109 |
MarcoMeter/recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT |
|
Emerging |
| 110 |
Kaixhin/imitation-learning
Imitation learning algorithms |
|
Emerging |
| 111 |
NatLabRockies/graph-env
Reinforcement learning for combinatorial optimization over directed graphs |
|
Emerging |
| 112 |
airboxlab/rllib-energyplus
Simple EnergyPlus environments for control optimization using reinforcement learning |
|
Emerging |
| 113 |
denisyarats/proto
Proto-RL: Reinforcement Learning with Prototypical Representations |
|
Emerging |
| 114 |
rmst/rlrd
PyTorch implementation of our paper Reinforcement Learning with Random... |
|
Emerging |
| 115 |
thanhkaist/CCFDM1
CCFDM reinforcement learning |
|
Emerging |
| 116 |
tirthajyoti/RL_basics
Basic Reinforcement Learning algorithms |
|
Emerging |
| 117 |
rllab-snu/Deep-Reinforcement-Learning
Introduction to Deep Reinforcement Learning |
|
Emerging |
| 118 |
nsidn98/NICE
Combining Reinforcement Learning with Integer Programming for Robust Scheduling |
|
Emerging |
| 119 |
whoiszyc/IntelliHealer
IntelliHealer: An imitation and reinforcement learning platform for... |
|
Emerging |
| 120 |
zuoxingdong/lagom
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement... |
|
Emerging |
| 121 |
TheoLvs/reinforcement-learning
Personal experiments on Reinforcement Learning |
|
Emerging |
| 122 |
araffin/rl-handson-rlvs21
Stable-Baselines3 (SB3) reinforcement learning tutorial for the... |
|
Emerging |
| 123 |
MishaLaskin/rad
RAD: Reinforcement Learning with Augmented Data |
|
Emerging |
| 124 |
antonpuz/DeROL
Deep Reinforcement One-Shot Learning Framework for Artificially Intelligent... |
|
Emerging |
| 125 |
RLE-Foundation/RLeXplore
RLeXplore provides stable baselines of exploration methods in reinforcement... |
|
Emerging |
| 126 |
luisgarciar/3D-bin-packing
Solving the 3D bin packing problem with reinforcement learning |
|
Emerging |
| 127 |
EsratMaria/Reinforcement-Learning_for_Energy_Minimization_Using_CLoudsim
Implementation of RL in the cloud for energy minimization due to migration... |
|
Emerging |
| 128 |
UlisseMini/procgen-tools
Tools for running experiments on RL agents in procgen environments |
|
Emerging |
| 129 |
921kiyo/symbolic-rl
Symbolic Reinforcement Learning using Inductive Logic Programming |
|
Emerging |
| 130 |
Zhenye-Na/advanced-deep-learning-and-reinforcement-learning-deepmind
🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind |... |
|
Emerging |
| 131 |
sdpkjc/abcdrl
Modular Single-file Reinfocement Learning Algorithms Library |
|
Emerging |
| 132 |
YuvrajSingh-mist/NeatRL
Repository of implementations of classic and sota rl algorithms from scratch... |
|
Emerging |
| 133 |
AdamStelmaszczyk/rl-tutorial
Source code for "A deep dive into reinforcement learning" |
|
Emerging |
| 134 |
astier/model-free-episodic-control
Model-Free-Episodic-Control implementation. |
|
Emerging |
| 135 |
takuseno/minerva
An out-of-the-box GUI tool for offline deep reinforcement learning |
|
Emerging |
| 136 |
affaan-m/Behavioral_RL
Reinforcement Learning with human behavioral biases integration |
|
Emerging |
| 137 |
LAMDA-RL/ODIS
The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent... |
|
Emerging |
| 138 |
reward-scope-ai/reward-scope
Real-time reward debugging and hacking detection for reinforcement learning |
|
Emerging |
| 139 |
appgym/appgym
Mobile Apps (Android) as Environment for Reinforcement Learning Agents |
|
Emerging |
| 140 |
anassinator/pddp
WIP implementation of Probabilistic Differential Dynamic Programming in PyTorch |
|
Emerging |
| 141 |
dalmia/udacity-deep-reinforcement-learning
My solutions to the projects (and mini-projects) of the Deep Reinforcement... |
|
Emerging |
| 142 |
VachanVY/Reinforcement-Learning
PyTorch implementations of algorithms from "Reinforcement Learning: An... |
|
Emerging |
| 143 |
chengxi600/RLStuff
A collection of reinforcement learning algorithm implementations |
|
Emerging |
| 144 |
BY571/CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL.... |
|
Experimental |
| 145 |
kochlisGit/Shadow-Hand-Controller
Construction of controllers for Shadow-Hand in Mujoco environment, using... |
|
Experimental |
| 146 |
saqib1707/RL-PPO-PyTorch
Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch |
|
Experimental |
| 147 |
denisyarats/exorl
ExORL: Exploratory Data for Offline Reinforcement Learning |
|
Experimental |
| 148 |
shehio/rl
Implementing RL agents, one algorithm at a time |
|
Experimental |
| 149 |
navneet-nmk/pytorch-rl
This repository contains model-free deep reinforcement learning algorithms... |
|
Experimental |
| 150 |
BNN-UPC/ENERO
Code used in the paper "ENERO: Efficient real-time WAN routing optimization... |
|
Experimental |
| 151 |
Alee08/multiagent-rl-rm
The Multi-Agent RLRM (Reinforcement Learning with Reward Machines) Framework... |
|
Experimental |
| 152 |
jayLEE0301/dhrl_official
Official code for "DHRL: A Graph-Based Approach for Long-Horizon and Sparse... |
|
Experimental |
| 153 |
lucadellalib/actorch
Deep reinforcement learning framework for fast prototyping based on PyTorch |
|
Experimental |
| 154 |
navneet-nmk/Pytorch-RL-CPP
A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch) |
|
Experimental |
| 155 |
VladGavra98/SERL
Safety-informed Evolutionary Reinforcement Learning applied to... |
|
Experimental |
| 156 |
teepanis/nonlinear-pendulum
Data and Code Availability -- Universal spectral structure in pendulum-like systems |
|
Experimental |
| 157 |
schmidtdominik/Rainbow
Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient... |
|
Experimental |
| 158 |
opium-sh/prl
Open-source library for a reinforcement learning research. |
|
Experimental |
| 159 |
harshaljanjani/taskschedulingdqn
Designing energy-aware scheduling and task allocation algorithms for online... |
|
Experimental |
| 160 |
goktug97/PEPG-ES
Python Implementation of Parameter-exploring Policy Gradients Evolution Strategy |
|
Experimental |
| 161 |
amr-khaled164/GRLMSL
🚀 Optimize microservice instance selection and load balancing in edge... |
|
Experimental |
| 162 |
Brownwang0426/Reversal-Generative-Reinforcement-Learning
A simple model-free and value-function-free reinforcement learning model |
|
Experimental |
| 163 |
nnaisense/pgpelib
A mini library for Policy Gradients with Parameter-based Exploration, with... |
|
Experimental |
| 164 |
AdamStelmaszczyk/dqn
TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay) |
|
Experimental |
| 165 |
jimimvp/torch_rl
Reinforcement learning library for PyTorch. |
|
Experimental |
| 166 |
NYU-MLDA/ABC-RL
This is work-in-progress (WIP) refactored implementation of... |
|
Experimental |
| 167 |
matthieu637/ddrl
Deep Developmental Reinforcement Learning |
|
Experimental |
| 168 |
haron1100/Upside-Down-Reinforcement-Learning
Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch |
|
Experimental |
| 169 |
lucaslingle/pytorch_rl2
Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning' |
|
Experimental |
| 170 |
ialexmp/DRL-Generalization
Exploring Generalization in Deep Reinforcement Learning algorithms for... |
|
Experimental |
| 171 |
cgel/DRL
A collection of Deep Reinforcement Learning algorithms implemented in... |
|
Experimental |
| 172 |
FabioMiguel2000/LOA-feat.Reinforcement-Learning
Assigment 2 for Course L.EIC029 Artificial Intelligence, FEUP LEIC 3rd Year... |
|
Experimental |
| 173 |
Asap7772/PTR
This repository contains the implementation of the PTR algorithm described... |
|
Experimental |
| 174 |
CLAIRE-Labo/no-representation-no-trust
Codebase to fully reproduce the results of "No Representation, No Trust:... |
|
Experimental |
| 175 |
mbchang/decentralized-rl
Decentralized Reinforcment Learning: Global Decision-Making via Local... |
|
Experimental |
| 176 |
0xnu/deep-reinforcement-learning
Deep Reinforcement Learning (DRL) |
|
Experimental |
| 177 |
Skw3mdy/Reinforcement-Learning-Projects
🤖 Explore reinforcement learning techniques with projects including a taxi... |
|
Experimental |
| 178 |
linker81/Reinforcement-Learning-CheatSheet
Cheatsheet of Reinforcement Learning (Based on Sutton-Barto Book - 2nd Edition) |
|
Experimental |
| 179 |
Jaehyun-Jeong/100LinesRL
Clean RL algorithm implementations in under 100 lines each. |
|
Experimental |
| 180 |
Defenser1337/Reinforcement-learning-for-Gradient-descent
Application of reinforcement learning to train hyperparameters of gradient... |
|
Experimental |
| 181 |
voaneves/colab-rl
Keras implementation of the latest Reinforcement Learning algorithms, ready... |
|
Experimental |
| 182 |
Naighten/track-simulator
Код магистрантской дипломной работы студента НГТУ им Р.Е. Алексеева Жукова... |
|
Experimental |
| 183 |
kyegomez/HindsightReplay
My implementation of Hindsight replay in PyTorch: "Hindsight Experience Replay" |
|
Experimental |
| 184 |
ProfessorNova/PPO-Humanoid
PPO implementation for controlling a humanoid in Gymnasium's Mujoco... |
|
Experimental |
| 185 |
WinDerek/reinforce-py
Reinforcement learning agents in Python (dynamic programming,... |
|
Experimental |
| 186 |
Daraan/ray_utilities
ray & RLlib tools for unified code across different repositories.... |
|
Experimental |
| 187 |
Now-Join-Us/V0
The code repository for "$V_0$: A Generalist Value Model for Any Policy at... |
|
Experimental |
| 188 |
JeepWay/DeepPack
Unofficial implementation of DeepPack in PyTorch. DeepPack is a deep... |
|
Experimental |
| 189 |
enjeeneer/zero-shot-rl
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low... |
|
Experimental |
| 190 |
mindspore-courses/Rainbow-MindSpore
About Rainbow-MindSpore! A step-by-step tutorial from DQN to Rainbow |
|
Experimental |
| 191 |
davirenner88-rgb/LR-S
🚀 Emulate Arknights: Endfield servers with LR-S for seamless game... |
|
Experimental |
| 192 |
Space-Robotics-Laboratory/rlstar
RL STaR is a platform for creating AI for robotic applications. Researchers... |
|
Experimental |
| 193 |
motokiomura/Q-DOT
[RLC 2025] Official code repository for "Offline Reinforcement Learning with... |
|
Experimental |
| 194 |
KeepALifeUS/ml-dqn
Rainbow DQN: Double, Dueling, PER, Noisy Nets. Atari benchmarks. PyTorch. |
|
Experimental |
| 195 |
nunesma/reinforcement_learning
Deep reinforcement learning techniques for artificial intelligence project |
|
Experimental |
| 196 |
32olaa/reward-scope
🔍 Detect reward hacking in RL training with RewardScope. Track reward... |
|
Experimental |
| 197 |
HGVAbyte/rlhf-data-agent-full
🔍 Generate synthetic preference-ranked datasets for RLHF and DPO training,... |
|
Experimental |
| 198 |
AlirezaShamsoshoara/RL-from-zero
Comprehensive collection of reinforcement learning algorithms implemented... |
|
Experimental |
| 199 |
mlnjsh/Reinforcement_Learning_Projects
20 RL basics notebooks + 10 advanced projects with Streamlit apps covering... |
|
Experimental |
| 200 |
Rudge0/DynaMO-RL
Optimize policy learning by dynamically allocating rollouts and modulating... |
|
Experimental |
| 201 |
mercurycontaminated-sandarac557/KnapsackRL
🎯 Optimize exploration budgets in Reinforcement Learning with KnapsackRL for... |
|
Experimental |
| 202 |
rickstaa/stable-learning-control
A framework for training theoretically stable (and robust) Reinforcement... |
|
Experimental |
| 203 |
tartavull/alfredo
Relentlessly learning, persistently failing, but never surrendering. |
|
Experimental |
| 204 |
ErickRosete/tacorl
TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning |
|
Experimental |
| 205 |
rbosh/ml-adp
Approximate dynamic programming for stochastic optimal control in Pytorch |
|
Experimental |
| 206 |
zhuzhipeng-123/reinforce-study-for-mmm
Reinforcement Learning Research - Exploring RL algorithms in practical scenarios |
|
Experimental |
| 207 |
Yuxing-Wang-THU/Surrogate-assisted-ERL
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning |
|
Experimental |
| 208 |
dlb-rl/pulse-rl
Code for PulseRL: Enabling Offline Reinforcement Learning for Digital... |
|
Experimental |
| 209 |
NVlabs/RL-TNCO
RL-TNCO: A reinforcement learning algorithm for solving the tensor network... |
|
Experimental |
| 210 |
cubrink/mujoco-2.1-rl-project
Implementing Deep Reinforcement Learning Algorithms in Python for use in the... |
|
Experimental |
| 211 |
MiscellaneousStuff/tlol-rl
TLoL (Reinforcement Learning Python Module) - League of Legends RL Module... |
|
Experimental |
| 212 |
JaydenTeoh/MORL-Generalization
Benchmark for evaluating the generalization capabilities of Multi-Objective... |
|
Experimental |
| 213 |
enginBozkurt/Deep-Reinforcement-Learning-for-Enterprise-Nanodegree
Udacity Deep Reinforcement Learning for Enterprise Nanodegree Projects |
|
Experimental |
| 214 |
mohmdelsayed/TinyRL
Real-Time Deep RL That Fits in Small Devices |
|
Experimental |
| 215 |
shivakanthsujit/reducible-loss
Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss |
|
Experimental |
| 216 |
natetsang/open-rl
Implementations of a large collection of reinforcement learning algorithms. |
|
Experimental |
| 217 |
Uzi-gpu/reinforcement-learning
Reinforcement Learning projects with Q-Learning, Actor-Critic, and REINFORCE... |
|
Experimental |
| 218 |
Axel-Bravo/19_udacity_drlnd
Deep Reinforcement Learning Nanodregree from Udacity |
|
Experimental |
| 219 |
ugr-sail/paper-drl_building
Supplementary material to the paper "An experimental evaluation of Deep... |
|
Experimental |
| 220 |
HzcIrving/DLRL-PlayGround
The code repo contains multiple code reproduction processes of various SOTA... |
|
Experimental |
| 221 |
declanoller/cat_mouse_continuous_RL
Using DDPG and A2C reinforcement learning algorithms to solve a math puzzle |
|
Experimental |
| 222 |
sapanz/Udacity-deep-reinforcement-learning-solution
This repo will cover most of machine learning algorithms with coding examples. |
|
Experimental |
| 223 |
prototwin/RLExamples
PotoTwin Reinforcement Learning Examples |
|
Experimental |
| 224 |
dalmia/P2_Continuous_Control
My solution code for the second project of Udacity's Deep Reinforcement... |
|
Experimental |
| 225 |
trunghng/reinforcement_learning_an_introduction
Python Implementation for problems in Reinforcement Learning - An Introduction book |
|
Experimental |
| 226 |
iliasoroka1/GRU_Lyapunov_Spectrum
Lyapunov Spectrum for Double Pendulum using GRU |
|
Experimental |
| 227 |
bmazoure/ppo_jax
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned... |
|
Experimental |
| 228 |
rafelps/learning-recursive-goal-proposal
Learning Recursive Goal Proposal: A hierarchical Reinforcement Learning Approach |
|
Experimental |
| 229 |
dayyass/rllib
Reinforcement Learning Library. |
|
Experimental |
| 230 |
adaptive-intelligent-robotics/HTE
This is the repository for the paper Hierarchical Quality-Diversity for... |
|
Experimental |
| 231 |
soovittt/RL-Studio
A full-stack platform for designing reinforcement learning environments,... |
|
Experimental |
| 232 |
ankitsharma-tech/Deep-Reinforcement-Learning-With-Pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3. |
|
Experimental |
| 233 |
rorofaiz/awesome-RLVR-boundary
🔍 Explore curated resources on Reinforcement Learning with Verifiable... |
|
Experimental |
| 234 |
andranik-sahakyan/team-tron-rl
Multi-Agent Reinforcement Learning project exploring the emergence and... |
|
Experimental |
| 235 |
mindspore-courses/Deep-Reinforcement-Learning-Algorithms-with-MindSpore
MindSpore implementations of deep reinforcement learning algorithms and environments |
|
Experimental |
| 236 |
rStar-RL/LoongRL
LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts... |
|
Experimental |
| 237 |
ARgruny/Deep-Reinforcement-Learning
Build and test DRL algorithms in different environments |
|
Experimental |
| 238 |
ashworks1706/kaelum
LATS-based inference with a reward model and online policy router across... |
|
Experimental |
| 239 |
mlnjsh/rl-book-labs
🎮 Interactive browser-based labs for "Complete Reinforcement Learning... |
|
Experimental |
| 240 |
GTR-GAMES/Deep-Hierarchical-Planning
🔍 Implement efficient long-horizon task planning with this PyTorch... |
|
Experimental |
| 241 |
victor369basu/MyosuiteDDQN
In this repository, we try to solve musculoskeletal tasks with `Double DQN... |
|
Experimental |
| 242 |
icaros-usc/dqd-rl
Official implementation of "Approximating Gradients for Differentiable... |
|
Experimental |
| 243 |
motokiomura/annealed-q-learning
[ICML 2025] Official code repository for "Gradual Transition from Bellman... |
|
Experimental |
| 244 |
liyan2015/SUMO-RL-MobiCharger
OpenAI-gym-like Reinforcement Learning environment for Dispatching of Mobile... |
|
Experimental |
| 245 |
xValentim/ReinforcementLearning_Zero_to_Hero_Course
In this repository you will learn all the basic math about Reinforcement... |
|
Experimental |
| 246 |
fareskhlifi/Intelligent-Scheduling-using-Reinforcement-learning-and-Deep-Q-Networks
Implementing a new environment in Gymnasium for intelligent schduling |
|
Experimental |
| 247 |
aminkhani/Deep-RL
You can see a reference for Books, Articles, Courses and Educational... |
|
Experimental |
| 248 |
snthomps/rlhf-ppo-pipeline
RLHF/PPO Training Pipeline with Performance Profiling and Optimization Demonstrations |
|
Experimental |
| 249 |
hmomin/PPO-Winter-Run
Trains an agent with Proximal Policy Optimization (PPO) to beat Winter Run |
|
Experimental |
| 250 |
bay3s/ppo-parallel
Parallelized implementation of Proximal Policy Optimization (PPO). |
|
Experimental |
| 251 |
undextrois/reinforcement-learning
RL Experiments and what not |
|
Experimental |
| 252 |
silviomori/udacity-deep-reinforcement-learning-p2-continuous-control
Create and train a double-jointed arm agent that is able to maintain its... |
|
Experimental |
| 253 |
mhahsler/Introduction_to_Reinforcement_Learning
Material for an introduction course to reinforcement learning for compute scientists |
|
Experimental |
| 254 |
Lare1998/rl-for-robotics
Reinforcement Learning applications for robotic control and task automation. |
|
Experimental |
| 255 |
yelurebajrang/HeteroRL_GEPO
⚡ Optimize heterogeneous reinforcement learning with GEPO for decentralized... |
|
Experimental |
| 256 |
Madid1976/reinforcement-learning-agents
Implementations of various reinforcement learning algorithms and agents for... |
|
Experimental |
| 257 |
kodok13/Label-Free-RLVR
📚 Explore a curated collection of research on Label-Free Reinforcement... |
|
Experimental |
| 258 |
a7med3laa/DRL-Books-resources
Deep Reinforcement Learning Books and links for studying |
|
Experimental |
| 259 |
julia-bel/MAPF_G2RL
Implementation of the G2RL approach in the POGEMA environment |
|
Experimental |
| 260 |
uzumstanley/DEEP-LEARNING
UNIVERSITY OF ROEHAMPTON LONDON |
|
Experimental |
| 261 |
Jcorrieri/multiagent-gridworld
Deep Reinforcement Learning for Multi-Robot Path Planning using PyTorch, Ray... |
|
Experimental |
| 262 |
BackpropTools/BackpropTools
A Fast, Portable Deep Reinforcement Learning Library for Continuous Control |
|
Experimental |
| 263 |
TroddenSpade/Exhaustive-Reinforcement-Learning
Exhaustive Implementation of Algorithms, Key Papers, and Well-Known Problems... |
|
Experimental |
| 264 |
PatrickSinger99/ReinforcementLearningInventoryManagement
Repository for my bachelor thesis on inventory management in a logistics... |
|
Experimental |
| 265 |
Aryia-Behroziuan/Robot-learning
In developmental robotics, robot learning algorithms generate their own... |
|
Experimental |
| 266 |
AndersonPeng/ppo_tutorial
PPO pytorch tutorial for continuous control (BipedalWalker-v3) |
|
Experimental |
| 267 |
manjavacas/rl-temario
Temario sobre aprendizaje por refuerzo en español. |
|
Experimental |
| 268 |
mbar0075/Advanced-Reinforcement-Learning
Deliverables relating to the Advanced Reinforcement Learning University Unit |
|
Experimental |
| 269 |
gabotechs/lazaro
Reinforcement learning framework for implementing custom models on custom... |
|
Experimental |
| 270 |
PathumDilhara/RL-agent-for-CNN-hyper-parameter-optimization
A reinforcement learning (RL) based agent that automatically tunes... |
|
Experimental |
| 271 |
brianspiering/rl-course
Applied Reinforcement Learning course |
|
Experimental |
| 272 |
CarsonScott/Dual-Process-Reinforcement
An intelligent agent that adaptively changes its thought processes to... |
|
Experimental |
| 273 |
openpsi-projects/srl
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores |
|
Experimental |
| 274 |
TroddenSpade/Maximum-Entropy-Deep-IRL
Implementations of Maximum Entropy Algorithms for solving Inverse... |
|
Experimental |
| 275 |
MatTheTab/GHOST_RL_materials
Materials for Reinforcement Learning and Machine Learning in games for GHOST. |
|
Experimental |
| 276 |
ArdavanKhalij/RL-Seminar-Project
This project is the project of RL course at Vrije Universiteit Brussels and... |
|
Experimental |
| 277 |
Develop-Packt/Building-an-Artificial-Intelligence-Algorithm
Learn how to build a machine learning mode and get started on the popular... |
|
Experimental |
| 278 |
TroddenSpade/Meta-Reinforcement-Learning
Code snippets of Meta Reinforcement Learning algorithms |
|
Experimental |
| 279 |
baekbyte/NormLayer
A Python SDK that enforces behavioral policies between agents at runtime in... |
|
Experimental |
| 280 |
Tahernezhad/Continuous-Control-Workbench
A clean PyTorch implementation of PPO, SAC, and TD3 made from scratch. It is... |
|
Experimental |
| 281 |
zchuning/repo
Resilient Model-Based RL by Regularizing Posterior Predictability |
|
Experimental |
| 282 |
thevilledev/elements-of-ai-idea
Project pitch on using reinforcement learning for resource scheduling |
|
Experimental |
| 283 |
yamatokataoka/learning-from-human-preferences
Replication of Deep Reinforcement Learning from Human Preferences... |
|
Experimental |
| 284 |
Bonifatius94/rl-algos
SOTA Reinforcement Learning Algorithms |
|
Experimental |
| 285 |
ArdavanKhalij/MDP
machine-learning reinforcement-learning artificial-intelligence... |
|
Experimental |
| 286 |
mohamedrxo/ppo
A comprehensive repository for training OpenAI Gym environments using... |
|
Experimental |
| 287 |
Saifahmadkhan/PlugNPlay
This library is a PlugNPlay version of our novel pipeline VacSIM. We have... |
|
Experimental |
| 288 |
Talendar/pyderl
Simple Deep Reinforcement Learning framework for Python. |
|
Experimental |
| 289 |
micdestefano/micppo
An implementation of Proximal Policy Optimization (PPO) |
|
Experimental |
| 290 |
PieroMacaluso/collaboration-n-competition
Implementation of Multi-Agent Deep Deterministic Policy Gradient (MADDPG)... |
|
Experimental |