CartPole Reinforcement Learning ML Frameworks
Educational implementations of reinforcement learning algorithms (DQN, SARSA, Q-Learning, A2C, DDPG) applied specifically to the CartPole control problem. Does NOT include general RL frameworks, other environments/benchmarks, or non-RL control methods.
There are 34 cartpole reinforcement learning frameworks tracked. The highest-rated is WilliamLwj/PyXAB at 48/100 with 127 stars.
Get all 34 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=cartpole-reinforcement-learning&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
WilliamLwj/PyXAB
PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization... |
|
Emerging |
| 2 |
jekyllstein/Reinforcement-Learning-Sutton-Barto-Exercise-Solutions
Chapter notes and exercise solutions for Reinforcement Learning: An... |
|
Emerging |
| 3 |
cfoh/Multi-Armed-Bandit-Example
Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB,... |
|
Emerging |
| 4 |
hypnosapos/cartpole-rl-remote
CartPole game by Reinforcement Learning, a journey from training to inference |
|
Emerging |
| 5 |
matteocasolari/reinforcement-learning-an-introduction-solutions
Implementations for solutions to programming exercises of Reinforcement... |
|
Emerging |
| 6 |
BY571/Upside-Down-Reinforcement-Learning
Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on... |
|
Emerging |
| 7 |
iamhectorotero/rlai-exercises
Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition] |
|
Emerging |
| 8 |
gerdm/reinforcement-learning
Repository of notes, code and notebooks in Python for the book... |
|
Experimental |
| 9 |
dynamicslab/MultiArm-Pendulum
This repository is for our paper: "The Experimental Multi-Arm Pendulum on a... |
|
Experimental |
| 10 |
ocraft/rl-sandbox
Selected algorithms and exercises from the book Sutton, R. S. & Barton, A.:... |
|
Experimental |
| 11 |
nicklashansen/reinforcement-learning-sutton-barto
Personal repository for course on reinforcement learning. Includes... |
|
Experimental |
| 12 |
thetawom/mabby
A multi-armed bandit (MAB) simulation library in Python |
|
Experimental |
| 13 |
Maskiabdo97/cartpole-ts
🤖 Control a cart-pole system using TypeScript with a focus on implementing... |
|
Experimental |
| 14 |
bprabhakar/upside-down-reinforcement-learning
Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by... |
|
Experimental |
| 15 |
singhsidhukuldeep/contextual-bandits
A comprehensive Python library implementing a variety of contextual and... |
|
Experimental |
| 16 |
Nth-iteration-labs/streamingbandit-ui
Client that handles the administration of StreamingBandit online, or... |
|
Experimental |
| 17 |
marlesson/meta-bandit-selector
The Contextual Meta-Bandit (CMB) can be used to select models using the... |
|
Experimental |
| 18 |
kambhampati-vijaya-sri-vyshnavi-devi89/dqn-rl-agent
DQN agent solving CartPole-v1 and LunarLander-v2 with Experience Replay,... |
|
Experimental |
| 19 |
pacalab/rl_sutton_barto
Reinforcement Learning (Sutton, Barto) - solved exercises |
|
Experimental |
| 20 |
victor-iyi/multi-armed-bandit-with-policy-gradient
A multi armed bandit Reinforcement learning problem using Policy Gradient. |
|
Experimental |
| 21 |
mtichikawa/bandit-ab-testing
Multi-armed bandit framework for adaptive A/B testing (Thompson Sampling,... |
|
Experimental |
| 22 |
oalvarobraz/pytorch-cartpole-rl
A from-scratch Deep Reinforcement Learning (DQN) agent built with PyTorch to... |
|
Experimental |
| 23 |
navidadkhah/CartPole-V1
CartPole problem solved using two Reinforcement learning algorithms (DQN and... |
|
Experimental |
| 24 |
SanketAgrawal/ReinforcementLearning
Chapter wise implementation & analysis of all the algorithms in RL : An... |
|
Experimental |
| 25 |
cezarbulancea/CartPole-RL
Implementation of several RL algorithms on the CartPole-v1 environment. |
|
Experimental |
| 26 |
rmitsuboshi/bandit
A small collection of Bandit algorithms, written in Rust 🦀. |
|
Experimental |
| 27 |
shc33/Actor-critic-learning-for-cartpole-with-MATLAB
Actor-critic learning for the OpenAI Gym cartpole game with MATLAB |
|
Experimental |
| 28 |
shc33/Deep-Q-Learning-for-cartpole-with-MATLAB
Deep Q-Learning for the OpenAI Gym cartpole game with MATLAB |
|
Experimental |
| 29 |
shc33/Deep-Policy-Gradient-learning-for-cartpole-with-MATLAB
Deep Policy Gradient learning for the OpenAI Gym cartpole game with MATLAB |
|
Experimental |
| 30 |
MikiTwenty/cart-pole-agent
Personal Project |
|
Experimental |
| 31 |
cezarbulancea/Multi-Armed-Bandits
Implementation of several multi-armed bandit problems. |
|
Experimental |
| 32 |
gunh0/reinforcement-learning-cartpole-balancing
📢 2019 Microsoft Student Partners (MSP) Evangelism Seminar - 2019.03.31 |
|
Experimental |
| 33 |
bcorfman/sb3-trial
Stable Baselines 3 Cartpole example configured with Rye as dependency manager. |
|
Experimental |
| 34 |
iiShreya/cartPoleEnv_hillClimbingAlgo
Hill Climbing Algorithm implemented for the Cart Pole Environment. |
|
Experimental |