CartPole Reinforcement Learning ML Frameworks

Educational implementations of reinforcement learning algorithms (DQN, SARSA, Q-Learning, A2C, DDPG) applied specifically to the CartPole control problem. Does NOT include general RL frameworks, other environments/benchmarks, or non-RL control methods.

There are 34 cartpole reinforcement learning frameworks tracked. The highest-rated is WilliamLwj/PyXAB at 48/100 with 127 stars.

Get all 34 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=cartpole-reinforcement-learning&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Framework	Score	Tier	Stars	Language
1	WilliamLwj/PyXAB PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization...	48	Emerging	127	Python
2	jekyllstein/Reinforcement-Learning-Sutton-Barto-Exercise-Solutions Chapter notes and exercise solutions for Reinforcement Learning: An...	40	Emerging	49	Julia
3	cfoh/Multi-Armed-Bandit-Example Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB,...	37	Emerging	45	Python
4	hypnosapos/cartpole-rl-remote CartPole game by Reinforcement Learning, a journey from training to inference	36	Emerging	25	Python
5	matteocasolari/reinforcement-learning-an-introduction-solutions Implementations for solutions to programming exercises of Reinforcement...	34	Emerging	34	Python
6	BY571/Upside-Down-Reinforcement-Learning Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on...	34	Emerging	78	Jupyter Notebook
7	iamhectorotero/rlai-exercises Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]	32	Emerging	155	Jupyter Notebook
8	gerdm/reinforcement-learning Repository of notes, code and notebooks in Python for the book...	29	Experimental	37	Jupyter Notebook
9	dynamicslab/MultiArm-Pendulum This repository is for our paper: "The Experimental Multi-Arm Pendulum on a...	27	Experimental	19	MATLAB
10	ocraft/rl-sandbox Selected algorithms and exercises from the book Sutton, R. S. & Barton, A.:...	27	Experimental	6	Python
11	nicklashansen/reinforcement-learning-sutton-barto Personal repository for course on reinforcement learning. Includes...	23	Experimental	2	Jupyter Notebook
12	thetawom/mabby A multi-armed bandit (MAB) simulation library in Python	22	Experimental	9	Python
13	Maskiabdo97/cartpole-ts 🤖 Control a cart-pole system using TypeScript with a focus on implementing...	22	Experimental	—	TypeScript
14	bprabhakar/upside-down-reinforcement-learning Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by...	21	Experimental	11	Jupyter Notebook
15	singhsidhukuldeep/contextual-bandits A comprehensive Python library implementing a variety of contextual and...	20	Experimental	13	Python
16	Nth-iteration-labs/streamingbandit-ui Client that handles the administration of StreamingBandit online, or...	20	Experimental	8	JavaScript
17	marlesson/meta-bandit-selector The Contextual Meta-Bandit (CMB) can be used to select models using the...	20	Experimental	9	Jupyter Notebook
18	kambhampati-vijaya-sri-vyshnavi-devi89/dqn-rl-agent DQN agent solving CartPole-v1 and LunarLander-v2 with Experience Replay,...	19	Experimental	—	HTML
19	pacalab/rl_sutton_barto Reinforcement Learning (Sutton, Barto) - solved exercises	15	Experimental	2	Jupyter Notebook
20	victor-iyi/multi-armed-bandit-with-policy-gradient A multi armed bandit Reinforcement learning problem using Policy Gradient.	14	Experimental	9	Jupyter Notebook
21	mtichikawa/bandit-ab-testing Multi-armed bandit framework for adaptive A/B testing (Thompson Sampling,...	14	Experimental	—	Jupyter Notebook
22	oalvarobraz/pytorch-cartpole-rl A from-scratch Deep Reinforcement Learning (DQN) agent built with PyTorch to...	14	Experimental	—	Python
23	navidadkhah/CartPole-V1 CartPole problem solved using two Reinforcement learning algorithms (DQN and...	13	Experimental	6	Python
24	SanketAgrawal/ReinforcementLearning Chapter wise implementation & analysis of all the algorithms in RL : An...	12	Experimental	3	Jupyter Notebook
25	cezarbulancea/CartPole-RL Implementation of several RL algorithms on the CartPole-v1 environment.	12	Experimental	4	Python
26	rmitsuboshi/bandit A small collection of Bandit algorithms, written in Rust 🦀.	12	Experimental	3	Rust
27	shc33/Actor-critic-learning-for-cartpole-with-MATLAB Actor-critic learning for the OpenAI Gym cartpole game with MATLAB	11	Experimental	—	MATLAB
28	shc33/Deep-Q-Learning-for-cartpole-with-MATLAB Deep Q-Learning for the OpenAI Gym cartpole game with MATLAB	11	Experimental	—	MATLAB
29	shc33/Deep-Policy-Gradient-learning-for-cartpole-with-MATLAB Deep Policy Gradient learning for the OpenAI Gym cartpole game with MATLAB	11	Experimental	—	MATLAB
30	MikiTwenty/cart-pole-agent Personal Project	11	Experimental	—	Jupyter Notebook
31	cezarbulancea/Multi-Armed-Bandits Implementation of several multi-armed bandit problems.	11	Experimental	2	Python
32	gunh0/reinforcement-learning-cartpole-balancing 📢 2019 Microsoft Student Partners (MSP) Evangelism Seminar - 2019.03.31	11	Experimental	2	Jupyter Notebook
33	bcorfman/sb3-trial Stable Baselines 3 Cartpole example configured with Rye as dependency manager.	10	Experimental	1	Makefile
34	iiShreya/cartPoleEnv_hillClimbingAlgo Hill Climbing Algorithm implemented for the Cart Pole Environment.	10	Experimental	1	Jupyter Notebook

Comparisons in this category

reinforcement-learning-an-introduction-solutions and rlai-exercises (34 vs 32) rlai-exercises and rl-sandbox (32 vs 27) Upside-Down-Reinforcement-Learning and upside-down-reinforcement-learning (34 vs 21) reinforcement-learning-an-introduction-solutions and rl-sandbox (34 vs 27)