Aerial Robot Reinforcement Learning ML Frameworks
Projects applying reinforcement learning algorithms (DQN, Q-Learning, DDPG, Actor-Critic, TD3, SAC) to train autonomous aerial vehicles (quadcopters, drones, UAVs) for navigation, control, trajectory planning, and delivery tasks. Does NOT include general robotics RL, non-aerial vehicle control, or RL frameworks without an aerial/drone application focus.
There are 112 aerial robot reinforcement learning frameworks tracked. 3 score above 50 (established tier). The highest-rated is LucasAlegre/sumo-rl at 69/100 with 1,002 stars. 1 of the top 10 are actively maintained.
Get all 112 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=aerial-robot-reinforcement-learning&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
LucasAlegre/sumo-rl
Reinforcement Learning environments for Traffic Signal Control with SUMO.... |
|
Established |
| 2 |
hilo-mpc/hilo-mpc
HILO-MPC is a Python toolbox for easy, flexible and fast development of... |
|
Established |
| 3 |
kyegomez/RoboCAT
Implementation of Deepmind's RoboCat: "Self-Improving Foundation Agent for... |
|
Established |
| 4 |
reiniscimurs/DRL-robot-navigation
Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo... |
|
Emerging |
| 5 |
OpenQuadruped/spot_mini_mini
Dynamics and Domain Randomized Gait Modulation with Bezier Curves for... |
|
Emerging |
| 6 |
cbfinn/gps
Guided Policy Search |
|
Emerging |
| 7 |
MishaLaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient... |
|
Emerging |
| 8 |
avisingh599/reward-learning-rl
[RSS 2019] End-to-End Robotic Reinforcement Learning without Reward Engineering |
|
Emerging |
| 9 |
adham-elarabawy/open-quadruped
An open-source 3D-printed quadrupedal robot. Intuitive gait generation... |
|
Emerging |
| 10 |
jiachenli94/Awesome-Decision-Making-Reinforcement-Learning
A selection of state-of-the-art research materials on decision making and... |
|
Emerging |
| 11 |
jhu-lcsr/good_robot
"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for... |
|
Emerging |
| 12 |
Weizhe-Chen/PyPolo
A Python library for Robotic Information Gathering |
|
Emerging |
| 13 |
fangvv/UAV-DDPG
Code for paper "Computation Offloading Optimization for UAV-assisted Mobile... |
|
Emerging |
| 14 |
anassinator/dqn-obstacle-avoidance
Deep Reinforcement Learning for Fixed-Wing Flight Control with Deep Q-Network |
|
Emerging |
| 15 |
MorvanZhou/train-robot-arm-from-scratch
Build environment and train a robot arm from scratch (Reinforcement Learning) |
|
Emerging |
| 16 |
sacktock/MASA-Safe-RL
MASA-Safe-RL: A safe reinforcement library for providing a common interface... |
|
Emerging |
| 17 |
UT-HCRL/LEGATO
Official codebase for LEGATO (Learning with a Handheld Grasping Tool) |
|
Emerging |
| 18 |
udacity/RL-Quadcopter
Teach a Quadcopter How to Fly! |
|
Emerging |
| 19 |
TRI-ML/RAP
This is the official code for the paper RAP: Risk-Aware Prediction for... |
|
Emerging |
| 20 |
nikhilbarhate99/min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via... |
|
Emerging |
| 21 |
UT-Austin-RPL/TRILL
Official codebase for TRILL (Teleoperation and Imitation Learning for... |
|
Emerging |
| 22 |
reiniscimurs/DRL-Robot-Navigation-ROS2
Deep Reinforcement Learning for mobile robot navigation in ROS2 Gazebo... |
|
Emerging |
| 23 |
roboterax/humanoid-gym
Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot... |
|
Emerging |
| 24 |
JdeRobot/RL-Studio
Robotic library for the training of Reinforcement Learning algorithms |
|
Emerging |
| 25 |
SatCom-TELMA/MA-DRL_Routing_Simulator
Multi-Agent Deep Reinforcement Learning (MA-DRL) Routing Simulator for... |
|
Emerging |
| 26 |
John-Wendell/DDPG-AirSim-Drone-Obstacle-Avoidance
Using DDPG and ConvLSTM to control a drone to avoid obstacle in AirSim |
|
Emerging |
| 27 |
mihdalal/raps
[NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning... |
|
Emerging |
| 28 |
danijar/director
Deep Hierarchical Planning from Pixels |
|
Emerging |
| 29 |
QuadCtrl/quad-ctrl
Quadcopter Controller with Deep Reinforcement Learning |
|
Emerging |
| 30 |
jackvial/lerobot-data-studio
LeRobot Data Studio - Unofficial LeRobot Dataset Editor |
|
Emerging |
| 31 |
samholt/NeuralLaplaceControl
Neural Laplace Control for Continuous-time Delayed Systems - an offline RL... |
|
Emerging |
| 32 |
AlgTUDelft/AlwaysSafe
Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety... |
|
Emerging |
| 33 |
Rolv-Arild/replay-pretraining
Rocket League pretraining from replay files |
|
Emerging |
| 34 |
Zhang-Xuewen/Deep-DeePC
This project is source code of paper Deep DeePC: Data-enabled predictive... |
|
Emerging |
| 35 |
DevMilk/UAV-Based-Cellular-Communication-Multi-Agent-DRL-Solution
UAV-based Cellular-Communication: Multi-Agent Deep Reinforcement Learning... |
|
Emerging |
| 36 |
Armandpl/furuta
Building and Training a Rotary Inverted Pendulum robot |
|
Emerging |
| 37 |
Jamesscn/Quadcopter
My master's degree project, titled Quadcopter Control with Deep... |
|
Experimental |
| 38 |
harvard-edge/AirLearning
Public repository for Air Learning project |
|
Experimental |
| 39 |
argmax-ai/drone-hardware
Robust Quadrotor Frame for Machine Learning and Control Research |
|
Experimental |
| 40 |
smidmatej/mpc_quad_ros
Model Predictive Controller for a quadcopter model using online learning... |
|
Experimental |
| 41 |
AiltonOliveir/RL-env-for-communications
Reinforcement learning environment for MIMO communications. |
|
Experimental |
| 42 |
bytedance/GR-MG
Official implementation of GR-MG |
|
Experimental |
| 43 |
dcaffo98/path-planning-cnn
Solving synthetic 2d path-planning problems with a convolutional neural network. |
|
Experimental |
| 44 |
MehdiShahbazi/Webots-reinforcement-navigation
Implementing obstacle avoidance and path planning for the Pioneer 3-DX robot... |
|
Experimental |
| 45 |
UT-Austin-RPL/PRELUDE
Official codebase for PRELUDE (Perceptive Locomotion Under Dynamic Environments) |
|
Experimental |
| 46 |
letsgogeeky/QuadCopter-RL
Teaching a QuadCopter to TakeOff and Land using Reinforcement Learning |
|
Experimental |
| 47 |
ami-iit/paper_romualdi_viceconte_2024_humanoids_dnn-mpc-walking
[Humanoids 2024 award finalist] Online DNN-Driven Nonlinear MPC for... |
|
Experimental |
| 48 |
CJReinforce/RIME_ICML2024
Official code for ICML 2024 paper, "RIME: Robust Preference-based... |
|
Experimental |
| 49 |
ifrunistuttgart/RL_Integrated-Updraft-Exploitation
This repository includes a reinforcement learning framework for end-to-end... |
|
Experimental |
| 50 |
rameshvarun/NeuralKart
A Real-time Mario Kart 64 AI using ConvNets. |
|
Experimental |
| 51 |
VanIseghemThomas/AI-Parking-Unity
A RL project focussed on autonomous parking, using Unity's MLAgents toolkit. |
|
Experimental |
| 52 |
dion-jy/learning-based-navigation-papers
paper-list of learning-based mobile navigation(especially motion planning &... |
|
Experimental |
| 53 |
BreenSammy/learning-corridor-selection
Deep-learning model for selection of high-level corridors in motion planning |
|
Experimental |
| 54 |
MarkFzp/Deep-Whole-Body-Control
[CoRL 2022] Deep Whole-Body Control: Learning a Unified Policy for... |
|
Experimental |
| 55 |
Tushar-ml/G2RL-Path-Planning
Code for G2RL to solve the multi-robot path planning problem in a fully... |
|
Experimental |
| 56 |
Amos-Chen98/neo-planner
[IROS'25] Learning to Initialize Trajectory Optimization for Vision-Based... |
|
Experimental |
| 57 |
DeepDynaSim/DeepLyapunovFunction
Efficient Computation of Lyapunov Functions Using Deep Neural Networks for... |
|
Experimental |
| 58 |
AxelBcr/QDroneNav
Drone Project Using Q-Learning : Helping a Drone find a target. Core... |
|
Experimental |
| 59 |
isri-aist/DataDrivenMPC
Model predictive control based on data-driven model |
|
Experimental |
| 60 |
AGiannoutsos/car_racer_gym
Apply major Reinforcement Learning algorithms (DQN,PPO,A2C) to CarRacing-v0... |
|
Experimental |
| 61 |
capstone-insper/drone-swarm-search-algorithms
Algorithms to solve the DSSE environment, focusing on optimizing drone swarm... |
|
Experimental |
| 62 |
vishweshvhavle/deep-rl-navigation
Deep Reinforcement Learning for mobile robot navigation in ROS2 Gazebo... |
|
Experimental |
| 63 |
WangXiaoMingo/TensorDL-MPC
DL-MPC(deep learning model predictive control) is a software toolkit... |
|
Experimental |
| 64 |
SpaciousCoder78/congestion-aware-sdn
Adaptive Congestion-Aware Routing in Software-Defined Networks Using... |
|
Experimental |
| 65 |
parachutel/Q-Learning-for-Intelligent-Driver
We propose a driver modeling process of an intelligent autonomous driving... |
|
Experimental |
| 66 |
7enTropy7/Racer_AI
Developed a highly customizable OpenAI gym environment and trained a... |
|
Experimental |
| 67 |
TUMFTM/munich-sumo-scenario
Kalibriertes SUMO-Straßennetzwerk der Münchner lnnenstadt (11,5 km2)... |
|
Experimental |
| 68 |
charbel-a-hC/SKIPP
Repository for the End-to-end Sketch-Guided Path Planning through Imitation... |
|
Experimental |
| 69 |
Taaseen-Ali/OpenAI-Gym-Car-Race
A self-driving car OpenAI Gym environment |
|
Experimental |
| 70 |
theguega/mpc_neural_surrogate
Behavior Cloning of MPC for 3-DOF Robotic Manipulator using NN |
|
Experimental |
| 71 |
silaozgel/Autonomous-AI-Parking-Simulation
A comparison of A* and Q-Learning algorithms in a dynamic 15x15 parking environment. |
|
Experimental |
| 72 |
ZikangXiong/mobrob
Mobile Robot Control via Goal-Conditioned Reinforcement Learning |
|
Experimental |
| 73 |
lbnmahs/quadrrl
Quadruped Robot Locomotion Learning and Evaluation Suite. |
|
Experimental |
| 74 |
RaffaeleGalliera/marlin-rlcc
Reinforcement Learning environment for Congestion Control with ContainerNet |
|
Experimental |
| 75 |
RAIL-group/RAIL-group-software
Robot planning and learning research code repository from the RAIL Group at GMU |
|
Experimental |
| 76 |
danliukuri/AutonomousParking
Simulation of car parking in different parking lots using Unity ML-Agents |
|
Experimental |
| 77 |
SMARTlab-Purdue/SAN-FAPL
This repository contains the source code for our paper: "Feedback-efficient... |
|
Experimental |
| 78 |
skypitcher/risk_aware_marl
Risk-aware multi-agent deep reinforcement learning for packet routing in... |
|
Experimental |
| 79 |
antoineleeman/SLS_safety_filter
Matlab implementation of the paper "Predictive Safety Filter using System... |
|
Experimental |
| 80 |
Scala-Robotics-Simulator/PPS-22-srs
A robotics simulator written in scala. |
|
Experimental |
| 81 |
d3ac/MetaRL-for-UAV-Anti-jamming
Patent : An anti-jamming communication method for unmanned cluster based on... |
|
Experimental |
| 82 |
kiwi-sherbet/PRESTO
Official codebase for PRESTO (Planning with Environment Representation,... |
|
Experimental |
| 83 |
lambdavi/path_planning_drl
Researching new methods to leverage DeepRL for Coverage and Path Planning |
|
Experimental |
| 84 |
Emmanuel-Naive/MATD3
Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3)... |
|
Experimental |
| 85 |
zerosansan/dqn_qlearning_sarsa_mobile_robot_navigation
A Reinforcement Learning (RL) based navigation implementation for mobile... |
|
Experimental |
| 86 |
our-projects-github/Safe-Deep-Learning-Based-Global-Path-Planning-Using-a-Fast-Collision-Free-Path-Generator
Implementation of "Safe Deep Learning-Based Global Path Planning Using a... |
|
Experimental |
| 87 |
AlinaBaber/ReinforcementLearning-QLearning-based-self-tuned-PID-controller-for-AUV-MatLab
This repository showcases a hybrid control system combining Reinforcement... |
|
Experimental |
| 88 |
sgawalsh/dqnTurtlebot
Implementing Deep-Q-Learning to train a bot to navigate an environment with obstacles |
|
Experimental |
| 89 |
ZeinBarhoum/RL-quadrotor
Reinforcement Learning for quadrotor trajectory planning and control |
|
Experimental |
| 90 |
gunjanmimo/DRONE-CONTROLLER
Drone Controller Physics Unity3D |
|
Experimental |
| 91 |
MPC-Berkeley/Implicit-Game-Theoretic-MPC
Implicit Game-Theoretic MPC |
|
Experimental |
| 92 |
newton-adhikari/rl_goal_nav_tb3
TurtleBot3 autonomous navigation using Deep Reinforcement Learning (DRL) in... |
|
Experimental |
| 93 |
jeffdacpano28/self-driving-car-rl
🏎️ Train autonomous racing cars with DQN and PPO in a realistic simulator... |
|
Experimental |
| 94 |
sparkup/autonomous-spacecraft-rl
Autonomous spacecraft reinforcement learning project focused on mission... |
|
Experimental |
| 95 |
AlessioLuciani/distributed-uav-rl-protocol
An implementation of a distributed protocol for cooperative sensing and... |
|
Experimental |
| 96 |
utra-robosoccer/Bez_IsaacGym
Isaac Gym Reinforcement Learning Environments for humanoid robot Bez |
|
Experimental |
| 97 |
ifrunistuttgart/RL_CrossCountrySoaring
This repository includes a reinforcement learning framework for solving the... |
|
Experimental |
| 98 |
Amanuel-1/autonomous-drone
A 3D autonomous drone simulation with AI-powered flight capabilities using... |
|
Experimental |
| 99 |
MickyasTA/DRL_robot_navigation_ros2
TD3-based Deep Reinforcement learning-based collision avoidance of mobile... |
|
Experimental |
| 100 |
NikoRina/real2sim-eval
🤖 Evaluate real robot policies using Gaussian splatting for soft-body... |
|
Experimental |
| 101 |
BOODY2209/anymal_c_velocity
🚶♂️ Train ANYmal C to walk and track body velocities by integrating a custom... |
|
Experimental |
| 102 |
FerhatAkalan/DroneDeliverySystemQLearning
This project is an intelligent drone simulator that optimizes urban package... |
|
Experimental |
| 103 |
Engineering-Geek/RL-UAV
Reinforcement Learning for Unmanned Airial Vehicles |
|
Experimental |
| 104 |
arjun7579/maddpg-drone-coverage
Implementation of a multi-agent UAV swarm system using MADDPG. Drones learn... |
|
Experimental |
| 105 |
AndyRay1998/U-model-based-Adaptive-Sliding-Mode-Control-Using-a-Deep-Deterministic-Policy-Gradient
Personal research topic about U-model control and Sliding Mode Control tuned... |
|
Experimental |
| 106 |
saqib1707/RL-Robot-Manipulation
Inverse Reinforcement Learning for Robot Hand Manipulation Task |
|
Experimental |
| 107 |
erfan-ashtari/Path-planning
Implementation of "Safe Deep Learning-Based Global Path Planning Using a... |
|
Experimental |
| 108 |
FerhatAkalan/DroneDeliverySystemDQN
This project is an intelligent drone simulator that optimizes urban package... |
|
Experimental |
| 109 |
Aaronaferns/MBRL-DeepMPC
Efficient Model-Based Deep Reinforcement Learning with Predictive Control:... |
|
Experimental |
| 110 |
Nikunj-Gupta/conformal-agent-modelling
CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning |
|
Experimental |
| 111 |
AidenPQ/exoskeleton-high-level-control-ml
DNN + Gaussian Process Regression for hip/knee trajectory generation... |
|
Experimental |
| 112 |
gregora/Drone-AI
A project for training a neural network using genetic algorithm to control a drone |
|
Experimental |