MahanVeisi8/RL_practices
Collection of my Reinforcement Learning (RL) practices including DQN, D3QN, and Adaptive Gamma, applied to the Lunar Lander and CartPole environments. đđšī¸
No commits in the last 6 months.
Stars
16
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Oct 21, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/MahanVeisi8/RL_practices"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
vietnh1009/Super-mario-bros-PPO-pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
taherfattahi/ppo-rocket-landing
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket...
anh-nn01/Lunar-Lander-Double-Deep-Q-Networks
An AI agent that use Double Deep Q-learning to teach itself to land a Lunar Lander on OpenAI universe
Itomigna2/Muesli-lunarlander
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
fvalka/atc-reinforcement-learning
Reinforcement learning for an air traffic control task. OpenAI gym based simulation.