vietnh1009/Super-mario-bros-PPO-pytorch

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

/ 100

Established

Implements a deep reinforcement learning agent using PyTorch with an actor-critic architecture, achieving 31/32 level completion across the full Super Mario Bros NES game by leveraging PPO's clipped objective function for stable policy updates. Integrates with the OpenAI Gym environment wrapper for NES emulation and employs CNN feature extraction from raw pixel inputs paired with separate policy and value networks. Training supports per-level hyperparameter tuning and Docker containerization for reproducible GPU-accelerated training workflows.

1,268 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

1,268

Forks

236

Language

Python

License

MIT

Related frameworks

taherfattahi/ppo-rocket-landing

Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket...

fvalka/atc-reinforcement-learning

Reinforcement learning for an air traffic control task. OpenAI gym based simulation.

sdsubhajitdas/Rocket_Lander_Gym

💥💥 This is a easy installable extension for OpenAi Gym Environment. This simulates SpaceX Falcon landing.

juliankappler/lunar-lander

Implementation of deep reinforcement learning algorithms for training an agent to play the game...

anh-nn01/Lunar-Lander-Double-Deep-Q-Networks

An AI agent that use Double Deep Q-learning to teach itself to land a Lunar Lander on OpenAI universe

Explore ML Frameworks

All categories Trending ML Framework directory Insights