werner-duvaud/muzero-general

MuZero

/ 100

Established

Implements the DeepMind MuZero algorithm using PyTorch with residual and fully-connected networks, enabling model-based RL without environment dynamics knowledge. Supports distributed training across multiple GPUs and Ray clusters with asynchronous self-play, plus real-time TensorBoard monitoring. Designed for quick adaptation to new environments—board games, Atari, and OpenAI Gym—by simply defining game classes and hyperparameters.

2,784 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

2,784

Forks

670

Language

Python

License

MIT

Compare

muzero-general and muzero muzero-general and muzero-cpp

Related frameworks

jonathan-laurent/AlphaZero.jl

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

mokemokechicken/reversi-alpha-zero

Reversi reinforcement learning by AlphaGo Zero methods.

suragnair/alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial +...

NeymarL/ChineseChess-AlphaZero

Implement AlphaZero/AlphaGo Zero methods on Chinese chess.

DHDev0/Stochastic-muzero

Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of...

Explore ML Frameworks

All categories Trending ML Framework directory Insights