werner-duvaud/muzero-general
MuZero
Implements the DeepMind MuZero algorithm using PyTorch with residual and fully-connected networks, enabling model-based RL without environment dynamics knowledge. Supports distributed training across multiple GPUs and Ray clusters with asynchronous self-play, plus real-time TensorBoard monitoring. Designed for quick adaptation to new environments—board games, Atari, and OpenAI Gym—by simply defining game classes and hyperparameters.
2,784 stars. No commits in the last 6 months.
Stars
2,784
Forks
670
Language
Python
License
MIT
Category
Last pushed
Sep 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/werner-duvaud/muzero-general"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
jonathan-laurent/AlphaZero.jl
A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.
mokemokechicken/reversi-alpha-zero
Reversi reinforcement learning by AlphaGo Zero methods.
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial +...
NeymarL/ChineseChess-AlphaZero
Implement AlphaZero/AlphaGo Zero methods on Chinese chess.
DHDev0/Stochastic-muzero
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of...