DHDev0/Stochastic-muzero
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
Stars
77
Forks
12
Language
Python
License
GPL-3.0
Category
Last pushed
Dec 31, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/DHDev0/Stochastic-muzero"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jonathan-laurent/AlphaZero.jl
A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.
mokemokechicken/reversi-alpha-zero
Reversi reinforcement learning by AlphaGo Zero methods.
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial +...
werner-duvaud/muzero-general
MuZero
NeymarL/ChineseChess-AlphaZero
Implement AlphaZero/AlphaGo Zero methods on Chinese chess.