kaesve/muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
168 stars. No commits in the last 6 months.
Stars
168
Forks
27
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 28, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/kaesve/muzero"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial +...
werner-duvaud/muzero-general
MuZero
jonathan-laurent/AlphaZero.jl
A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.
mokemokechicken/reversi-alpha-zero
Reversi reinforcement learning by AlphaGo Zero methods.
NeymarL/ChineseChess-AlphaZero
Implement AlphaZero/AlphaGo Zero methods on Chinese chess.