XUNIK8/Reinforcement-Learning-in-Tsumego
利用强化学习、基于蒙特卡洛树搜索的UCT算法解决围棋死活题问题。Inplement improved Reinforcement Learning and UCT algorithms (based on Monte Carlo Tree Search) on Go/Tsumego problems
No commits in the last 6 months.
Stars
15
Forks
3
Language
Python
License
Apache-2.0
Category
Last pushed
Dec 26, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/XUNIK8/Reinforcement-Learning-in-Tsumego"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
werner-duvaud/muzero-general
MuZero
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial +...
jonathan-laurent/AlphaZero.jl
A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.
NeymarL/ChineseChess-AlphaZero
Implement AlphaZero/AlphaGo Zero methods on Chinese chess.
mokemokechicken/reversi-alpha-zero
Reversi reinforcement learning by AlphaGo Zero methods.