This website requires JavaScript.

Proof Number Based Monte-Carlo Tree Search

Elliot DoeMark H. M. WinandsJakub KowalskiDennis J. N. J. SoemersDaniel G\'orskiCameron Browne
Mar 2023
摘要
This paper proposes a new game search algorithm, PN-MCTS, that combinesMonte-Carlo Tree Search (MCTS) and Proof-Number Search (PNS). These twoalgorithms have been successfully applied for decision making in a range ofdomains. We define three areas where the additional knowledge provided by theproof and disproof numbers gathered in MCTS trees might be used: final moveselection, solving subtrees, and the UCT formula. We test all possiblecombinations on different time settings, playing against vanilla UCT MCTS onseveral games: Lines of Action ($7$$\times$$7$ and $8$$\times$$8$), MiniShogi,Knightthrough, Awari, and Gomoku. Furthermore, we extend this new algorithm toproperly address games with draws, like Awari, by adding an additional layer ofPNS on top of the MCTS tree. The experiments show that PN-MCTS confidentlyoutperforms MCTS in 5 out of 6 game domains (all except Gomoku), achieving winrates up to 96.2% for Lines of Action.
展开全部
图表提取

暂无人提供速读十问回答

论文十问由沈向洋博士提出,鼓励大家带着这十个问题去阅读论文,用有用的信息构建认知模型。写出自己的十问回答,还有机会在当前页面展示哦。

Q1论文试图解决什么问题?
Q2这是否是一个新的问题?
Q3这篇文章要验证一个什么科学假设?
0
被引用
笔记
问答