Abstract
In this article we investigate how three multi-player search policies, namely maxn, paranoid, and Best-Reply Search, can be embedded in the MCTS framework. The performance of these search policies is tested in four different deterministic multi-player games with perfect information by running self-play experiments. We show that MCTS with the maxn search policy overall performs best.
Furthermore, we introduce a multi-player variant of the MCTS-Solver. We propose three update rules for solving nodes in a multi-player MCTS tree. The experimental results show that the multi-player variant of the MCTS-Solver is a genuine improvement for MCTS in multi-player games.
Get full access to this article
View all access options for this article.
