Abstract

Get full access to this article
View all access options for this article.
References
1.
Cohen-Solal
Q.
(2020 ). Learning to Play Two-Player Perfect-Information Games without Knowledge. arXiv preprint arXiv:2008.01188 .
2.
Cohen-Solal
Q.
Cazenave
T.
(2021a ). DESCENT wins five gold medals at the Computer Olympiad . ICGA Journal , 43 (2 ), 132 –134 .
3.
Cohen-Solal
Q.
Cazenave
T.
(2021b ). Minimax Strikes Back. In Reinforcement Learning in Games at AAAI .
4.
Cohen-Solal
Q.
Cazenave
T.
(2023a ). Athénan wins sixteen gold medals at the Computer Olympiad . ICGA Journal , 45 (3 ).
5.
Cohen-Solal
Q.
Cazenave
T.
(2023b ). Minimax Strikes Back. In AAMAS (pp. 1923–1931) .
6.
Danihelka
I.
Guez
A.
Schrittwieser
J.
Silver
D.
(2022 ). Policy improvement by planning with Gumbel. In International Conference on Learning Representations .
7.
Korf
R. E.
Chickering
D. M.
(1996 ). Best-first minimax search . Artificial Intelligence , 84 (1--2 ), 299 –337 .
8.
Silver
D.
Hubert
T.
Schrittwieser
J.
Antonoglou
I.
Lai
M.
Guez
A.
Lanctot
M.
Sifre
L.
Kumaran
D.
Graepel
T.
, et al . (2018 ). A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play . Science (New York, N.Y.) , 362 (6419 ), 1140 –1144 .
9.
Wu
T.-R.
Guei
H.
Peng
P.-C.
Huang
P.-W.
Wei
T. H.
Shih
C.-C.
Tsai
Y.-J.
(2024 ). MiniZero: Comparative analysis of AlphaZero and MuZero on Go, Othello, and Atari Games . IEEE Transactions on Games .
