Accelerating Self-Play Learning in Go

27 February 2019

Papers citing "Accelerating Self-Play Learning in Go"

19 / 19 papers shown

Title
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze Chunyu Xuan Yazhe Niu Yuan Pu Shuai Hu Yu Liu Jing Yang 73 0 0 03 Jan 2025
Mastering Chess with a Transformer Model Daniel Monroe The Leela Chess Zero Team 44 3 0 18 Sep 2024
Know your Enemy: Investigating Monte-Carlo Tree Search with Opponent Models in Pommerman Jannis Weil Johannes Czech Tobias Meuser Kristian Kersting 21 2 0 22 May 2023
Superhuman Artificial Intelligence Can Improve Human Decision Making by Increasing Novelty Minkyu Shin Jin Kim B. V. Opheusden Thomas Griffiths 32 47 0 13 Mar 2023
Targeted Search Control in AlphaZero for Effective Policy Improvement Alexandre Trudeau Michael Bowling 18 1 0 23 Feb 2023
Are AlphaZero-like Agents Robust to Adversarial Perturbations? Li-Cheng Lan Huan Zhang Ti-Rong Wu Meng-Yu Tsai I-Chen Wu Cho-Jui Hsieh AAML 27 10 0 07 Nov 2022
The ProfessionAl Go annotation datasEt (PAGE) Yifan Gao Danni Zhang Haoyue Li 20 0 0 03 Nov 2022
Adversarial Policies Beat Superhuman Go AIs T. T. Wang Adam Gleave Tom Tseng Kellin Pelrine Nora Belrose ... Michael Dennis Yawen Duan V. Pogrebniak Sergey Levine Stuart Russell AAML 13 21 0 01 Nov 2022
Graph Value Iteration Dieqiao Feng Carla P. Gomes B. Selman 6 0 0 20 Sep 2022
The cost of passing -- using deep learning AIs to expand our understanding of the ancient game of Go A. Egri-Nagy Antti Törmänen 22 0 0 24 Aug 2022
Impartial Games: A Challenge for Reinforcement Learning Bei Zhou Søren Riis 32 6 0 25 May 2022
PGD: A Large-scale Professional Go Dataset for Data-driven Analytics Yifan Gao AI4TS 16 3 0 30 Apr 2022
Score vs. Winrate in Score-Based Games: which Reward for Reinforcement Learning? Luca Pasqualini G. Amato Maurizio Parton R. Gini Alessandro Marchetti C. Metta F. Morandin M. Fantozzi 19 2 0 31 Jan 2022
Modeling Strong and Human-Like Gameplay with KL-Regularized Search Athul Paul Jacob David J. Wu Gabriele Farina Adam Lerer Hengyuan Hu A. Bakhtin Jacob Andreas Noam Brown 27 52 0 14 Dec 2021
Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess Gregory Clark 33 9 0 05 Oct 2021
Transfer of Fully Convolutional Policy-Value Networks Between Games and Game Variants Dennis J. N. J. Soemers Vegard Mella Éric Piette Matthew Stephenson C. Browne O. Teytaud OffRL 25 8 0 24 Feb 2021
Minimax Strikes Back Quentin Cohen-Solal Tristan Cazenave 41 13 0 19 Dec 2020
Self-Play Learning Without a Reward Metric Dan Schmidt N. Moran Jonathan S. Rosenfeld Jonathan Rosenthal J. Yedidia 14 4 0 16 Dec 2019
SAI: a Sensible Artificial Intelligence that plays with handicap and targets high scores in 9x9 Go (extended version) F. Morandin G. Amato M. Fantozzi R. Gini C. Metta Maurizio Parton LLMAG 24 8 0 26 May 2019