Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.10565
Cited By
Accelerating Self-Play Learning in Go
27 February 2019
David J. Wu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Accelerating Self-Play Learning in Go"
19 / 19 papers shown
Title
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
73
0
0
03 Jan 2025
Mastering Chess with a Transformer Model
Daniel Monroe
The Leela Chess Zero Team
44
3
0
18 Sep 2024
Know your Enemy: Investigating Monte-Carlo Tree Search with Opponent Models in Pommerman
Jannis Weil
Johannes Czech
Tobias Meuser
Kristian Kersting
21
2
0
22 May 2023
Superhuman Artificial Intelligence Can Improve Human Decision Making by Increasing Novelty
Minkyu Shin
Jin Kim
B. V. Opheusden
Thomas Griffiths
32
47
0
13 Mar 2023
Targeted Search Control in AlphaZero for Effective Policy Improvement
Alexandre Trudeau
Michael Bowling
18
1
0
23 Feb 2023
Are AlphaZero-like Agents Robust to Adversarial Perturbations?
Li-Cheng Lan
Huan Zhang
Ti-Rong Wu
Meng-Yu Tsai
I-Chen Wu
Cho-Jui Hsieh
AAML
27
10
0
07 Nov 2022
The ProfessionAl Go annotation datasEt (PAGE)
Yifan Gao
Danni Zhang
Haoyue Li
20
0
0
03 Nov 2022
Adversarial Policies Beat Superhuman Go AIs
T. T. Wang
Adam Gleave
Tom Tseng
Kellin Pelrine
Nora Belrose
...
Michael Dennis
Yawen Duan
V. Pogrebniak
Sergey Levine
Stuart Russell
AAML
13
21
0
01 Nov 2022
Graph Value Iteration
Dieqiao Feng
Carla P. Gomes
B. Selman
6
0
0
20 Sep 2022
The cost of passing -- using deep learning AIs to expand our understanding of the ancient game of Go
A. Egri-Nagy
Antti Törmänen
22
0
0
24 Aug 2022
Impartial Games: A Challenge for Reinforcement Learning
Bei Zhou
Søren Riis
32
6
0
25 May 2022
PGD: A Large-scale Professional Go Dataset for Data-driven Analytics
Yifan Gao
AI4TS
16
3
0
30 Apr 2022
Score vs. Winrate in Score-Based Games: which Reward for Reinforcement Learning?
Luca Pasqualini
G. Amato
Maurizio Parton
R. Gini
Alessandro Marchetti
C. Metta
F. Morandin
M. Fantozzi
19
2
0
31 Jan 2022
Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Athul Paul Jacob
David J. Wu
Gabriele Farina
Adam Lerer
Hengyuan Hu
A. Bakhtin
Jacob Andreas
Noam Brown
27
52
0
14 Dec 2021
Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess
Gregory Clark
33
9
0
05 Oct 2021
Transfer of Fully Convolutional Policy-Value Networks Between Games and Game Variants
Dennis J. N. J. Soemers
Vegard Mella
Éric Piette
Matthew Stephenson
C. Browne
O. Teytaud
OffRL
25
8
0
24 Feb 2021
Minimax Strikes Back
Quentin Cohen-Solal
Tristan Cazenave
41
13
0
19 Dec 2020
Self-Play Learning Without a Reward Metric
Dan Schmidt
N. Moran
Jonathan S. Rosenfeld
Jonathan Rosenthal
J. Yedidia
14
4
0
16 Dec 2019
SAI: a Sensible Artificial Intelligence that plays with handicap and targets high scores in 9x9 Go (extended version)
F. Morandin
G. Amato
M. Fantozzi
R. Gini
C. Metta
Maurizio Parton
LLMAG
24
8
0
26 May 2019
1