Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.01815
Cited By
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
5 December 2017
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
A. Guez
Marc Lanctot
Laurent Sifre
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm"
50 / 266 papers shown
Title
On the Utility of Learning about Humans for Human-AI Coordination
Micah Carroll
Rohin Shah
Mark K. Ho
Thomas Griffiths
S. Seshia
Pieter Abbeel
Anca Dragan
HAI
25
382
0
13 Oct 2019
Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models
Qingyang Wu
Yichi Zhang
Yu Li
Zhou Yu
VLM
22
63
0
09 Oct 2019
Harnessing Structures for Value-Based Planning and Reinforcement Learning
Yuzhe Yang
Guo Zhang
Zhi Xu
Dina Katabi
OffRL
27
31
0
26 Sep 2019
Emergent Tool Use From Multi-Agent Autocurricula
Bowen Baker
I. Kanitscheider
Todor Markov
Yi Wu
Glenn Powell
Bob McGrew
Igor Mordatch
LRM
54
647
0
17 Sep 2019
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
22
86
0
10 Sep 2019
No Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette
Yuchen Lu
Steven Bocco
Max O. Smith
Satya Ortiz-Gagné
Jonathan K. Kummerfeld
Satinder Singh
Joelle Pineau
Aaron Courville
33
57
0
04 Sep 2019
Playing a Strategy Game with Knowledge-Based Reinforcement Learning
Viktor Voss
L. Nechepurenko
Rudi Schaefer
Steffen Bauer
27
5
0
15 Aug 2019
Neural Simplex Architecture
Dung Phan
Radu Grosu
N. Jansen
Nicola Paoletti
S. Smolka
Scott D. Stoller
22
61
0
01 Aug 2019
SentiMATE: Learning to play Chess through Natural Language Processing
Isaac Kamlish
Isaac Bentata Chocron
Nicholas McCarthy
17
10
0
18 Jul 2019
General Board Game Playing for Education and Research in Generic AI Game Learning
W. Konen
LLMAG
19
25
0
11 Jul 2019
On Inductive Biases in Deep Reinforcement Learning
Matteo Hessel
H. V. Hasselt
Joseph Modayil
David Silver
AI4CE
33
41
0
05 Jul 2019
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
14
38
0
28 Jun 2019
Generalization to Novel Objects using Prior Relational Knowledge
V. Vijay
Abhinav Ganesh
Hanlin Tang
Arjun K. Bansal
GNN
19
6
0
26 Jun 2019
Modern Deep Reinforcement Learning Algorithms
Sergey Ivanov
A. Dýakonov
OffRL
29
39
0
24 Jun 2019
Inductive general game playing
Andrew Cropper
Richard Evans
Mark Law
AI4CE
20
27
0
23 Jun 2019
Defending Against Adversarial Examples with K-Nearest Neighbor
Chawin Sitawarin
David Wagner
AAML
11
29
0
23 Jun 2019
Planning With Uncertain Specifications (PUnS)
Ankit J. Shah
Shen Li
J. Shah
24
25
0
07 Jun 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Jeff Clune
17
116
0
27 May 2019
Adversarial Policies: Attacking Deep Reinforcement Learning
Adam Gleave
Michael Dennis
Cody Wild
Neel Kant
Sergey Levine
Stuart J. Russell
AAML
27
349
0
25 May 2019
Ignorance-Aware Approaches and Algorithms for Prototype Selection in Machine Learning
V. Terziyan
A. Nikulin
19
4
0
15 May 2019
Benchmark and Survey of Automated Machine Learning Frameworks
Marc-André Zöller
Marco F. Huber
25
86
0
26 Apr 2019
Deep Policies for Width-Based Planning in Pixel Domains
Miquel Junyent
Anders Jonsson
Vicencc Gómez
41
10
0
12 Apr 2019
Policy Gradient Search: Online Planning and Expert Iteration without Search Trees
Thomas W. Anthony
Robert Nishihara
Philipp Moritz
Tim Salimans
John Schulman
37
30
0
07 Apr 2019
A Local Approach to Forward Model Learning: Results on the Game of Life Game
Simon Lucas
Alexander Dockhorn
Vanessa Volz
Chris Bamford
Raluca D. Gaina
Ivan Bravi
Diego Perez-Liebana
Sanaz Mostaghim
R. Kruse
24
17
0
29 Mar 2019
A cooperative game for automated learning of elasto-plasticity knowledge graphs and models with AI-guided experimentation
Kun Wang
WaiChing Sun
Q. Du
AI4CE
23
56
0
08 Mar 2019
Towards Understanding Chinese Checkers with Heuristics, Monte Carlo Tree Search, and Deep Reinforcement Learning
Ziyu Liu
Meng Zhou
Weiqing Cao
Qiang Qu
H. W. F. Yeung
Yuk Ying Chung
21
4
0
05 Mar 2019
Coloring Big Graphs with AlphaGoZero
Jiayi Huang
Md. Mostofa Ali Patwary
G. Diamos
AI4CE
GNN
20
49
0
26 Feb 2019
Planning in Hierarchical Reinforcement Learning: Guarantees for Using Local Policies
Tom Zahavy
Avinatan Hassidim
Haim Kaplan
Yishay Mansour
OffRL
22
7
0
26 Feb 2019
Challenges for an Ontology of Artificial Intelligence
Scott H. Hawley
21
11
0
25 Feb 2019
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations
Yuhui Wang
Hao He
Xiaoyang Tan
30
9
0
15 Feb 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
19
9
0
14 Feb 2019
Learning Position Evaluation Functions Used in Monte Carlo Softmax Search
H. Igarashi
Yuichi Morioka
Kazumasa Yamamoto
16
0
0
30 Jan 2019
Ablation Studies in Artificial Neural Networks
Richard Meyes
Melanie Lu
Constantin Waubert de Puiseau
Tobias Meisen
16
210
0
24 Jan 2019
The Oracle of DLphi
Dominik Alfke
W. Baines
J. Blechschmidt
Mauricio J. del Razo Sarmina
Amnon Drory
...
L. Thesing
Philipp Trunschke
Johannes von Lindheim
David Weber
Melanie Weber
39
0
0
17 Jan 2019
Malthusian Reinforcement Learning
Joel Z Leibo
Julien Perolat
Edward Hughes
S. Wheelwright
Adam H. Marblestone
Edgar A. Duénez-Guzmán
P. Sunehag
Iain Dunning
T. Graepel
AI4CE
33
37
0
17 Dec 2018
Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control
F. Ebert
Chelsea Finn
Sudeep Dasari
Annie Xie
Alex X. Lee
Sergey Levine
SSL
35
379
0
03 Dec 2018
On the Complexity of Reconnaissance Blind Chess
Jared Markowitz
Matthieu de Rochemonteix
Ashley J. Llorens
22
11
0
07 Nov 2018
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
13
148
0
21 Oct 2018
Supervising strong learners by amplifying weak experts
Paul Christiano
Buck Shlegeris
Dario Amodei
27
114
0
19 Oct 2018
Transfer Learning versus Multi-agent Learning regarding Distributed Decision-Making in Highway Traffic
Mark Schutera
Niklas Goby
Dirk Neumann
Markus Reischl
21
5
0
19 Oct 2018
Finding the best design parameters for optical nanostructures using reinforcement learning
Iman Sajedian
Trevon Badloe
J. Rho
22
12
0
18 Oct 2018
Reinforcement Learning Decoders for Fault-Tolerant Quantum Computation
R. Sweke
Markus S. Kesselring
E. van Nieuwenburg
Jens Eisert
AI4CE
LRM
27
107
0
16 Oct 2018
The 30-Year Cycle In The AI Debate
J. Chauvet
32
8
0
08 Oct 2018
Towards Game-based Metrics for Computational Co-creativity
Rodrigo Canaan
Stefan Menzel
Julian Togelius
Andy Nealen
17
9
0
26 Sep 2018
SAI, a Sensible Artificial Intelligence that plays Go
F. Morandin
G. Amato
R. Gini
C. Metta
Maurizio Parton
G. Pascutto
LLMAG
21
13
0
11 Sep 2018
ViZDoom Competitions: Playing Doom from Pixels
Marek Wydmuch
Michal Kempka
Wojciech Ja'skowski
29
119
0
10 Sep 2018
Adaptive Behavior Generation for Autonomous Driving using Deep Reinforcement Learning with Compact Semantic States
Peter Wolf
Karl Kurzer
Tobias Wingert
Florian Kuhnt
Johann Marius Zöllner
30
55
0
10 Sep 2018
A proof that artificial neural networks overcome the curse of dimensionality in the numerical approximation of Black-Scholes partial differential equations
Philipp Grohs
F. Hornung
Arnulf Jentzen
Philippe von Wurstemberger
21
167
0
07 Sep 2018
Improving Hearthstone AI by Combining MCTS and Supervised Learning Algorithms
M. Świechowski
T. Tajmajer
Andrzej Janusz
BDL
66
59
0
14 Aug 2018
Recent Advances in Deep Learning: An Overview
Matiur Rahman Minar
Jibon Naher
VLM
29
116
0
21 Jul 2018
Previous
1
2
3
4
5
6
Next