Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.01815
Cited By
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
5 December 2017
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
A. Guez
Marc Lanctot
Laurent Sifre
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm"
17 / 267 papers shown
Title
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
42
643
0
01 Jul 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
33
213
0
20 Jun 2018
ML + FV =
♡
\heartsuit
♡
? A Survey on the Application of Machine Learning to Formal Verification
Moussa Amrani
L. Lucio
Adrien Bibal
30
5
0
10 Jun 2018
Re-evaluating Evaluation
David Balduzzi
K. Tuyls
Julien Perolat
T. Graepel
MoMe
30
97
0
07 Jun 2018
Fast Exploration with Simplified Models and Approximately Optimistic Planning in Model Based Reinforcement Learning
Ramtin Keramati
Jay Whang
Patrick Cho
Emma Brunskill
OffRL
29
7
0
01 Jun 2018
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
26
83
0
26 May 2018
A0C: Alpha Zero in Continuous Action Space
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
16
48
0
24 May 2018
Bandit-Based Monte Carlo Optimization for Nearest Neighbors
Vivek Bagaria
Tavor Z. Baharav
G. Kamath
David Tse
13
12
0
21 May 2018
AGI Safety Literature Review
Tom Everitt
G. Lea
Marcus Hutter
AI4CE
36
115
0
03 May 2018
AI safety via debate
G. Irving
Paul Christiano
Dario Amodei
204
203
0
02 May 2018
Monte Carlo Q-learning for General Game Playing
Hui Wang
M. Emmerich
Aske Plaat
GP
13
20
0
16 Feb 2018
ProofWatch: Watchlist Guidance for Large Theories in E
Z. Goertzel
Jan Jakubuv
S. Schulz
Josef Urban
LRM
28
13
0
12 Feb 2018
Tunneling Neural Perception and Logic Reasoning through Abductive Learning
Wang-Zhou Dai
Qiu-Ling Xu
Yang Yu
Zhi-Hua Zhou
LRM
AI4CE
32
22
0
04 Feb 2018
Deep Reinforcement Learning using Capsules in Advanced Game Environments
Per-Arne Andersen
18
16
0
29 Jan 2018
Innateness, AlphaZero, and Artificial Intelligence
G. Marcus
30
73
0
17 Jan 2018
Building a Conversational Agent Overnight with Dialogue Self-Play
Pararth Shah
Dilek Z. Hakkani-Tür
Gokhan Tur
Abhinav Rastogi
Ankur Bapna
Neha Nayak Kennard
Larry Heck
45
193
0
15 Jan 2018
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes
Igor Adamski
R. Adamski
T. Grel
Adam Jedrych
Kamil Kaczmarek
Henryk Michalewski
OffRL
41
37
0
09 Jan 2018
Previous
1
2
3
4
5
6