Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.01815
Cited By
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
5 December 2017
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
A. Guez
Marc Lanctot
Laurent Sifre
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm"
50 / 266 papers shown
Title
REGRAD: A Large-Scale Relational Grasp Dataset for Safe and Object-Specific Robotic Grasping in Clutter
Hanbo Zhang
Deyu Yang
Han Wang
Binglei Zhao
Xuguang Lan
Jishiyu Ding
Nanning Zheng
46
40
0
29 Apr 2021
Sifting out the features by pruning: Are convolutional networks the winning lottery ticket of fully connected ones?
Franco Pellegrini
Giulio Biroli
54
6
0
27 Apr 2021
Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search
Animesh Sinha
Utkarsh Azad
Harjinder Singh
47
27
0
01 Apr 2021
Self-adaptive Torque Vectoring Controller Using Reinforcement Learning
Shayan Taherian
Sampo Kuutti
Marco Visca
Saber Fallah
13
4
0
27 Mar 2021
Policy-Guided Heuristic Search with Guarantees
Laurent Orseau
Levi H. S. Lelis
37
26
0
21 Mar 2021
Neural Networks and Denotation
E. Allen
27
0
0
15 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
23
17
0
15 Mar 2021
Learning to run a Power Network Challenge: a Retrospective Analysis
Antoine Marot
Benjamin Donnot
Gabriel Dulac-Arnold
A. Kelly
A. O'Sullivan
J. Viebahn
M. Awad
Isabelle M Guyon
P. Panciatici
Camilo Romero
22
77
0
02 Mar 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
18
18
0
13 Feb 2021
Deep Reinforcement Learning for the Control of Robotic Manipulation: A Focussed Mini-Review
Rongrong Liu
F. Nageotte
P. Zanne
M. de Mathelin
Birgitta Dresp
53
143
0
08 Feb 2021
Differentiable Trust Region Layers for Deep Reinforcement Learning
Fabian Otto
P. Becker
Ngo Anh Vien
Hanna Ziesche
Gerhard Neumann
OffRL
41
19
0
22 Jan 2021
Asymmetric self-play for automatic goal discovery in robotic manipulation
OpenAI OpenAI
Matthias Plappert
Raul Sampedro
Tao Xu
Ilge Akkaya
...
Hyeonwoo Noh
Lilian Weng
Qiming Yuan
Casey Chu
Wojciech Zaremba
SSL
82
76
0
13 Jan 2021
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z Leibo
Kate Larson
T. Graepel
42
200
0
15 Dec 2020
Relative Variational Intrinsic Control
Kate Baumli
David Warde-Farley
Steven Hansen
Volodymyr Mnih
26
42
0
14 Dec 2020
Hindsight and Sequential Rationality of Correlated Play
Dustin Morrill
Ryan DÓrazio
Reca Sarfati
Marc Lanctot
James Wright
Amy Greenwald
Michael Bowling
29
29
0
10 Dec 2020
Ensemble Squared: A Meta AutoML System
Jason Yoo
Tony Joseph
Dylan Yung
S. Nasseri
Frank Wood
MoE
27
8
0
10 Dec 2020
On the Binding Problem in Artificial Neural Networks
Klaus Greff
Sjoerd van Steenkiste
Jürgen Schmidhuber
OCL
233
255
0
09 Dec 2020
EvoCraft: A New Challenge for Open-Endedness
Djordje Grbic
Rasmus Berg Palm
Elias Najarro
Claire Glanois
S. Risi
27
30
0
08 Dec 2020
Deep Reinforcement Learning for Resource Constrained Multiclass Scheduling in Wireless Networks
Apostolos Avranas
Marios Kountouris
P. Ciblat
24
7
0
27 Nov 2020
Experimental design for MRI by greedy policy search
Tim Bakker
H. V. Hoof
Max Welling
29
56
0
30 Oct 2020
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Ming Zhou
Jun Luo
Julian Villela
Yaodong Yang
David Rusu
...
H. Ammar
Hongbo Zhang
Wulong Liu
Jianye Hao
Jun Wang
139
193
0
19 Oct 2020
OR-Gym: A Reinforcement Learning Library for Operations Research Problems
Christian D. Hubbs
Hector D. Perez
Owais Sarwar
N. Sahinidis
I. Grossmann
J. Wassick
OffRL
AI4CE
27
74
0
14 Aug 2020
Learning to Play Two-Player Perfect-Information Games without Knowledge
Quentin Cohen-Solal
OffRL
47
13
0
03 Aug 2020
Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection
Tianming Wang
Wenjie Lu
H. Yu
Dikai Liu
41
1
0
29 Jul 2020
Monte-Carlo Tree Search as Regularized Policy Optimization
Jean-Bastien Grill
Florent Altché
Yunhao Tang
Thomas Hubert
Michal Valko
Ioannis Antonoglou
Rémi Munos
27
73
0
24 Jul 2020
Adaptive Discretization for Model-Based Reinforcement Learning
Sean R. Sinclair
Tianyu Wang
Gauri Jain
Siddhartha Banerjee
Chao Yu
OffRL
19
21
0
01 Jul 2020
Circuit Routing Using Monte Carlo Tree Search and Deep Neural Networks
Youbiao He
F. S. Bao
15
13
0
24 Jun 2020
Rinascimento: using event-value functions for playing Splendor
Ivan Bravi
Simon Lucas
27
2
0
10 Jun 2020
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
Weichao Mao
Kaipeng Zhang
Qiaomin Xie
Tamer Basar
21
14
0
08 Jun 2020
The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems
Eric M. S. P. Veith
Nils Wenninghoff
Emilie Frost
28
5
0
27 May 2020
Reassessing Claims of Human Parity and Super-Human Performance in Machine Translation at WMT 2019
Antonio Toral
27
43
0
12 May 2020
A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical Systems
Anthony Corso
Robert J. Moss
Mark Koren
Ritchie Lee
Mykel J. Kochenderfer
21
173
0
06 May 2020
Improving Movement Predictions of Traffic Actors in Bird's-Eye View Models using GANs and Differentiable Trajectory Rasterization
E. Wang
Henggang Cui
S. Yalamanchi
M. Moorthy
Fang-Chieh Chou
Nemanja Djuric
47
22
0
14 Apr 2020
How Do You Act? An Empirical Study to Understand Behavior of Deep Reinforcement Learning Agents
Richard Meyes
Moritz Schneider
Tobias Meisen
28
2
0
07 Apr 2020
Weakly-Supervised Reinforcement Learning for Controllable Behavior
Lisa Lee
Benjamin Eysenbach
Ruslan Salakhutdinov
S. Gu
Chelsea Finn
SSL
22
26
0
06 Apr 2020
A Survey of End-to-End Driving: Architectures and Training Methods
Ardi Tampuu
Maksym Semikin
Naveed Muhammad
D. Fishman
Tambet Matiisen
3DV
23
229
0
13 Mar 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
21
11
0
25 Feb 2020
Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligence
S. Raschka
Joshua Patterson
Corey J. Nolet
AI4CE
29
485
0
12 Feb 2020
Reinforcement Learning for POMDP: Partitioned Rollout and Policy Iteration with Application to Autonomous Sequential Repair Problems
Sushmita Bhattacharya
Sahil Badyal
Thomas Wheeler
Stephanie Gil
Dimitri Bertsekas
30
34
0
11 Feb 2020
Compositional ADAM: An Adaptive Compositional Solver
Rasul Tutunov
Minne Li
Alexander I. Cowen-Rivers
Jun Wang
Haitham Bou-Ammar
ODL
59
16
0
10 Feb 2020
Towards Learning Multi-agent Negotiations via Self-Play
Yichuan Tang
25
33
0
28 Jan 2020
Algorithms in Multi-Agent Systems: A Holistic Perspective from Reinforcement Learning and Game Theory
Yunlong Lu
Kai Yan
AI4CE
15
13
0
17 Jan 2020
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning
Eivind Meyer
Haakon Robinson
Adil Rasheed
Omer San
33
65
0
18 Dec 2019
Learning Generalizable Visual Representations via Interactive Gameplay
Luca Weihs
Aniruddha Kembhavi
Kiana Ehsani
Sarah M Pratt
Winson Han
Alvaro Herrasti
Eric Kolve
Dustin Schwenk
Roozbeh Mottaghi
Ali Farhadi
21
9
0
17 Dec 2019
Self-Play Learning Without a Reward Metric
Dan Schmidt
N. Moran
Jonathan S. Rosenfeld
Jonathan Rosenthal
J. Yedidia
19
4
0
16 Dec 2019
Scratch that! An Evolution-based Adversarial Attack against Neural Networks
Malhar Jere
Loris Rossi
Briland Hitaj
Gabriela F. Cretu-Ciocarlie
Giacomo Boracchi
F. Koushanfar
AAML
14
18
0
05 Dec 2019
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering
Yuval Heffetz
Roman Vainshtein
Gilad Katz
Lior Rokach
25
39
0
31 Oct 2019
Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
Mo Yu
Shiyu Chang
Yang Zhang
Tommi Jaakkola
21
140
0
29 Oct 2019
Optimal Immunization Policy Using Dynamic Programming
A. Alaeddini
Daniel J. Klein
17
1
0
19 Oct 2019
Deep Reinforcement Learning meets Graph Neural Networks: exploring a routing optimization use case
Paul Almasan
J. Suárez-Varela
Krzysztof Rusek
Pere Barlet-Ros
A. Cabellos-Aparicio
GNN
AI4CE
61
186
0
16 Oct 2019
Previous
1
2
3
4
5
6
Next