Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.06560
Cited By
Deep Reinforcement Learning that Matters
19 September 2017
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning that Matters"
50 / 379 papers shown
Title
Behaviour Suite for Reinforcement Learning
Ian Osband
Yotam Doron
Matteo Hessel
John Aslanides
Eren Sezener
...
Satinder Singh
Benjamin Van Roy
R. Sutton
David Silver
H. V. Hasselt
OffRL
32
178
0
09 Aug 2019
explAIner: A Visual Analytics Framework for Interactive and Explainable Machine Learning
Thilo Spinner
U. Schlegel
H. Schäfer
Mennatallah El-Assady
HAI
20
234
0
29 Jul 2019
Deep Lagrangian Networks for end-to-end learning of energy-based control for under-actuated systems
M. Lutter
Kim D. Listmann
Jan Peters
PINN
16
71
0
10 Jul 2019
AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates
Ning Liu
Xiaolong Ma
Zhiyuan Xu
Yanzhi Wang
Jian Tang
Jieping Ye
43
185
0
06 Jul 2019
Co-training for Policy Learning
Jialin Song
Ravi Lanka
Yisong Yue
M. Ono
OffRL
18
19
0
03 Jul 2019
Hyp-RL : Hyperparameter Optimization by Reinforcement Learning
H. Jomaa
Josif Grabocka
Lars Schmidt-Thieme
25
65
0
27 Jun 2019
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Boyi Liu
Qi Cai
Zhuoran Yang
Zhaoran Wang
30
108
0
25 Jun 2019
Modern Deep Reinforcement Learning Algorithms
Sergey Ivanov
A. Dýakonov
OffRL
29
39
0
24 Jun 2019
RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration
Brahma S. Pavse
F. Torabi
Josiah P. Hanna
Garrett A. Warnell
Peter Stone
27
33
0
18 Jun 2019
Tackling Climate Change with Machine Learning
David Rolnick
P. Donti
L. Kaack
K. Kochanski
Alexandre Lacoste
...
Demis Hassabis
John C. Platt
F. Creutzig
J. Chayes
Yoshua Bengio
AI4Cl
AI4CE
38
788
0
10 Jun 2019
Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains
Matthieu Zimmer
Paul Weng
24
7
0
10 Jun 2019
Multimodal End-to-End Autonomous Driving
Yi Xiao
Felipe Codevilla
A. Gurram
O. Urfalioglu
Antonio M. López
19
241
0
07 Jun 2019
An Empirical Study on Hyperparameters and their Interdependence for RL Generalization
Xingyou Song
Yilun Du
Jacob Jackson
AI4CE
27
8
0
02 Jun 2019
On Network Design Spaces for Visual Recognition
Ilija Radosavovic
Justin Johnson
Saining Xie
Wan-Yen Lo
Piotr Dollár
27
134
0
30 May 2019
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima
Qi Cai
Zhuoran Yang
Jason D. Lee
Zhaoran Wang
42
29
0
24 May 2019
REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning
Brian Yang
Jesse Zhang
Vitchyr H. Pong
Sergey Levine
Dinesh Jayaraman
27
37
0
17 May 2019
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
29
51
0
05 May 2019
How You Act Tells a Lot: Privacy-Leakage Attack on Deep Reinforcement Learning
Xinlei Pan
Weiyao Wang
Xiaoshuai Zhang
Bo-wen Li
Jinfeng Yi
D. Song
MIACV
69
26
0
24 Apr 2019
The Scientific Method in the Science of Machine Learning
Jessica Zosa Forde
Michela Paganini
24
35
0
24 Apr 2019
Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems
Ran A. Wang
Karthikeya S. Parunandi
Dan Yu
D. Kalathil
S. Chakravorty
23
11
0
17 Apr 2019
Differentiable Sampling with Flexible Reference Word Order for Neural Machine Translation
Weijia Xu
Xing Niu
Marine Carpuat
24
10
0
04 Apr 2019
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
OffRL
LRM
35
25
0
03 Apr 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
44
28
0
25 Mar 2019
Deep learning for molecular design - a review of the state of the art
Daniel C. Elton
Zois Boukouvalas
M. Fuge
Peter W. Chung
AI4CE
3DV
29
327
0
11 Mar 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
26
17
0
11 Mar 2019
Adaptive Power System Emergency Control using Deep Reinforcement Learning
Qiuhua Huang
Renke Huang
Weituo Hao
Jie Tan
Rui Fan
Zhenyu Huang
17
270
0
09 Mar 2019
The AI Driving Olympics at NeurIPS 2018
J. Zilly
J. Tani
Breandan Considine
Bhairav Mehta
Andrea F. Daniele
...
R. Hristov
S. Mallya
Emilio Frazzoli
A. Censi
Liam Paull
21
14
0
06 Mar 2019
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning
Ruo-Ze Liu
Haifeng Guo
Xiaozhong Ji
Yang Yu
Zhen-Jia Pang
Zitai Xiao
Yuzhou Wu
Tong Lu
OffRL
19
13
0
02 Mar 2019
Neural Packet Classification
Eric Liang
Hang Zhu
Xin Jin
Ion Stoica
OffRL
37
120
0
27 Feb 2019
Investigating Generalisation in Continuous Deep Reinforcement Learning
Chenyang Zhao
Olivier Sigaud
F. Stulp
Timothy M. Hospedales
OffRL
22
48
0
19 Feb 2019
Fast Efficient Hyperparameter Tuning for Policy Gradients
Supratik Paul
Vitaly Kurin
Shimon Whiteson
22
32
0
18 Feb 2019
Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning
Andrew Silva
Matthew C. Gombolay
OffRL
27
20
0
15 Feb 2019
Ten ways to fool the masses with machine learning
F. Minhas
Amina Asif
Asa Ben-Hur
FedML
HAI
33
5
0
07 Jan 2019
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
54
433
0
26 Dec 2018
TD-Regularized Actor-Critic Methods
Simone Parisi
Voot Tangkaratt
Jan Peters
Mohammad Emtiyaz Khan
OffRL
30
32
0
19 Dec 2018
Dopamine: A Research Framework for Deep Reinforcement Learning
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
28
276
0
14 Dec 2018
Distilling Information from a Flood: A Possibility for the Use of Meta-Analysis and Systematic Review in Machine Learning Research
Peter Henderson
Emma Brunskill
AI4CE
37
3
0
03 Dec 2018
An initial attempt of combining visual selective attention with deep reinforcement learning
Liu Yuezhang
Ruohan Zhang
D. Ballard
23
20
0
11 Nov 2018
A Closer Look at Deep Policy Gradients
Andrew Ilyas
Logan Engstrom
Shibani Santurkar
Dimitris Tsipras
Firdaus Janoos
Larry Rudolph
Aleksander Madry
30
50
0
06 Nov 2018
Temporal Regularization in Markov Decision Process
Pierre Thodoroff
A. Durand
Joelle Pineau
Doina Precup
16
15
0
01 Nov 2018
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
D. Song
OffRL
18
233
0
29 Oct 2018
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
Michael Schaarschmidt
Sven Mika
Kai Fricke
Eiko Yoneki
OffRL
23
5
0
21 Oct 2018
O2A: One-shot Observational learning with Action vectors
Leo Pauly
Wisdom C. Agboh
David C. Hogg
R. Fuentes
57
9
0
17 Oct 2018
Learning Socially Appropriate Robot Approaching Behavior Toward Groups using Deep Reinforcement Learning
Yuan Gao
Fangkai Yang
Martin Frisk
Daniel Hernández
Christopher E. Peters
Ginevra Castellano
27
5
0
16 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
48
553
0
12 Oct 2018
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen
Amin Babadi
Xiaoxiao Ma
J. Lehtinen
32
62
0
05 Oct 2018
CEM-RL: Combining evolutionary and gradient-based methods for policy search
Aloïs Pourchot
Olivier Sigaud
32
160
0
02 Oct 2018
SmartChoices: Hybridizing Programming and Machine Learning
Victor Carbune
Thierry Coppey
A. Daryin
Thomas Deselaers
Nikhil Sarda
J. Yagnik
24
2
0
01 Oct 2018
Generalization and Regularization in DQN
Jesse Farebrother
Marlos C. Machado
Michael Bowling
30
204
0
29 Sep 2018
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Yunhao Tang
Shipra Agrawal
TPM
39
29
0
27 Sep 2018
Previous
1
2
3
4
5
6
7
8
Next