Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.01387
Cited By
Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
3 September 2019
T. Paine
Çağlar Gülçehre
Bobak Shahriari
Misha Denil
Matt Hoffman
Hubert Soyer
Richard Tanburn
Steven Kapturowski
Neil C. Rabinowitz
Duncan Williams
Gabriel Barth-Maron
Ziyun Wang
Nando de Freitas
Worlds Team
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Making Efficient Use of Demonstrations to Solve Hard Exploration Problems"
31 / 31 papers shown
Title
Task-Relevant Adversarial Imitation Learning
Konrad Zolna
Scott E. Reed
Alexander Novikov
Sergio Gomez Colmenarejo
David Budden
Serkan Cabi
Misha Denil
Nando de Freitas
Ziyun Wang
GAN
159
61
0
02 Oct 2019
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
64
41
0
06 Aug 2019
Benchmarking Model-Based Reinforcement Learning
Tingwu Wang
Xuchan Bao
I. Clavera
Jerrick Hoang
Yeming Wen
Eric D. Langlois
Matthew Shunshi Zhang
Guodong Zhang
Pieter Abbeel
Jimmy Ba
OffRL
71
364
0
03 Jul 2019
Learning Belief Representations for Imitation Learning in POMDPs
Tanmay Gangwani
Joel Lehman
Qiang Liu
Jian Peng
57
37
0
22 Jun 2019
Reinforced Imitation in Heterogeneous Action Space
Konrad Zolna
Negar Rostamzadeh
Yoshua Bengio
Sungjin Ahn
Pedro H. O. Pinheiro
61
11
0
06 Apr 2019
Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning
Arthur Juliani
Ahmed Khalifa
Vincent-Pierre Berges
Jonathan Harper
Ervin Teng
Hunter Henry
A. Crespi
Julian Togelius
Danny Lange
64
144
0
04 Feb 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
97
370
0
30 Jan 2019
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
112
139
0
08 Dec 2018
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
246
1,624
0
07 Dec 2018
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
T. Paine
Sergio Gomez Colmenarejo
Ziyun Wang
Scott E. Reed
Y. Aytar
...
Matthew W. Hoffman
Gabriel Barth-Maron
Serkan Cabi
David Budden
Nando de Freitas
OffRL
70
25
0
11 Oct 2018
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
95
260
0
09 Sep 2018
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCV
BDL
78
380
0
08 Jun 2018
Observe and Look Further: Achieving Consistent Performance on Atari
Tobias Pohlen
Bilal Piot
Todd Hester
M. G. Azar
Dan Horgan
...
John Quan
Mel Vecerík
Matteo Hessel
Rémi Munos
Olivier Pietquin
53
121
0
29 May 2018
Playing hard exploration games by watching YouTube
Y. Aytar
Tobias Pfaff
David Budden
T. Paine
Ziyun Wang
Nando de Freitas
65
271
0
29 May 2018
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
151
741
0
02 Mar 2018
Divide-and-Conquer Reinforcement Learning
Dibya Ghosh
Avi Singh
Aravind Rajeswaran
Vikash Kumar
Sergey Levine
OffRL
85
127
0
27 Nov 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
109
2,270
0
06 Oct 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
102
788
0
28 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
97
557
0
18 Sep 2017
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Matej Vecerík
Todd Hester
Jonathan Scholz
Fumin Wang
Olivier Pietquin
Bilal Piot
N. Heess
Thomas Rothörl
Thomas Lampe
Martin Riedmiller
OffRL
97
667
0
27 Jul 2017
Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration
Rouhollah Rahmatizadeh
P. Abolghasemi
Ladislau Bölöni
Sergey Levine
105
259
0
10 Jul 2017
Learning human behaviors from motion capture by adversarial imitation
J. Merel
Yuval Tassa
TB Dhruva
S. Srinivasan
Jay Lemmon
Ziyun Wang
Greg Wayne
N. Heess
GAN
67
202
0
07 Jul 2017
Automated Curriculum Learning for Neural Networks
Alex Graves
Marc G. Bellemare
Jacob Menick
Rémi Munos
Koray Kavukcuoglu
93
531
0
10 Apr 2017
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
111
1,229
0
16 Nov 2016
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
159
3,119
0
10 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
179
1,483
0
06 Jun 2016
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Zhiwen Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNN
AI4CE
433
18,361
0
27 May 2016
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
91
3,768
0
20 Nov 2015
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
231
3,797
0
18 Nov 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,312
0
22 Dec 2014
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
120
3,021
0
19 Jul 2012
1