Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.03328
Cited By
Guided Exploration with Proximal Policy Optimization using a Single Demonstration
7 July 2020
Gabriele Libardi
Gianni De Fabritiis
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Guided Exploration with Proximal Policy Optimization using a Single Demonstration"
10 / 10 papers shown
Title
Never Give Up: Learning Directed Exploration Strategies
Adria Puigdomenech Badia
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Bilal Piot
...
O. Tieleman
Martín Arjovsky
Alexander Pritzel
Andew Bolt
Charles Blundell
50
294
0
14 Feb 2020
Watch, Try, Learn: Meta-Learning from Demonstrations and Reward
Allan Zhou
Eric Jang
Daniel Kappler
Alexander Herzog
Mohi Khansari
Paul Wohlhart
Yunfei Bai
Mrinal Kalakrishnan
Sergey Levine
Chelsea Finn
54
49
0
07 Jun 2019
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
72
137
0
08 Dec 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
149
1,584
0
05 Feb 2018
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
96
2,423
0
15 May 2017
One-Shot Imitation Learning
Yan Duan
Marcin Andrychowicz
Bradly C. Stadie
Jonathan Ho
Jonas Schneider
Ilya Sutskever
Pieter Abbeel
Wojciech Zaremba
OffRL
66
684
0
21 Mar 2017
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
114
3,089
0
10 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
179
5,056
0
05 Jun 2016
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
198
3,781
0
18 Nov 2015
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
80
2,992
0
19 Jul 2012
1