Guided Exploration with Proximal Policy Optimization using a Single Demonstration

7 July 2020

Papers citing "Guided Exploration with Proximal Policy Optimization using a Single Demonstration"

10 / 10 papers shown

Title
Never Give Up: Learning Directed Exploration Strategies Adria Puigdomenech Badia Pablo Sprechmann Alex Vitvitskyi Daniel Guo Bilal Piot ... O. Tieleman Martín Arjovsky Alexander Pritzel Andew Bolt Charles Blundell 50 294 0 14 Feb 2020
Watch, Try, Learn: Meta-Learning from Demonstrations and Reward Allan Zhou Eric Jang Daniel Kappler Alexander Herzog Mohi Khansari Paul Wohlhart Yunfei Bai Mrinal Kalakrishnan Sergey Levine Chelsea Finn 54 49 0 07 Jun 2019
Learning Montezuma's Revenge from a Single Demonstration Tim Salimans Richard J. Chen 72 137 0 08 Dec 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures L. Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymyr Mnih ... Vlad Firoiu Tim Harley Iain Dunning Shane Legg Koray Kavukcuoglu 149 1,584 0 05 Feb 2018
Curiosity-driven Exploration by Self-supervised Prediction Deepak Pathak Pulkit Agrawal Alexei A. Efros Trevor Darrell LRM SSL 96 2,423 0 15 May 2017
One-Shot Imitation Learning Yan Duan Marcin Andrychowicz Bradly C. Stadie Jonathan Ho Jonas Schneider Ilya Sutskever Pieter Abbeel Wojciech Zaremba OffRL 66 684 0 21 Mar 2017
Generative Adversarial Imitation Learning Jonathan Ho Stefano Ermon GAN 114 3,089 0 10 Jun 2016
OpenAI Gym Greg Brockman Vicki Cheung Ludwig Pettersson Jonas Schneider John Schulman Jie Tang Wojciech Zaremba OffRL ODL 179 5,056 0 05 Jun 2016
Prioritized Experience Replay Tom Schaul John Quan Ioannis Antonoglou David Silver OffRL 198 3,781 0 18 Nov 2015
The Arcade Learning Environment: An Evaluation Platform for General Agents Marc G. Bellemare Yavar Naddaf J. Veness Michael Bowling 80 2,992 0 19 Jul 2012