Integrating Behavior Cloning and Reinforcement Learning for Improved
Performance in Dense and Sparse Reward Environments

v1v2 (latest)

Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments

9 October 2019

Vinicius G. Goecks

Gregory M. Gremillion

Vernon J. Lawhern

Nicholas R. Waytowich

ArXiv (abs)PDF HTML

Papers citing "Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments"

19 / 19 papers shown

Title
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning Tom Zahavy Matan Haroush Nadav Merlis D. Mankowitz Shie Mannor 102 191 0 06 Sep 2018
Cycle-of-Learning for Autonomous Systems from Human Interaction Nicholas R. Waytowich Vinicius G. Goecks Vernon J. Lawhern 45 20 0 28 Aug 2018
Learning Dexterous In-Hand Manipulation OpenAI OpenAI Marcin Andrychowicz Bowen Baker Maciek Chociej Rafal Jozefowicz ... Szymon Sidor Joshua Tobin Peter Welinder Lilian Weng Wojciech Zaremba 169 1,884 0 01 Aug 2018
Observe and Look Further: Achieving Consistent Performance on Atari Tobias Pohlen Bilal Piot Todd Hester M. G. Azar Dan Horgan ... John Quan Mel Vecerík Matteo Hessel Rémi Munos Olivier Pietquin 63 121 0 29 May 2018
Reinforcement Learning from Imperfect Demonstrations Yang Gao Huazhe Xu Ji Lin Feng Yu Sergey Levine Trevor Darrell 74 202 0 14 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine 317 8,432 0 04 Jan 2018
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces Garrett A. Warnell Nicholas R. Waytowich Vernon J. Lawhern Peter Stone 72 272 0 28 Sep 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations Ashvin Nair Bob McGrew Marcin Andrychowicz Wojciech Zaremba Pieter Abbeel OffRL 102 789 0 28 Sep 2017
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations Aravind Rajeswaran Vikash Kumar Abhishek Gupta Giulia Vezzani John Schulman E. Todorov Sergey Levine 146 1,101 0 28 Sep 2017
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards Matej Vecerík Todd Hester Jonathan Scholz Fumin Wang Olivier Pietquin Bilal Piot N. Heess Thomas Rothörl Thomas Lampe Martin Riedmiller OffRL 99 669 0 27 Jul 2017
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention William Saunders Girish Sastry Andreas Stuhlmuller Owain Evans OffRL 72 231 0 17 Jul 2017
AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles S. Shah Debadeepta Dey Chris Lovett Ashish Kapoor 96 2,007 0 15 May 2017
Accurately and Efficiently Interpreting Human-Robot Instructions of Varying Granularities Dilip Arumugam Siddharth Karamcheti N. Gopalan Lawson L. S. Wong Stefanie Tellex LM&Ro 61 60 0 21 Apr 2017
Interactive Learning from Policy-Dependent Human Feedback J. MacGlashan Mark K. Ho R. Loftin Bei Peng Guan Wang David L. Roberts Matthew E. Taylor Michael L. Littman 97 306 0 21 Jan 2017
Generative Adversarial Imitation Learning Jonathan Ho Stefano Ermon GAN 165 3,125 0 10 Jun 2016
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) Djork-Arné Clevert Thomas Unterthiner Sepp Hochreiter 311 5,539 0 23 Nov 2015
Prioritized Experience Replay Tom Schaul John Quan Ioannis Antonoglou David Silver OffRL 233 3,796 0 18 Nov 2015
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 330 13,295 0 09 Sep 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 2.1K 150,433 0 22 Dec 2014