Multimodal Reward Shaping for Efficient Exploration in Reinforcement
Learning

Multimodal Reward Shaping for Efficient Exploration in Reinforcement Learning

19 July 2021

Papers citing "Multimodal Reward Shaping for Efficient Exploration in Reinforcement Learning"

19 / 19 papers shown

Title
State Entropy Maximization with Random Encoders for Efficient Exploration Younggyo Seo Lili Chen Jinwoo Shin Honglak Lee Pieter Abbeel Kimin Lee 41 123 0 18 Feb 2021
Intrinsic Reward Driven Imitation Learning via Generative Model Xingrui Yu Yueming Lyu Ivor W. Tsang 19 54 0 26 Jun 2020
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments Roberta Raileanu Tim Rocktaschel 44 171 0 27 Feb 2020
Never Give Up: Learning Directed Exploration Strategies Adria Puigdomenech Badia Pablo Sprechmann Alex Vitvitskyi Daniel Guo Bilal Piot ... O. Tieleman Martín Arjovsky Alexander Pritzel Andew Bolt Charles Blundell 46 294 0 14 Feb 2020
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods Riashat Islam Raihan Seraj Pierre-Luc Bacon Doina Precup 18 8 0 11 Dec 2019
Efficient Exploration via State Marginal Matching Lisa Lee Benjamin Eysenbach Emilio Parisotto Eric Xing Sergey Levine Ruslan Salakhutdinov 99 242 0 12 Jun 2019
Go-Explore: a New Approach for Hard-Exploration Problems Adrien Ecoffet Joost Huizinga Joel Lehman Kenneth O. Stanley Jeff Clune AI4TS 64 365 0 30 Jan 2019
Exploration by Random Network Distillation Yuri Burda Harrison Edwards Amos Storkey Oleg Klimov 88 1,310 0 30 Oct 2018
Episodic Curiosity through Reachability Nikolay Savinov Anton Raichuk Raphaël Marinier Damien Vincent Marc Pollefeys Timothy Lillicrap Sylvain Gelly 37 267 0 04 Oct 2018
Large-Scale Study of Curiosity-Driven Learning Yuri Burda Harrison Edwards Deepak Pathak Amos Storkey Trevor Darrell Alexei A. Efros LRM 49 700 0 13 Aug 2018
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning Amy Zhang Nicolas Ballas Joelle Pineau CLL OffRL 49 177 0 20 Jun 2018
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 208 18,685 0 20 Jul 2017
Curiosity-driven Exploration by Self-supervised Prediction Deepak Pathak Pulkit Agrawal Alexei A. Efros Trevor Darrell LRM SSL 96 2,423 0 15 May 2017
Count-Based Exploration with Neural Density Models Georg Ostrovski Marc G. Bellemare Aaron van den Oord Rémi Munos 74 616 0 03 Mar 2017
Unifying Count-Based Exploration and Intrinsic Motivation Marc G. Bellemare S. Srinivasan Georg Ostrovski Tom Schaul D. Saxton Rémi Munos 159 1,465 0 06 Jun 2016
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models Bradly C. Stadie Sergey Levine Pieter Abbeel 73 502 0 03 Jul 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation John Schulman Philipp Moritz Sergey Levine Michael I. Jordan Pieter Abbeel OffRL 38 3,368 0 08 Jun 2015
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift Sergey Ioffe Christian Szegedy OOD 298 43,154 0 11 Feb 2015
Auto-Encoding Variational Bayes Diederik P. Kingma Max Welling BDL 358 16,962 0 20 Dec 2013