Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.08888
Cited By
Multimodal Reward Shaping for Efficient Exploration in Reinforcement Learning
19 July 2021
Mingqi Yuan
Mon-on Pun
Dong Wang
Yi Chen
Haojun Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multimodal Reward Shaping for Efficient Exploration in Reinforcement Learning"
19 / 19 papers shown
Title
State Entropy Maximization with Random Encoders for Efficient Exploration
Younggyo Seo
Lili Chen
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
44
123
0
18 Feb 2021
Intrinsic Reward Driven Imitation Learning via Generative Model
Xingrui Yu
Yueming Lyu
Ivor W. Tsang
19
54
0
26 Jun 2020
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments
Roberta Raileanu
Tim Rocktaschel
46
171
0
27 Feb 2020
Never Give Up: Learning Directed Exploration Strategies
Adria Puigdomenech Badia
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Bilal Piot
...
O. Tieleman
Martín Arjovsky
Alexander Pritzel
Andew Bolt
Charles Blundell
46
294
0
14 Feb 2020
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam
Raihan Seraj
Pierre-Luc Bacon
Doina Precup
23
8
0
11 Dec 2019
Efficient Exploration via State Marginal Matching
Lisa Lee
Benjamin Eysenbach
Emilio Parisotto
Eric Xing
Sergey Levine
Ruslan Salakhutdinov
99
242
0
12 Jun 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
64
365
0
30 Jan 2019
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
95
1,310
0
30 Oct 2018
Episodic Curiosity through Reachability
Nikolay Savinov
Anton Raichuk
Raphaël Marinier
Damien Vincent
Marc Pollefeys
Timothy Lillicrap
Sylvain Gelly
37
267
0
04 Oct 2018
Large-Scale Study of Curiosity-Driven Learning
Yuri Burda
Harrison Edwards
Deepak Pathak
Amos Storkey
Trevor Darrell
Alexei A. Efros
LRM
54
700
0
13 Aug 2018
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLL
OffRL
52
177
0
20 Jun 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
236
18,685
0
20 Jul 2017
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
96
2,423
0
15 May 2017
Count-Based Exploration with Neural Density Models
Georg Ostrovski
Marc G. Bellemare
Aaron van den Oord
Rémi Munos
74
616
0
03 Mar 2017
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
162
1,465
0
06 Jun 2016
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
Bradly C. Stadie
Sergey Levine
Pieter Abbeel
76
502
0
03 Jul 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
43
3,368
0
08 Jun 2015
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
328
43,154
0
11 Feb 2015
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
367
16,962
0
20 Dec 2013
1