Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.13125
Cited By
Normality-Guided Distributional Reinforcement Learning for Continuous Control
28 August 2022
Ju-Seung Byun
Andrew Perrault
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Normality-Guided Distributional Reinforcement Learning for Continuous Control"
32 / 32 papers shown
Title
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Vincent Mai
Kaustubh Mani
Liam Paull
61
35
0
05 Jan 2022
Implicit Distributional Reinforcement Learning
Yuguang Yue
Zhendong Wang
Mingyuan Zhou
OffRL
45
16
0
13 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
44
201
0
09 Jul 2020
Sample-based Distributional Policy Gradient
Rahul Singh
Keuntaek Lee
Yongxin Chen
41
19
0
08 Jan 2020
Regularization Matters in Policy Optimization
Zhuang Liu
Xuanlin Li
Bingyi Kang
Trevor Darrell
OffRL
50
33
0
21 Oct 2019
Estimating Risk and Uncertainty in Deep Reinforcement Learning
W. Clements
B. V. Delft
Benoît-Marie Robaglia
Reda Bahi Slaoui
Sébastien Toth
49
96
0
23 May 2019
Distributional Reinforcement Learning for Efficient Exploration
B. Mavrin
Shangtong Zhang
Hengshuai Yao
Linglong Kong
Kaiwen Wu
Yaoliang Yu
OOD
OffRL
41
87
0
13 May 2019
Understanding the impact of entropy on policy optimization
Zafarali Ahmed
Nicolas Le Roux
Mohammad Norouzi
Dale Schuurmans
73
230
0
27 Nov 2018
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
109
1,310
0
30 Oct 2018
Distributed Distributional Deterministic Policy Gradients
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
82
479
0
23 Apr 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
164
5,121
0
26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
241
8,236
0
04 Jan 2018
Efficient exploration with Double Uncertain Value Networks
Thomas M. Moerland
Joost Broekens
Catholijn M. Jonker
40
42
0
29 Nov 2017
Distributional Reinforcement Learning with Quantile Regression
Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
87
756
0
27 Oct 2017
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
82
1,497
0
21 Jul 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
285
18,685
0
20 Jul 2017
The Cramer Distance as a Solution to Biased Wasserstein Gradients
Marc G. Bellemare
Ivo Danihelka
Will Dabney
S. Mohamed
Balaji Lakshminarayanan
Stephan Hoyer
Rémi Munos
GAN
53
344
0
30 May 2017
What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision?
Alex Kendall
Y. Gal
BDL
OOD
UD
UQCV
PER
312
4,667
0
15 Mar 2017
EX2: Exploration with Exemplar Models for Deep Reinforcement Learning
Justin Fu
John D. Co-Reyes
Sergey Levine
OffRL
52
155
0
03 Mar 2017
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
555
5,748
0
05 Dec 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
86
764
0
15 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
76
344
0
07 Nov 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
167
1,465
0
06 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
188
5,056
0
05 Jun 2016
VIME: Variational Information Maximizing Exploration
Rein Houthooft
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
55
78
0
31 May 2016
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
78
1,302
0
15 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
170
8,805
0
04 Feb 2016
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
60
3,368
0
08 Jun 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
254
6,722
0
19 Feb 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.1K
149,474
0
22 Dec 2014
Sparse Quantile Huber Regression for Efficient and Robust Estimation
Aleksandr Aravkin
Anju Kambadur
A. Lozano
Ronny Luss
48
17
0
19 Feb 2014
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
92
2,992
0
19 Jul 2012
1