Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.06257
Cited By
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
10 March 2021
Benjamin Eysenbach
Sergey Levine
OOD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Maximum Entropy RL (Provably) Solves Some Robust RL Problems"
50 / 114 papers shown
Title
Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization
Uri Gadot
E. Derman
Navdeep Kumar
Maxence Mohamed Elfatihi
Kfir Y. Levy
Shie Mannor
30
5
0
03 Sep 2023
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
26
5
0
26 Jul 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
40
19
0
17 Jul 2023
Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity
Runyu Zhang
Yang Hu
Na Li
38
5
0
20 Jun 2023
Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Miguel Suau
M. Spaan
F. Oliehoek
CML
19
4
0
04 Jun 2023
Solving Robust MDPs through No-Regret Dynamics
E. Guha
30
0
0
30 May 2023
Reinforcement Learning with Simple Sequence Priors
Tankred Saanum
N. Éltető
Peter Dayan
Marcel Binz
Eric Schulz
OffRL
23
7
0
26 May 2023
Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies
Hanna Ziesche
Leonel Rozo
26
5
0
17 May 2023
What Matters in Reinforcement Learning for Tractography
Antoine Théberge
Christian Desrosiers
Maxime Descoteaux
Pierre-Marc Jodoin
OffRL
21
2
0
15 May 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
25
0
0
22 Mar 2023
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization
E. Derman
Yevgeniy Men
M. Geist
Shie Mannor
39
1
0
12 Mar 2023
Decision-Making Under Uncertainty: Beyond Probabilities
Thom S. Badings
T. D. Simão
Marnix Suilen
N. Jansen
UD
PER
31
12
0
10 Mar 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
29
3
0
08 Mar 2023
Bounding the Optimal Value Function in Compositional Reinforcement Learning
Jacob Adamczyk
Volodymyr Makarenko
A. Arriojas
Stas Tiomkin
R. Kulkarni
OffRL
32
2
0
05 Mar 2023
Multi-Start Team Orienteering Problem for UAS Mission Re-Planning with Data-Efficient Deep Reinforcement Learning
Dong Ho Lee
Jaemyung Ahn
22
6
0
02 Mar 2023
Minimax-Bayes Reinforcement Learning
Thomas Kleine Buening
Christos Dimitrakakis
Hannes Eriksson
Divya Grover
Emilio Jorge
OffRL
16
5
0
21 Feb 2023
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided Bounds on the Value Function
Jacob Adamczyk
Stas Tiomkin
R. Kulkarni
OffRL
22
0
0
19 Feb 2023
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Zuxin Liu
Zijian Guo
Yi-Fan Yao
Zhepeng Cen
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
31
46
0
14 Feb 2023
A general Markov decision process formalism for action-state entropy-regularized reward maximization
D. Grytskyy
Jorge Ramírez-Ruiz
R. Moreno-Bote
22
3
0
02 Feb 2023
An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Navdeep Kumar
Kfir Y. Levy
Kaixin Wang
Shie Mannor
36
2
0
31 Jan 2023
Policy Gradient for Rectangular Robust Markov Decision Processes
Navdeep Kumar
E. Derman
M. Geist
Kfir Y. Levy
Shie Mannor
20
19
0
31 Jan 2023
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Mengdi Wang
Furong Huang
Dinesh Manocha
24
7
0
28 Jan 2023
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training
Philipp Altmann
Thomy Phan
Fabian Ritz
Thomas Gabor
Claudia Linnhoff-Popien
OffRL
29
1
0
18 Jan 2023
Robust Average-Reward Markov Decision Processes
Yue Wang
Alvaro Velasquez
George K. Atia
Ashley Prater-Bennette
Shaofeng Zou
33
11
0
02 Jan 2023
Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning
Ronghui Mu
Wenjie Ruan
Leandro Soriano Marcolino
Gaojie Jin
Q. Ni
40
5
0
22 Dec 2022
Risk-Sensitive Reinforcement Learning with Exponential Criteria
Erfaun Noorani
Christos N. Mavridis
John S. Baras
30
8
0
18 Dec 2022
Resilience Evaluation of Entropy Regularized Logistic Networks with Probabilistic Cost
Koshi Oishi
Yota Hashizume
Tomohiko Jimbo
Hirotaka Kaji
Kenji Kashima
15
2
0
05 Dec 2022
Utilizing Prior Solutions for Reward Shaping and Composition in Entropy-Regularized Reinforcement Learning
Jacob Adamczyk
A. Arriojas
Stas Tiomkin
R. Kulkarni
37
8
0
02 Dec 2022
Path Planning Using Wassertein Distributionally Robust Deep Q-learning
Cem Alptürk
Venkatraman Renganathan
OOD
16
0
0
04 Nov 2022
Latent State Marginalization as a Low-cost Approach for Improving Exploration
Dinghuai Zhang
Aaron Courville
Yoshua Bengio
Qinqing Zheng
Amy Zhang
Ricky T. Q. Chen
OOD
25
9
0
03 Oct 2022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind
T. D. Simão
Tal Kachman
N. Jansen
16
10
0
02 Oct 2022
On the convex formulations of robust Markov decision processes
Julien Grand-Clément
Marek Petrik
53
10
0
21 Sep 2022
Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning
Xianfu Chen
Zhifeng Zhao
S. Mao
Celimuge Wu
Honggang Zhang
M. Bennis
OffRL
20
3
0
19 Sep 2022
Example When Local Optimal Policies Contain Unstable Control
B. Song
Jean-Jacques E. Slotine
Quang-Cuong Pham
46
1
0
15 Sep 2022
A Gaussian variational inference approach to motion planning
Hongzhe Yu
Yongxin Chen
34
16
0
13 Sep 2022
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRL
AI4CE
30
31
0
13 Jul 2022
Conditional Energy-Based Models for Implicit Policies: The Gap between Theory and Practice
Duy-Nguyen Ta
Eric A. Cousineau
Huihua Zhao
Siyuan Feng
26
3
0
12 Jul 2022
Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization
Yuan Zhang
Jianhong Wang
Joschka Boedecker
36
3
0
05 Jul 2022
Robust Reinforcement Learning with Distributional Risk-averse formulation
Pierre Clavier
S. Allassonnière
E. L. Pennec
OOD
33
7
0
14 Jun 2022
On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Zuxin Liu
Zijian Guo
Zhepeng Cen
Huan Zhang
Jie Tan
Bo-wen Li
Ding Zhao
OOD
OffRL
42
35
0
29 May 2022
Efficient Policy Iteration for Robust Markov Decision Processes via Regularization
Navdeep Kumar
Kfir Y. Levy
Kaixin Wang
Shie Mannor
21
18
0
28 May 2022
Complex behavior from intrinsic motivation to occupy action-state path space
Jorge Ramírez-Ruiz
D. Grytskyy
Chiara Mastrogiuseppe
Yamen Habib
R. Moreno-Bote
29
7
0
20 May 2022
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang
Shaofeng Zou
81
67
0
15 May 2022
SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics
Yannis Flet-Berliac
D. Basu
AAML
20
8
0
20 Apr 2022
Divide & Conquer Imitation Learning
Alexandre Chenu
Nicolas Perrin-Gilbert
Olivier Sigaud
10
5
0
15 Apr 2022
Maximum entropy optimal density control of discrete-time linear systems and Schrödinger bridges
Kaito Ito
Kenji Kashima
11
12
0
11 Apr 2022
Your Policy Regularizer is Secretly an Adversary
Rob Brekelmans
Tim Genewein
Jordi Grau-Moya
Grégoire Delétang
M. Kunesch
Shane Legg
Pedro A. Ortega
AAML
18
12
0
23 Mar 2022
Do You Need the Entropy Reward (in Practice)?
Haonan Yu
Haichao Zhang
Wei-ping Xu
28
7
0
28 Jan 2022
A Statistical Analysis of Polyak-Ruppert Averaged Q-learning
Xiang Li
Wenhao Yang
Jiadong Liang
Zhihua Zhang
Michael I. Jordan
37
15
0
29 Dec 2021
Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Dailin Hu
Pieter Abbeel
Roy Fox
24
1
0
28 Nov 2021
Previous
1
2
3
Next