Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.06267
Cited By
Twice regularized MDPs and the equivalence between robustness and regularization
12 October 2021
E. Derman
Matthieu Geist
Shie Mannor
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Twice regularized MDPs and the equivalence between robustness and regularization"
27 / 27 papers shown
Title
Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning
Chi Zhang
Ziying Jia
George Atia
Sihong He
Yue Wang
31
0
0
24 May 2025
Dual Formulation for Non-Rectangular Lp Robust Markov Decision Processes
Navdeep Kumar
Adarsh Gupta
Maxence Mohamed Elfatihi
Giorgia Ramponi
Kfir Y. Levy
Shie Mannor
70
0
0
13 Feb 2025
Decoding Game: On Minimax Optimality of Heuristic Text Generation Strategies
Sijin Chen
Omar Hagrass
Jason M. Klusowski
65
4
0
04 Oct 2024
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Toshinori Kitamura
Tadashi Kozuno
Wataru Kumagai
Kenta Hoshino
Y. Hosoe
Kazumi Kasaura
Masashi Hamaya
Paavo Parmas
Yutaka Matsuo
79
2
0
29 Aug 2024
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
70
2
0
26 Jan 2024
User-Oriented Robust Reinforcement Learning
Haoyi You
Beichen Yu
Haiming Jin
Zhaoxing Yang
Jiahui Sun
OffRL
40
0
0
15 Feb 2022
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
60
179
0
10 Mar 2021
Regularized Policies are Reward Robust
Hisham Husain
K. Ciosek
Ryota Tomioka
19
23
0
18 Jan 2021
Scalable First-Order Methods for Robust MDPs
Julien Grand-Clément
Christian Kroer
24
28
0
11 May 2020
Distributional Robustness and Regularization in Reinforcement Learning
E. Derman
Shie Mannor
38
44
0
05 Mar 2020
Reinforcement Learning via Fenchel-Rockafellar Duality
Ofir Nachum
Bo Dai
OffRL
64
119
0
07 Jan 2020
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs
Lior Shani
Yonathan Efroni
Shie Mannor
32
175
0
06 Sep 2019
Wasserstein Distributionally Robust Optimization: Theory and Applications in Machine Learning
Daniel Kuhn
Peyman Mohajerin Esfahani
Viet Anh Nguyen
Soroosh Shafieezadeh-Abadeh
OOD
37
392
0
23 Aug 2019
A Theory of Regularized Markov Decision Processes
Matthieu Geist
B. Scherrer
Olivier Pietquin
84
317
0
31 Jan 2019
Action Robust Reinforcement Learning and Applications in Continuous Control
Chen Tessler
Yonathan Efroni
Shie Mannor
44
232
0
26 Jan 2019
Soft-Robust Actor-Critic Policy-Gradient
E. Derman
D. Mankowitz
Timothy A. Mann
Shie Mannor
23
62
0
11 Mar 2018
Differentiable Dynamic Programming for Structured Prediction and Attention
A. Mensch
Mathieu Blondel
40
129
0
11 Feb 2018
Learning Robust Options
D. Mankowitz
Timothy A. Mann
Pierre-Luc Bacon
Doina Precup
Shie Mannor
24
48
0
09 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
186
8,236
0
04 Jan 2018
Sparse Markov Decision Processes with Causal Sparse Tsallis Entropy Regularization for Reinforcement Learning
Kyungjae Lee
Sungjoon Choi
Songhwai Oh
38
67
0
19 Sep 2017
Reinforcement Learning under Model Mismatch
Aurko Roy
Huan Xu
Sebastian Pokutta
OOD
34
80
0
15 Jun 2017
Robust Adversarial Reinforcement Learning
Lerrel Pinto
James Davidson
Rahul Sukthankar
Abhinav Gupta
OOD
73
848
0
08 Mar 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
55
1,329
0
27 Feb 2017
Distributionally Robust Logistic Regression
Soroosh Shafieezadeh-Abadeh
Peyman Mohajerin Esfahani
Daniel Kuhn
OOD
37
304
0
30 Sep 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
227
6,722
0
19 Feb 2015
Lightning Does Not Strike Twice: Robust MDPs with Coupled Uncertainty
Shie Mannor
O. Mebel
Huan Xu
45
67
0
18 Jun 2012
Robustness and Regularization of Support Vector Machines
Huan Xu
Constantine Caramanis
Shie Mannor
91
471
0
25 Mar 2008
1