Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.01041
Cited By
v1
v2
v3 (latest)
Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity
4 January 2021
Kai Zhang
Xiangyuan Zhang
Bin Hu
Tamer Bacsar
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity"
35 / 35 papers shown
Title
Robust Policy Gradient against Strong Data Corruption
Xuezhou Zhang
Yiding Chen
Xiaojin Zhu
Wen Sun
AAML
90
39
0
11 Feb 2021
Independent Policy Gradient Methods for Competitive Reinforcement Learning
C. Daskalakis
Dylan J. Foster
Noah Golowich
236
163
0
11 Jan 2021
Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon
B. Hambly
Renyuan Xu
Huining Yang
63
63
0
20 Nov 2020
Efficient Methods for Structured Nonconvex-Nonconcave Min-Max Optimization
Jelena Diakonikolas
C. Daskalakis
Michael I. Jordan
84
145
0
31 Oct 2020
RAT iLQR: A Risk Auto-Tuning Controller to Optimally Account for Stochastic Model Mismatch
Haruki Nishimura
Negar Mehr
Adrien Gaidon
Mac Schwager
81
13
0
16 Oct 2020
Gradient Descent-Ascent Provably Converges to Strict Local Minmax Equilibria with a Finite Timescale Separation
Tanner Fiez
Lillian J. Ratliff
37
16
0
30 Sep 2020
The Complexity of Constrained Min-Max Optimization
C. Daskalakis
Stratis Skoulakis
Manolis Zampetakis
122
137
0
21 Sep 2020
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Kai Zhang
Sham Kakade
Tamer Bacsar
Lin F. Yang
134
123
0
15 Jul 2020
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
Yingjie Fei
Zhuoran Yang
Yudong Chen
Zhaoran Wang
Qiaomin Xie
56
67
0
22 Jun 2020
Policy Learning of MDPs with Mixed Continuous/Discrete Variables: A Case Study on Model-Free Control of Markovian Jump Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
57
16
0
04 Jun 2020
Global Convergence and Variance-Reduced Optimization for a Class of Nonconvex-Nonconcave Minimax Problems
Junchi Yang
Negar Kiyavash
Niao He
88
84
0
22 Feb 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
62
35
0
10 Feb 2020
Naive Exploration is Optimal for Online LQR
Max Simchowitz
Dylan J. Foster
78
185
0
27 Jan 2020
Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem
Hesameddin Mohammadi
A. Zare
Mahdi Soltanolkotabi
M. Jovanović
72
124
0
26 Dec 2019
Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach
Yingying Li
Yujie Tang
Runyu Zhang
Na Li
78
102
0
19 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
219
1,225
0
24 Nov 2019
Poincaré Recurrence, Cycles and Spurious Equilibria in Gradient-Descent-Ascent for Non-Convex Non-Concave Zero-Sum Games
Lampros Flokas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Georgios Piliouras
MLT
79
41
0
28 Oct 2019
Policy Optimization for
H
2
\mathcal{H}_2
H
2
Linear Control with
H
∞
\mathcal{H}_\infty
H
∞
Robustness Guarantee: Implicit Regularization and Global Convergence
Kai Zhang
Bin Hu
Tamer Basar
62
121
0
21 Oct 2019
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
Kai Zhang
Alec Koppel
Haoqi Zhu
Tamer Basar
72
191
0
19 Jun 2019
Robust Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
Nir Levine
Rae Jeong
Yuanyuan Shi
Jackie Kay
A. Abdolmaleki
Jost Tobias Springenberg
Timothy A. Mann
Todd Hester
Martin Riedmiller
OOD
120
122
0
18 Jun 2019
On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems
Tianyi Lin
Chi Jin
Michael I. Jordan
129
508
0
02 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Kai Zhang
Zhuoran Yang
Tamer Basar
85
127
0
31 May 2019
Solving a Class of Non-Convex Min-Max Games Using Iterative First Order Methods
Maher Nouiehed
Maziar Sanjabi
Tianjian Huang
Jason D. Lee
Meisam Razaviyayn
96
344
0
21 Feb 2019
Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems
Dhruv Malik
A. Pananjady
Kush S. Bhatia
K. Khamaru
Peter L. Bartlett
Martin J. Wainwright
51
199
0
20 Dec 2018
The Gap Between Model-Based and Model-Free Methods on the Linear Quadratic Regulator: An Asymptotic Viewpoint
Stephen Tu
Benjamin Recht
OffRL
75
151
0
09 Dec 2018
Finding Mixed Nash Equilibria of Generative Adversarial Networks
Ya-Ping Hsieh
Chen Liu
S. Chakrabartty
GAN
82
93
0
23 Oct 2018
The Limit Points of (Optimistic) Gradient Descent in Min-Max Optimization
C. Daskalakis
Ioannis Panageas
83
256
0
11 Jul 2018
A Tour of Reinforcement Learning: The View from Continuous Control
Benjamin Recht
124
631
0
25 Jun 2018
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator
Maryam Fazel
Rong Ge
Sham Kakade
M. Mesbahi
100
610
0
15 Jan 2018
On the Sample Complexity of the Linear Quadratic Regulator
Sarah Dean
Horia Mania
Nikolai Matni
Benjamin Recht
Stephen Tu
77
580
0
04 Oct 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
547
19,296
0
20 Jul 2017
Robust Adversarial Reinforcement Learning
Lerrel Pinto
James Davidson
Rahul Sukthankar
Abhinav Gupta
OOD
109
861
0
08 Mar 2017
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
330
13,289
0
09 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
133
3,439
0
08 Jun 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
281
6,801
0
19 Feb 2015
1