Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.08305
Cited By
Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems
20 December 2018
Dhruv Malik
A. Pananjady
Kush S. Bhatia
K. Khamaru
Peter L. Bartlett
Martin J. Wainwright
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems"
42 / 42 papers shown
Title
Multiscale Adaptive Conflict-Balancing Model For Multimedia Deepfake Detection
Zihan Xiong
Xiaohua Wu
Lei Chen
Fangqi Lou
11
0
0
19 May 2025
Learning Stabilizing Policies via an Unstable Subspace Representation
Leonardo F. Toso
Lintao Ye
James Anderson
34
0
0
02 May 2025
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Xuyang Chen
Jingliang Duan
Lin Zhao
62
1
0
02 May 2025
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning
Donglin Zhan
Leonardo F. Toso
James Anderson
104
1
0
04 Feb 2025
Building Socially-Equitable Public Models
Yejia Liu
Jianyi Yang
Pengfei Li
Tongxin Li
Shaolei Ren
OffRL
46
0
0
04 Jun 2024
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective
Muhammad Aneeq uz Zaman
Alec Koppel
Mathieu Laurière
Tamer Basar
44
3
0
17 Mar 2024
Model-Free
μ
μ
μ
-Synthesis: A Nonsmooth Optimization Perspective
Darioush Keivan
Xing-ming Guo
Peter M. Seiler
Geir Dullerud
Bin Hu
36
0
0
18 Feb 2024
Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method
Elissa Mhanna
Mohamad Assaad
57
1
0
30 Jan 2024
Oracle Complexity Reduction for Model-free LQR: A Stochastic Variance-Reduced Policy Gradient Approach
Leonardo F. Toso
Han Wang
James Anderson
37
2
0
19 Sep 2023
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators
Yin-Huan Han
Meisam Razaviyayn
Renyuan Xu
27
5
0
15 Mar 2023
Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient
Xiangyuan Zhang
Tamer Basar
36
19
0
25 Feb 2023
Learning the Kalman Filter with Fine-Grained Sample Complexity
Xiangyuan Zhang
Bin Hu
Tamer Bacsar
26
16
0
30 Jan 2023
Global Convergence of Direct Policy Search for State-Feedback
H
∞
\mathcal{H}_\infty
H
∞
Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
Xing-ming Guo
Bin Hu
41
12
0
20 Oct 2022
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Bin Hu
Kaipeng Zhang
Na Li
M. Mesbahi
Maryam Fazel
Tamer Bacsar
87
27
0
10 Oct 2022
Rate-Optimal Online Convex Optimization in Adaptive Linear Control
Asaf B. Cassel
Alon Cohen
Google Research
34
9
0
03 Jun 2022
Learning Mixtures of Linear Dynamical Systems
Yanxi Chen
H. Vincent Poor
22
17
0
26 Jan 2022
On the Sample Complexity of Decentralized Linear Quadratic Regulator with Partially Nested Information Structure
Lintao Ye
Haoqi Zhu
V. Gupta
33
14
0
14 Oct 2021
Stabilizing Dynamical Systems via Policy Gradient Methods
Juan C. Perdomo
Jack Umenberger
Max Simchowitz
40
44
0
13 Oct 2021
Regret Analysis of Distributed Online LQR Control for Unknown LTI Systems
Ting-Jui Chang
Shahin Shahrampour
32
8
0
15 May 2021
Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with
T
\sqrt{T}
T
Regret
Asaf B. Cassel
Tomer Koren
OffRL
36
17
0
25 Feb 2021
Data-Driven System Level Synthesis
Anton Xue
Nikolai Matni
24
41
0
20 Nov 2020
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
Tengyu Xu
Yingbin Liang
Guanghui Lan
52
122
0
11 Nov 2020
Sample Efficient Reinforcement Learning with REINFORCE
Junzi Zhang
Jongho Kim
Brendan O'Donoghue
Stephen P. Boyd
42
101
0
22 Oct 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
21
42
0
02 Aug 2020
Cooperative Multi-Agent Reinforcement Learning with Partial Observations
Yan Zhang
Michael M. Zavlanos
OffRL
32
22
0
18 Jun 2020
A New One-Point Residual-Feedback Oracle For Black-Box Learning and Control
Yan Zhang
Yi Zhou
Kaiyi Ji
Michael M. Zavlanos
23
40
0
18 Jun 2020
A Primer on Zeroth-Order Optimization in Signal Processing and Machine Learning
Sijia Liu
Pin-Yu Chen
B. Kailkhura
Gaoyuan Zhang
A. Hero III
P. Varshney
26
224
0
11 Jun 2020
Non-asymptotic Convergence Analysis of Two Time-scale (Natural) Actor-Critic Algorithms
Tengyu Xu
Zhe Wang
Yingbin Liang
26
57
0
07 May 2020
Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms
Tengyu Xu
Zhe Wang
Yingbin Liang
27
25
0
27 Apr 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
25
35
0
10 Feb 2020
Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem
Hesameddin Mohammadi
A. Zare
Mahdi Soltanolkotabi
M. Jovanović
32
122
0
26 Dec 2019
Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach
Yingying Li
Yujie Tang
Runyu Zhang
Na Li
24
101
0
19 Dec 2019
Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator
Yuwei Luo
Zhuoran Yang
Zhaoran Wang
Mladen Kolar
26
9
0
14 Dec 2019
Policy Optimization for
H
2
\mathcal{H}_2
H
2
Linear Control with
H
∞
\mathcal{H}_\infty
H
∞
Robustness Guarantee: Implicit Regularization and Global Convergence
Kaipeng Zhang
Bin Hu
Tamer Basar
24
119
0
21 Oct 2019
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games
Zuyue Fu
Zhuoran Yang
Yongxin Chen
Zhaoran Wang
27
54
0
16 Oct 2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Zhuoran Yang
Yongxin Chen
Mingyi Hong
Zhaoran Wang
37
39
0
14 Jul 2019
From self-tuning regulators to reinforcement learning and back again
Nikolai Matni
Alexandre Proutiere
Anders Rantzer
Stephen Tu
27
88
0
27 Jun 2019
Robust exploration in linear quadratic reinforcement learning
Jack Umenberger
Mina Ferizbegovic
Thomas B. Schon
H. Hjalmarsson
23
38
0
04 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Kaipeng Zhang
Zhuoran Yang
Tamer Basar
32
125
0
31 May 2019
Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator
K. Krauth
Stephen Tu
Benjamin Recht
27
57
0
30 May 2019
The Gap Between Model-Based and Model-Free Methods on the Linear Quadratic Regulator: An Asymptotic Viewpoint
Stephen Tu
Benjamin Recht
OffRL
24
150
0
09 Dec 2018
Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition
Hamed Karimi
J. Nutini
Mark Schmidt
139
1,205
0
16 Aug 2016
1