ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.08305
  4. Cited By
Derivative-Free Methods for Policy Optimization: Guarantees for Linear
  Quadratic Systems

Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

20 December 2018
Dhruv Malik
A. Pananjady
Kush S. Bhatia
K. Khamaru
Peter L. Bartlett
Martin J. Wainwright
ArXivPDFHTML

Papers citing "Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems"

42 / 42 papers shown
Title
Multiscale Adaptive Conflict-Balancing Model For Multimedia Deepfake Detection
Multiscale Adaptive Conflict-Balancing Model For Multimedia Deepfake Detection
Zihan Xiong
Xiaohua Wu
Lei Chen
Fangqi Lou
9
0
0
19 May 2025
Learning Stabilizing Policies via an Unstable Subspace Representation
Learning Stabilizing Policies via an Unstable Subspace Representation
Leonardo F. Toso
Lintao Ye
James Anderson
34
0
0
02 May 2025
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Xuyang Chen
Jingliang Duan
Lin Zhao
62
1
0
02 May 2025
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning
Donglin Zhan
Leonardo F. Toso
James Anderson
104
1
0
04 Feb 2025
Building Socially-Equitable Public Models
Building Socially-Equitable Public Models
Yejia Liu
Jianyi Yang
Pengfei Li
Tongxin Li
Shaolei Ren
OffRL
46
0
0
04 Jun 2024
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective
Muhammad Aneeq uz Zaman
Alec Koppel
Mathieu Laurière
Tamer Basar
41
3
0
17 Mar 2024
Model-Free $μ$-Synthesis: A Nonsmooth Optimization Perspective
Model-Free μμμ-Synthesis: A Nonsmooth Optimization Perspective
Darioush Keivan
Xing-ming Guo
Peter M. Seiler
Geir Dullerud
Bin Hu
36
0
0
18 Feb 2024
Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method
Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method
Elissa Mhanna
Mohamad Assaad
55
1
0
30 Jan 2024
Oracle Complexity Reduction for Model-free LQR: A Stochastic
  Variance-Reduced Policy Gradient Approach
Oracle Complexity Reduction for Model-free LQR: A Stochastic Variance-Reduced Policy Gradient Approach
Leonardo F. Toso
Han Wang
James Anderson
37
2
0
19 Sep 2023
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators
Yin-Huan Han
Meisam Razaviyayn
Renyuan Xu
27
5
0
15 Mar 2023
Revisiting LQR Control from the Perspective of Receding-Horizon Policy
  Gradient
Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient
Xiangyuan Zhang
Tamer Basar
36
19
0
25 Feb 2023
Learning the Kalman Filter with Fine-Grained Sample Complexity
Learning the Kalman Filter with Fine-Grained Sample Complexity
Xiangyuan Zhang
Bin Hu
Tamer Bacsar
26
16
0
30 Jan 2023
Global Convergence of Direct Policy Search for State-Feedback
  $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with
  Goldstein Subdifferential
Global Convergence of Direct Policy Search for State-Feedback H∞\mathcal{H}_\inftyH∞​ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
Xing-ming Guo
Bin Hu
41
12
0
20 Oct 2022
Towards a Theoretical Foundation of Policy Optimization for Learning
  Control Policies
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Bin Hu
Kaipeng Zhang
Na Li
M. Mesbahi
Maryam Fazel
Tamer Bacsar
87
27
0
10 Oct 2022
Rate-Optimal Online Convex Optimization in Adaptive Linear Control
Rate-Optimal Online Convex Optimization in Adaptive Linear Control
Asaf B. Cassel
Alon Cohen
Google Research
34
9
0
03 Jun 2022
Learning Mixtures of Linear Dynamical Systems
Learning Mixtures of Linear Dynamical Systems
Yanxi Chen
H. Vincent Poor
20
17
0
26 Jan 2022
On the Sample Complexity of Decentralized Linear Quadratic Regulator
  with Partially Nested Information Structure
On the Sample Complexity of Decentralized Linear Quadratic Regulator with Partially Nested Information Structure
Lintao Ye
Haoqi Zhu
V. Gupta
33
14
0
14 Oct 2021
Stabilizing Dynamical Systems via Policy Gradient Methods
Stabilizing Dynamical Systems via Policy Gradient Methods
Juan C. Perdomo
Jack Umenberger
Max Simchowitz
40
44
0
13 Oct 2021
Regret Analysis of Distributed Online LQR Control for Unknown LTI
  Systems
Regret Analysis of Distributed Online LQR Control for Unknown LTI Systems
Ting-Jui Chang
Shahin Shahrampour
29
8
0
15 May 2021
Online Policy Gradient for Model Free Learning of Linear Quadratic
  Regulators with $\sqrt{T}$ Regret
Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with T\sqrt{T}T​ Regret
Asaf B. Cassel
Tomer Koren
OffRL
36
17
0
25 Feb 2021
Data-Driven System Level Synthesis
Data-Driven System Level Synthesis
Anton Xue
Nikolai Matni
24
41
0
20 Nov 2020
CRPO: A New Approach for Safe Reinforcement Learning with Convergence
  Guarantee
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
Tengyu Xu
Yingbin Liang
Guanghui Lan
52
122
0
11 Nov 2020
Sample Efficient Reinforcement Learning with REINFORCE
Sample Efficient Reinforcement Learning with REINFORCE
Junzi Zhang
Jongho Kim
Brendan O'Donoghue
Stephen P. Boyd
42
101
0
22 Oct 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
21
42
0
02 Aug 2020
Cooperative Multi-Agent Reinforcement Learning with Partial Observations
Cooperative Multi-Agent Reinforcement Learning with Partial Observations
Yan Zhang
Michael M. Zavlanos
OffRL
32
22
0
18 Jun 2020
A New One-Point Residual-Feedback Oracle For Black-Box Learning and
  Control
A New One-Point Residual-Feedback Oracle For Black-Box Learning and Control
Yan Zhang
Yi Zhou
Kaiyi Ji
Michael M. Zavlanos
23
40
0
18 Jun 2020
A Primer on Zeroth-Order Optimization in Signal Processing and Machine
  Learning
A Primer on Zeroth-Order Optimization in Signal Processing and Machine Learning
Sijia Liu
Pin-Yu Chen
B. Kailkhura
Gaoyuan Zhang
A. Hero III
P. Varshney
26
224
0
11 Jun 2020
Non-asymptotic Convergence Analysis of Two Time-scale (Natural)
  Actor-Critic Algorithms
Non-asymptotic Convergence Analysis of Two Time-scale (Natural) Actor-Critic Algorithms
Tengyu Xu
Zhe Wang
Yingbin Liang
26
57
0
07 May 2020
Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms
Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms
Tengyu Xu
Zhe Wang
Yingbin Liang
27
25
0
27 Apr 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump
  Linear Systems
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
25
35
0
10 Feb 2020
Convergence and sample complexity of gradient methods for the model-free
  linear quadratic regulator problem
Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem
Hesameddin Mohammadi
A. Zare
Mahdi Soltanolkotabi
M. Jovanović
32
122
0
26 Dec 2019
Distributed Reinforcement Learning for Decentralized Linear Quadratic
  Control: A Derivative-Free Policy Optimization Approach
Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach
Yingying Li
Yujie Tang
Runyu Zhang
Na Li
24
101
0
19 Dec 2019
Natural Actor-Critic Converges Globally for Hierarchical Linear
  Quadratic Regulator
Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator
Yuwei Luo
Zhuoran Yang
Zhaoran Wang
Mladen Kolar
26
9
0
14 Dec 2019
Policy Optimization for $\mathcal{H}_2$ Linear Control with
  $\mathcal{H}_\infty$ Robustness Guarantee: Implicit Regularization and Global
  Convergence
Policy Optimization for H2\mathcal{H}_2H2​ Linear Control with H∞\mathcal{H}_\inftyH∞​ Robustness Guarantee: Implicit Regularization and Global Convergence
Kaipeng Zhang
Bin Hu
Tamer Basar
24
119
0
21 Oct 2019
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic
  Mean-Field Games
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games
Zuyue Fu
Zhuoran Yang
Yongxin Chen
Zhaoran Wang
27
54
0
16 Oct 2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic
  Regulator with Ergodic Cost
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Zhuoran Yang
Yongxin Chen
Mingyi Hong
Zhaoran Wang
32
39
0
14 Jul 2019
From self-tuning regulators to reinforcement learning and back again
From self-tuning regulators to reinforcement learning and back again
Nikolai Matni
Alexandre Proutiere
Anders Rantzer
Stephen Tu
27
88
0
27 Jun 2019
Robust exploration in linear quadratic reinforcement learning
Robust exploration in linear quadratic reinforcement learning
Jack Umenberger
Mina Ferizbegovic
Thomas B. Schon
H. Hjalmarsson
23
38
0
04 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum
  Linear Quadratic Games
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Kaipeng Zhang
Zhuoran Yang
Tamer Basar
32
125
0
31 May 2019
Finite-time Analysis of Approximate Policy Iteration for the Linear
  Quadratic Regulator
Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator
K. Krauth
Stephen Tu
Benjamin Recht
27
57
0
30 May 2019
The Gap Between Model-Based and Model-Free Methods on the Linear
  Quadratic Regulator: An Asymptotic Viewpoint
The Gap Between Model-Based and Model-Free Methods on the Linear Quadratic Regulator: An Asymptotic Viewpoint
Stephen Tu
Benjamin Recht
OffRL
13
150
0
09 Dec 2018
Linear Convergence of Gradient and Proximal-Gradient Methods Under the
  Polyak-Łojasiewicz Condition
Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition
Hamed Karimi
J. Nutini
Mark Schmidt
139
1,205
0
16 Aug 2016
1