Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

20 December 2018

Papers citing "Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems"

42 / 42 papers shown

Title
Multiscale Adaptive Conflict-Balancing Model For Multimedia Deepfake Detection Zihan Xiong Xiaohua Wu Lei Chen Fangqi Lou 11 0 0 19 May 2025
Learning Stabilizing Policies via an Unstable Subspace Representation Leonardo F. Toso Lintao Ye James Anderson 34 0 0 02 May 2025
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator Xuyang Chen Jingliang Duan Lin Zhao 62 1 0 02 May 2025
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning Donglin Zhan Leonardo F. Toso James Anderson 104 1 0 04 Feb 2025
Building Socially-Equitable Public Models Yejia Liu Jianyi Yang Pengfei Li Tongxin Li Shaolei Ren OffRL 46 0 0 04 Jun 2024
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective Muhammad Aneeq uz Zaman Alec Koppel Mathieu Laurière Tamer Basar 44 3 0 17 Mar 2024
Model-Free $μ$ -Synthesis: A Nonsmooth Optimization Perspective Darioush Keivan Xing-ming Guo Peter M. Seiler Geir Dullerud Bin Hu 36 0 0 18 Feb 2024
Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method Elissa Mhanna Mohamad Assaad 57 1 0 30 Jan 2024
Oracle Complexity Reduction for Model-free LQR: A Stochastic Variance-Reduced Policy Gradient Approach Leonardo F. Toso Han Wang James Anderson 37 2 0 19 Sep 2023
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators Yin-Huan Han Meisam Razaviyayn Renyuan Xu 27 5 0 15 Mar 2023
Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient Xiangyuan Zhang Tamer Basar 36 19 0 25 Feb 2023
Learning the Kalman Filter with Fine-Grained Sample Complexity Xiangyuan Zhang Bin Hu Tamer Bacsar 26 16 0 30 Jan 2023
$Global Convergence of Direct Policy Search for State-Feedback $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential$ Global Convergence of Direct Policy Search for State-Feedback $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential Xing-ming Guo Bin Hu 41 12 0 20 Oct 2022
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies Bin Hu Kaipeng Zhang Na Li M. Mesbahi Maryam Fazel Tamer Bacsar 87 27 0 10 Oct 2022
Rate-Optimal Online Convex Optimization in Adaptive Linear Control Asaf B. Cassel Alon Cohen Google Research 34 9 0 03 Jun 2022
Learning Mixtures of Linear Dynamical Systems Yanxi Chen H. Vincent Poor 22 17 0 26 Jan 2022
On the Sample Complexity of Decentralized Linear Quadratic Regulator with Partially Nested Information Structure Lintao Ye Haoqi Zhu V. Gupta 33 14 0 14 Oct 2021
Stabilizing Dynamical Systems via Policy Gradient Methods Juan C. Perdomo Jack Umenberger Max Simchowitz 40 44 0 13 Oct 2021
Regret Analysis of Distributed Online LQR Control for Unknown LTI Systems Ting-Jui Chang Shahin Shahrampour 32 8 0 15 May 2021
$Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret$ Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret Asaf B. Cassel Tomer Koren OffRL 36 17 0 25 Feb 2021
Data-Driven System Level Synthesis Anton Xue Nikolai Matni 24 41 0 20 Nov 2020
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee Tengyu Xu Yingbin Liang Guanghui Lan 52 122 0 11 Nov 2020
Sample Efficient Reinforcement Learning with REINFORCE Junzi Zhang Jongho Kim Brendan O'Donoghue Stephen P. Boyd 42 101 0 22 Oct 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy Zuyue Fu Zhuoran Yang Zhaoran Wang 21 42 0 02 Aug 2020
Cooperative Multi-Agent Reinforcement Learning with Partial Observations Yan Zhang Michael M. Zavlanos OffRL 32 22 0 18 Jun 2020
A New One-Point Residual-Feedback Oracle For Black-Box Learning and Control Yan Zhang Yi Zhou Kaiyi Ji Michael M. Zavlanos 23 40 0 18 Jun 2020
A Primer on Zeroth-Order Optimization in Signal Processing and Machine Learning Sijia Liu Pin-Yu Chen B. Kailkhura Gaoyuan Zhang A. Hero III P. Varshney 26 224 0 11 Jun 2020
Non-asymptotic Convergence Analysis of Two Time-scale (Natural) Actor-Critic Algorithms Tengyu Xu Zhe Wang Yingbin Liang 26 57 0 07 May 2020
Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms Tengyu Xu Zhe Wang Yingbin Liang 27 25 0 27 Apr 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems Joao Paulo Jansch-Porto Bin Hu Geir Dullerud 25 35 0 10 Feb 2020
Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem Hesameddin Mohammadi A. Zare Mahdi Soltanolkotabi M. Jovanović 32 122 0 26 Dec 2019
Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach Yingying Li Yujie Tang Runyu Zhang Na Li 24 101 0 19 Dec 2019
Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator Yuwei Luo Zhuoran Yang Zhaoran Wang Mladen Kolar 26 9 0 14 Dec 2019
$Policy Optimization for $\mathcal{H}_2$ Linear Control with $\mathcal{H}_\infty$ Robustness Guarantee: Implicit Regularization and Global Convergence$ Policy Optimization for $\mathcal{H}_2$ Linear Control with $\mathcal{H}_\infty$ Robustness Guarantee: Implicit Regularization and Global Convergence Kaipeng Zhang Bin Hu Tamer Basar 24 119 0 21 Oct 2019
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games Zuyue Fu Zhuoran Yang Yongxin Chen Zhaoran Wang 27 54 0 16 Oct 2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost Zhuoran Yang Yongxin Chen Mingyi Hong Zhaoran Wang 37 39 0 14 Jul 2019
From self-tuning regulators to reinforcement learning and back again Nikolai Matni Alexandre Proutiere Anders Rantzer Stephen Tu 27 88 0 27 Jun 2019
Robust exploration in linear quadratic reinforcement learning Jack Umenberger Mina Ferizbegovic Thomas B. Schon H. Hjalmarsson 23 38 0 04 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games Kaipeng Zhang Zhuoran Yang Tamer Basar 32 125 0 31 May 2019
Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator K. Krauth Stephen Tu Benjamin Recht 27 57 0 30 May 2019
The Gap Between Model-Based and Model-Free Methods on the Linear Quadratic Regulator: An Asymptotic Viewpoint Stephen Tu Benjamin Recht OffRL 24 150 0 09 Dec 2018
Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition Hamed Karimi J. Nutini Mark Schmidt 139 1,205 0 16 Aug 2016