Policy Learning of MDPs with Mixed Continuous/Discrete Variables: A Case Study on Model-Free Control of Markovian Jump Systems

4 June 2020

Joao Paulo Jansch-Porto

Geir Dullerud

Papers citing "Policy Learning of MDPs with Mixed Continuous/Discrete Variables: A Case Study on Model-Free Control of Markovian Jump Systems"

20 / 20 papers shown

Title
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems Joao Paulo Jansch-Porto Bin Hu Geir Dullerud 47 35 0 10 Feb 2020
$Policy Optimization for $\mathcal{H}_2$ Linear Control with $\mathcal{H}_\infty$ Robustness Guarantee: Implicit Regularization and Global Convergence$ Policy Optimization for $\mathcal{H}_2$ Linear Control with $\mathcal{H}_\infty$ Robustness Guarantee: Implicit Regularization and Global Convergence Kai Zhang Bin Hu Tamer Basar 45 119 0 21 Oct 2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost Zhuoran Yang Yongxin Chen Mingyi Hong Zhaoran Wang 80 39 0 14 Jul 2019
Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory Bin Hu U. Syed 58 58 0 16 Jun 2019
Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems Dhruv Malik A. Pananjady Kush S. Bhatia K. Khamaru Peter L. Bartlett Martin J. Wainwright 48 198 0 20 Dec 2018
The Gap Between Model-Based and Model-Free Methods on the Linear Quadratic Regulator: An Asymptotic Viewpoint Stephen Tu Benjamin Recht OffRL 52 150 0 09 Dec 2018
Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator Sarah Dean Horia Mania Nikolai Matni Benjamin Recht Stephen Tu 33 283 0 23 May 2018
Model-Free Linear Quadratic Control via Reduction to Expert Prediction Yasin Abbasi-Yadkori N. Lazić Csaba Szepesvári OffRL 40 94 0 17 Apr 2018
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator Maryam Fazel Rong Ge Sham Kakade M. Mesbahi 77 599 0 15 Jan 2018
On the Sample Complexity of the Linear Quadratic Regulator Sarah Dean Horia Mania Nikolai Matni Benjamin Recht Stephen Tu 63 574 0 04 Oct 2017
Deep Reinforcement Learning that Matters Peter Henderson Riashat Islam Philip Bachman Joelle Pineau Doina Precup David Meger OffRL 112 1,940 0 19 Sep 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 259 18,685 0 20 Jul 2017
A Unified Analysis of Stochastic Optimization Methods Using Jump System Theory and Quadratic Constraints Bin Hu Peter M. Seiler Anders Rantzer 99 35 0 25 Jun 2017
Towards Generalization and Simplicity in Continuous Control Aravind Rajeswaran Kendall Lowrey E. Todorov Sham Kakade OffRL 84 276 0 08 Mar 2017
Benchmarking Deep Reinforcement Learning for Continuous Control Yan Duan Xi Chen Rein Houthooft John Schulman Pieter Abbeel OffRL 76 1,689 0 22 Apr 2016
High-Dimensional Continuous Control Using Generalized Advantage Estimation John Schulman Philipp Moritz Sergey Levine Michael I. Jordan Pieter Abbeel OffRL 60 3,368 0 08 Jun 2015
End-to-End Training of Deep Visuomotor Policies Sergey Levine Chelsea Finn Trevor Darrell Pieter Abbeel BDL 249 3,418 0 02 Apr 2015
Trust Region Policy Optimization John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel 254 6,722 0 19 Feb 2015
Solving Factored MDPs with Continuous and Discrete Variables Carlos Guestrin Milos Hauskrecht Branislav Kveton 72 76 0 11 Jul 2012
Bayesian Nonparametric Inference of Switching Linear Dynamical Systems E. Fox Erik B. Sudderth Michael I. Jordan A. Willsky 74 244 0 19 Mar 2010