Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.08642
Cited By
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
22 December 2017
Stephen Tu
Benjamin Recht
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator"
28 / 28 papers shown
Title
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Xuyang Chen
Jingliang Duan
Lin Zhao
54
1
0
02 May 2025
Achieving Tighter Finite-Time Rates for Heterogeneous Federated Stochastic Approximation under Markovian Sampling
Feng Zhu
Aritra Mitra
Robert W. Heath
FedML
36
0
0
15 Apr 2025
Coordinating Planning and Tracking in Layered Control Policies via Actor-Critic Learning
Fengjun Yang
Nikolai Matni
OffRL
31
0
0
03 Aug 2024
Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range
Yuzi Yan
Yuan-Chung Shen
34
0
0
05 Mar 2024
The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation
P. Amortila
Nan Jiang
Csaba Szepesvári
OffRL
29
3
0
25 Jul 2023
Learning and Concentration for High Dimensional Linear Gaussians: an Invariant Subspace Approach
Muhammad Naeem
29
2
0
04 Apr 2023
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation
Zhifa Ke
Junyu Zhang
Zaiwen Wen
24
0
0
25 Feb 2023
Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Zichen Zhang
Johannes Kirschner
Junxi Zhang
Francesco Zanini
Alex Ayoub
Masood Dehghan
Dale Schuurmans
OffRL
24
3
0
17 Dec 2022
Transportation-Inequalities, Lyapunov Stability and Sampling for Dynamical Systems on Continuous State Space
Muhammad Naeem
Miroslav Pajic
22
3
0
25 May 2022
A Complete Characterization of Linear Estimators for Offline Policy Evaluation
Juan C. Perdomo
A. Krishnamurthy
Peter L. Bartlett
Sham Kakade
OffRL
27
3
0
08 Mar 2022
Single Time-scale Actor-critic Method to Solve the Linear Quadratic Regulator with Convergence Guarantees
Mo Zhou
Jianfeng Lu
29
13
0
31 Jan 2022
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
25
35
0
10 Feb 2020
Probabilistic Safety Constraints for Learned High Relative Degree System Dynamics
M. J. Khojasteh
Vikas Dhiman
M. Franceschetti
Nikolay Atanasov
34
73
0
20 Dec 2019
Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator
Yuwei Luo
Zhuoran Yang
Zhaoran Wang
Mladen Kolar
26
9
0
14 Dec 2019
Statistical Learning for Analysis of Networked Control Systems over Unknown Channels
Konstantinos Gatsis
George J. Pappas
14
11
0
08 Nov 2019
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games
Zuyue Fu
Zhuoran Yang
Yongxin Chen
Zhaoran Wang
14
54
0
16 Oct 2019
Finite-Time Performance of Distributed Temporal Difference Learning with Linear Function Approximation
Thinh T. Doan
S. T. Maguluri
Justin Romberg
30
41
0
25 Jul 2019
Alice's Adventures in the Markovian World
Zhanzhan Zhao
Haoran Sun
22
0
0
21 Jul 2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Zhuoran Yang
Yongxin Chen
Mingyi Hong
Zhaoran Wang
32
39
0
14 Jul 2019
From self-tuning regulators to reinforcement learning and back again
Nikolai Matni
Alexandre Proutière
Anders Rantzer
Stephen Tu
16
88
0
27 Jun 2019
Finite-Sample Analysis for SARSA with Linear Function Approximation
Shaofeng Zou
Tengyu Xu
Yingbin Liang
13
146
0
06 Feb 2019
On the Global Convergence of Imitation Learning: A Case for Linear Quadratic Regulator
Qi Cai
Mingyi Hong
Yongxin Chen
Zhaoran Wang
21
34
0
11 Jan 2019
Simple Regret Minimization for Contextual Bandits
A. Deshmukh
Srinagesh Sharma
J. Cutler
M. Moldwin
Clayton Scott
14
24
0
17 Oct 2018
A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation
Jalaj Bhandari
Daniel Russo
Raghav Singal
16
334
0
06 Jun 2018
Learning convex bounds for linear quadratic control policy synthesis
Jack Umenberger
Thomas B. Schon
24
12
0
01 Jun 2018
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Yasin Abbasi-Yadkori
N. Lazić
Csaba Szepesvári
OffRL
13
94
0
17 Apr 2018
Structured Control Nets for Deep Reinforcement Learning
Mario Srouji
Jian Zhang
Ruslan Salakhutdinov
30
43
0
22 Feb 2018
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator
Maryam Fazel
Rong Ge
Sham Kakade
M. Mesbahi
35
598
0
15 Jan 2018
1