ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.08642
  4. Cited By
Least-Squares Temporal Difference Learning for the Linear Quadratic
  Regulator

Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator

22 December 2017
Stephen Tu
Benjamin Recht
    OffRL
ArXivPDFHTML

Papers citing "Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator"

28 / 28 papers shown
Title
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Xuyang Chen
Jingliang Duan
Lin Zhao
54
1
0
02 May 2025
Achieving Tighter Finite-Time Rates for Heterogeneous Federated Stochastic Approximation under Markovian Sampling
Achieving Tighter Finite-Time Rates for Heterogeneous Federated Stochastic Approximation under Markovian Sampling
Feng Zhu
Aritra Mitra
Robert W. Heath
FedML
36
0
0
15 Apr 2025
Coordinating Planning and Tracking in Layered Control Policies via
  Actor-Critic Learning
Coordinating Planning and Tracking in Layered Control Policies via Actor-Critic Learning
Fengjun Yang
Nikolai Matni
OffRL
31
0
0
03 Aug 2024
Distributed Policy Gradient for Linear Quadratic Networked Control with
  Limited Communication Range
Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range
Yuzi Yan
Yuan-Chung Shen
34
0
0
05 Mar 2024
The Optimal Approximation Factors in Misspecified Off-Policy Value
  Function Estimation
The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation
P. Amortila
Nan Jiang
Csaba Szepesvári
OffRL
29
3
0
25 Jul 2023
Learning and Concentration for High Dimensional Linear Gaussians: an
  Invariant Subspace Approach
Learning and Concentration for High Dimensional Linear Gaussians: an Invariant Subspace Approach
Muhammad Naeem
29
2
0
04 Apr 2023
Gauss-Newton Temporal Difference Learning with Nonlinear Function
  Approximation
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation
Zhifa Ke
Junyu Zhang
Zaiwen Wen
24
0
0
25 Feb 2023
Managing Temporal Resolution in Continuous Value Estimation: A
  Fundamental Trade-off
Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Zichen Zhang
Johannes Kirschner
Junxi Zhang
Francesco Zanini
Alex Ayoub
Masood Dehghan
Dale Schuurmans
OffRL
24
3
0
17 Dec 2022
Transportation-Inequalities, Lyapunov Stability and Sampling for
  Dynamical Systems on Continuous State Space
Transportation-Inequalities, Lyapunov Stability and Sampling for Dynamical Systems on Continuous State Space
Muhammad Naeem
Miroslav Pajic
22
3
0
25 May 2022
A Complete Characterization of Linear Estimators for Offline Policy
  Evaluation
A Complete Characterization of Linear Estimators for Offline Policy Evaluation
Juan C. Perdomo
A. Krishnamurthy
Peter L. Bartlett
Sham Kakade
OffRL
27
3
0
08 Mar 2022
Single Time-scale Actor-critic Method to Solve the Linear Quadratic
  Regulator with Convergence Guarantees
Single Time-scale Actor-critic Method to Solve the Linear Quadratic Regulator with Convergence Guarantees
Mo Zhou
Jianfeng Lu
29
13
0
31 Jan 2022
Convergence Guarantees of Policy Optimization Methods for Markovian Jump
  Linear Systems
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
25
35
0
10 Feb 2020
Probabilistic Safety Constraints for Learned High Relative Degree System
  Dynamics
Probabilistic Safety Constraints for Learned High Relative Degree System Dynamics
M. J. Khojasteh
Vikas Dhiman
M. Franceschetti
Nikolay Atanasov
34
73
0
20 Dec 2019
Natural Actor-Critic Converges Globally for Hierarchical Linear
  Quadratic Regulator
Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator
Yuwei Luo
Zhuoran Yang
Zhaoran Wang
Mladen Kolar
26
9
0
14 Dec 2019
Statistical Learning for Analysis of Networked Control Systems over
  Unknown Channels
Statistical Learning for Analysis of Networked Control Systems over Unknown Channels
Konstantinos Gatsis
George J. Pappas
14
11
0
08 Nov 2019
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic
  Mean-Field Games
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games
Zuyue Fu
Zhuoran Yang
Yongxin Chen
Zhaoran Wang
14
54
0
16 Oct 2019
Finite-Time Performance of Distributed Temporal Difference Learning with
  Linear Function Approximation
Finite-Time Performance of Distributed Temporal Difference Learning with Linear Function Approximation
Thinh T. Doan
S. T. Maguluri
Justin Romberg
30
41
0
25 Jul 2019
Alice's Adventures in the Markovian World
Alice's Adventures in the Markovian World
Zhanzhan Zhao
Haoran Sun
22
0
0
21 Jul 2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic
  Regulator with Ergodic Cost
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Zhuoran Yang
Yongxin Chen
Mingyi Hong
Zhaoran Wang
32
39
0
14 Jul 2019
From self-tuning regulators to reinforcement learning and back again
From self-tuning regulators to reinforcement learning and back again
Nikolai Matni
Alexandre Proutière
Anders Rantzer
Stephen Tu
16
88
0
27 Jun 2019
Finite-Sample Analysis for SARSA with Linear Function Approximation
Finite-Sample Analysis for SARSA with Linear Function Approximation
Shaofeng Zou
Tengyu Xu
Yingbin Liang
13
146
0
06 Feb 2019
On the Global Convergence of Imitation Learning: A Case for Linear
  Quadratic Regulator
On the Global Convergence of Imitation Learning: A Case for Linear Quadratic Regulator
Qi Cai
Mingyi Hong
Yongxin Chen
Zhaoran Wang
21
34
0
11 Jan 2019
Simple Regret Minimization for Contextual Bandits
Simple Regret Minimization for Contextual Bandits
A. Deshmukh
Srinagesh Sharma
J. Cutler
M. Moldwin
Clayton Scott
14
24
0
17 Oct 2018
A Finite Time Analysis of Temporal Difference Learning With Linear
  Function Approximation
A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation
Jalaj Bhandari
Daniel Russo
Raghav Singal
16
334
0
06 Jun 2018
Learning convex bounds for linear quadratic control policy synthesis
Learning convex bounds for linear quadratic control policy synthesis
Jack Umenberger
Thomas B. Schon
24
12
0
01 Jun 2018
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Yasin Abbasi-Yadkori
N. Lazić
Csaba Szepesvári
OffRL
13
94
0
17 Apr 2018
Structured Control Nets for Deep Reinforcement Learning
Structured Control Nets for Deep Reinforcement Learning
Mario Srouji
Jian Zhang
Ruslan Salakhutdinov
30
43
0
22 Feb 2018
Global Convergence of Policy Gradient Methods for the Linear Quadratic
  Regulator
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator
Maryam Fazel
Rong Ge
Sham Kakade
M. Mesbahi
35
598
0
15 Jan 2018
1