ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.10300
  4. Cited By
Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a
  Finite Horizon

Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon

20 November 2020
B. Hambly
Renyuan Xu
Huining Yang
ArXivPDFHTML

Papers citing "Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon"

15 / 15 papers shown
Title
From Deep Learning to LLMs: A survey of AI in Quantitative Investment
From Deep Learning to LLMs: A survey of AI in Quantitative Investment
Bokai Cao
Saizhuo Wang
Xinyi Lin
Xiaojun Wu
Haohan Zhang
L. Ni
Jian Guo
AIFin
59
1
0
27 Mar 2025
Model-Free $μ$-Synthesis: A Nonsmooth Optimization Perspective
Model-Free μμμ-Synthesis: A Nonsmooth Optimization Perspective
Darioush Keivan
Xing-ming Guo
Peter M. Seiler
Geir Dullerud
Bin Hu
36
0
0
18 Feb 2024
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and
  Global Optimality
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
33
0
0
22 Mar 2023
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators
Yin-Huan Han
Meisam Razaviyayn
Renyuan Xu
27
5
0
15 Mar 2023
Revisiting LQR Control from the Perspective of Receding-Horizon Policy
  Gradient
Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient
Xiangyuan Zhang
Tamer Basar
36
19
0
25 Feb 2023
Learning the Kalman Filter with Fine-Grained Sample Complexity
Learning the Kalman Filter with Fine-Grained Sample Complexity
Xiangyuan Zhang
Bin Hu
Tamer Bacsar
26
16
0
30 Jan 2023
Global Convergence of Direct Policy Search for State-Feedback
  $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with
  Goldstein Subdifferential
Global Convergence of Direct Policy Search for State-Feedback H∞\mathcal{H}_\inftyH∞​ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
Xing-ming Guo
Bin Hu
41
12
0
20 Oct 2022
Towards a Theoretical Foundation of Policy Optimization for Learning
  Control Policies
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Bin Hu
Kaipeng Zhang
Na Li
M. Mesbahi
Maryam Fazel
Tamer Bacsar
87
27
0
10 Oct 2022
Statistical Learning Theory for Control: A Finite Sample Perspective
Statistical Learning Theory for Control: A Finite Sample Perspective
Anastasios Tsiamis
Ingvar M. Ziemann
Nikolai Matni
George J. Pappas
28
73
0
12 Sep 2022
Linear convergence of a policy gradient method for some finite horizon
  continuous time control problems
Linear convergence of a policy gradient method for some finite horizon continuous time control problems
C. Reisinger
Wolfgang Stockinger
Yufei Zhang
21
5
0
22 Mar 2022
A Small Gain Analysis of Single Timescale Actor Critic
A Small Gain Analysis of Single Timescale Actor Critic
Alexander Olshevsky
Bahman Gharesifard
33
20
0
04 Mar 2022
Recent Advances in Reinforcement Learning in Finance
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
29
167
0
08 Dec 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic
  Reinforcement Learning and Global Convergence of Policy Gradient Methods
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
31
6
0
13 Sep 2021
Policy Gradient Methods Find the Nash Equilibrium in N-player
  General-sum Linear-quadratic Games
Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games
B. Hambly
Renyuan Xu
Huining Yang
23
25
0
27 Jul 2021
Online Policy Gradient for Model Free Learning of Linear Quadratic
  Regulators with $\sqrt{T}$ Regret
Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with T\sqrt{T}T​ Regret
Asaf B. Cassel
Tomer Koren
OffRL
33
17
0
25 Feb 2021
1