ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.06781
  4. Cited By
Characterizing the Exact Behaviors of Temporal Difference Learning
  Algorithms Using Markov Jump Linear System Theory

Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory

16 June 2019
Bin Hu
U. Syed
ArXivPDFHTML

Papers citing "Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory"

20 / 20 papers shown
Title
Rates of Convergence in the Central Limit Theorem for Markov Chains, with an Application to TD Learning
Rates of Convergence in the Central Limit Theorem for Markov Chains, with an Application to TD Learning
R. Srikant
46
5
0
28 Jan 2024
Finite-Time Analysis of Asynchronous Q-learning under Diminishing
  Step-Size from Control-Theoretic View
Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View
Han-Dong Lim
Dong-hwan Lee
30
1
0
25 Jul 2022
Exact Formulas for Finite-Time Estimation Errors of Decentralized
  Temporal Difference Learning with Linear Function Approximation
Exact Formulas for Finite-Time Estimation Errors of Decentralized Temporal Difference Learning with Linear Function Approximation
Xing-ming Guo
Bin Hu
13
2
0
20 Apr 2022
Convex Programs and Lyapunov Functions for Reinforcement Learning: A
  Unified Perspective on the Analysis of Value-Based Methods
Convex Programs and Lyapunov Functions for Reinforcement Learning: A Unified Perspective on the Analysis of Value-Based Methods
Xing-ming Guo
Bin Hu
OffRL
27
3
0
14 Feb 2022
Control Theoretic Analysis of Temporal Difference Learning
Dong-hwan Lee
Do Wan Kim
18
1
0
29 Dec 2021
Global Convergence Using Policy Gradient Methods for Model-free
  Markovian Jump Linear Quadratic Control
Global Convergence Using Policy Gradient Methods for Model-free Markovian Jump Linear Quadratic Control
Santanu Rathod
Manoj Bhadu
A. De
19
8
0
30 Nov 2021
PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning
  Method
PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method
Ziwei Guan
Tengyu Xu
Yingbin Liang
13
4
0
13 Oct 2021
Online Bootstrap Inference For Policy Evaluation in Reinforcement
  Learning
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
50
27
0
08 Aug 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear
  Function Approximation
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation
Zaiwei Chen
S. Khodadadian
S. T. Maguluri
OffRL
63
29
0
26 May 2021
On the Linear convergence of Natural Policy Gradient Algorithm
On the Linear convergence of Natural Policy Gradient Algorithm
S. Khodadadian
P. Jhunjhunwala
Sushil Mahavir Varma
S. T. Maguluri
40
56
0
04 May 2021
Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and
  Finite-Time Performance
Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and Finite-Time Performance
Thinh T. Doan
14
45
0
03 Nov 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
90
146
0
04 May 2020
Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms
Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms
Tengyu Xu
Zhe Wang
Yingbin Liang
21
25
0
27 Apr 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump
  Linear Systems
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
25
35
0
10 Feb 2020
Explicit Mean-Square Error Bounds for Monte-Carlo and Linear Stochastic
  Approximation
Explicit Mean-Square Error Bounds for Monte-Carlo and Linear Stochastic Approximation
Shuhang Chen
Adithya M. Devraj
Ana Bušić
Sean P. Meyn
8
31
0
07 Feb 2020
A frequency-domain analysis of inexact gradient methods
A frequency-domain analysis of inexact gradient methods
Oran Gannot
24
25
0
31 Dec 2019
Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale
  Stochastic Approximation
Finite-Time Analysis and Restarting Scheme for Linear Two-Time-Scale Stochastic Approximation
Thinh T. Doan
21
36
0
23 Dec 2019
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased
  Stochastic Approximation
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation
Gang Wang
Bingcong Li
G. Giannakis
29
28
0
10 Sep 2019
Finite-Time Performance of Distributed Temporal Difference Learning with
  Linear Function Approximation
Finite-Time Performance of Distributed Temporal Difference Learning with Linear Function Approximation
Thinh T. Doan
S. T. Maguluri
Justin Romberg
30
41
0
25 Jul 2019
Finite-Sample Analysis of Nonlinear Stochastic Approximation with
  Applications in Reinforcement Learning
Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning
Zaiwei Chen
Sheng Zhang
Thinh T. Doan
John-Paul Clarke
S. T. Maguluri
17
58
0
27 May 2019
1