ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.04299
  4. Cited By
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased
  Stochastic Approximation

A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation

10 September 2019
Gang Wang
Bingcong Li
G. Giannakis
ArXivPDFHTML

Papers citing "A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation"

19 / 19 papers shown
Title
A Concentration Bound for TD(0) with Function Approximation
A Concentration Bound for TD(0) with Function Approximation
Siddharth Chandak
Vivek Borkar
29
0
0
16 Dec 2023
A primal-dual perspective for distributed TD-learning
A primal-dual perspective for distributed TD-learning
Han-Dong Lim
Donghwan Lee
19
1
0
01 Oct 2023
On Rademacher Complexity-based Generalization Bounds for Deep Learning
On Rademacher Complexity-based Generalization Bounds for Deep Learning
Lan V. Truong
MLT
41
13
0
08 Aug 2022
A Single-Timescale Analysis For Stochastic Approximation With Multiple
  Coupled Sequences
A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences
Han Shen
Tianyi Chen
42
15
0
21 Jun 2022
Generalization Error Bounds on Deep Learning with Markov Datasets
Generalization Error Bounds on Deep Learning with Markov Datasets
Lan V. Truong
29
8
0
23 Dec 2021
Finite-Time Error Bounds for Distributed Linear Stochastic Approximation
Finite-Time Error Bounds for Distributed Linear Stochastic Approximation
Yixuan Lin
V. Gupta
Ji Liu
29
3
0
24 Nov 2021
A Concentration Bound for LSPE($λ$)
A Concentration Bound for LSPE(λλλ)
Siddharth Chandak
Vivek Borkar
H. Dolhare
35
0
0
04 Nov 2021
Concentration of Contractive Stochastic Approximation and Reinforcement
  Learning
Concentration of Contractive Stochastic Approximation and Reinforcement Learning
Siddharth Chandak
Vivek Borkar
Parth Dodhia
40
17
0
27 Jun 2021
Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved
  Complexity
Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity
Shaocong Ma
Ziyi Chen
Yi Zhou
Shaofeng Zou
15
11
0
30 Mar 2021
Multi-Agent Off-Policy TD Learning: Finite-Time Analysis with
  Near-Optimal Sample Complexity and Communication Complexity
Multi-Agent Off-Policy TD Learning: Finite-Time Analysis with Near-Optimal Sample Complexity and Communication Complexity
Ziyi Chen
Yi Zhou
Rongrong Chen
OffRL
19
7
0
24 Mar 2021
On Convergence of Gradient Expected Sarsa($λ$)
On Convergence of Gradient Expected Sarsa(λλλ)
Long Yang
Gang Zheng
Yu Zhang
Qian Zheng
Pengfei Li
Gang Pan
21
2
0
14 Dec 2020
Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence
  Analysis
Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis
Shaocong Ma
Yi Zhou
Shaofeng Zou
OffRL
6
14
0
26 Oct 2020
A priori guarantees of finite-time convergence for Deep Neural Networks
A priori guarantees of finite-time convergence for Deep Neural Networks
Anushree Rankawat
M. Rankawat
Harshal B. Oza
12
0
0
16 Sep 2020
Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth
  Nonlinear TD Learning
Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning
Shuang Qiu
Zhuoran Yang
Xiaohan Wei
Jieping Ye
Zhaoran Wang
31
38
0
23 Aug 2020
Reinforcement Learning for Caching with Space-Time Popularity Dynamics
Reinforcement Learning for Caching with Space-Time Popularity Dynamics
A. Sadeghi
G. Giannakis
Gang Wang
Fatemeh Sheikholeslami
16
1
0
19 May 2020
Reanalysis of Variance Reduced Temporal Difference Learning
Reanalysis of Variance Reduced Temporal Difference Learning
Tengyu Xu
Zhe Wang
Yi Zhou
Yingbin Liang
OffRL
24
38
0
07 Jan 2020
A Finite-Time Analysis of Q-Learning with Neural Network Function
  Approximation
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation
Pan Xu
Quanquan Gu
13
66
0
10 Dec 2019
Finite-Sample Analysis of Decentralized Temporal-Difference Learning
  with Linear Function Approximation
Finite-Sample Analysis of Decentralized Temporal-Difference Learning with Linear Function Approximation
Jun Sun
Gang Wang
G. Giannakis
Qinmin Yang
Zaiyue Yang
OffRL
11
20
0
03 Nov 2019
Characterizing the Exact Behaviors of Temporal Difference Learning
  Algorithms Using Markov Jump Linear System Theory
Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory
Bin Hu
U. Syed
10
58
0
16 Jun 2019
1