Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.04299
Cited By
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation
10 September 2019
Gang Wang
Bingcong Li
G. Giannakis
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation"
19 / 19 papers shown
Title
A Concentration Bound for TD(0) with Function Approximation
Siddharth Chandak
Vivek Borkar
29
0
0
16 Dec 2023
A primal-dual perspective for distributed TD-learning
Han-Dong Lim
Donghwan Lee
19
1
0
01 Oct 2023
On Rademacher Complexity-based Generalization Bounds for Deep Learning
Lan V. Truong
MLT
41
13
0
08 Aug 2022
A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences
Han Shen
Tianyi Chen
42
15
0
21 Jun 2022
Generalization Error Bounds on Deep Learning with Markov Datasets
Lan V. Truong
29
8
0
23 Dec 2021
Finite-Time Error Bounds for Distributed Linear Stochastic Approximation
Yixuan Lin
V. Gupta
Ji Liu
29
3
0
24 Nov 2021
A Concentration Bound for LSPE(
λ
λ
λ
)
Siddharth Chandak
Vivek Borkar
H. Dolhare
35
0
0
04 Nov 2021
Concentration of Contractive Stochastic Approximation and Reinforcement Learning
Siddharth Chandak
Vivek Borkar
Parth Dodhia
40
17
0
27 Jun 2021
Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity
Shaocong Ma
Ziyi Chen
Yi Zhou
Shaofeng Zou
15
11
0
30 Mar 2021
Multi-Agent Off-Policy TD Learning: Finite-Time Analysis with Near-Optimal Sample Complexity and Communication Complexity
Ziyi Chen
Yi Zhou
Rongrong Chen
OffRL
19
7
0
24 Mar 2021
On Convergence of Gradient Expected Sarsa(
λ
λ
λ
)
Long Yang
Gang Zheng
Yu Zhang
Qian Zheng
Pengfei Li
Gang Pan
21
2
0
14 Dec 2020
Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis
Shaocong Ma
Yi Zhou
Shaofeng Zou
OffRL
6
14
0
26 Oct 2020
A priori guarantees of finite-time convergence for Deep Neural Networks
Anushree Rankawat
M. Rankawat
Harshal B. Oza
12
0
0
16 Sep 2020
Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning
Shuang Qiu
Zhuoran Yang
Xiaohan Wei
Jieping Ye
Zhaoran Wang
31
38
0
23 Aug 2020
Reinforcement Learning for Caching with Space-Time Popularity Dynamics
A. Sadeghi
G. Giannakis
Gang Wang
Fatemeh Sheikholeslami
16
1
0
19 May 2020
Reanalysis of Variance Reduced Temporal Difference Learning
Tengyu Xu
Zhe Wang
Yi Zhou
Yingbin Liang
OffRL
24
38
0
07 Jan 2020
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation
Pan Xu
Quanquan Gu
13
66
0
10 Dec 2019
Finite-Sample Analysis of Decentralized Temporal-Difference Learning with Linear Function Approximation
Jun Sun
Gang Wang
G. Giannakis
Qinmin Yang
Zaiyue Yang
OffRL
11
20
0
03 Nov 2019
Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory
Bin Hu
U. Syed
10
58
0
16 Jun 2019
1