ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.09652
  4. Cited By
On Convergence of some Gradient-based Temporal-Differences Algorithms
  for Off-Policy Learning

On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning

27 December 2017
Huizhen Yu
    OffRL
ArXivPDFHTML

Papers citing "On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning"

4 / 4 papers shown
Title
Finite-Time Error Bounds for Greedy-GQ
Finite-Time Error Bounds for Greedy-GQ
Yue Wang
Yi Zhou
Shaofeng Zou
31
1
0
06 Sep 2022
Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved
  Complexity
Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity
Shaocong Ma
Ziyi Chen
Yi Zhou
Shaofeng Zou
17
11
0
30 Mar 2021
Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth
  Nonlinear TD Learning
Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning
Shuang Qiu
Zhuoran Yang
Xiaohan Wei
Jieping Ye
Zhaoran Wang
33
38
0
23 Aug 2020
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic
  Mean-Field Games
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games
Zuyue Fu
Zhuoran Yang
Yongxin Chen
Zhaoran Wang
19
54
0
16 Oct 2019
1