ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.05397
16
5

On the Performance of Temporal Difference Learning With Neural Networks

8 December 2023
Haoxing Tian
I. Paschalidis
Alexander Olshevsky
ArXivPDFHTML
Abstract

Neural Temporal Difference (TD) Learning is an approximate temporal difference method for policy evaluation that uses a neural network for function approximation. Analysis of Neural TD Learning has proven to be challenging. In this paper we provide a convergence analysis of Neural TD Learning with a projection onto B(θ0,ω)B(\theta_0, \omega)B(θ0​,ω), a ball of fixed radius ω\omegaω around the initial point θ0\theta_0θ0​. We show an approximation bound of O(ϵ)+O~(1/m)O(\epsilon) + \tilde{O} (1/\sqrt{m})O(ϵ)+O~(1/m​) where ϵ\epsilonϵ is the approximation quality of the best neural network in B(θ0,ω)B(\theta_0, \omega)B(θ0​,ω) and mmm is the width of all hidden layers in the network.

View on arXiv
Comments on this paper