ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.14308
78
18
v1v2v3v4 (latest)

Concentration of Contractive Stochastic Approximation and Reinforcement Learning

27 June 2021
Siddharth Chandak
Vivek Borkar
ArXiv (abs)PDFHTML
Abstract

Using a martingale concentration inequality, concentration bounds `from time n0n_0n0​ on' are derived for stochastic approximation algorithms with contractive maps and both martingale difference and Markov noises. These are applied to reinforcement learning algorithms, in particular to asynchronous Q-learning and TD(0).

View on arXiv
Comments on this paper