ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.02786
  4. Cited By
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

6 July 2020
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
ArXivPDFHTML

Papers citing "TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?"

2 / 2 papers shown
Title
Why Target Networks Stabilise Temporal Difference Methods
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
21
7
0
24 Feb 2023
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation
  Perspective
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
24
54
0
11 May 2021
1