Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.02786
Cited By
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
6 July 2020
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?"
2 / 2 papers shown
Title
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
21
7
0
24 Feb 2023
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
24
54
0
11 May 2021
1