ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.12537
  4. Cited By
Why Target Networks Stabilise Temporal Difference Methods

Why Target Networks Stabilise Temporal Difference Methods

24 February 2023
Matt Fellows
Matthew Smith
Shimon Whiteson
    OOD
    AAML
ArXivPDFHTML

Papers citing "Why Target Networks Stabilise Temporal Difference Methods"

6 / 6 papers shown
Title
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
Han-Dong Lim
Donghwan Lee
16
0
0
15 Apr 2025
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
62
14
0
05 Jul 2024
A Bayesian Solution To The Imitation Gap
A Bayesian Solution To The Imitation Gap
Risto Vuorio
Mattie Fellows
Cong Lu
Clémence Grislain
Shimon Whiteson
30
1
0
29 Jun 2024
Target Networks and Over-parameterization Stabilize Off-policy
  Bootstrapping with Function Approximation
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
Fengdi Che
Chenjun Xiao
Jincheng Mei
Bo Dai
Ramki Gummadi
Oscar A Ramirez
Christopher K Harris
A. R. Mahmood
Dale Schuurmans
30
5
0
31 May 2024
RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient
  Minimum Radiation Exposure Pathway
RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient Minimum Radiation Exposure Pathway
B. Sadhu
Trijit Sadhu
S. Anand
AI4CE
22
0
0
01 Feb 2024
Bayesian Exploration Networks
Bayesian Exploration Networks
Matt Fellows
Brandon Kaplowitz
Christian Schroeder de Witt
Shimon Whiteson
BDL
31
3
0
24 Aug 2023
1