ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.12772
  4. Cited By
Intentionally-underestimated Value Function at Terminal State for
  Temporal-difference Learning with Mis-designed Reward

Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed Reward

24 August 2023
Taisuke Kobayashi
ArXivPDFHTML

Papers citing "Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed Reward"

2 / 2 papers shown
Title
Improvements of Dark Experience Replay and Reservoir Sampling towards Better Balance between Consolidation and Plasticity
Improvements of Dark Experience Replay and Reservoir Sampling towards Better Balance between Consolidation and Plasticity
Taisuke Kobayashi
CLL
41
0
0
29 Apr 2025
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
43
3
0
08 Mar 2023
1