ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.01865
  4. Cited By
Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms
  via Batch Prioritized Experience Replay

Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay

2 November 2021
Dogan C. Cicek
Enes Duran
Baturay Saglam
Furkan B. Mutlu
Suleyman Serdar Kozat
    OffRL
ArXivPDFHTML

Papers citing "Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay"

2 / 2 papers shown
Title
Augmenting Offline RL with Unlabeled Data
Augmenting Offline RL with Unlabeled Data
Zhao Wang
Briti Gangopadhyay
Jia-Fong Yeh
Shingo Takamatsu
OffRL
33
0
0
11 Jun 2024
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step
  Q-learning: A Novel Correction Approach
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach
Baturay Saglam
Dogan C. Cicek
Furkan B. Mutlu
Suleyman Serdar Kozat
OffRL
OnRL
26
1
0
01 Aug 2022
1