ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.12239
  4. Cited By
Off-Policy Average Reward Actor-Critic with Deterministic Policy Search

Off-Policy Average Reward Actor-Critic with Deterministic Policy Search

20 May 2023
Naman Saxena
Subhojyoti Khastagir
Shishir Kolathaya
S. Bhatnagar
    OffRL
ArXivPDFHTML

Papers citing "Off-Policy Average Reward Actor-Critic with Deterministic Policy Search"

8 / 8 papers shown
Title
Towards Optimal Offline Reinforcement Learning
Towards Optimal Offline Reinforcement Learning
Mengmeng Li
Daniel Kuhn
Tobias Sutter
OffRL
67
0
0
15 Mar 2025
Average-Reward Reinforcement Learning with Entropy Regularization
Average-Reward Reinforcement Learning with Entropy Regularization
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OOD
61
2
0
17 Jan 2025
An Empirical Study of Deep Reinforcement Learning in Continuing Tasks
An Empirical Study of Deep Reinforcement Learning in Continuing Tasks
Yi Wan
D. Korenkevych
Zheqing Zhu
OffRL
CLL
55
0
0
12 Jan 2025
Average Reward Reinforcement Learning for Wireless Radio Resource Management
Average Reward Reinforcement Learning for Wireless Radio Resource Management
Kun Yang
Jing Yang
Cong Shen
59
1
0
12 Jan 2025
NeoRL: Efficient Exploration for Nonepisodic RL
NeoRL: Efficient Exploration for Nonepisodic RL
Bhavya Sukhija
Lenart Treven
Florian Dorfler
Stelian Coros
Andreas Krause
OffRL
41
0
0
03 Jun 2024
Adaptive Advantage-Guided Policy Regularization for Offline
  Reinforcement Learning
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Tenglong Liu
Yang Li
Yixing Lan
Hao Gao
Wei Pan
Xin Xu
OffRL
41
5
0
30 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent
  Baseline
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
36
0
0
04 May 2024
When Do Off-Policy and On-Policy Policy Gradient Methods Align?
When Do Off-Policy and On-Policy Policy Gradient Methods Align?
Davide Mambelli
Stephan Bongers
O. Zoeter
M. Spaan
F. Oliehoek
OffRL
26
0
0
19 Feb 2024
1