Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.03442
Cited By
Average-Reward Reinforcement Learning with Trust Region Methods
7 June 2021
Xiaoteng Ma
Xiao-Jing Tang
Li Xia
Jun Yang
Qianchuan Zhao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Average-Reward Reinforcement Learning with Trust Region Methods"
10 / 10 papers shown
Title
Average-Reward Reinforcement Learning with Entropy Regularization
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OOD
61
2
0
17 Jan 2025
An Empirical Study of Deep Reinforcement Learning in Continuing Tasks
Yi Wan
D. Korenkevych
Zheqing Zhu
OffRL
CLL
47
0
0
12 Jan 2025
Average Reward Reinforcement Learning for Wireless Radio Resource Management
Kun Yang
Jing Yang
Cong Shen
46
1
0
12 Jan 2025
RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning
Yukinari Hisaki
Isao Ono
24
2
0
04 Aug 2024
Joint Admission Control and Resource Allocation of Virtual Network Embedding via Hierarchical Deep Reinforcement Learning
Tianfu Wang
Li Shen
Qilin Fan
Tong Xu
Tongliang Liu
Hui Xiong
26
4
0
25 Jun 2024
NeoRL: Efficient Exploration for Nonepisodic RL
Bhavya Sukhija
Lenart Treven
Florian Dorfler
Stelian Coros
Andreas Krause
OffRL
33
0
0
03 Jun 2024
Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report
Jerrod Wigmore
B. Shrader
E. Modiano
OffRL
32
1
0
05 Apr 2024
Provable Policy Gradient Methods for Average-Reward Markov Potential Games
Min Cheng
Ruida Zhou
P. R. Kumar
Chao Tian
54
2
0
09 Mar 2024
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri
R. Jain
Haipeng Luo
21
2
0
02 Feb 2023
Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Xiaoteng Ma
Shuai Ma
Li Xia
Qianchuan Zhao
16
3
0
15 Jun 2022
1