Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.17245
Cited By
CROP: Conservative Reward for Model-based Offline Policy Optimization
26 October 2023
Hao Li
Xiaohu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
Xiao-Yin Liu
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Bo-Xian Yao
Zeng-Guang Hou
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CROP: Conservative Reward for Model-based Offline Policy Optimization"
4 / 4 papers shown
Title
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
102
262
0
04 Oct 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
219
413
0
16 Feb 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,955
0
04 May 2020
1