Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.12399
Cited By
ConQUR: Mitigating Delusional Bias in Deep Q-learning
27 February 2020
A. Su
Jayden Ooi
Tyler Lu
Dale Schuurmans
Craig Boutilier
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ConQUR: Mitigating Delusional Bias in Deep Q-learning"
3 / 3 papers shown
Title
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
93
2
0
07 Sep 2024
Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning
Xianfu Chen
Zhifeng Zhao
S. Mao
Celimuge Wu
Honggang Zhang
M. Bennis
OffRL
81
3
0
19 Sep 2022
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
77
38
0
17 Mar 2021
1