ConQUR: Mitigating Delusional Bias in Deep Q-learning

27 February 2020

Papers citing "ConQUR: Mitigating Delusional Bias in Deep Q-learning"

3 / 3 papers shown

Title
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn Hongyao Tang Glen Berseth OffRL 93 2 0 07 Sep 2024
Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning Xianfu Chen Zhifeng Zhao S. Mao Celimuge Wu Honggang Zhang M. Bennis OffRL 81 3 0 19 Sep 2022
Regularized Behavior Value Estimation Çağlar Gülçehre Sergio Gomez Colmenarejo Ziyun Wang Jakub Sygnowski T. Paine Konrad Zolna Yutian Chen Matthew W. Hoffman Razvan Pascanu Nando de Freitas OffRL 77 38 0 17 Mar 2021