A Kernel Loss for Solving the Bellman Equation

25 May 2019

Papers citing "A Kernel Loss for Solving the Bellman Equation"

18 / 18 papers shown

Title
Neural Network Approximation for Pessimistic Offline Reinforcement Learning Di Wu Yuling Jiao Li Shen Haizhao Yang Xiliang Lu OffRL 29 1 0 19 Dec 2023
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning Outongyi Lv Bingxin Zhou OffRL 41 0 0 05 Jul 2023
Distributional Offline Policy Evaluation with Predictive Error Guarantees Runzhe Wu Masatoshi Uehara Wen Sun OffRL 38 13 0 19 Feb 2023
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability Hanlin Zhu Amy Zhang OffRL 21 2 0 07 Feb 2023
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning Hanlin Zhu Paria Rashidinejad Jiantao Jiao OffRL 38 15 0 30 Jan 2023
Inference on Strongly Identified Functionals of Weakly Identified Functions Andrew Bennett Nathan Kallus Xiaojie Mao Whitney Newey Vasilis Syrgkanis Masatoshi Uehara 35 15 0 17 Aug 2022
Robust Losses for Learning Value Functions Andrew Patterson Victor Liao Martha White 25 12 0 17 May 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error Scott Fujimoto D. Meger Doina Precup Ofir Nachum S. Gu 30 32 0 28 Jan 2022
Hyperparameter Selection Methods for Fitted Q-Evaluation with Error Guarantee Kohei Miyaguchi OffRL 38 1 0 07 Jan 2022
Optimal policy evaluation using kernel-based temporal difference methods Yaqi Duan Mengdi Wang Martin J. Wainwright OffRL 22 26 0 24 Sep 2021
Convergent and Efficient Deep Q Network Algorithm Zhikang T. Wang Masahito Ueda 18 12 0 29 Jun 2021
Bayesian Bellman Operators M. Fellows Kristian Hartikainen Shimon Whiteson OffRL 37 15 0 09 Jun 2021
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings Ming Yin Yu-Xiang Wang OffRL 32 19 0 13 May 2021
Logistic Q-Learning Joan Bas-Serrano Sebastian Curi Andreas Krause Gergely Neu 11 40 0 21 Oct 2020
Minimax Value Interval for Off-Policy Evaluation and Policy Optimization Nan Jiang Jiawei Huang OffRL 25 17 0 06 Feb 2020
Minimax Weight and Q-Function Learning for Off-Policy Evaluation Masatoshi Uehara Jiawei Huang Nan Jiang OffRL 22 183 0 28 Oct 2019
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation Ziyang Tang Yihao Feng Lihong Li Dengyong Zhou Qiang Liu OffRL 19 67 0 16 Oct 2019
A Kernel Test of Goodness of Fit Kacper P. Chwialkowski Heiko Strathmann A. Gretton BDL 107 324 0 09 Feb 2016