ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.10506
  4. Cited By
A Kernel Loss for Solving the Bellman Equation

A Kernel Loss for Solving the Bellman Equation

25 May 2019
Yihao Feng
Lihong Li
Qiang Liu
ArXivPDFHTML

Papers citing "A Kernel Loss for Solving the Bellman Equation"

18 / 18 papers shown
Title
Neural Network Approximation for Pessimistic Offline Reinforcement
  Learning
Neural Network Approximation for Pessimistic Offline Reinforcement Learning
Di Wu
Yuling Jiao
Li Shen
Haizhao Yang
Xiliang Lu
OffRL
29
1
0
19 Dec 2023
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
Outongyi Lv
Bingxin Zhou
OffRL
41
0
0
05 Jul 2023
Distributional Offline Policy Evaluation with Predictive Error
  Guarantees
Distributional Offline Policy Evaluation with Predictive Error Guarantees
Runzhe Wu
Masatoshi Uehara
Wen Sun
OffRL
38
13
0
19 Feb 2023
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with
  General Function Approximation and Single-Policy Concentrability
Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability
Hanlin Zhu
Amy Zhang
OffRL
18
2
0
07 Feb 2023
Importance Weighted Actor-Critic for Optimal Conservative Offline
  Reinforcement Learning
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Hanlin Zhu
Paria Rashidinejad
Jiantao Jiao
OffRL
38
15
0
30 Jan 2023
Inference on Strongly Identified Functionals of Weakly Identified
  Functions
Inference on Strongly Identified Functionals of Weakly Identified Functions
Andrew Bennett
Nathan Kallus
Xiaojie Mao
Whitney Newey
Vasilis Syrgkanis
Masatoshi Uehara
32
15
0
17 Aug 2022
Robust Losses for Learning Value Functions
Robust Losses for Learning Value Functions
Andrew Patterson
Victor Liao
Martha White
23
12
0
17 May 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement
  for Value Error
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
D. Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Hyperparameter Selection Methods for Fitted Q-Evaluation with Error
  Guarantee
Hyperparameter Selection Methods for Fitted Q-Evaluation with Error Guarantee
Kohei Miyaguchi
OffRL
38
1
0
07 Jan 2022
Optimal policy evaluation using kernel-based temporal difference methods
Optimal policy evaluation using kernel-based temporal difference methods
Yaqi Duan
Mengdi Wang
Martin J. Wainwright
OffRL
20
26
0
24 Sep 2021
Convergent and Efficient Deep Q Network Algorithm
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
14
12
0
29 Jun 2021
Bayesian Bellman Operators
Bayesian Bellman Operators
M. Fellows
Kristian Hartikainen
Shimon Whiteson
OffRL
37
15
0
09 Jun 2021
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in
  Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Ming Yin
Yu-Xiang Wang
OffRL
32
19
0
13 May 2021
Logistic Q-Learning
Logistic Q-Learning
Joan Bas-Serrano
Sebastian Curi
Andreas Krause
Gergely Neu
9
40
0
21 Oct 2020
Minimax Value Interval for Off-Policy Evaluation and Policy Optimization
Minimax Value Interval for Off-Policy Evaluation and Policy Optimization
Nan Jiang
Jiawei Huang
OffRL
22
17
0
06 Feb 2020
Minimax Weight and Q-Function Learning for Off-Policy Evaluation
Minimax Weight and Q-Function Learning for Off-Policy Evaluation
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
20
183
0
28 Oct 2019
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
Ziyang Tang
Yihao Feng
Lihong Li
Dengyong Zhou
Qiang Liu
OffRL
17
67
0
16 Oct 2019
A Kernel Test of Goodness of Fit
A Kernel Test of Goodness of Fit
Kacper P. Chwialkowski
Heiko Strathmann
A. Gretton
BDL
107
324
0
09 Feb 2016
1