Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.05837
Cited By
Learning Bellman Complete Representations for Offline Policy Evaluation
12 July 2022
Jonathan D. Chang
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Bellman Complete Representations for Offline Policy Evaluation"
12 / 12 papers shown
Title
Generalized Linear Markov Decision Process
Sinian Zhang
Kaicheng Zhang
Ziping Xu
Tianxi Cai
D. Zhou
46
0
0
01 Jun 2025
Primal-Dual Spectral Representation for Off-policy Evaluation
Yang Hu
Tianyi Chen
Na Li
Kai Wang
Bo Dai
OffRL
85
0
0
23 Oct 2024
The Central Role of the Loss Function in Reinforcement Learning
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
305
10
0
19 Sep 2024
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Allen Nie
Yash Chandak
Christina J. Yuan
Anirudhan Badrinath
Yannis Flet-Berliac
Emma Brunskil
OffRL
99
0
0
27 May 2024
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes
Andrew Bennett
Nathan Kallus
Miruna Oprescu
Wen Sun
Kaiwen Wang
AAML
OffRL
89
1
0
29 Mar 2024
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang
Owen Oertell
Alekh Agarwal
Nathan Kallus
Wen Sun
OffRL
125
12
0
11 Feb 2024
State-Action Similarity-Based Representations for Off-Policy Evaluation
Brahma S. Pavse
Josiah P. Hanna
OffRL
73
4
0
27 Oct 2023
π
2
vec
\pi2\text{vec}
π
2
vec
: Policy Representations with Successor Features
Gianluca Scarpellini
Ksenia Konyushkova
Claudio Fantacci
T. Paine
Yutian Chen
Misha Denil
OffRL
62
0
0
16 Jun 2023
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Kaiwen Wang
Kevin Zhou
Runzhe Wu
Nathan Kallus
Wen Sun
OffRL
84
19
0
25 May 2023
Distributional Offline Policy Evaluation with Predictive Error Guarantees
Runzhe Wu
Masatoshi Uehara
Wen Sun
OffRL
75
14
0
19 Feb 2023
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
Andrea Zanette
OffRL
65
15
0
10 Nov 2022
Oracle Inequalities for Model Selection in Offline Reinforcement Learning
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
Emma Brunskill
OffRL
83
13
0
03 Nov 2022
1