Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.04518
Cited By
Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning
11 February 2020
Nathan Kallus
Angela Zhou
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning"
18 / 18 papers shown
Title
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
141
0
0
01 May 2025
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
16
6
0
01 Jun 2023
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Yang Xu
C. Shi
S. Luo
Lan Wang
R. Song
OffRL
29
4
0
29 Dec 2022
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Zuyue Fu
Zhengling Qi
Zhuoran Yang
Zhaoran Wang
Lan Wang
OffRL
20
0
0
23 Dec 2022
Offline Policy Evaluation and Optimization under Confounding
Chinmaya Kausik
Yangyi Lu
Kevin Tan
Maggie Makar
Yixin Wang
Ambuj Tewari
OffRL
23
8
0
29 Nov 2022
Causal Deep Reinforcement Learning Using Observational Data
Wenxuan Zhu
Chao Yu
Q. Zhang
CML
OffRL
26
5
0
28 Nov 2022
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models
Rui Miao
Zhengling Qi
Xiaoke Zhang
OffRL
30
10
0
21 Sep 2022
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments
Mengxin Yu
Zhuoran Yang
Jianqing Fan
OffRL
21
8
0
23 Aug 2022
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
54
22
0
26 May 2022
Model-Free and Model-Based Policy Evaluation when Causality is Uncertain
David Bruns-Smith
CML
ELM
OffRL
24
12
0
02 Apr 2022
Stochastic Causal Programming for Bounding Treatment Effects
Kirtan Padh
Jakob Zeitler
David S. Watson
Matt J. Kusner
Ricardo M. A. Silva
Niki Kilbertus
CML
30
26
0
22 Feb 2022
Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
C. Shi
Jin Zhu
Ye Shen
S. Luo
Hong Zhu
R. Song
OffRL
25
30
0
22 Feb 2022
Partial Counterfactual Identification from Observational and Experimental Data
Junzhe Zhang
Jin Tian
Elias Bareinboim
24
60
0
12 Oct 2021
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
32
52
0
26 Apr 2021
Estimating and Improving Dynamic Treatment Regimes With a Time-Varying Instrumental Variable
Shuxiao Chen
B. Zhang
27
19
0
15 Apr 2021
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
Luofeng Liao
Zuyue Fu
Zhuoran Yang
Yixin Wang
Mladen Kolar
Zhaoran Wang
OffRL
18
34
0
19 Feb 2021
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Andrew Bennett
Nathan Kallus
Lihong Li
Ali Mousavi
OffRL
35
43
0
27 Jul 2020
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding
Hongseok Namkoong
Ramtin Keramati
Steve Yadlowsky
Emma Brunskill
OffRL
8
63
0
12 Mar 2020
1