Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.13589
Cited By
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
26 May 2022
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes"
15 / 15 papers shown
Title
Automatic Reward Shaping from Confounded Offline Data
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRL
OnRL
28
1
0
16 May 2025
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
141
0
0
01 May 2025
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing
Jitao Wang
C. Shi
John D. Piette
Joshua R. Loftus
Donglin Zeng
Zhenke Wu
OffRL
64
0
0
10 Jan 2025
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
Yuheng Zhang
Nan Jiang
OffRL
29
4
0
22 Feb 2024
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
16
6
0
01 Jun 2023
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
Jose H. Blanchet
Miao Lu
Tong Zhang
Han Zhong
OffRL
42
29
0
16 May 2023
Implicit Anatomical Rendering for Medical Image Segmentation with Stochastic Experts
Chenyu You
Weicheng Dai
Yifei Min
Lawrence H. Staib
James S. Duncan
MedIm
69
27
0
06 Apr 2023
One Neuron Saved Is One Neuron Earned: On Parametric Efficiency of Quadratic Networks
Fenglei Fan
Hangcheng Dong
Zhongming Wu
Lecheng Ruan
T. Zeng
Yiming Cui
Jing-Xiao Liao
59
8
0
11 Mar 2023
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Zuyue Fu
Zhengling Qi
Zhuoran Yang
Zhaoran Wang
Lan Wang
OffRL
20
0
0
23 Dec 2022
Offline Policy Evaluation and Optimization under Confounding
Chinmaya Kausik
Yangyi Lu
Kevin Tan
Maggie Makar
Yixin Wang
Ambuj Tewari
OffRL
23
8
0
29 Nov 2022
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach
Miao Lu
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
OffRL
34
1
0
12 Sep 2022
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments
Mengxin Yu
Zhuoran Yang
Jianqing Fan
OffRL
21
8
0
23 Aug 2022
TF-Blender: Temporal Feature Blender for Video Object Detection
Yiming Cui
Liqi Yan
Zhiwen Cao
Dongfang Liu
ViT
53
100
0
12 Aug 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
219
413
0
16 Feb 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
1