Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.09900
Cited By
Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality
19 December 2022
Ying Jin
Zhimei Ren
Zhuoran Yang
Zhaoran Wang
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality"
25 / 25 papers shown
Title
Clustered KL-barycenter design for policy evaluation
Simon Weissmann
Till Freihaut
Claire Vernade
Giorgia Ramponi
Leif Döring
OffRL
66
0
0
04 Mar 2025
Off-policy estimation with adaptively collected data: the power of online learning
Jeonghwan Lee
Cong Ma
OffRL
76
0
0
19 Nov 2024
Robust Offline Policy Learning with Observational Data from Multiple Sources
Aldo G. Carranza
Susan Athey
OffRL
21
2
0
11 Oct 2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
45
0
0
04 Jun 2024
Combining Experimental and Historical Data for Policy Evaluation
Ting Li
Chengchun Shi
Qianglin Wen
Yang Sui
Yongli Qin
Chunbo Lai
Hongtu Zhu
OffRL
46
0
0
01 Jun 2024
Applied Causal Inference Powered by ML and AI
Victor Chernozhukov
Christian Hansen
Nathan Kallus
Martin Spindler
Vasilis Syrgkanis
CML
36
29
0
04 Mar 2024
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond
Thanh Nguyen-Tang
Raman Arora
OffRL
33
3
0
06 Jan 2024
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity
Guhao Feng
Han Zhong
OffRL
73
2
0
28 Dec 2023
Individualized Policy Evaluation and Learning under Clustered Network Interference
Yi Zhang
Kosuke Imai
OffRL
34
1
0
04 Nov 2023
Pessimistic Off-Policy Multi-Objective Optimization
S. Alizadeh
Aniruddha Bhargava
Karthick Gopalswamy
Lalit P. Jain
B. Kveton
Ge Liu
OffRL
35
0
0
28 Oct 2023
Positivity-free Policy Learning with Observational Data
Pan Zhao
Antoine Chambaz
Julie Josse
Shu Yang
34
6
0
10 Oct 2023
Importance-Weighted Offline Learning Done Right
Germano Gabbianelli
Gergely Neu
Matteo Papini
OffRL
21
5
0
27 Sep 2023
Bayesian Safe Policy Learning with Chance Constrained Optimization: Application to Military Security Assessment during the Vietnam War
Zeyang Jia
Eli Ben-Michael
Kosuke Imai
24
4
0
17 Jul 2023
Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization
Sanath Kumar Krishnamurthy
Ruohan Zhan
Susan Athey
Emma Brunskill
30
6
0
05 Jul 2023
Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data
Sunil Madhow
Dan Xiao
Ming Yin
Yu-Xiang Wang
OffRL
27
0
0
24 Jun 2023
Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual Bandits
Lequn Wang
A. Krishnamurthy
Aleksandrs Slivkins
OffRL
35
8
0
13 Jun 2023
Off-policy evaluation beyond overlap: partial identification through smoothness
Samir Khan
Martin Saveski
J. Ugander
OffRL
33
5
0
19 May 2023
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning
Gen Li
Yuling Yan
Yuxin Chen
Jianqing Fan
OffRL
73
12
0
14 Apr 2023
Efficient and robust transfer learning of optimal individualized treatment regimes with right-censored survival data
Pan Zhao
Julie Josse
Shu Yang
OffRL
29
3
0
13 Jan 2023
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian
Paria Rashidinejad
Hanlin Zhu
Kunhe Yang
Stuart J. Russell
Jiantao Jiao
OffRL
35
26
0
01 Nov 2022
Anytime-valid off-policy inference for contextual bandits
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
OffRL
43
25
0
19 Oct 2022
Semi-supervised Batch Learning From Logged Data
Gholamali Aminian
Armin Behnamnia
R. Vega
Laura Toni
Chengchun Shi
Hamid R. Rabiee
Omar Rivasplata
Miguel R. D. Rodrigues
OffRL
26
0
0
15 Sep 2022
Upper bounds on the Natarajan dimensions of some function classes
Yingji Jin
36
6
0
15 Sep 2022
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
96
144
0
13 Jul 2021
Estimating means of bounded random variables by betting
Ian Waudby-Smith
Aaditya Ramdas
59
148
0
19 Oct 2020
1