ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.09900
  4. Cited By
Policy learning "without'' overlap: Pessimism and generalized empirical
  Bernstein's inequality

Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality

19 December 2022
Ying Jin
Zhimei Ren
Zhuoran Yang
Zhaoran Wang
    OffRL
ArXivPDFHTML

Papers citing "Policy learning "without'' overlap: Pessimism and generalized empirical Bernstein's inequality"

25 / 25 papers shown
Title
Clustered KL-barycenter design for policy evaluation
Simon Weissmann
Till Freihaut
Claire Vernade
Giorgia Ramponi
Leif Döring
OffRL
66
0
0
04 Mar 2025
Off-policy estimation with adaptively collected data: the power of
  online learning
Off-policy estimation with adaptively collected data: the power of online learning
Jeonghwan Lee
Cong Ma
OffRL
76
0
0
19 Nov 2024
Robust Offline Policy Learning with Observational Data from Multiple
  Sources
Robust Offline Policy Learning with Observational Data from Multiple Sources
Aldo G. Carranza
Susan Athey
OffRL
21
2
0
11 Oct 2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in
  Tabular MDP
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
45
0
0
04 Jun 2024
Combining Experimental and Historical Data for Policy Evaluation
Combining Experimental and Historical Data for Policy Evaluation
Ting Li
Chengchun Shi
Qianglin Wen
Yang Sui
Yongli Qin
Chunbo Lai
Hongtu Zhu
OffRL
46
0
0
01 Jun 2024
Applied Causal Inference Powered by ML and AI
Applied Causal Inference Powered by ML and AI
Victor Chernozhukov
Christian Hansen
Nathan Kallus
Martin Spindler
Vasilis Syrgkanis
CML
36
29
0
04 Mar 2024
On Sample-Efficient Offline Reinforcement Learning: Data Diversity,
  Posterior Sampling, and Beyond
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond
Thanh Nguyen-Tang
Raman Arora
OffRL
33
3
0
06 Jan 2024
Rethinking Model-based, Policy-based, and Value-based Reinforcement
  Learning via the Lens of Representation Complexity
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity
Guhao Feng
Han Zhong
OffRL
73
2
0
28 Dec 2023
Individualized Policy Evaluation and Learning under Clustered Network Interference
Individualized Policy Evaluation and Learning under Clustered Network Interference
Yi Zhang
Kosuke Imai
OffRL
34
1
0
04 Nov 2023
Pessimistic Off-Policy Multi-Objective Optimization
Pessimistic Off-Policy Multi-Objective Optimization
S. Alizadeh
Aniruddha Bhargava
Karthick Gopalswamy
Lalit P. Jain
B. Kveton
Ge Liu
OffRL
35
0
0
28 Oct 2023
Positivity-free Policy Learning with Observational Data
Positivity-free Policy Learning with Observational Data
Pan Zhao
Antoine Chambaz
Julie Josse
Shu Yang
34
6
0
10 Oct 2023
Importance-Weighted Offline Learning Done Right
Importance-Weighted Offline Learning Done Right
Germano Gabbianelli
Gergely Neu
Matteo Papini
OffRL
21
5
0
27 Sep 2023
Bayesian Safe Policy Learning with Chance Constrained Optimization:
  Application to Military Security Assessment during the Vietnam War
Bayesian Safe Policy Learning with Chance Constrained Optimization: Application to Military Security Assessment during the Vietnam War
Zeyang Jia
Eli Ben-Michael
Kosuke Imai
24
4
0
17 Jul 2023
Proportional Response: Contextual Bandits for Simple and Cumulative
  Regret Minimization
Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization
Sanath Kumar Krishnamurthy
Ruohan Zhan
Susan Athey
Emma Brunskill
30
6
0
05 Jul 2023
Offline Policy Evaluation for Reinforcement Learning with Adaptively
  Collected Data
Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data
Sunil Madhow
Dan Xiao
Ming Yin
Yu-Xiang Wang
OffRL
27
0
0
24 Jun 2023
Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual
  Bandits
Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual Bandits
Lequn Wang
A. Krishnamurthy
Aleksandrs Slivkins
OffRL
35
8
0
13 Jun 2023
Off-policy evaluation beyond overlap: partial identification through
  smoothness
Off-policy evaluation beyond overlap: partial identification through smoothness
Samir Khan
Martin Saveski
J. Ugander
OffRL
33
5
0
19 May 2023
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning
Gen Li
Yuling Yan
Yuxin Chen
Jianqing Fan
OffRL
73
12
0
14 Apr 2023
Efficient and robust transfer learning of optimal individualized
  treatment regimes with right-censored survival data
Efficient and robust transfer learning of optimal individualized treatment regimes with right-censored survival data
Pan Zhao
Julie Josse
Shu Yang
OffRL
29
3
0
13 Jan 2023
Optimal Conservative Offline RL with General Function Approximation via
  Augmented Lagrangian
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian
Paria Rashidinejad
Hanlin Zhu
Kunhe Yang
Stuart J. Russell
Jiantao Jiao
OffRL
35
26
0
01 Nov 2022
Anytime-valid off-policy inference for contextual bandits
Anytime-valid off-policy inference for contextual bandits
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
OffRL
43
25
0
19 Oct 2022
Semi-supervised Batch Learning From Logged Data
Semi-supervised Batch Learning From Logged Data
Gholamali Aminian
Armin Behnamnia
R. Vega
Laura Toni
Chengchun Shi
Hamid R. Rabiee
Omar Rivasplata
Miguel R. D. Rodrigues
OffRL
26
0
0
15 Sep 2022
Upper bounds on the Natarajan dimensions of some function classes
Upper bounds on the Natarajan dimensions of some function classes
Yingji Jin
36
6
0
15 Sep 2022
Pessimistic Model-based Offline Reinforcement Learning under Partial
  Coverage
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
96
144
0
13 Jul 2021
Estimating means of bounded random variables by betting
Estimating means of bounded random variables by betting
Ian Waudby-Smith
Aaditya Ramdas
59
148
0
19 Oct 2020
1