ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.01120
21
2

Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation

3 September 2023
Jan Malte Lichtenberg
Alexander K. Buchholz
Giuseppe Di Benedetto
M. Ruffini
Ben London
    OffRL
ArXivPDFHTML
Abstract

"Clipping" (a.k.a. importance weight truncation) is a widely used variance-reduction technique for counterfactual off-policy estimators. Like other variance-reduction techniques, clipping reduces variance at the cost of increased bias. However, unlike other techniques, the bias introduced by clipping is always a downward bias (assuming non-negative rewards), yielding a lower bound on the true expected reward. In this work we propose a simple extension, called double clipping\textit{double clipping}double clipping, which aims to compensate this downward bias and thus reduce the overall bias, while maintaining the variance reduction properties of the original estimator.

View on arXiv
Comments on this paper