ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.15501
  4. Cited By
Doubly Robust Interval Estimation for Optimal Policy Evaluation in
  Online Learning

Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning

29 October 2021
Ye Shen
Hengrui Cai
Rui Song
    OffRL
ArXivPDFHTML

Papers citing "Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning"

2 / 2 papers shown
Title
Anytime-valid off-policy inference for contextual bandits
Anytime-valid off-policy inference for contextual bandits
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
OffRL
43
25
0
19 Oct 2022
Online Bootstrap Inference For Policy Evaluation in Reinforcement
  Learning
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
50
27
0
08 Aug 2021
1