ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.10768
  4. Cited By
Anytime-valid off-policy inference for contextual bandits

Anytime-valid off-policy inference for contextual bandits

19 October 2022
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
    OffRL
ArXivPDFHTML

Papers citing "Anytime-valid off-policy inference for contextual bandits"

5 / 5 papers shown
Title
Inference with the Upper Confidence Bound Algorithm
Inference with the Upper Confidence Bound Algorithm
K. Khamaru
Cun-Hui Zhang
37
0
0
08 Aug 2024
Auditing Fairness by Betting
Auditing Fairness by Betting
Ben Chugg
Santiago Cortes-Gomez
Bryan Wilder
Aaditya Ramdas
MLAU
45
7
0
27 May 2023
Game-theoretic statistics and safe anytime-valid inference
Game-theoretic statistics and safe anytime-valid inference
Aaditya Ramdas
Peter Grünwald
V. Vovk
Glenn Shafer
38
118
0
04 Oct 2022
Online Bootstrap Inference For Policy Evaluation in Reinforcement
  Learning
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
50
26
0
08 Aug 2021
Estimating means of bounded random variables by betting
Estimating means of bounded random variables by betting
Ian Waudby-Smith
Aaditya Ramdas
51
148
0
19 Oct 2020
1