ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.00418
  4. Cited By
Post-Contextual-Bandit Inference

Post-Contextual-Bandit Inference

1 June 2021
Aurélien F. Bibaut
Antoine Chambaz
Maria Dimakopoulou
Nathan Kallus
Mark van der Laan
ArXivPDFHTML

Papers citing "Post-Contextual-Bandit Inference"

10 / 10 papers shown
Title
SNPL: Simultaneous Policy Learning and Evaluation for Safe Multi-Objective Policy Improvement
SNPL: Simultaneous Policy Learning and Evaluation for Safe Multi-Objective Policy Improvement
Brian Cho
Ana-Roxana Pop
Ariel Evince
Nathan Kallus
OffRL
51
0
0
17 Mar 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
74
1
0
22 Feb 2025
Inference with the Upper Confidence Bound Algorithm
Inference with the Upper Confidence Bound Algorithm
K. Khamaru
Cun-Hui Zhang
53
0
0
08 Aug 2024
Online learning in bandits with predicted context
Online learning in bandits with predicted context
Yongyi Guo
Ziping Xu
Susan Murphy
26
4
0
26 Jul 2023
Did we personalize? Assessing personalization by an online reinforcement
  learning algorithm using resampling
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling
Susobhan Ghosh
Raphael Kim
Prasidh Chhabria
Raaz Dwivedi
Predrag Klasjna
Peng Liao
Kelly Zhang
Susan Murphy
OffRL
38
8
0
11 Apr 2023
Near-Optimal Non-Parametric Sequential Tests and Confidence Sequences
  with Possibly Dependent Observations
Near-Optimal Non-Parametric Sequential Tests and Confidence Sequences with Possibly Dependent Observations
Aurélien F. Bibaut
Nathan Kallus
Michael Lindon
35
9
0
29 Dec 2022
Anytime-valid off-policy inference for contextual bandits
Anytime-valid off-policy inference for contextual bandits
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
OffRL
45
25
0
19 Oct 2022
Best Arm Identification with Contextual Information under a Small Gap
Best Arm Identification with Contextual Information under a Small Gap
Masahiro Kato
Masaaki Imaizumi
Takuya Ishihara
T. Kitagawa
27
2
0
15 Sep 2022
Entropy Regularization for Population Estimation
Entropy Regularization for Population Estimation
Ben Chugg
Peter Henderson
Jacob Goldin
Daniel E. Ho
33
3
0
24 Aug 2022
OpenML Benchmarking Suites
OpenML Benchmarking Suites
B. Bischl
Giuseppe Casalicchio
Matthias Feurer
Pieter Gijsbers
Frank Hutter
Michel Lang
R. G. Mantovani
Jan N. van Rijn
Joaquin Vanschoren
VLM
ELM
43
152
0
11 Aug 2017
1