ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.04204
  4. Cited By
Semiparametric Contextual Bandits

Semiparametric Contextual Bandits

12 March 2018
A. Krishnamurthy
Zhiwei Steven Wu
Vasilis Syrgkanis
ArXivPDFHTML

Papers citing "Semiparametric Contextual Bandits"

10 / 10 papers shown
Title
Zero-Inflated Bandits
Zero-Inflated Bandits
Haoyu Wei
Runzhe Wan
Lei Shi
Rui Song
42
0
0
25 Dec 2023
Selective Uncertainty Propagation in Offline RL
Selective Uncertainty Propagation in Offline RL
Sanath Kumar Krishnamurthy
Shrey Modi
Tanmay Gangwani
S. Katariya
B. Kveton
A. Rangi
OffRL
61
0
0
01 Feb 2023
GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation
GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation
Mubarrat Chowdhury
Elkhan Ismayilzada
Khalequzzaman Sayem
Gi-Soo Kim
20
0
0
20 Jan 2023
Statistical Estimation of Confounded Linear MDPs: An Instrumental
  Variable Approach
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach
Miao Lu
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
OffRL
40
1
0
12 Sep 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment
  Effect Oracles
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles
Aldo G. Carranza
Sanath Kumar Krishnamurthy
Susan Athey
19
1
0
30 Mar 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal
  Adaptive Interventions
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions
Nina Deliu
Joseph Jay Williams
B. Chakraborty
OffRL
30
5
0
04 Mar 2022
Bandits with adversarial scaling
Bandits with adversarial scaling
Thodoris Lykouris
Vahab Mirrokni
R. Leme
8
14
0
04 Mar 2020
Personalized HeartSteps: A Reinforcement Learning Algorithm for
  Optimizing Physical Activity
Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical Activity
Peng Liao
Kristjan Greenewald
P. Klasnja
S. Murphy
17
83
0
08 Sep 2019
Model selection for contextual bandits
Model selection for contextual bandits
Dylan J. Foster
A. Krishnamurthy
Haipeng Luo
OffRL
29
89
0
03 Jun 2019
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model
Gi-Soo Kim
M. Paik
17
14
0
31 Jan 2019
1