ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.02553
  4. Cited By
Smooth Contextual Bandits: Bridging the Parametric and
  Non-differentiable Regret Regimes

Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes

5 September 2019
Yichun Hu
Nathan Kallus
Xiaojie Mao
ArXivPDFHTML

Papers citing "Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes"

10 / 10 papers shown
Title
Contextual Bandits for Unbounded Context Distributions
Contextual Bandits for Unbounded Context Distributions
Puning Zhao
Xiaogang Xu
Zhe Liu
Huiwen Wu
Qin Zhang
Zong Ke
Tianhang Zheng
74
4
0
19 Aug 2024
Batched Nonparametric Contextual Bandits
Batched Nonparametric Contextual Bandits
Rong Jiang
Cong Ma
OffRL
39
1
0
27 Feb 2024
Utility Fairness in Contextual Dynamic Pricing with Demand Learning
Utility Fairness in Contextual Dynamic Pricing with Demand Learning
Xi Chen
David Simchi-Levi
Yining Wang
24
2
0
28 Nov 2023
Smooth Non-Stationary Bandits
Smooth Non-Stationary Bandits
S. Jia
Qian Xie
Nathan Kallus
P. Frazier
106
9
0
29 Jan 2023
Transfer Learning for Contextual Multi-armed Bandits
Transfer Learning for Contextual Multi-armed Bandits
Changxiao Cai
T. Tony Cai
Hongzhe Li
47
16
0
22 Nov 2022
Optimal Contextual Bandits with Knapsacks under Realizability via
  Regression Oracles
Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles
Yuxuan Han
Jialin Zeng
Yang Wang
Yangzhen Xiang
Jiheng Zhang
59
9
0
21 Oct 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment
  Effect Oracles
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles
Aldo G. Carranza
Sanath Kumar Krishnamurthy
Susan Athey
24
1
0
30 Mar 2022
Analysis of Thompson Sampling for Partially Observable Contextual
  Multi-Armed Bandits
Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits
Yash J. Patel
Mohamad Kazem Shirani Faradonbeh
16
15
0
23 Oct 2021
Fast Rates for the Regret of Offline Reinforcement Learning
Fast Rates for the Regret of Offline Reinforcement Learning
Yichun Hu
Nathan Kallus
Masatoshi Uehara
OffRL
24
30
0
31 Jan 2021
Fast Rates for Contextual Linear Optimization
Fast Rates for Contextual Linear Optimization
Yichun Hu
Nathan Kallus
Xiaojie Mao
OffRL
34
41
0
05 Nov 2020
1