Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.06321
Cited By
Sequential Batch Learning in Finite-Action Linear Contextual Bandits
14 April 2020
Yanjun Han
Zhengqing Zhou
Zhengyuan Zhou
Jose H. Blanchet
Peter Glynn
Yinyu Ye
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sequential Batch Learning in Finite-Action Linear Contextual Bandits"
18 / 18 papers shown
Title
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Nan Lu
Ethan X. Fang
Junwei Lu
260
0
0
27 Apr 2025
Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits
Jiabin Lin
Shana Moothedath
Namrata Vaswani
69
4
0
08 Jan 2025
Batched Stochastic Bandit for Nondegenerate Functions
Yu Liu
Yunlu Shu
Tianyu Wang
57
0
0
09 May 2024
Generalized Linear Bandits with Limited Adaptivity
Ayush Sawarni
Nirjhar Das
Siddharth Barman
Gaurav Sinha
42
3
0
10 Apr 2024
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
Yi Xu
Weiran Shen
Xiao Zhang
Jun Xu
OffRL
57
0
0
24 Mar 2024
Batched Nonparametric Contextual Bandits
Rong Jiang
Cong Ma
OffRL
49
1
0
27 Feb 2024
Harnessing the Power of Federated Learning in Federated Contextual Bandits
Chengshuai Shi
Ruida Zhou
Kun Yang
Cong Shen
FedML
35
0
0
26 Dec 2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes
Han Zhong
Tong Zhang
40
26
0
15 May 2023
Sequential Counterfactual Risk Minimization
Houssam Zenati
Eustache Diemert
Matthieu Martin
Julien Mairal
Pierre Gaillard
OffRL
35
3
0
23 Feb 2023
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms
Osama A. Hanna
Lin F. Yang
Christina Fragouli
32
11
0
08 Nov 2022
Reward Imputation with Sketching for Contextual Batched Bandits
Xiao Zhang
Ninglu Shao
Zihua Si
Jun Xu
Wen Wang
Hanjing Su
Jirong Wen
OffRL
28
1
0
13 Oct 2022
Efficient Real-world Testing of Causal Decision Making via Bayesian Experimental Design for Contextual Optimisation
Desi R. Ivanova
Joel Jennings
Cheng Zhang
Adam Foster
CML
32
2
0
12 Jul 2022
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization
Hui Yuan
Chengzhuo Ni
Huazheng Wang
Xuezhou Zhang
Le Cong
Csaba Szepesvári
Mengdi Wang
28
2
0
05 Jun 2022
Privacy Amplification via Shuffling for Linear Contextual Bandits
Evrard Garcelon
Kamalika Chaudhuri
Vianney Perchet
Matteo Pirotta
FedML
45
18
0
11 Dec 2021
Online Learning of Energy Consumption for Navigation of Electric Vehicles
Niklas Åkerblom
Yuxin Chen
M. Chehreghani
32
12
0
03 Nov 2021
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
122
167
0
06 Jan 2021
Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design
Yufei Ruan
Jiaqi Yang
Yuanshuo Zhou
OffRL
113
51
0
04 Jul 2020
Inference for Batched Bandits
Kelly W. Zhang
Lucas Janson
Susan Murphy
35
81
0
08 Feb 2020
1