ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.06321
  4. Cited By
Sequential Batch Learning in Finite-Action Linear Contextual Bandits

Sequential Batch Learning in Finite-Action Linear Contextual Bandits

14 April 2020
Yanjun Han
Zhengqing Zhou
Zhengyuan Zhou
Jose H. Blanchet
Peter Glynn
Yinyu Ye
    OffRL
ArXivPDFHTML

Papers citing "Sequential Batch Learning in Finite-Action Linear Contextual Bandits"

18 / 18 papers shown
Title
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Nan Lu
Ethan X. Fang
Junwei Lu
260
0
0
27 Apr 2025
Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits
Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits
Jiabin Lin
Shana Moothedath
Namrata Vaswani
69
4
0
08 Jan 2025
Batched Stochastic Bandit for Nondegenerate Functions
Batched Stochastic Bandit for Nondegenerate Functions
Yu Liu
Yunlu Shu
Tianyu Wang
57
0
0
09 May 2024
Generalized Linear Bandits with Limited Adaptivity
Generalized Linear Bandits with Limited Adaptivity
Ayush Sawarni
Nirjhar Das
Siddharth Barman
Gaurav Sinha
42
3
0
10 Apr 2024
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
Yi Xu
Weiran Shen
Xiao Zhang
Jun Xu
OffRL
57
0
0
24 Mar 2024
Batched Nonparametric Contextual Bandits
Batched Nonparametric Contextual Bandits
Rong Jiang
Cong Ma
OffRL
49
1
0
27 Feb 2024
Harnessing the Power of Federated Learning in Federated Contextual
  Bandits
Harnessing the Power of Federated Learning in Federated Contextual Bandits
Chengshuai Shi
Ruida Zhou
Kun Yang
Cong Shen
FedML
35
0
0
26 Dec 2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in
  Linear Markov Decision Processes
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes
Han Zhong
Tong Zhang
40
26
0
15 May 2023
Sequential Counterfactual Risk Minimization
Sequential Counterfactual Risk Minimization
Houssam Zenati
Eustache Diemert
Matthieu Martin
Julien Mairal
Pierre Gaillard
OffRL
35
3
0
23 Feb 2023
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear
  Bandit Algorithms
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms
Osama A. Hanna
Lin F. Yang
Christina Fragouli
32
11
0
08 Nov 2022
Reward Imputation with Sketching for Contextual Batched Bandits
Reward Imputation with Sketching for Contextual Batched Bandits
Xiao Zhang
Ninglu Shao
Zihua Si
Jun Xu
Wen Wang
Hanjing Su
Jirong Wen
OffRL
28
1
0
13 Oct 2022
Efficient Real-world Testing of Causal Decision Making via Bayesian
  Experimental Design for Contextual Optimisation
Efficient Real-world Testing of Causal Decision Making via Bayesian Experimental Design for Contextual Optimisation
Desi R. Ivanova
Joel Jennings
Cheng Zhang
Adam Foster
CML
32
2
0
12 Jul 2022
Bandit Theory and Thompson Sampling-Guided Directed Evolution for
  Sequence Optimization
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization
Hui Yuan
Chengzhuo Ni
Huazheng Wang
Xuezhou Zhang
Le Cong
Csaba Szepesvári
Mengdi Wang
28
2
0
05 Jun 2022
Privacy Amplification via Shuffling for Linear Contextual Bandits
Privacy Amplification via Shuffling for Linear Contextual Bandits
Evrard Garcelon
Kamalika Chaudhuri
Vianney Perchet
Matteo Pirotta
FedML
45
18
0
11 Dec 2021
Online Learning of Energy Consumption for Navigation of Electric
  Vehicles
Online Learning of Energy Consumption for Navigation of Electric Vehicles
Niklas Åkerblom
Yuxin Chen
M. Chehreghani
32
12
0
03 Nov 2021
Provably Efficient Reinforcement Learning with Linear Function
  Approximation Under Adaptivity Constraints
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
122
167
0
06 Jan 2021
Linear Bandits with Limited Adaptivity and Learning Distributional
  Optimal Design
Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design
Yufei Ruan
Jiaqi Yang
Yuanshuo Zhou
OffRL
113
51
0
04 Jul 2020
Inference for Batched Bandits
Inference for Batched Bandits
Kelly W. Zhang
Lucas Janson
Susan Murphy
35
81
0
08 Feb 2020
1