Linear Bandits with Stochastic Delayed Feedback

v1v2v3 (latest)

Linear Bandits with Stochastic Delayed Feedback

5 July 2018

Alexandra Carpentier

Giovanni Zappella

ArXiv (abs)PDF HTML

Papers citing "Linear Bandits with Stochastic Delayed Feedback"

17 / 17 papers shown

Title
Contextual Linear Bandits with Delay as Payoff Mengxiao Zhang Yingfei Wang Haipeng Luo 193 2 0 18 Feb 2025
Biased Dueling Bandits with Stochastic Delayed Feedback Bongsoo Yi Yue Kang Yao Li 87 1 0 26 Aug 2024
Reinforcement Learning with Delayed, Composite, and Partially Anonymous Reward Washim Uddin Mondal Vaneet Aggarwal 79 2 0 04 May 2023
Multi-Armed Bandits with Generalized Temporally-Partitioned Rewards Ronald C. van den Broek Rik Litjens Tobias Sagis Luc Siecker Nina Verbeeke Pratik Gajane 89 0 0 01 Mar 2023
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback Yunchang Yang Hangshi Zhong Tianhao Wu B. Liu Liwei Wang S. Du OffRL 119 8 0 03 Feb 2023
Dynamical Linear Bandits Marco Mussi Alberto Maria Metelli Marcello Restelli 71 2 0 16 Nov 2022
Delayed Feedback in Generalised Linear Bandits Revisited Benjamin Howson Ciara Pike-Burke Sarah Filippi 61 16 0 21 Jul 2022
Bayesian Optimization under Stochastic Delayed Feedback Arun Verma Zhongxiang Dai Bryan Kian Hsiang Low 80 12 0 19 Jun 2022
Multi-Armed Bandit Problem with Temporally-Partitioned Rewards: When Partial Feedback Counts Giulia Romano Andrea Agostini F. Trovò N. Gatti Marcello Restelli 34 2 0 01 Jun 2022
Thompson Sampling with Unrestricted Delays Hang Wu Stefan Wager 73 8 0 24 Feb 2022
Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences Aadirupa Saha Pierre Gaillard 71 7 0 14 Feb 2022
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback Tiancheng Jin Tal Lancewicki Haipeng Luo Yishay Mansour Aviv A. Rosenberg 131 22 0 31 Jan 2022
Optimism and Delays in Episodic Reinforcement Learning Benjamin Howson Ciara Pike-Burke Sarah Filippi 56 6 0 15 Nov 2021
Smooth Sequential Optimisation with Delayed Feedback S. Chennu Jamie Martin P. Liyanagama Phil Mohr 29 2 0 21 Jun 2021
Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or Bayesian? S. Santosh S. Darak 53 0 0 05 Jun 2021
Leveraging Post Hoc Context for Faster Learning in Bandit Settings with Applications in Robot-Assisted Feeding E. Gordon Sumegh Roychowdhury Tapomayukh Bhattacharjee Kevin Jamieson S. Srinivasa 145 20 0 05 Nov 2020
Cooperative Multi-Agent Bandits with Heavy Tails Abhimanyu Dubey Alex Pentland 54 50 0 14 Aug 2020