Nonstochastic Multiarmed Bandits with Unrestricted Delays

3 June 2019

Papers citing "Nonstochastic Multiarmed Bandits with Unrestricted Delays"

16 / 16 papers shown

Title
Contextual Linear Bandits with Delay as Payoff Mengxiao Zhang Yingfei Wang Haipeng Luo 46 0 0 18 Feb 2025
Biased Dueling Bandits with Stochastic Delayed Feedback Bongsoo Yi Yue Kang Yao Li 38 1 0 26 Aug 2024
A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays Saeed Masoudian Julian Zimmert Yevgeny Seldin 45 3 0 21 Aug 2023
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback Yunchang Yang Hangshi Zhong Tianhao Wu B. Liu Liwei Wang S. Du OffRL 27 8 0 03 Feb 2023
Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning Jiatai Huang Yan Dai Longbo Huang 24 6 0 25 Jan 2023
On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits Jialin Yi Milan Vojnović 21 3 0 30 Nov 2022
Dynamical Linear Bandits Marco Mussi Alberto Maria Metelli Marcello Restelli 41 2 0 16 Nov 2022
Lazy Queries Can Reduce Variance in Zeroth-order Optimization Quan-Wu Xiao Qing Ling Tianyi Chen 41 0 0 14 Jun 2022
Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback Tianyi Lin Aldo Pacchiano Yaodong Yu Michael I. Jordan 29 0 0 15 May 2022
Partial Likelihood Thompson Sampling Han Wu Stefan Wager LM&MA 30 1 0 02 Mar 2022
Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences Aadirupa Saha Pierre Gaillard 36 8 0 14 Feb 2022
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback Tiancheng Jin Tal Lancewicki Haipeng Luo Yishay Mansour Aviv A. Rosenberg 74 21 0 31 Jan 2022
Isotuning With Applications To Scale-Free Online Learning Laurent Orseau Marcus Hutter 13 5 0 29 Dec 2021
Nonstochastic Bandits with Composite Anonymous Feedback Nicolò Cesa-Bianchi Tommaso Cesari Roberto Colomboni Claudio Gentile Yishay Mansour 108 39 0 06 Dec 2021
Learning Adversarial Markov Decision Processes with Delayed Feedback Tal Lancewicki Aviv A. Rosenberg Yishay Mansour 38 32 0 29 Dec 2020
Stochastic bandits with arm-dependent delays Anne Gael Manegueu Claire Vernade Alexandra Carpentier Michal Valko 21 44 0 18 Jun 2020