Delay and Cooperation in Nonstochastic Bandits

15 February 2016

Papers citing "Delay and Cooperation in Nonstochastic Bandits"

27 / 27 papers shown

Title
Communication Bounds for the Distributed Experts Problem Zhihao Jia Qi Pang Trung Tran David Woodruff Zhihao Zhang Wenting Zheng 68 0 0 06 Jan 2025
Biased Dueling Bandits with Stochastic Delayed Feedback Bongsoo Yi Yue Kang Yao Li 38 1 0 26 Aug 2024
Multi-Player Approaches for Dueling Bandits Or Raveh Junya Honda Masashi Sugiyama 46 1 0 25 May 2024
A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays Saeed Masoudian Julian Zimmert Yevgeny Seldin 45 3 0 21 Aug 2023
Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs L. Yang Xuchuang Wang Mohammad Hajiesmaili Lijun Zhang John C. S. Lui Don Towsley 36 5 0 08 Aug 2023
Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning Jiatai Huang Yan Dai Longbo Huang 27 6 0 25 Jan 2023
Decision Market Based Learning For Multi-agent Contextual Bandit Problems Wenlong Wang T. Pfeiffer 29 1 0 01 Dec 2022
On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits Jialin Yi Milan Vojnović 27 3 0 30 Nov 2022
A survey on multi-player bandits Etienne Boursier Vianney Perchet 32 13 0 29 Nov 2022
Asynchronous Gradient Play in Zero-Sum Multi-agent Games Ruicheng Ao Shicong Cen Yuejie Chi 48 5 0 16 Nov 2022
Learning in Stackelberg Games with Non-myopic Agents Nika Haghtalab Thodoris Lykouris Sloan Nietert Alexander Wei 28 29 0 19 Aug 2022
Differentially Private Linear Bandits with Partial Distributed Feedback Fengjiao Li Xingyu Zhou Bo Ji FedML 34 13 0 12 Jul 2022
Private and Byzantine-Proof Cooperative Decision-Making Abhimanyu Dubey Alex Pentland 19 24 0 27 May 2022
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback Tiancheng Jin Tal Lancewicki Haipeng Luo Yishay Mansour Aviv A. Rosenberg 74 21 0 31 Jan 2022
One More Step Towards Reality: Cooperative Bandits with Imperfect Communication Udari Madhushani Abhimanyu Dubey Naomi Ehrich Leonard Alex Pentland 28 25 0 24 Nov 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models Runzhe Wan Linjuan Ge Rui Song 36 28 0 13 Aug 2021
Cooperative Stochastic Multi-agent Multi-armed Bandits Robust to Adversarial Corruptions Junyan Liu Shuai Li Dapeng Li 23 6 0 08 Jun 2021
Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or Bayesian? S. Santosh S. Darak 19 0 0 05 Jun 2021
Learning Adversarial Markov Decision Processes with Delayed Feedback Tal Lancewicki Aviv A. Rosenberg Yishay Mansour 43 32 0 29 Dec 2020
On No-Sensing Adversarial Multi-player Multi-armed Bandits with Collision Communications Chengshuai Shi Cong Shen AAML 19 9 0 02 Nov 2020
Robust Multi-Agent Multi-Armed Bandits Daniel Vial Sanjay Shakkottai R. Srikant 19 36 0 07 Jul 2020
Distributed No-Regret Learning in Multi-Agent Systems Xiao Xu Qing Zhao 19 12 0 20 Feb 2020
Regret Bounds for Batched Bandits Hossein Esfandiari Amin Karbasi Abbas Mehrabian Vahab Mirrokni 28 61 0 11 Oct 2019
Social Learning in Multi Agent Multi Armed Bandits Abishek Sankararaman A. Ganesh Sanjay Shakkottai 28 84 0 04 Oct 2019
Individual Regret in Cooperative Nonstochastic Multi-Armed Bandits Yogev Bar-On Yishay Mansour 13 41 0 07 Jul 2019
Nonstochastic Multiarmed Bandits with Unrestricted Delays Tobias Sommer Thune Nicolò Cesa-Bianchi Yevgeny Seldin 28 52 0 03 Jun 2019
Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach Orly Avner Shie Mannor 21 28 0 14 Aug 2018