ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.04741
  4. Cited By
Delay and Cooperation in Nonstochastic Bandits

Delay and Cooperation in Nonstochastic Bandits

15 February 2016
Nicolò Cesa-Bianchi
Claudio Gentile
Yishay Mansour
Alberto Minora
ArXivPDFHTML

Papers citing "Delay and Cooperation in Nonstochastic Bandits"

27 / 27 papers shown
Title
Communication Bounds for the Distributed Experts Problem
Zhihao Jia
Qi Pang
Trung Tran
David Woodruff
Zhihao Zhang
Wenting Zheng
68
0
0
06 Jan 2025
Biased Dueling Bandits with Stochastic Delayed Feedback
Biased Dueling Bandits with Stochastic Delayed Feedback
Bongsoo Yi
Yue Kang
Yao Li
38
1
0
26 Aug 2024
Multi-Player Approaches for Dueling Bandits
Multi-Player Approaches for Dueling Bandits
Or Raveh
Junya Honda
Masashi Sugiyama
46
1
0
25 May 2024
A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with
  Robustness to Excessive Delays
A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays
Saeed Masoudian
Julian Zimmert
Yevgeny Seldin
45
3
0
21 Aug 2023
Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal
  Individual Regret and Constant Communication Costs
Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs
L. Yang
Xuchuang Wang
Mohammad Hajiesmaili
Lijun Zhang
John C. S. Lui
Don Towsley
36
5
0
08 Aug 2023
Banker Online Mirror Descent: A Universal Approach for Delayed Online
  Bandit Learning
Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning
Jiatai Huang
Yan Dai
Longbo Huang
27
6
0
25 Jan 2023
Decision Market Based Learning For Multi-agent Contextual Bandit
  Problems
Decision Market Based Learning For Multi-agent Contextual Bandit Problems
Wenlong Wang
T. Pfeiffer
29
1
0
01 Dec 2022
On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits
On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits
Jialin Yi
Milan Vojnović
27
3
0
30 Nov 2022
A survey on multi-player bandits
A survey on multi-player bandits
Etienne Boursier
Vianney Perchet
32
13
0
29 Nov 2022
Asynchronous Gradient Play in Zero-Sum Multi-agent Games
Asynchronous Gradient Play in Zero-Sum Multi-agent Games
Ruicheng Ao
Shicong Cen
Yuejie Chi
48
5
0
16 Nov 2022
Learning in Stackelberg Games with Non-myopic Agents
Learning in Stackelberg Games with Non-myopic Agents
Nika Haghtalab
Thodoris Lykouris
Sloan Nietert
Alexander Wei
28
29
0
19 Aug 2022
Differentially Private Linear Bandits with Partial Distributed Feedback
Differentially Private Linear Bandits with Partial Distributed Feedback
Fengjiao Li
Xingyu Zhou
Bo Ji
FedML
34
13
0
12 Jul 2022
Private and Byzantine-Proof Cooperative Decision-Making
Private and Byzantine-Proof Cooperative Decision-Making
Abhimanyu Dubey
Alex Pentland
19
24
0
27 May 2022
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
Tiancheng Jin
Tal Lancewicki
Haipeng Luo
Yishay Mansour
Aviv A. Rosenberg
74
21
0
31 Jan 2022
One More Step Towards Reality: Cooperative Bandits with Imperfect
  Communication
One More Step Towards Reality: Cooperative Bandits with Imperfect Communication
Udari Madhushani
Abhimanyu Dubey
Naomi Ehrich Leonard
Alex Pentland
28
25
0
24 Nov 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Runzhe Wan
Linjuan Ge
Rui Song
36
28
0
13 Aug 2021
Cooperative Stochastic Multi-agent Multi-armed Bandits Robust to
  Adversarial Corruptions
Cooperative Stochastic Multi-agent Multi-armed Bandits Robust to Adversarial Corruptions
Junyan Liu
Shuai Li
Dapeng Li
23
6
0
08 Jun 2021
Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or
  Bayesian?
Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or Bayesian?
S. Santosh
S. Darak
19
0
0
05 Jun 2021
Learning Adversarial Markov Decision Processes with Delayed Feedback
Learning Adversarial Markov Decision Processes with Delayed Feedback
Tal Lancewicki
Aviv A. Rosenberg
Yishay Mansour
43
32
0
29 Dec 2020
On No-Sensing Adversarial Multi-player Multi-armed Bandits with
  Collision Communications
On No-Sensing Adversarial Multi-player Multi-armed Bandits with Collision Communications
Chengshuai Shi
Cong Shen
AAML
19
9
0
02 Nov 2020
Robust Multi-Agent Multi-Armed Bandits
Robust Multi-Agent Multi-Armed Bandits
Daniel Vial
Sanjay Shakkottai
R. Srikant
19
36
0
07 Jul 2020
Distributed No-Regret Learning in Multi-Agent Systems
Distributed No-Regret Learning in Multi-Agent Systems
Xiao Xu
Qing Zhao
19
12
0
20 Feb 2020
Regret Bounds for Batched Bandits
Regret Bounds for Batched Bandits
Hossein Esfandiari
Amin Karbasi
Abbas Mehrabian
Vahab Mirrokni
28
61
0
11 Oct 2019
Social Learning in Multi Agent Multi Armed Bandits
Social Learning in Multi Agent Multi Armed Bandits
Abishek Sankararaman
A. Ganesh
Sanjay Shakkottai
28
84
0
04 Oct 2019
Individual Regret in Cooperative Nonstochastic Multi-Armed Bandits
Individual Regret in Cooperative Nonstochastic Multi-Armed Bandits
Yogev Bar-On
Yishay Mansour
13
41
0
07 Jul 2019
Nonstochastic Multiarmed Bandits with Unrestricted Delays
Nonstochastic Multiarmed Bandits with Unrestricted Delays
Tobias Sommer Thune
Nicolò Cesa-Bianchi
Yevgeny Seldin
28
52
0
03 Jun 2019
Multi-user Communication Networks: A Coordinated Multi-armed Bandit
  Approach
Multi-user Communication Networks: A Coordinated Multi-armed Bandit Approach
Orly Avner
Shie Mannor
21
28
0
14 Aug 2018
1