ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.00973
92
2

Federated Linear Bandits with Finite Adversarial Actions

2 November 2023
Li Fan
Ruida Zhou
Chao Tian
Cong Shen
    FedML
ArXivPDFHTML
Abstract

We study a federated linear bandits model, where MMM clients communicate with a central server to solve a linear contextual bandits problem with finite adversarial action sets that may be different across clients. To address the unique challenges of adversarial finite action sets, we propose the FedSupLinUCB algorithm, which extends the principles of SupLinUCB and OFUL algorithms in linear contextual bandits. We prove that FedSupLinUCB achieves a total regret of O~(dT)\tilde{O}(\sqrt{d T})O~(dT​), where TTT is the total number of arm pulls from all clients, and ddd is the ambient dimension of the linear model. This matches the minimax lower bound and thus is order-optimal (up to polylog terms). We study both asynchronous and synchronous cases and show that the communication cost can be controlled as O(dM2log⁡(d)log⁡(T))O(d M^2 \log(d)\log(T))O(dM2log(d)log(T)) and O(d3M3log⁡(d))O(\sqrt{d^3 M^3} \log(d))O(d3M3​log(d)), respectively. The FedSupLinUCB design is further extended to two scenarios: (1) variance-adaptive, where a total regret of O~(d∑t=1Tσt2)\tilde{O} (\sqrt{d \sum \nolimits_{t=1}^{T} \sigma_t^2})O~(d∑t=1T​σt2​​) can be achieved with σt2\sigma_t^2σt2​ being the noise variance of round ttt; and (2) adversarial corruption, where a total regret of O~(dT+dCp)\tilde{O}(\sqrt{dT} + d C_p)O~(dT​+dCp​) can be achieved with CpC_pCp​ being the total corruption budget. Experiment results corroborate the theoretical analysis and demonstrate the effectiveness of FedSupLinUCB on both synthetic and real-world datasets.

View on arXiv
Comments on this paper