Fair Algorithms with Probing for Multi-Agent Multi-Armed Bandits

17 June 2025

Tianyi Xu

Jiaxin Liu

Zizhan Zheng

FaML

ArXiv (abs)PDF HTML

Main:8 Pages

2 Figures

Bibliography:2 Pages

Appendix:16 Pages

Abstract

We propose a multi-agent multi-armed bandit (MA-MAB) framework aimed at ensuring fair outcomes across agents while maximizing overall system performance. A key challenge in this setting is decision-making under limited information about arm rewards. To address this, we introduce a novel probing framework that strategically gathers information about selected arms before allocation. In the offline setting, where reward distributions are known, we leverage submodular properties to design a greedy probing algorithm with a provable performance bound. For the more complex online setting, we develop an algorithm that achieves sublinear regret while maintaining fairness. Extensive experiments on synthetic and real-world datasets show that our approach outperforms baseline methods, achieving better fairness and efficiency.

View on arXiv

@article{xu2025_2506.14988,
  title={ Fair Algorithms with Probing for Multi-Agent Multi-Armed Bandits },
  author={ Tianyi Xu and Jiaxin Liu and Zizhan Zheng },
  journal={arXiv preprint arXiv:2506.14988},
  year={ 2025 }
}

Comments on this paper