Thompson Sampling for Combinatorial Semi-Bandits

13 March 2018

Papers citing "Thompson Sampling for Combinatorial Semi-Bandits"

50 / 79 papers shown

Title
Bi-Criteria Optimization for Combinatorial Bandits: Sublinear Regret and Constraint Violation under Bandit Feedback Vaneet Aggarwal Shweta Jain Subham Pokhriyal Christopher J. Quinn 274 0 0 15 Mar 2025
ATA: Adaptive Task Allocation for Efficient Resource Management in Distributed Machine Learning Artavazd Maranjyan El Mehdi Saad Peter Richtárik Francesco Orabona 57 0 0 02 Feb 2025
Dynamic Information Sub-Selection for Decision Support Hung-Tien Huang M. Lennon Shreyas Bhat Brahmavar Sean Sylvia Junier B. Oliva 40 0 0 30 Oct 2024
Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation Peiwen Sun Sitong Cheng Xin Li Zhen Ye Huadai Liu Honggang Zhang Wei Xue Yike Guo DiffM 28 3 0 14 Oct 2024
Stochastic Bandits for Egalitarian Assignment Eugene Lim Vincent Y. F. Tan Harold Soh 21 0 0 08 Oct 2024
Thompson Sampling For Combinatorial Bandits: Polynomial Regret and Mismatched Sampling Paradox Raymond Zhang Richard Combes 20 0 0 07 Oct 2024
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits Tatsuhiro Shimizu Koichi Tanaka Ren Kishimoto Haruka Kiyohara Masahiro Nomura Yuta Saito CML OffRL 47 0 0 20 Aug 2024
Thompson Sampling Itself is Differentially Private Tingting Ou Marco Avella Medina Rachel Cummings 11 1 0 20 Jul 2024
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond Xutong Liu Siwei Wang Jinhang Zuo Han Zhong Xuchuang Wang Zhiyong Wang Shuai Li Mohammad Hajiesmaili J. C. Lui Wei Chen 85 1 0 03 Jun 2024
Matroid Semi-Bandits in Sublinear Time Ruo-Chun Tzeng Naoto Ohsaka Kaito Ariu 22 0 0 28 May 2024
Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement Ruiqi Zhang Yuexiang Zhai Andrea Zanette 51 0 0 24 Feb 2024
Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo Haoyang Zheng Wei Deng Christian Moya Guang Lin 27 6 0 22 Jan 2024
Cooperative Multi-Agent Graph Bandits: UCB Algorithm and Regret Analysis Phevos Paschalidis Runyu Zhang Na Li 28 0 0 18 Jan 2024
Zero-Inflated Bandits Haoyu Wei Runzhe Wan Lei Shi Rui Song 42 0 0 25 Dec 2023
Online Influence Maximization: Concept and Algorithm Jianxiong Guo 36 0 0 30 Nov 2023
Bandit Learning to Rank with Position-Based Click Models: Personalized and Equal Treatments Tianchen Zhou Jia-Wei Liu Yang Jiao Chaosheng Dong Yetian Chen Yan Gao Yi Sun OffRL 33 4 0 08 Nov 2023
Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Arman Rahbar Niklas Åkerblom M. Chehreghani 28 0 0 21 Aug 2023
Constant or logarithmic regret in asynchronous multiplayer bandits Hugo Richard Etienne Boursier Vianney Perchet 42 1 0 31 May 2023
A reinforced learning approach to optimal design under model uncertainty Mingyao Ai Holger Dette Zhenghao Liu Jun Yu 25 0 0 28 Mar 2023
When Combinatorial Thompson Sampling meets Approximation Regret Pierre Perrault 62 6 0 22 Feb 2023
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits Yue Kang Cho-Jui Hsieh T. C. Lee 26 1 0 18 Feb 2023
Multiplier Bootstrap-based Exploration Runzhe Wan Haoyu Wei B. Kveton R. Song 18 3 0 03 Feb 2023
A Combinatorial Semi-Bandit Approach to Charging Station Selection for Electric Vehicles Niklas Åkerblom M. Chehreghani 25 0 0 17 Jan 2023
Thompson Sampling with Diffusion Generative Prior Yu-Guan Hsieh S. Kasiviswanathan B. Kveton Patrick Blobaum DiffM 37 7 0 12 Jan 2023
A survey on multi-player bandits Etienne Boursier Vianney Perchet 32 13 0 29 Nov 2022
BORA: Bayesian Optimization for Resource Allocation Antonio Candelieri Andrea Ponti Francesco Archetti 8 0 0 12 Oct 2022
Batch-Size Independent Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms or Independent Arms Xutong Liu Jinhang Zuo Siwei Wang Carlee Joe-Wong John C. S. Lui Wei Chen 38 16 0 31 Aug 2022
Unimodal Mono-Partite Matching in a Bandit Setting Romaric Gaudel Matthieu Rodet 36 0 0 02 Aug 2022
Differentially Private Federated Combinatorial Bandits with Constraints Sambhav Solanki Samhita Kanaparthy Sankarshan Damle Sujit Gujar FedML 29 4 0 27 Jun 2022
A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification F. Hoseini Niklas Åkerblom M. Chehreghani 32 3 0 16 Jun 2022
Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits Tianyuan Jin Pan Xu X. Xiao Anima Anandkumar 36 12 0 07 Jun 2022
Incentivizing Combinatorial Bandit Exploration Xinyan Hu Dung Daniel Ngo Aleksandrs Slivkins Zhiwei Steven Wu 15 11 0 01 Jun 2022
Thompson Sampling for Bandit Learning in Matching Markets Fang-yuan Kong Junming Yin Shuaiqi Li 11 15 0 26 Apr 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework Runzhe Wan Linjuan Ge Rui Song 18 13 0 26 Feb 2022
Budgeted Combinatorial Multi-Armed Bandits Debojit Das Shweta Jain Sujit Gujar 17 7 0 08 Feb 2022
Risk-Aware Algorithms for Combinatorial Semi-Bandits Ranga Shaarad Ayyagari Ambedkar Dukkipati 13 1 0 02 Dec 2021
Contextual Combinatorial Multi-output GP Bandits with Group Constraints Sepehr Elahi Baran Atalar Sevda Öğüt Cem Tekin 30 2 0 29 Nov 2021
The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle Fang-yuan Kong Yueran Yang Wei Chen Shuai Li 48 7 0 08 Nov 2021
Online Learning of Energy Consumption for Navigation of Electric Vehicles Niklas Åkerblom Yuxin Chen M. Chehreghani 28 12 0 03 Nov 2021
Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization Chengshuai Shi Wei Xiong Cong Shen Jing Yang 25 23 0 27 Oct 2021
Online Learning of Network Bottlenecks via Minimax Paths Niklas Åkerblom F. Hoseini M. Chehreghani 29 10 0 17 Sep 2021
Pure Exploration and Regret Minimization in Matching Bandits Flore Sentenac Jialin Yi Clément Calauzènes Vianney Perchet Milan Vojnović 11 6 0 31 Jul 2021
Simple Combinatorial Algorithms for Combinatorial Bandits: Corruptions and Approximations Haike Xu Jian Li 16 4 0 12 Jun 2021
Multi-layered Network Exploration via Random Walks: From Offline Optimization to Online Learning Xutong Liu Jinhang Zuo Xiaowei Chen Wei Chen John C. S. Lui OffRL 15 12 0 09 Jun 2021
Sleeping Combinatorial Bandits Kumar Abhishek Ganesh Ghalme Sujit Gujar Y. Narahari 16 0 0 03 Jun 2021
Censored Semi-Bandits for Resource Allocation Arun Verma M. Hanawal A. Rajkumar Raman Sankaran 9 3 0 12 Apr 2021
Asymptotically Optimal Strategies For Combinatorial Semi-Bandits in Polynomial Time Thibaut Cuvelier Richard Combes É. Gourdin 18 8 0 14 Feb 2021
On the Suboptimality of Thompson Sampling in High Dimensions Raymond Zhang Richard Combes 11 4 0 10 Feb 2021
Adversarial Combinatorial Bandits with General Non-linear Reward Functions Xi Chen Yanjun Han Yining Wang 30 16 0 05 Jan 2021
Accurate and Fast Federated Learning via Combinatorial Multi-Armed Bandits Taehyeon Kim Sangmin Bae Jin-woo Lee Se-Young Yun FedML 29 15 0 06 Dec 2020