ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.04623
  4. Cited By
Thompson Sampling for Combinatorial Semi-Bandits

Thompson Sampling for Combinatorial Semi-Bandits

13 March 2018
Siwei Wang
Wei Chen
ArXivPDFHTML

Papers citing "Thompson Sampling for Combinatorial Semi-Bandits"

50 / 79 papers shown
Title
Bi-Criteria Optimization for Combinatorial Bandits: Sublinear Regret and Constraint Violation under Bandit Feedback
Bi-Criteria Optimization for Combinatorial Bandits: Sublinear Regret and Constraint Violation under Bandit Feedback
Vaneet Aggarwal
Shweta Jain
Subham Pokhriyal
Christopher J. Quinn
274
0
0
15 Mar 2025
ATA: Adaptive Task Allocation for Efficient Resource Management in Distributed Machine Learning
ATA: Adaptive Task Allocation for Efficient Resource Management in Distributed Machine Learning
Artavazd Maranjyan
El Mehdi Saad
Peter Richtárik
Francesco Orabona
57
0
0
02 Feb 2025
Dynamic Information Sub-Selection for Decision Support
Dynamic Information Sub-Selection for Decision Support
Hung-Tien Huang
M. Lennon
Shreyas Bhat Brahmavar
Sean Sylvia
Junier B. Oliva
40
0
0
30 Oct 2024
Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Peiwen Sun
Sitong Cheng
Xin Li
Zhen Ye
Huadai Liu
Honggang Zhang
Wei Xue
Yike Guo
DiffM
28
3
0
14 Oct 2024
Stochastic Bandits for Egalitarian Assignment
Stochastic Bandits for Egalitarian Assignment
Eugene Lim
Vincent Y. F. Tan
Harold Soh
21
0
0
08 Oct 2024
Thompson Sampling For Combinatorial Bandits: Polynomial Regret and
  Mismatched Sampling Paradox
Thompson Sampling For Combinatorial Bandits: Polynomial Regret and Mismatched Sampling Paradox
Raymond Zhang
Richard Combes
20
0
0
07 Oct 2024
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial
  Bandits
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Tatsuhiro Shimizu
Koichi Tanaka
Ren Kishimoto
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
CML
OffRL
47
0
0
20 Aug 2024
Thompson Sampling Itself is Differentially Private
Thompson Sampling Itself is Differentially Private
Tingting Ou
Marco Avella Medina
Rachel Cummings
11
1
0
20 Jul 2024
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
Xutong Liu
Siwei Wang
Jinhang Zuo
Han Zhong
Xuchuang Wang
Zhiyong Wang
Shuai Li
Mohammad Hajiesmaili
J. C. Lui
Wei Chen
85
1
0
03 Jun 2024
Matroid Semi-Bandits in Sublinear Time
Matroid Semi-Bandits in Sublinear Time
Ruo-Chun Tzeng
Naoto Ohsaka
Kaito Ariu
22
0
0
28 May 2024
Is Offline Decision Making Possible with Only Few Samples? Reliable
  Decisions in Data-Starved Bandits via Trust Region Enhancement
Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement
Ruiqi Zhang
Yuexiang Zhai
Andrea Zanette
51
0
0
24 Feb 2024
Accelerating Approximate Thompson Sampling with Underdamped Langevin
  Monte Carlo
Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo
Haoyang Zheng
Wei Deng
Christian Moya
Guang Lin
27
6
0
22 Jan 2024
Cooperative Multi-Agent Graph Bandits: UCB Algorithm and Regret Analysis
Cooperative Multi-Agent Graph Bandits: UCB Algorithm and Regret Analysis
Phevos Paschalidis
Runyu Zhang
Na Li
28
0
0
18 Jan 2024
Zero-Inflated Bandits
Zero-Inflated Bandits
Haoyu Wei
Runzhe Wan
Lei Shi
Rui Song
42
0
0
25 Dec 2023
Online Influence Maximization: Concept and Algorithm
Online Influence Maximization: Concept and Algorithm
Jianxiong Guo
36
0
0
30 Nov 2023
Bandit Learning to Rank with Position-Based Click Models: Personalized
  and Equal Treatments
Bandit Learning to Rank with Position-Based Click Models: Personalized and Equal Treatments
Tianchen Zhou
Jia-Wei Liu
Yang Jiao
Chaosheng Dong
Yetian Chen
Yan Gao
Yi Sun
OffRL
33
4
0
08 Nov 2023
Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach
Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach
Arman Rahbar
Niklas Åkerblom
M. Chehreghani
28
0
0
21 Aug 2023
Constant or logarithmic regret in asynchronous multiplayer bandits
Constant or logarithmic regret in asynchronous multiplayer bandits
Hugo Richard
Etienne Boursier
Vianney Perchet
42
1
0
31 May 2023
A reinforced learning approach to optimal design under model uncertainty
A reinforced learning approach to optimal design under model uncertainty
Mingyao Ai
Holger Dette
Zhenghao Liu
Jun Yu
25
0
0
28 Mar 2023
When Combinatorial Thompson Sampling meets Approximation Regret
When Combinatorial Thompson Sampling meets Approximation Regret
Pierre Perrault
62
6
0
22 Feb 2023
Online Continuous Hyperparameter Optimization for Generalized Linear
  Contextual Bandits
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits
Yue Kang
Cho-Jui Hsieh
T. C. Lee
26
1
0
18 Feb 2023
Multiplier Bootstrap-based Exploration
Multiplier Bootstrap-based Exploration
Runzhe Wan
Haoyu Wei
B. Kveton
R. Song
18
3
0
03 Feb 2023
A Combinatorial Semi-Bandit Approach to Charging Station Selection for
  Electric Vehicles
A Combinatorial Semi-Bandit Approach to Charging Station Selection for Electric Vehicles
Niklas Åkerblom
M. Chehreghani
25
0
0
17 Jan 2023
Thompson Sampling with Diffusion Generative Prior
Thompson Sampling with Diffusion Generative Prior
Yu-Guan Hsieh
S. Kasiviswanathan
B. Kveton
Patrick Blobaum
DiffM
37
7
0
12 Jan 2023
A survey on multi-player bandits
A survey on multi-player bandits
Etienne Boursier
Vianney Perchet
32
13
0
29 Nov 2022
BORA: Bayesian Optimization for Resource Allocation
BORA: Bayesian Optimization for Resource Allocation
Antonio Candelieri
Andrea Ponti
Francesco Archetti
8
0
0
12 Oct 2022
Batch-Size Independent Regret Bounds for Combinatorial Semi-Bandits with
  Probabilistically Triggered Arms or Independent Arms
Batch-Size Independent Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms or Independent Arms
Xutong Liu
Jinhang Zuo
Siwei Wang
Carlee Joe-Wong
John C. S. Lui
Wei Chen
38
16
0
31 Aug 2022
Unimodal Mono-Partite Matching in a Bandit Setting
Unimodal Mono-Partite Matching in a Bandit Setting
Romaric Gaudel
Matthieu Rodet
36
0
0
02 Aug 2022
Differentially Private Federated Combinatorial Bandits with Constraints
Differentially Private Federated Combinatorial Bandits with Constraints
Sambhav Solanki
Samhita Kanaparthy
Sankarshan Damle
Sujit Gujar
FedML
29
4
0
27 Jun 2022
A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck
  Identification
A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification
F. Hoseini
Niklas Åkerblom
M. Chehreghani
32
3
0
16 Jun 2022
Finite-Time Regret of Thompson Sampling Algorithms for Exponential
  Family Multi-Armed Bandits
Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits
Tianyuan Jin
Pan Xu
X. Xiao
Anima Anandkumar
36
12
0
07 Jun 2022
Incentivizing Combinatorial Bandit Exploration
Incentivizing Combinatorial Bandit Exploration
Xinyan Hu
Dung Daniel Ngo
Aleksandrs Slivkins
Zhiwei Steven Wu
15
11
0
01 Jun 2022
Thompson Sampling for Bandit Learning in Matching Markets
Thompson Sampling for Bandit Learning in Matching Markets
Fang-yuan Kong
Junming Yin
Shuaiqi Li
11
15
0
26 Apr 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning
  Framework
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Runzhe Wan
Linjuan Ge
Rui Song
18
13
0
26 Feb 2022
Budgeted Combinatorial Multi-Armed Bandits
Budgeted Combinatorial Multi-Armed Bandits
Debojit Das
Shweta Jain
Sujit Gujar
17
7
0
08 Feb 2022
Risk-Aware Algorithms for Combinatorial Semi-Bandits
Risk-Aware Algorithms for Combinatorial Semi-Bandits
Ranga Shaarad Ayyagari
Ambedkar Dukkipati
13
1
0
02 Dec 2021
Contextual Combinatorial Multi-output GP Bandits with Group Constraints
Contextual Combinatorial Multi-output GP Bandits with Group Constraints
Sepehr Elahi
Baran Atalar
Sevda Öğüt
Cem Tekin
30
2
0
29 Nov 2021
The Hardness Analysis of Thompson Sampling for Combinatorial
  Semi-bandits with Greedy Oracle
The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle
Fang-yuan Kong
Yueran Yang
Wei Chen
Shuai Li
48
7
0
08 Nov 2021
Online Learning of Energy Consumption for Navigation of Electric
  Vehicles
Online Learning of Energy Consumption for Navigation of Electric Vehicles
Niklas Åkerblom
Yuxin Chen
M. Chehreghani
28
12
0
03 Nov 2021
Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and
  Generalization
Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization
Chengshuai Shi
Wei Xiong
Cong Shen
Jing Yang
25
23
0
27 Oct 2021
Online Learning of Network Bottlenecks via Minimax Paths
Online Learning of Network Bottlenecks via Minimax Paths
Niklas Åkerblom
F. Hoseini
M. Chehreghani
29
10
0
17 Sep 2021
Pure Exploration and Regret Minimization in Matching Bandits
Pure Exploration and Regret Minimization in Matching Bandits
Flore Sentenac
Jialin Yi
Clément Calauzènes
Vianney Perchet
Milan Vojnović
11
6
0
31 Jul 2021
Simple Combinatorial Algorithms for Combinatorial Bandits: Corruptions
  and Approximations
Simple Combinatorial Algorithms for Combinatorial Bandits: Corruptions and Approximations
Haike Xu
Jian Li
16
4
0
12 Jun 2021
Multi-layered Network Exploration via Random Walks: From Offline
  Optimization to Online Learning
Multi-layered Network Exploration via Random Walks: From Offline Optimization to Online Learning
Xutong Liu
Jinhang Zuo
Xiaowei Chen
Wei Chen
John C. S. Lui
OffRL
15
12
0
09 Jun 2021
Sleeping Combinatorial Bandits
Sleeping Combinatorial Bandits
Kumar Abhishek
Ganesh Ghalme
Sujit Gujar
Y. Narahari
16
0
0
03 Jun 2021
Censored Semi-Bandits for Resource Allocation
Censored Semi-Bandits for Resource Allocation
Arun Verma
M. Hanawal
A. Rajkumar
Raman Sankaran
9
3
0
12 Apr 2021
Asymptotically Optimal Strategies For Combinatorial Semi-Bandits in
  Polynomial Time
Asymptotically Optimal Strategies For Combinatorial Semi-Bandits in Polynomial Time
Thibaut Cuvelier
Richard Combes
É. Gourdin
18
8
0
14 Feb 2021
On the Suboptimality of Thompson Sampling in High Dimensions
On the Suboptimality of Thompson Sampling in High Dimensions
Raymond Zhang
Richard Combes
11
4
0
10 Feb 2021
Adversarial Combinatorial Bandits with General Non-linear Reward
  Functions
Adversarial Combinatorial Bandits with General Non-linear Reward Functions
Xi Chen
Yanjun Han
Yining Wang
30
16
0
05 Jan 2021
Accurate and Fast Federated Learning via Combinatorial Multi-Armed
  Bandits
Accurate and Fast Federated Learning via Combinatorial Multi-Armed Bandits
Taehyeon Kim
Sangmin Bae
Jin-woo Lee
Se-Young Yun
FedML
29
15
0
06 Dec 2020
12
Next