Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.04208
Cited By
Combinatorial Cascading Bandits
15 July 2015
B. Kveton
Zheng Wen
Azin Ashkan
Csaba Szepesvári
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Combinatorial Cascading Bandits"
50 / 65 papers shown
Title
Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference
Fateme Jamshidi
Mohammad Shahverdikondori
Negar Kiyavash
44
0
0
10 Mar 2025
Stochastic Bandits for Egalitarian Assignment
Eugene Lim
Vincent Y. F. Tan
Harold Soh
21
0
0
08 Oct 2024
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
Xutong Liu
Siwei Wang
Jinhang Zuo
Han Zhong
Xuchuang Wang
Zhiyong Wang
Shuai Li
Mohammad Hajiesmaili
J. C. Lui
Wei Chen
85
1
0
03 Jun 2024
Cost-Effective Online Multi-LLM Selection with Versatile Reward Models
Xiangxiang Dai
Jin Li
Xutong Liu
Anqi Yu
J. C. Lui
41
5
0
26 May 2024
No-Regret M
♮
{}^{\natural}
♮
-Concave Function Maximization: Stochastic Bandit Algorithms and NP-Hardness of Adversarial Full-Information Setting
Taihei Oki
Shinsaku Sakaue
25
0
0
21 May 2024
Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
Hantao Yang
Xutong Liu
Zhiyong Wang
Hong Xie
J. C. Lui
Defu Lian
Enhong Chen
FedML
46
4
0
26 Feb 2024
Multi-Armed Bandits with Interference
Su Jia
P. Frazier
Nathan Kallus
24
3
0
02 Feb 2024
Cascading Reinforcement Learning
Yihan Du
R. Srikant
Wei Chen
19
0
0
17 Jan 2024
Bandit Learning to Rank with Position-Based Click Models: Personalized and Equal Treatments
Tianchen Zhou
Jia-Wei Liu
Yang Jiao
Chaosheng Dong
Yetian Chen
Yan Gao
Yi Sun
OffRL
25
4
0
08 Nov 2023
Adversarial Attacks on Online Learning to Rank with Stochastic Click Models
Zichen Wang
R. Balasubramanian
Hui Yuan
Chenyu Song
Mengdi Wang
Huazheng Wang
AAML
34
2
0
30 May 2023
Adversarial Attacks on Online Learning to Rank with Click Feedback
Jinhang Zuo
Zhiyao Zhang
Zhiyong Wang
Shuai Li
Mohammad Hajiesmaili
Adam Wierman
AAML
24
3
0
26 May 2023
Combinatorial Bandits for Maximum Value Reward Function under Max Value-Index Feedback
Yiliu Wang
Wei Chen
Milan Vojnović
14
2
0
25 May 2023
Exploration of Unranked Items in Safe Online Learning to Re-Rank
Hiroaki Shiino
Kaito Ariu
Kenshi Abe
Togashi Riku
OnRL
20
0
0
02 May 2023
Contextual Combinatorial Bandits with Probabilistically Triggered Arms
Xutong Liu
Jinhang Zuo
Siwei Wang
John C. S. Lui
Mohammad Hajiesmaili
Adam Wierman
Wei Chen
14
15
0
30 Mar 2023
Batch-Size Independent Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms or Independent Arms
Xutong Liu
Jinhang Zuo
Siwei Wang
Carlee Joe-Wong
John C. S. Lui
Wei Chen
38
16
0
31 Aug 2022
UniRank: Unimodal Bandit Algorithm for Online Ranking
Camille-Sovanneary Gauthier
Romaric Gaudel
Elisa Fromont
18
2
0
02 Aug 2022
Learning to Sell a Focal-ancillary Combination
Hanrui Wang
Xiaocheng Li
K. Talluri
19
0
0
23 Jul 2022
Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms
Xuchuang Wang
Hong Xie
John C. S. Lui
27
6
0
17 Jun 2022
Minimax Regret for Cascading Bandits
Daniel Vial
Sujay Sanghavi
Sanjay Shakkottai
R. Srikant
13
12
0
23 Mar 2022
Learning Neural Ranking Models Online from Implicit User Feedback
Yiling Jia
Hongning Wang
11
6
0
17 Jan 2022
Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization
Chengshuai Shi
Wei Xiong
Cong Shen
Jing Yang
20
23
0
27 Oct 2021
Contextual Combinatorial Bandits with Changing Action Sets via Gaussian Processes
Andi Nika
Sepehr Elahi
Cem Tekin
24
2
0
05 Oct 2021
Online Learning of Independent Cascade Models with Node-level Feedback
Shuoguang Yang
Van-Anh Truong
21
2
0
06 Sep 2021
No Regrets for Learning the Prior in Bandits
Soumya Basu
B. Kveton
Manzil Zaheer
Csaba Szepesvári
41
33
0
13 Jul 2021
Contextual Recommendations and Low-Regret Cutting-Plane Algorithms
Sreenivas Gollapudi
Guru Guruganesh
Kostas Kollias
Pasin Manurangsi
R. Leme
Jon Schneider
18
3
0
09 Jun 2021
On Learning to Rank Long Sequences with Contextual Bandits
Anirban Santara
Claudio Gentile
Gaurav Aggarwal
Shuai Li
23
0
0
07 Jun 2021
Sleeping Combinatorial Bandits
Kumar Abhishek
Ganesh Ghalme
Sujit Gujar
Y. Narahari
11
0
0
03 Jun 2021
Cascading Bandit under Differential Privacy
Kun Wang
Jing Dong
Baoxiang Wang
Shuai Li
Shuo Shao
27
1
0
24 May 2021
Combinatorial Blocking Bandits with Stochastic Delays
Alexia Atsidakou
O. Papadigenopoulos
Soumya Basu
C. Caramanis
Sanjay Shakkottai
21
8
0
22 May 2021
Recurrent Submodular Welfare and Matroid Blocking Bandits
O. Papadigenopoulos
C. Caramanis
21
2
0
30 Jan 2021
Revenue Maximization and Learning in Products Ranking
Ningyuan Chen
Anran Li
Shuoguang Yang
CML
14
0
0
07 Dec 2020
Position-Based Multiple-Play Bandits with Thompson Sampling
Camille-Sovanneary Gauthier
Romaric Gaudel
Elisa Fromont
19
2
0
28 Sep 2020
Fatigue-aware Bandits for Dependent Click Models
Junyu Cao
Wei-Ju Sun
Zuo-Jun
Z. Shen
M. Ettl
17
13
0
22 Aug 2020
Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation
Xu He
Bo An
Yanghua Li
Haikai Chen
Qingyu Guo
Xuzhao Li
Zhirong Wang
4
11
0
21 Aug 2020
Variable Selection via Thompson Sampling
Yi Liu
Veronika Rockova
11
15
0
01 Jul 2020
Tight Lower Bounds for Combinatorial Multi-Armed Bandits
Nadav Merlis
Shie Mannor
6
16
0
13 Feb 2020
Combinatorial Semi-Bandit in the Non-Stationary Environment
Wei Chen
Liwei Wang
Haoyu Zhao
Kai Zheng
29
18
0
10 Feb 2020
Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting
Zixin Zhong
Wang Chi Cheung
Vincent Y. F. Tan
23
8
0
23 Jan 2020
Observe Before Play: Multi-armed Bandit with Pre-observations
Jinhang Zuo
Xiaoxi Zhang
Carlee Joe-Wong
18
16
0
21 Nov 2019
Thompson Sampling for Combinatorial Network Optimization in Unknown Environments
Alihan Huyuk
Cem Tekin
25
16
0
07 Jul 2019
Batch-Size Independent Regret Bounds for the Combinatorial Multi-Armed Bandit Problem
Nadav Merlis
Shie Mannor
17
26
0
08 May 2019
Dynamic Learning with Frequent New Product Launches: A Sequential Multinomial Logit Bandit Problem
Junyu Cao
Wei-Ju Sun
13
2
0
29 Apr 2019
Waterfall Bandits: Learning to Sell Ads Online
B. Kveton
Saied Mahdian
S. Muthukrishnan
Zheng Wen
Yikun Xian
20
4
0
20 Apr 2019
Introduction to Multi-Armed Bandits
Aleksandrs Slivkins
11
989
0
15 Apr 2019
Online Diverse Learning to Rank from Partial-Click Feedback
Prakhar Gupta
G. Hiranandani
Harvineet Singh
B. Kveton
Zheng Wen
I. Burhanuddin
13
0
0
01 Nov 2018
Thompson Sampling Algorithms for Cascading Bandits
Zixin Zhong
Wang Chi Cheung
Vincent Y. F. Tan
12
14
0
02 Oct 2018
BubbleRank: Safe Online Learning to Re-Rank via Implicit Click Feedback
Chang Li
B. Kveton
Tor Lattimore
Ilya Markov
Maarten de Rijke
Csaba Szepesvári
M. Zoghi
OffRL
13
11
0
15 Jun 2018
TopRank: A practical algorithm for online stochastic ranking
Tor Lattimore
B. Kveton
Shuai Li
Csaba Szepesvári
LRM
14
70
0
06 Jun 2018
Cost-aware Cascading Bandits
Ruida Zhou
Chao Gan
Jing Yang
Cong Shen
14
18
0
22 May 2018
Thompson Sampling for Combinatorial Semi-Bandits
Siwei Wang
Wei Chen
13
125
0
13 Mar 2018
1
2
Next