Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.00977
Cited By
Thompson Sampling for the MNL-Bandit
3 June 2017
Shipra Agrawal
Vashist Avadhanula
Vineet Goyal
A. Zeevi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Thompson Sampling for the MNL-Bandit"
19 / 19 papers shown
Title
Learning an Optimal Assortment Policy under Observational Data
Yuxuan Han
Han Zhong
Miao Lu
Jose H. Blanchet
Zhengyuan Zhou
OffRL
73
0
0
10 Feb 2025
Online Joint Assortment-Inventory Optimization under MNL Choices
Yong Liang
Xiaojie Mao
Shiyuan Wang
56
0
0
03 Jan 2025
Harm Mitigation in Recommender Systems under User Preference Dynamics
Jerry Chee
Shankar Kalyanaraman
S. Ernala
Udi Weinsberg
Sarah Dean
Stratis Ioannidis
57
5
0
14 Jun 2024
MNL-Bandit in non-stationary environments
Ayoub Foussoul
Vineet Goyal
Varun Gupta
39
2
0
04 Mar 2023
Multiplier Bootstrap-based Exploration
Runzhe Wan
Haoyu Wei
Branislav Kveton
R. Song
21
3
0
03 Feb 2023
Combinatorial Inference on the Optimal Assortment in Multinomial Logit Models
Shuting Shen
Xi Chen
Ethan X. Fang
Junwei Lu
27
2
0
28 Jan 2023
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Runzhe Wan
Linjuan Ge
Rui Song
23
13
0
26 Feb 2022
Online Learning of Independent Cascade Models with Node-level Feedback
Shuoguang Yang
Van-Anh Truong
25
3
0
06 Sep 2021
Pure Exploration with Structured Preference Feedback
Shubham Gupta
Aadirupa Saha
S. Katariya
35
0
0
12 Apr 2021
UCB-based Algorithms for Multinomial Logistic Regression Bandits
Sanae Amani
Christos Thrampoulidis
34
10
0
21 Mar 2021
Online Multi-Armed Bandits with Adaptive Inference
Maria Dimakopoulou
Zhimei Ren
Zhengyuan Zhou
40
34
0
25 Feb 2021
On the Suboptimality of Thompson Sampling in High Dimensions
Raymond Zhang
Richard Combes
19
4
0
10 Feb 2021
Fully Gap-Dependent Bounds for Multinomial Logit Bandit
Jiaqi Yang
16
2
0
19 Nov 2020
Near-Optimal MNL Bandits Under Risk Criteria
Guangyu Xi
Chao Tao
Yuanshuo Zhou
19
3
0
26 Sep 2020
Online Learning and Optimization for Revenue Management Problems with Add-on Discounts
D. Simchi-Levi
Rui Sun
Huanan Zhang
16
11
0
02 May 2020
Dynamic Learning with Frequent New Product Launches: A Sequential Multinomial Logit Bandit Problem
Junyu Cao
Wei-Ju Sun
21
2
0
29 Apr 2019
Dynamic Assortment Selection under the Nested Logit Models
Xi Chen
Chao Shi
Yining Wang
Yuanshuo Zhou
22
13
0
27 Jun 2018
An Optimal Policy for Dynamic Assortment Planning Under Uncapacitated Multinomial Logit Models
Xi Chen
Yining Wang
Yuanshuo Zhou
22
4
0
12 May 2018
MNL-Bandit: A Dynamic Learning Approach to Assortment Selection
Shipra Agrawal
Vashist Avadhanula
Vineet Goyal
A. Zeevi
39
154
0
13 Jun 2017
1