Thompson Sampling for the MNL-Bandit

Thompson Sampling for the MNL-Bandit

3 June 2017

Vashist Avadhanula

Papers citing "Thompson Sampling for the MNL-Bandit"

19 / 19 papers shown

Title
Learning an Optimal Assortment Policy under Observational Data Yuxuan Han Han Zhong Miao Lu Jose H. Blanchet Zhengyuan Zhou OffRL 73 0 0 10 Feb 2025
Online Joint Assortment-Inventory Optimization under MNL Choices Yong Liang Xiaojie Mao Shiyuan Wang 56 0 0 03 Jan 2025
Harm Mitigation in Recommender Systems under User Preference Dynamics Jerry Chee Shankar Kalyanaraman S. Ernala Udi Weinsberg Sarah Dean Stratis Ioannidis 57 5 0 14 Jun 2024
MNL-Bandit in non-stationary environments Ayoub Foussoul Vineet Goyal Varun Gupta 39 2 0 04 Mar 2023
Multiplier Bootstrap-based Exploration Runzhe Wan Haoyu Wei Branislav Kveton R. Song 21 3 0 03 Feb 2023
Combinatorial Inference on the Optimal Assortment in Multinomial Logit Models Shuting Shen Xi Chen Ethan X. Fang Junwei Lu 27 2 0 28 Jan 2023
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework Runzhe Wan Linjuan Ge Rui Song 23 13 0 26 Feb 2022
Online Learning of Independent Cascade Models with Node-level Feedback Shuoguang Yang Van-Anh Truong 25 3 0 06 Sep 2021
Pure Exploration with Structured Preference Feedback Shubham Gupta Aadirupa Saha S. Katariya 35 0 0 12 Apr 2021
UCB-based Algorithms for Multinomial Logistic Regression Bandits Sanae Amani Christos Thrampoulidis 34 10 0 21 Mar 2021
Online Multi-Armed Bandits with Adaptive Inference Maria Dimakopoulou Zhimei Ren Zhengyuan Zhou 40 34 0 25 Feb 2021
On the Suboptimality of Thompson Sampling in High Dimensions Raymond Zhang Richard Combes 19 4 0 10 Feb 2021
Fully Gap-Dependent Bounds for Multinomial Logit Bandit Jiaqi Yang 16 2 0 19 Nov 2020
Near-Optimal MNL Bandits Under Risk Criteria Guangyu Xi Chao Tao Yuanshuo Zhou 19 3 0 26 Sep 2020
Online Learning and Optimization for Revenue Management Problems with Add-on Discounts D. Simchi-Levi Rui Sun Huanan Zhang 16 11 0 02 May 2020
Dynamic Learning with Frequent New Product Launches: A Sequential Multinomial Logit Bandit Problem Junyu Cao Wei-Ju Sun 21 2 0 29 Apr 2019
Dynamic Assortment Selection under the Nested Logit Models Xi Chen Chao Shi Yining Wang Yuanshuo Zhou 22 13 0 27 Jun 2018
An Optimal Policy for Dynamic Assortment Planning Under Uncapacitated Multinomial Logit Models Xi Chen Yining Wang Yuanshuo Zhou 22 4 0 12 May 2018
MNL-Bandit: A Dynamic Learning Approach to Assortment Selection Shipra Agrawal Vashist Avadhanula Vineet Goyal A. Zeevi 39 154 0 13 Jun 2017