Shrinking the Upper Confidence Bound: A Dynamic Product Selection Problem for Urban Warehouses

19 March 2019

Papers citing "Shrinking the Upper Confidence Bound: A Dynamic Product Selection Problem for Urban Warehouses"

8 / 8 papers shown

Title
Introduction to Multi-Armed Bandits Aleksandrs Slivkins 38 999 0 15 Apr 2019
Online Learning and Decision-Making under Generalized Linear Model with High-Dimensional Data Xue Wang Mike Mingcheng Wei Tao Yao 73 4 0 07 Dec 2018
Efficient Learning in Large-Scale Combinatorial Semi-Bandits Zheng Wen Branislav Kveton Azin Ashkan OffRL 61 96 0 28 Jun 2014
Learning to Optimize Via Posterior Sampling Daniel Russo Benjamin Van Roy 60 697 0 11 Jan 2013
Representation Learning: A Review and New Perspectives Yoshua Bengio Aaron Courville Pascal Vincent OOD SSL 81 12,370 0 24 Jun 2012
Towards minimax policies for online linear optimization with bandit feedback Sébastien Bubeck Nicolò Cesa-Bianchi Sham Kakade OffRL 58 149 0 14 Feb 2012
Combinatorial Network Optimization with Unknown Variables: Multi-Armed Bandits with Linear Rewards Yi Gai Bhaskar Krishnamachari Rahul Jain 57 261 0 22 Nov 2010
Linearly Parameterized Bandits Paat Rusmevichientong J. Tsitsiklis 45 558 0 18 Dec 2008