Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1903.07844
Cited By
Shrinking the Upper Confidence Bound: A Dynamic Product Selection Problem for Urban Warehouses
19 March 2019
Rong Jin
D. Simchi-Levi
Liwen Wang
Xinshang Wang
Sen Yang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Shrinking the Upper Confidence Bound: A Dynamic Product Selection Problem for Urban Warehouses"
8 / 8 papers shown
Title
Introduction to Multi-Armed Bandits
Aleksandrs Slivkins
38
999
0
15 Apr 2019
Online Learning and Decision-Making under Generalized Linear Model with High-Dimensional Data
Xue Wang
Mike Mingcheng Wei
Tao Yao
73
4
0
07 Dec 2018
Efficient Learning in Large-Scale Combinatorial Semi-Bandits
Zheng Wen
Branislav Kveton
Azin Ashkan
OffRL
61
96
0
28 Jun 2014
Learning to Optimize Via Posterior Sampling
Daniel Russo
Benjamin Van Roy
60
697
0
11 Jan 2013
Representation Learning: A Review and New Perspectives
Yoshua Bengio
Aaron Courville
Pascal Vincent
OOD
SSL
81
12,370
0
24 Jun 2012
Towards minimax policies for online linear optimization with bandit feedback
Sébastien Bubeck
Nicolò Cesa-Bianchi
Sham Kakade
OffRL
58
149
0
14 Feb 2012
Combinatorial Network Optimization with Unknown Variables: Multi-Armed Bandits with Linear Rewards
Yi Gai
Bhaskar Krishnamachari
Rahul Jain
57
261
0
22 Nov 2010
Linearly Parameterized Bandits
Paat Rusmevichientong
J. Tsitsiklis
45
558
0
18 Dec 2008
1