Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.09943
Cited By
Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay
19 July 2023
Thomas M. McDonald
Lucas Maystre
M. Lalmas
Daniel Russo
K. Ciosek
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay"
8 / 8 papers shown
Title
Partial Likelihood Thompson Sampling
Han Wu
Stefan Wager
LM&MA
39
2
0
02 Mar 2022
No Regrets for Learning the Prior in Bandits
Soumya Basu
Branislav Kveton
Manzil Zaheer
Csaba Szepesvári
80
33
0
13 Jul 2021
Bayesian decision-making under misspecified priors with applications to meta-learning
Max Simchowitz
Christopher Tosh
A. Krishnamurthy
Daniel J. Hsu
Thodoris Lykouris
Miroslav Dudík
Robert Schapire
76
50
0
03 Jul 2021
Targeting for long-term outcomes
Jeremy Yang
Dean Eckles
Paramveer S. Dhillon
Sinan Aral
OffRL
34
52
0
29 Oct 2020
Quantifying the Carbon Emissions of Machine Learning
Alexandre Lacoste
A. Luccioni
Victor Schmidt
Thomas Dandres
86
695
0
21 Oct 2019
Meta Dynamic Pricing: Transfer Learning Across Experiments
Hamsa Bastani
D. Simchi-Levi
Ruihao Zhu
91
91
0
28 Feb 2019
Best arm identification in multi-armed bandits with delayed feedback
Aditya Grover
Todor Markov
Patrick Attia
Norman Jin
Nicholas Perkins
...
M. Chen
Zi Yang
Stephen J. Harris
W. Chueh
Stefano Ermon
48
74
0
29 Mar 2018
Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization
Lisha Li
Kevin Jamieson
Giulia DeSalvo
Afshin Rostamizadeh
Ameet Talwalkar
213
2,321
0
21 Mar 2016
1