Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1307.6887
Cited By
Sequential Transfer in Multi-armed Bandit with Finite Set of Models
25 July 2013
M. G. Azar
A. Lazaric
Emma Brunskill
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sequential Transfer in Multi-armed Bandit with Finite Set of Models"
28 / 28 papers shown
Title
Causally Abstracted Multi-armed Bandits
Fabio Massimo Zennaro
Nicholas Bishop
Joel Dyer
Yorgos Felekis
Anisoara Calinescu
Michael Wooldridge
Theodoros Damoulas
38
2
0
26 Apr 2024
Meta Learning in Bandits within Shared Affine Subspaces
Steven Bilaj
Sofien Dhouib
S. Maghsudi
41
2
0
31 Mar 2024
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
28
2
0
05 Jul 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
30
10
0
09 Dec 2022
Hypothesis Transfer in Bandits by Weighted Models
Steven Bilaj
Sofien Dhouib
S. Maghsudi
17
2
0
14 Nov 2022
Provable Benefits of Representational Transfer in Reinforcement Learning
Alekh Agarwal
Yuda Song
Wen Sun
Kaiwen Wang
Mengdi Wang
Xuezhou Zhang
OffRL
23
33
0
29 May 2022
Multi-Environment Meta-Learning in Stochastic Linear Bandits
Ahmadreza Moradipari
Mohammad Ghavamzadeh
Taha Rajabzadeh
Christos Thrampoulidis
M. Alizadeh
19
4
0
12 May 2022
Representation Learning for Context-Dependent Decision-Making
Yuzhen Qin
Tommaso Menara
Samet Oymak
ShiNung Ching
Fabio Pasqualetti
40
3
0
12 May 2022
Modeling Attrition in Recommender Systems with Departing Bandits
Omer Ben-Porat
Lee Cohen
Liu Leqi
Zachary Chase Lipton
Yishay Mansour
13
11
0
25 Mar 2022
PAC-Bayesian Lifelong Learning For Multi-Armed Bandits
H. Flynn
David Reeb
M. Kandemir
Jan Peters
34
7
0
07 Mar 2022
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Javad Azizi
T. Duong
Yasin Abbasi-Yadkori
András Gyorgy
Claire Vernade
Mohammad Ghavamzadeh
34
8
0
25 Feb 2022
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Non-Stationary Representation Learning in Sequential Linear Bandits
Yuzhen Qin
Tommaso Menara
Samet Oymak
ShiNung Ching
Fabio Pasqualetti
OffRL
40
17
0
13 Jan 2022
Chronological Causal Bandits
Neil Dhir
18
0
0
03 Dec 2021
Hierarchical Bayesian Bandits
Joey Hong
B. Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
FedML
47
38
0
12 Nov 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Runzhe Wan
Linjuan Ge
Rui Song
36
28
0
13 Aug 2021
No Regrets for Learning the Prior in Bandits
Soumya Basu
B. Kveton
Manzil Zaheer
Csaba Szepesvári
41
33
0
13 Jul 2021
Bayesian decision-making under misspecified priors with applications to meta-learning
Max Simchowitz
Christopher Tosh
A. Krishnamurthy
Daniel J. Hsu
Thodoris Lykouris
Miroslav Dudík
Robert Schapire
40
49
0
03 Jul 2021
Meta-Thompson Sampling
B. Kveton
Mikhail Konobeev
Manzil Zaheer
Chih-Wei Hsu
Martin Mladenov
Craig Boutilier
Csaba Szepesvári
50
61
0
11 Feb 2021
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality
Kwang-Sung Jun
Chicheng Zhang
15
10
0
15 Jun 2020
Meta-learning with Stochastic Linear Bandits
Leonardo Cella
A. Lazaric
Massimiliano Pontil
FedML
22
56
0
18 May 2020
Relative Error Tensor Low Rank Approximation
Zhao Song
David P. Woodruff
Peilin Zhong
30
122
0
26 Apr 2017
On Context-Dependent Clustering of Bandits
Claudio Gentile
Shuai Li
Purushottam Kar
Alexandros Karatzoglou
Evans Etrue
Giovanni Zappella
15
138
0
06 Aug 2016
A PAC RL Algorithm for Episodic POMDPs
Z. Guo
Shayan Doroudi
Emma Brunskill
24
55
0
25 May 2016
Latent Contextual Bandits and their Application to Personalized Recommendations for New Users
Li Zhou
Emma Brunskill
14
62
0
22 Apr 2016
Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret
Haitham Bou-Ammar
Rasul Tutunov
Eric Eaton
OffRL
CLL
26
64
0
21 May 2015
Online Clustering of Bandits
Claudio Gentile
Shuai Li
Giovanni Zappella
26
263
0
31 Jan 2014
Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations
Finale Doshi-Velez
George Konidaris
41
128
0
15 Aug 2013
1