ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1307.6887
  4. Cited By
Sequential Transfer in Multi-armed Bandit with Finite Set of Models

Sequential Transfer in Multi-armed Bandit with Finite Set of Models

25 July 2013
M. G. Azar
A. Lazaric
Emma Brunskill
    OffRL
ArXivPDFHTML

Papers citing "Sequential Transfer in Multi-armed Bandit with Finite Set of Models"

28 / 28 papers shown
Title
Causally Abstracted Multi-armed Bandits
Causally Abstracted Multi-armed Bandits
Fabio Massimo Zennaro
Nicholas Bishop
Joel Dyer
Yorgos Felekis
Anisoara Calinescu
Michael Wooldridge
Theodoros Damoulas
38
2
0
26 Apr 2024
Meta Learning in Bandits within Shared Affine Subspaces
Meta Learning in Bandits within Shared Affine Subspaces
Steven Bilaj
Sofien Dhouib
S. Maghsudi
41
2
0
31 Mar 2024
Meta-Learning Adversarial Bandit Algorithms
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
28
2
0
05 Jul 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
30
10
0
09 Dec 2022
Hypothesis Transfer in Bandits by Weighted Models
Hypothesis Transfer in Bandits by Weighted Models
Steven Bilaj
Sofien Dhouib
S. Maghsudi
17
2
0
14 Nov 2022
Provable Benefits of Representational Transfer in Reinforcement Learning
Provable Benefits of Representational Transfer in Reinforcement Learning
Alekh Agarwal
Yuda Song
Wen Sun
Kaiwen Wang
Mengdi Wang
Xuezhou Zhang
OffRL
23
33
0
29 May 2022
Multi-Environment Meta-Learning in Stochastic Linear Bandits
Multi-Environment Meta-Learning in Stochastic Linear Bandits
Ahmadreza Moradipari
Mohammad Ghavamzadeh
Taha Rajabzadeh
Christos Thrampoulidis
M. Alizadeh
19
4
0
12 May 2022
Representation Learning for Context-Dependent Decision-Making
Representation Learning for Context-Dependent Decision-Making
Yuzhen Qin
Tommaso Menara
Samet Oymak
ShiNung Ching
Fabio Pasqualetti
40
3
0
12 May 2022
Modeling Attrition in Recommender Systems with Departing Bandits
Modeling Attrition in Recommender Systems with Departing Bandits
Omer Ben-Porat
Lee Cohen
Liu Leqi
Zachary Chase Lipton
Yishay Mansour
13
11
0
25 Mar 2022
PAC-Bayesian Lifelong Learning For Multi-Armed Bandits
PAC-Bayesian Lifelong Learning For Multi-Armed Bandits
H. Flynn
David Reeb
M. Kandemir
Jan Peters
34
7
0
07 Mar 2022
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal
  Arms
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Javad Azizi
T. Duong
Yasin Abbasi-Yadkori
András Gyorgy
Claire Vernade
Mohammad Ghavamzadeh
34
8
0
25 Feb 2022
Meta-Learning for Simple Regret Minimization
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Non-Stationary Representation Learning in Sequential Linear Bandits
Non-Stationary Representation Learning in Sequential Linear Bandits
Yuzhen Qin
Tommaso Menara
Samet Oymak
ShiNung Ching
Fabio Pasqualetti
OffRL
40
17
0
13 Jan 2022
Chronological Causal Bandits
Chronological Causal Bandits
Neil Dhir
18
0
0
03 Dec 2021
Hierarchical Bayesian Bandits
Hierarchical Bayesian Bandits
Joey Hong
B. Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
FedML
47
38
0
12 Nov 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Runzhe Wan
Linjuan Ge
Rui Song
36
28
0
13 Aug 2021
No Regrets for Learning the Prior in Bandits
No Regrets for Learning the Prior in Bandits
Soumya Basu
B. Kveton
Manzil Zaheer
Csaba Szepesvári
41
33
0
13 Jul 2021
Bayesian decision-making under misspecified priors with applications to
  meta-learning
Bayesian decision-making under misspecified priors with applications to meta-learning
Max Simchowitz
Christopher Tosh
A. Krishnamurthy
Daniel J. Hsu
Thodoris Lykouris
Miroslav Dudík
Robert Schapire
40
49
0
03 Jul 2021
Meta-Thompson Sampling
Meta-Thompson Sampling
B. Kveton
Mikhail Konobeev
Manzil Zaheer
Chih-Wei Hsu
Martin Mladenov
Craig Boutilier
Csaba Szepesvári
50
61
0
11 Feb 2021
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic
  Optimality
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality
Kwang-Sung Jun
Chicheng Zhang
15
10
0
15 Jun 2020
Meta-learning with Stochastic Linear Bandits
Meta-learning with Stochastic Linear Bandits
Leonardo Cella
A. Lazaric
Massimiliano Pontil
FedML
22
56
0
18 May 2020
Relative Error Tensor Low Rank Approximation
Relative Error Tensor Low Rank Approximation
Zhao Song
David P. Woodruff
Peilin Zhong
30
122
0
26 Apr 2017
On Context-Dependent Clustering of Bandits
On Context-Dependent Clustering of Bandits
Claudio Gentile
Shuai Li
Purushottam Kar
Alexandros Karatzoglou
Evans Etrue
Giovanni Zappella
15
138
0
06 Aug 2016
A PAC RL Algorithm for Episodic POMDPs
A PAC RL Algorithm for Episodic POMDPs
Z. Guo
Shayan Doroudi
Emma Brunskill
24
55
0
25 May 2016
Latent Contextual Bandits and their Application to Personalized
  Recommendations for New Users
Latent Contextual Bandits and their Application to Personalized Recommendations for New Users
Li Zhou
Emma Brunskill
14
62
0
22 Apr 2016
Safe Policy Search for Lifelong Reinforcement Learning with Sublinear
  Regret
Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret
Haitham Bou-Ammar
Rasul Tutunov
Eric Eaton
OffRL
CLL
26
64
0
21 May 2015
Online Clustering of Bandits
Online Clustering of Bandits
Claudio Gentile
Shuai Li
Giovanni Zappella
26
263
0
31 Jan 2014
Hidden Parameter Markov Decision Processes: A Semiparametric Regression
  Approach for Discovering Latent Task Parametrizations
Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations
Finale Doshi-Velez
George Konidaris
41
128
0
15 Aug 2013
1