Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.06129
Cited By
Meta-Thompson Sampling
11 February 2021
B. Kveton
Mikhail Konobeev
Manzil Zaheer
Chih-Wei Hsu
Martin Mladenov
Craig Boutilier
Csaba Szepesvári
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Meta-Thompson Sampling"
46 / 46 papers shown
Title
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
23
0
0
06 Apr 2025
Multi-Task Dynamic Pricing in Credit Market with Contextual Information
Adel Javanmard
Jingwei Ji
Renyuan Xu
39
1
0
18 Oct 2024
Minimax-optimal trust-aware multi-armed bandits
Changxiao Cai
Jiacheng Zhang
23
0
0
04 Oct 2024
Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret Analysis
Hao Li
Dong Liang
Zheng Xie
28
0
0
10 Sep 2024
Meta Clustering of Neural Bandits
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
40
2
0
10 Aug 2024
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
Meta Learning in Bandits within Shared Affine Subspaces
Steven Bilaj
Sofien Dhouib
S. Maghsudi
39
2
0
31 Mar 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
DiffM
27
4
0
15 Feb 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
34
2
0
08 Feb 2024
Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis
Sharu Theresa Jose
Shana Moothedath
30
2
0
21 Jan 2024
Adaptive Interventions with User-Defined Goals for Health Behavior Change
Aishwarya Mandyam
Matthew Joerke
William Denton
Barbara E. Engelhardt
Emma Brunskill
29
1
0
16 Nov 2023
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Mirco Mutti
Ric De Santi
Marcello Restelli
Alexander Marx
Giorgia Ramponi
CML
30
4
0
11 Oct 2023
Interactive Graph Convolutional Filtering
Jin Zhang
Defu Lian
Hong Xie
Yawen Li
Enhong Chen
27
0
0
04 Sep 2023
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
26
2
0
05 Jul 2023
Context-lumpable stochastic bandits
Chung-Wei Lee
Qinghua Liu
Yasin Abbasi-Yadkori
Chi Jin
Tor Lattimore
Csaba Szepesvári
OffRL
100
2
0
22 Jun 2023
Finite-Time Logarithmic Bayes Regret Upper Bounds
Alexia Atsidakou
B. Kveton
S. Katariya
C. Caramanis
Sujay Sanghavi
29
0
0
15 Jun 2023
Sequential Best-Arm Identification with Application to Brain-Computer Interface
Xiaoping Zhou
Botao Hao
Jian Kang
Tor Lattimore
Lexin Li
29
2
0
17 May 2023
Neural Exploitation and Exploration of Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
42
8
0
05 May 2023
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling
Aadirupa Saha
B. Kveton
36
1
0
16 Mar 2023
Leveraging Demonstrations to Improve Online Learning: Quality Matters
Botao Hao
Rahul Jain
Tor Lattimore
Benjamin Van Roy
Zheng Wen
26
8
0
07 Feb 2023
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
Thompson Sampling with Diffusion Generative Prior
Yu-Guan Hsieh
S. Kasiviswanathan
B. Kveton
Patrick Blobaum
DiffM
27
7
0
12 Jan 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
30
10
0
09 Dec 2022
On the Sample Complexity of Representation Learning in Multi-task Bandits with Global and Local structure
Alessio Russo
Alexandre Proutière
26
2
0
28 Nov 2022
Robust Contextual Linear Bandits
Rong Zhu
B. Kveton
22
3
0
26 Oct 2022
Tractable Optimality in Episodic Latent MABs
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
50
3
0
05 Oct 2022
Hierarchical Conversational Preference Elicitation with Bandit Feedback
Jinhang Zuo
Songwen Hu
Tong Yu
Shuai Li
Handong Zhao
Carlee Joe-Wong
34
10
0
06 Sep 2022
Online Bayesian Meta-Learning for Cognitive Tracking Radar
C. Thornton
R. M. Buehrer
A. Martone
26
5
0
07 Jul 2022
Online Meta-Learning in Adversarial Multi-Armed Bandits
Ilya Osadchiy
Kfir Y. Levy
Ron Meir
26
3
0
31 May 2022
Mixed-Effect Thompson Sampling
Imad Aouali
B. Kveton
S. Katariya
OffRL
45
11
0
30 May 2022
Meta Representation Learning with Contextual Linear Bandits
Leonardo Cella
Karim Lounici
Massimiliano Pontil
39
5
0
30 May 2022
Meta-Learning Adversarial Bandits
Maria-Florina Balcan
Keegan Harris
M. Khodak
Zhiwei Steven Wu
FedML
AAML
35
7
0
27 May 2022
Multi-Environment Meta-Learning in Stochastic Linear Bandits
Ahmadreza Moradipari
Mohammad Ghavamzadeh
Taha Rajabzadeh
Christos Thrampoulidis
M. Alizadeh
19
4
0
12 May 2022
Safe Exploration for Efficient Policy Evaluation and Comparison
Runzhe Wan
B. Kveton
Rui Song
OffRL
20
10
0
26 Feb 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Runzhe Wan
Linjuan Ge
Rui Song
18
13
0
26 Feb 2022
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Javad Azizi
T. Duong
Yasin Abbasi-Yadkori
András Gyorgy
Claire Vernade
Mohammad Ghavamzadeh
34
8
0
25 Feb 2022
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Multi-task Representation Learning with Stochastic Linear Bandits
Leonardo Cella
Karim Lounici
Grégoire Pacreau
Massimiliano Pontil
16
21
0
21 Feb 2022
Synthetically Controlled Bandits
Vivek Farias
C. Moallemi
Tianyi Peng
Andrew Zheng
30
13
0
14 Feb 2022
Deep Hierarchy in Bandits
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
27
20
0
03 Feb 2022
Multitask Learning and Bandits via Robust Statistics
Kan Xu
Hamsa Bastani
35
5
0
28 Dec 2021
Hierarchical Bayesian Bandits
Joey Hong
B. Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
FedML
47
37
0
12 Nov 2021
Safe Data Collection for Offline and Online Policy Learning
Ruihao Zhu
B. Kveton
OffRL
11
5
0
08 Nov 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Runzhe Wan
Linjuan Ge
Rui Song
30
28
0
13 Aug 2021
No Regrets for Learning the Prior in Bandits
Soumya Basu
B. Kveton
Manzil Zaheer
Csaba Szepesvári
41
33
0
13 Jul 2021
Bayesian decision-making under misspecified priors with applications to meta-learning
Max Simchowitz
Christopher Tosh
A. Krishnamurthy
Daniel J. Hsu
Thodoris Lykouris
Miroslav Dudík
Robert Schapire
17
49
0
03 Jul 2021
1