ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.06129
  4. Cited By
Meta-Thompson Sampling

Meta-Thompson Sampling

11 February 2021
B. Kveton
Mikhail Konobeev
Manzil Zaheer
Chih-Wei Hsu
Martin Mladenov
Craig Boutilier
Csaba Szepesvári
ArXivPDFHTML

Papers citing "Meta-Thompson Sampling"

46 / 46 papers shown
Title
A Classification View on Meta Learning Bandits
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
23
0
0
06 Apr 2025
Multi-Task Dynamic Pricing in Credit Market with Contextual Information
Multi-Task Dynamic Pricing in Credit Market with Contextual Information
Adel Javanmard
Jingwei Ji
Renyuan Xu
39
1
0
18 Oct 2024
Minimax-optimal trust-aware multi-armed bandits
Minimax-optimal trust-aware multi-armed bandits
Changxiao Cai
Jiacheng Zhang
23
0
0
04 Oct 2024
Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret
  Analysis
Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret Analysis
Hao Li
Dong Liang
Zheng Xie
28
0
0
10 Sep 2024
Meta Clustering of Neural Bandits
Meta Clustering of Neural Bandits
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
40
2
0
10 Aug 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
Meta Learning in Bandits within Shared Affine Subspaces
Meta Learning in Bandits within Shared Affine Subspaces
Steven Bilaj
Sofien Dhouib
S. Maghsudi
39
2
0
31 Mar 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
DiffM
27
4
0
15 Feb 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
34
2
0
08 Feb 2024
Thompson Sampling for Stochastic Bandits with Noisy Contexts: An
  Information-Theoretic Regret Analysis
Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis
Sharu Theresa Jose
Shana Moothedath
30
2
0
21 Jan 2024
Adaptive Interventions with User-Defined Goals for Health Behavior
  Change
Adaptive Interventions with User-Defined Goals for Health Behavior Change
Aishwarya Mandyam
Matthew Joerke
William Denton
Barbara E. Engelhardt
Emma Brunskill
29
1
0
16 Nov 2023
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement
  Learning
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Mirco Mutti
Ric De Santi
Marcello Restelli
Alexander Marx
Giorgia Ramponi
CML
30
4
0
11 Oct 2023
Interactive Graph Convolutional Filtering
Interactive Graph Convolutional Filtering
Jin Zhang
Defu Lian
Hong Xie
Yawen Li
Enhong Chen
27
0
0
04 Sep 2023
Meta-Learning Adversarial Bandit Algorithms
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
26
2
0
05 Jul 2023
Context-lumpable stochastic bandits
Context-lumpable stochastic bandits
Chung-Wei Lee
Qinghua Liu
Yasin Abbasi-Yadkori
Chi Jin
Tor Lattimore
Csaba Szepesvári
OffRL
100
2
0
22 Jun 2023
Finite-Time Logarithmic Bayes Regret Upper Bounds
Finite-Time Logarithmic Bayes Regret Upper Bounds
Alexia Atsidakou
B. Kveton
S. Katariya
C. Caramanis
Sujay Sanghavi
29
0
0
15 Jun 2023
Sequential Best-Arm Identification with Application to Brain-Computer
  Interface
Sequential Best-Arm Identification with Application to Brain-Computer Interface
Xiaoping Zhou
Botao Hao
Jian Kang
Tor Lattimore
Lexin Li
29
2
0
17 May 2023
Neural Exploitation and Exploration of Contextual Bandits
Neural Exploitation and Exploration of Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
42
8
0
05 May 2023
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling
Aadirupa Saha
B. Kveton
36
1
0
16 Mar 2023
Leveraging Demonstrations to Improve Online Learning: Quality Matters
Leveraging Demonstrations to Improve Online Learning: Quality Matters
Botao Hao
Rahul Jain
Tor Lattimore
Benjamin Van Roy
Zheng Wen
26
8
0
07 Feb 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
Thompson Sampling with Diffusion Generative Prior
Thompson Sampling with Diffusion Generative Prior
Yu-Guan Hsieh
S. Kasiviswanathan
B. Kveton
Patrick Blobaum
DiffM
27
7
0
12 Jan 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
30
10
0
09 Dec 2022
On the Sample Complexity of Representation Learning in Multi-task
  Bandits with Global and Local structure
On the Sample Complexity of Representation Learning in Multi-task Bandits with Global and Local structure
Alessio Russo
Alexandre Proutière
26
2
0
28 Nov 2022
Robust Contextual Linear Bandits
Robust Contextual Linear Bandits
Rong Zhu
B. Kveton
22
3
0
26 Oct 2022
Tractable Optimality in Episodic Latent MABs
Tractable Optimality in Episodic Latent MABs
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
50
3
0
05 Oct 2022
Hierarchical Conversational Preference Elicitation with Bandit Feedback
Hierarchical Conversational Preference Elicitation with Bandit Feedback
Jinhang Zuo
Songwen Hu
Tong Yu
Shuai Li
Handong Zhao
Carlee Joe-Wong
34
10
0
06 Sep 2022
Online Bayesian Meta-Learning for Cognitive Tracking Radar
Online Bayesian Meta-Learning for Cognitive Tracking Radar
C. Thornton
R. M. Buehrer
A. Martone
26
5
0
07 Jul 2022
Online Meta-Learning in Adversarial Multi-Armed Bandits
Online Meta-Learning in Adversarial Multi-Armed Bandits
Ilya Osadchiy
Kfir Y. Levy
Ron Meir
26
3
0
31 May 2022
Mixed-Effect Thompson Sampling
Mixed-Effect Thompson Sampling
Imad Aouali
B. Kveton
S. Katariya
OffRL
45
11
0
30 May 2022
Meta Representation Learning with Contextual Linear Bandits
Meta Representation Learning with Contextual Linear Bandits
Leonardo Cella
Karim Lounici
Massimiliano Pontil
39
5
0
30 May 2022
Meta-Learning Adversarial Bandits
Meta-Learning Adversarial Bandits
Maria-Florina Balcan
Keegan Harris
M. Khodak
Zhiwei Steven Wu
FedML
AAML
35
7
0
27 May 2022
Multi-Environment Meta-Learning in Stochastic Linear Bandits
Multi-Environment Meta-Learning in Stochastic Linear Bandits
Ahmadreza Moradipari
Mohammad Ghavamzadeh
Taha Rajabzadeh
Christos Thrampoulidis
M. Alizadeh
19
4
0
12 May 2022
Safe Exploration for Efficient Policy Evaluation and Comparison
Safe Exploration for Efficient Policy Evaluation and Comparison
Runzhe Wan
B. Kveton
Rui Song
OffRL
20
10
0
26 Feb 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning
  Framework
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Runzhe Wan
Linjuan Ge
Rui Song
18
13
0
26 Feb 2022
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal
  Arms
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Javad Azizi
T. Duong
Yasin Abbasi-Yadkori
András Gyorgy
Claire Vernade
Mohammad Ghavamzadeh
34
8
0
25 Feb 2022
Meta-Learning for Simple Regret Minimization
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Multi-task Representation Learning with Stochastic Linear Bandits
Multi-task Representation Learning with Stochastic Linear Bandits
Leonardo Cella
Karim Lounici
Grégoire Pacreau
Massimiliano Pontil
16
21
0
21 Feb 2022
Synthetically Controlled Bandits
Synthetically Controlled Bandits
Vivek Farias
C. Moallemi
Tianyi Peng
Andrew Zheng
30
13
0
14 Feb 2022
Deep Hierarchy in Bandits
Deep Hierarchy in Bandits
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
27
20
0
03 Feb 2022
Multitask Learning and Bandits via Robust Statistics
Multitask Learning and Bandits via Robust Statistics
Kan Xu
Hamsa Bastani
35
5
0
28 Dec 2021
Hierarchical Bayesian Bandits
Hierarchical Bayesian Bandits
Joey Hong
B. Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
FedML
47
37
0
12 Nov 2021
Safe Data Collection for Offline and Online Policy Learning
Safe Data Collection for Offline and Online Policy Learning
Ruihao Zhu
B. Kveton
OffRL
11
5
0
08 Nov 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Runzhe Wan
Linjuan Ge
Rui Song
30
28
0
13 Aug 2021
No Regrets for Learning the Prior in Bandits
No Regrets for Learning the Prior in Bandits
Soumya Basu
B. Kveton
Manzil Zaheer
Csaba Szepesvári
41
33
0
13 Jul 2021
Bayesian decision-making under misspecified priors with applications to
  meta-learning
Bayesian decision-making under misspecified priors with applications to meta-learning
Max Simchowitz
Christopher Tosh
A. Krishnamurthy
Daniel J. Hsu
Thodoris Lykouris
Miroslav Dudík
Robert Schapire
17
49
0
03 Jul 2021
1