Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.01509
Cited By
Bayesian decision-making under misspecified priors with applications to meta-learning
3 July 2021
Max Simchowitz
Christopher Tosh
A. Krishnamurthy
Daniel J. Hsu
Thodoris Lykouris
Miroslav Dudík
Robert Schapire
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bayesian decision-making under misspecified priors with applications to meta-learning"
35 / 35 papers shown
Title
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
92
1
0
29 Apr 2025
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
23
0
0
06 Apr 2025
Meta Clustering of Neural Bandits
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
42
2
0
10 Aug 2024
Test-Time Regret Minimization in Meta Reinforcement Learning
Mirco Mutti
Aviv Tamar
26
4
0
04 Jun 2024
Leveraging (Biased) Information: Multi-armed Bandits with Offline Data
Wang Chi Cheung
Lixing Lyu
OffRL
32
3
0
04 May 2024
A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting
Masaki Adachi
Satoshi Hayakawa
Martin Jørgensen
Saad Hamid
Harald Oberhauser
Michael A. Osborne
GP
32
3
0
18 Apr 2024
Can large language models explore in-context?
Akshay Krishnamurthy
Keegan Harris
Dylan J. Foster
Cyril Zhang
Aleksandrs Slivkins
LM&Ro
LLMAG
LRM
123
23
0
22 Mar 2024
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use
Susobhan Ghosh
Yongyi Guo
Pei-Yao Hung
Lara N. Coughlin
Erin Bonar
Inbal Nahum-Shani
Maureen A. Walton
Susan Murphy
41
4
0
27 Feb 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
DiffM
27
4
0
15 Feb 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
36
2
0
08 Feb 2024
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
Bayesian Active Learning in the Presence of Nuisance Parameters
Sabina J. Sloman
Ayush Bharti
Julien Martinelli
Samuel Kaski
26
3
0
23 Oct 2023
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Mirco Mutti
Ric De Santi
Marcello Restelli
Alexander Marx
Giorgia Ramponi
CML
30
4
0
11 Oct 2023
Interactive Graph Convolutional Filtering
Jin Zhang
Defu Lian
Hong Xie
Yawen Li
Enhong Chen
27
0
0
04 Sep 2023
Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay
Thomas M. McDonald
Lucas Maystre
M. Lalmas
Daniel Russo
K. Ciosek
OffRL
27
15
0
19 Jul 2023
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
28
2
0
05 Jul 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
35
74
0
26 Jun 2023
Finite-Time Logarithmic Bayes Regret Upper Bounds
Alexia Atsidakou
B. Kveton
S. Katariya
C. Caramanis
Sujay Sanghavi
29
0
0
15 Jun 2023
Sequential Best-Arm Identification with Application to Brain-Computer Interface
Xiaoping Zhou
Botao Hao
Jian Kang
Tor Lattimore
Lexin Li
32
2
0
17 May 2023
Leveraging Demonstrations to Improve Online Learning: Quality Matters
Botao Hao
Rahul Jain
Tor Lattimore
Benjamin Van Roy
Zheng Wen
29
8
0
07 Feb 2023
Thompson Sampling with Diffusion Generative Prior
Yu-Guan Hsieh
S. Kasiviswanathan
B. Kveton
Patrick Blobaum
DiffM
27
7
0
12 Jan 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
30
10
0
09 Dec 2022
Lifelong Bandit Optimization: No Prior and No Regret
Felix Schur
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
46
3
0
27 Oct 2022
Tractable Optimality in Episodic Latent MABs
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
50
3
0
05 Oct 2022
Online Bayesian Meta-Learning for Cognitive Tracking Radar
C. Thornton
R. M. Buehrer
A. Martone
29
5
0
07 Jul 2022
Meta Reinforcement Learning with Finite Training Tasks -- a Density Estimation Approach
Zohar Rimon
Aviv Tamar
Gilad Adler
OOD
OffRL
34
8
0
21 Jun 2022
Mixed-Effect Thompson Sampling
Imad Aouali
B. Kveton
S. Katariya
OffRL
45
11
0
30 May 2022
Meta-Learning Adversarial Bandits
Maria-Florina Balcan
Keegan Harris
M. Khodak
Zhiwei Steven Wu
FedML
AAML
43
7
0
27 May 2022
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Multi-task Representation Learning with Stochastic Linear Bandits
Leonardo Cella
Karim Lounici
Grégoire Pacreau
Massimiliano Pontil
18
21
0
21 Feb 2022
Meta-Learning Hypothesis Spaces for Sequential Decision-making
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
OffRL
33
6
0
01 Feb 2022
Gaussian Imagination in Bandit Learning
Yueyang Liu
Adithya M. Devraj
Benjamin Van Roy
Kuang Xu
27
7
0
06 Jan 2022
Hierarchical Bayesian Bandits
Joey Hong
B. Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
FedML
47
37
0
12 Nov 2021
Safe Data Collection for Offline and Online Policy Learning
Ruihao Zhu
B. Kveton
OffRL
11
5
0
08 Nov 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Runzhe Wan
Linjuan Ge
Rui Song
36
28
0
13 Aug 2021
1