ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.01509
  4. Cited By
Bayesian decision-making under misspecified priors with applications to
  meta-learning

Bayesian decision-making under misspecified priors with applications to meta-learning

3 July 2021
Max Simchowitz
Christopher Tosh
A. Krishnamurthy
Daniel J. Hsu
Thodoris Lykouris
Miroslav Dudík
Robert Schapire
ArXivPDFHTML

Papers citing "Bayesian decision-making under misspecified priors with applications to meta-learning"

35 / 35 papers shown
Title
Toward Efficient Exploration by Large Language Model Agents
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
92
1
0
29 Apr 2025
A Classification View on Meta Learning Bandits
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
23
0
0
06 Apr 2025
Meta Clustering of Neural Bandits
Meta Clustering of Neural Bandits
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
42
2
0
10 Aug 2024
Test-Time Regret Minimization in Meta Reinforcement Learning
Test-Time Regret Minimization in Meta Reinforcement Learning
Mirco Mutti
Aviv Tamar
26
4
0
04 Jun 2024
Leveraging (Biased) Information: Multi-armed Bandits with Offline Data
Leveraging (Biased) Information: Multi-armed Bandits with Offline Data
Wang Chi Cheung
Lixing Lyu
OffRL
32
3
0
04 May 2024
A Quadrature Approach for General-Purpose Batch Bayesian Optimization
  via Probabilistic Lifting
A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting
Masaki Adachi
Satoshi Hayakawa
Martin Jørgensen
Saad Hamid
Harald Oberhauser
Michael A. Osborne
GP
32
3
0
18 Apr 2024
Can large language models explore in-context?
Can large language models explore in-context?
Akshay Krishnamurthy
Keegan Harris
Dylan J. Foster
Cyril Zhang
Aleksandrs Slivkins
LM&Ro
LLMAG
LRM
123
23
0
22 Mar 2024
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis
  Use
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use
Susobhan Ghosh
Yongyi Guo
Pei-Yao Hung
Lara N. Coughlin
Erin Bonar
Inbal Nahum-Shani
Maureen A. Walton
Susan Murphy
41
4
0
27 Feb 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
DiffM
27
4
0
15 Feb 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
36
2
0
08 Feb 2024
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
Bayesian Active Learning in the Presence of Nuisance Parameters
Bayesian Active Learning in the Presence of Nuisance Parameters
Sabina J. Sloman
Ayush Bharti
Julien Martinelli
Samuel Kaski
26
3
0
23 Oct 2023
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement
  Learning
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Mirco Mutti
Ric De Santi
Marcello Restelli
Alexander Marx
Giorgia Ramponi
CML
30
4
0
11 Oct 2023
Interactive Graph Convolutional Filtering
Interactive Graph Convolutional Filtering
Jin Zhang
Defu Lian
Hong Xie
Yawen Li
Enhong Chen
27
0
0
04 Sep 2023
Impatient Bandits: Optimizing Recommendations for the Long-Term Without
  Delay
Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay
Thomas M. McDonald
Lucas Maystre
M. Lalmas
Daniel Russo
K. Ciosek
OffRL
27
15
0
19 Jul 2023
Meta-Learning Adversarial Bandit Algorithms
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
28
2
0
05 Jul 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
35
74
0
26 Jun 2023
Finite-Time Logarithmic Bayes Regret Upper Bounds
Finite-Time Logarithmic Bayes Regret Upper Bounds
Alexia Atsidakou
B. Kveton
S. Katariya
C. Caramanis
Sujay Sanghavi
29
0
0
15 Jun 2023
Sequential Best-Arm Identification with Application to Brain-Computer
  Interface
Sequential Best-Arm Identification with Application to Brain-Computer Interface
Xiaoping Zhou
Botao Hao
Jian Kang
Tor Lattimore
Lexin Li
32
2
0
17 May 2023
Leveraging Demonstrations to Improve Online Learning: Quality Matters
Leveraging Demonstrations to Improve Online Learning: Quality Matters
Botao Hao
Rahul Jain
Tor Lattimore
Benjamin Van Roy
Zheng Wen
29
8
0
07 Feb 2023
Thompson Sampling with Diffusion Generative Prior
Thompson Sampling with Diffusion Generative Prior
Yu-Guan Hsieh
S. Kasiviswanathan
B. Kveton
Patrick Blobaum
DiffM
27
7
0
12 Jan 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
30
10
0
09 Dec 2022
Lifelong Bandit Optimization: No Prior and No Regret
Lifelong Bandit Optimization: No Prior and No Regret
Felix Schur
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
46
3
0
27 Oct 2022
Tractable Optimality in Episodic Latent MABs
Tractable Optimality in Episodic Latent MABs
Jeongyeol Kwon
Yonathan Efroni
C. Caramanis
Shie Mannor
50
3
0
05 Oct 2022
Online Bayesian Meta-Learning for Cognitive Tracking Radar
Online Bayesian Meta-Learning for Cognitive Tracking Radar
C. Thornton
R. M. Buehrer
A. Martone
29
5
0
07 Jul 2022
Meta Reinforcement Learning with Finite Training Tasks -- a Density
  Estimation Approach
Meta Reinforcement Learning with Finite Training Tasks -- a Density Estimation Approach
Zohar Rimon
Aviv Tamar
Gilad Adler
OOD
OffRL
34
8
0
21 Jun 2022
Mixed-Effect Thompson Sampling
Mixed-Effect Thompson Sampling
Imad Aouali
B. Kveton
S. Katariya
OffRL
45
11
0
30 May 2022
Meta-Learning Adversarial Bandits
Meta-Learning Adversarial Bandits
Maria-Florina Balcan
Keegan Harris
M. Khodak
Zhiwei Steven Wu
FedML
AAML
43
7
0
27 May 2022
Meta-Learning for Simple Regret Minimization
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Multi-task Representation Learning with Stochastic Linear Bandits
Multi-task Representation Learning with Stochastic Linear Bandits
Leonardo Cella
Karim Lounici
Grégoire Pacreau
Massimiliano Pontil
18
21
0
21 Feb 2022
Meta-Learning Hypothesis Spaces for Sequential Decision-making
Meta-Learning Hypothesis Spaces for Sequential Decision-making
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
OffRL
33
6
0
01 Feb 2022
Gaussian Imagination in Bandit Learning
Gaussian Imagination in Bandit Learning
Yueyang Liu
Adithya M. Devraj
Benjamin Van Roy
Kuang Xu
27
7
0
06 Jan 2022
Hierarchical Bayesian Bandits
Hierarchical Bayesian Bandits
Joey Hong
B. Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
FedML
47
37
0
12 Nov 2021
Safe Data Collection for Offline and Online Policy Learning
Safe Data Collection for Offline and Online Policy Learning
Ruihao Zhu
B. Kveton
OffRL
11
5
0
08 Nov 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Runzhe Wan
Linjuan Ge
Rui Song
36
28
0
13 Aug 2021
1