Bayesian decision-making under misspecified priors with applications to meta-learning

3 July 2021

Max Simchowitz

Papers citing "Bayesian decision-making under misspecified priors with applications to meta-learning"

35 / 35 papers shown

Title
Toward Efficient Exploration by Large Language Model Agents Dilip Arumugam Thomas L. Griffiths LLMAG 92 1 0 29 Apr 2025
A Classification View on Meta Learning Bandits Mirco Mutti Jeongyeol Kwon Shie Mannor Aviv Tamar 23 0 0 06 Apr 2025
Meta Clustering of Neural Bandits Yikun Ban Yunzhe Qi Tianxin Wei Lihui Liu Jingrui He 42 2 0 10 Aug 2024
Test-Time Regret Minimization in Meta Reinforcement Learning Mirco Mutti Aviv Tamar 26 4 0 04 Jun 2024
Leveraging (Biased) Information: Multi-armed Bandits with Offline Data Wang Chi Cheung Lixing Lyu OffRL 32 3 0 04 May 2024
A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting Masaki Adachi Satoshi Hayakawa Martin Jørgensen Saad Hamid Harald Oberhauser Michael A. Osborne GP 32 3 0 18 Apr 2024
Can large language models explore in-context? Akshay Krishnamurthy Keegan Harris Dylan J. Foster Cyril Zhang Aleksandrs Slivkins LM&Ro LLMAG LRM 123 23 0 22 Mar 2024
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use Susobhan Ghosh Yongyi Guo Pei-Yao Hung Lara N. Coughlin Erin Bonar Inbal Nahum-Shani Maureen A. Walton Susan Murphy 41 4 0 27 Feb 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces Imad Aouali DiffM 27 4 0 15 Feb 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits Nicolas Nguyen Imad Aouali András Gyorgy Claire Vernade 36 2 0 08 Feb 2024
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning Hongming Zhang Tongzheng Ren Chenjun Xiao Dale Schuurmans Bo Dai 45 3 0 20 Nov 2023
Bayesian Active Learning in the Presence of Nuisance Parameters Sabina J. Sloman Ayush Bharti Julien Martinelli Samuel Kaski 26 3 0 23 Oct 2023
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning Mirco Mutti Ric De Santi Marcello Restelli Alexander Marx Giorgia Ramponi CML 30 4 0 11 Oct 2023
Interactive Graph Convolutional Filtering Jin Zhang Defu Lian Hong Xie Yawen Li Enhong Chen 27 0 0 04 Sep 2023
Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay Thomas M. McDonald Lucas Maystre M. Lalmas Daniel Russo K. Ciosek OffRL 27 15 0 19 Jul 2023
Meta-Learning Adversarial Bandit Algorithms M. Khodak Ilya Osadchiy Keegan Harris Maria-Florina Balcan Kfir Y. Levy Ron Meir Zhiwei Steven Wu FedML 28 2 0 05 Jul 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning Jonathan Lee Annie Xie Aldo Pacchiano Yash Chandak Chelsea Finn Ofir Nachum Emma Brunskill OffRL 35 74 0 26 Jun 2023
Finite-Time Logarithmic Bayes Regret Upper Bounds Alexia Atsidakou B. Kveton S. Katariya C. Caramanis Sujay Sanghavi 29 0 0 15 Jun 2023
Sequential Best-Arm Identification with Application to Brain-Computer Interface Xiaoping Zhou Botao Hao Jian Kang Tor Lattimore Lexin Li 32 2 0 17 May 2023
Leveraging Demonstrations to Improve Online Learning: Quality Matters Botao Hao Rahul Jain Tor Lattimore Benjamin Van Roy Zheng Wen 29 8 0 07 Feb 2023
Thompson Sampling with Diffusion Generative Prior Yu-Guan Hsieh S. Kasiviswanathan B. Kveton Patrick Blobaum DiffM 27 7 0 12 Jan 2023
Multi-Task Off-Policy Learning from Bandit Feedback Joey Hong B. Kveton S. Katariya Manzil Zaheer Mohammad Ghavamzadeh OffRL 30 10 0 09 Dec 2022
Lifelong Bandit Optimization: No Prior and No Regret Felix Schur Parnian Kassraie Jonas Rothfuss Andreas Krause 46 3 0 27 Oct 2022
Tractable Optimality in Episodic Latent MABs Jeongyeol Kwon Yonathan Efroni C. Caramanis Shie Mannor 50 3 0 05 Oct 2022
Online Bayesian Meta-Learning for Cognitive Tracking Radar C. Thornton R. M. Buehrer A. Martone 29 5 0 07 Jul 2022
Meta Reinforcement Learning with Finite Training Tasks -- a Density Estimation Approach Zohar Rimon Aviv Tamar Gilad Adler OOD OffRL 34 8 0 21 Jun 2022
Mixed-Effect Thompson Sampling Imad Aouali B. Kveton S. Katariya OffRL 45 11 0 30 May 2022
Meta-Learning Adversarial Bandits Maria-Florina Balcan Keegan Harris M. Khodak Zhiwei Steven Wu FedML AAML 43 7 0 27 May 2022
Meta-Learning for Simple Regret Minimization Javad Azizi B. Kveton Mohammad Ghavamzadeh S. Katariya 22 10 0 25 Feb 2022
Multi-task Representation Learning with Stochastic Linear Bandits Leonardo Cella Karim Lounici Grégoire Pacreau Massimiliano Pontil 18 21 0 21 Feb 2022
Meta-Learning Hypothesis Spaces for Sequential Decision-making Parnian Kassraie Jonas Rothfuss Andreas Krause OffRL 33 6 0 01 Feb 2022
Gaussian Imagination in Bandit Learning Yueyang Liu Adithya M. Devraj Benjamin Van Roy Kuang Xu 27 7 0 06 Jan 2022
Hierarchical Bayesian Bandits Joey Hong B. Kveton Manzil Zaheer Mohammad Ghavamzadeh FedML 47 37 0 12 Nov 2021
Safe Data Collection for Offline and Online Policy Learning Ruihao Zhu B. Kveton OffRL 11 5 0 08 Nov 2021
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models Runzhe Wan Linjuan Ge Rui Song 36 28 0 13 Aug 2021