Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.06422
Cited By
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
13 August 2021
Runzhe Wan
Linjuan Ge
Rui Song
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models"
21 / 21 papers shown
Title
Multi-Task Combinatorial Bandits for Budget Allocation
Lin Ge
Yang Xu
Jianing Chu
David Cramer
Fuhong Li
Kelly Paulson
Rui Song
30
0
0
31 Aug 2024
Meta Clustering of Neural Bandits
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
42
2
0
10 Aug 2024
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use
Susobhan Ghosh
Yongyi Guo
Pei-Yao Hung
Lara N. Coughlin
Erin Bonar
Inbal Nahum-Shani
Maureen A. Walton
Susan Murphy
41
4
0
27 Feb 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
DiffM
27
4
0
15 Feb 2024
Zero-Inflated Bandits
Haoyu Wei
Runzhe Wan
Lei Shi
Rui Song
42
0
0
25 Dec 2023
Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches
Yu Liu
Runzhe Wan
James McQueen
Doug Hains
Jinxiang Gu
Rui Song
20
0
0
20 Dec 2023
Online Clustering of Bandits with Misspecified User Models
Zhiyong Wang
Jize Xie
Xutong Liu
Shuai Li
J. C. Lui
47
10
0
04 Oct 2023
Concurrent Constrained Optimization of Unknown Rewards for Multi-Robot Task Allocation
Sukriti Singh
Anusha Srikanthan
Vivek Mallampati
Harish Ravichandar
21
3
0
24 May 2023
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling
Aadirupa Saha
B. Kveton
36
1
0
16 Mar 2023
Multiplier Bootstrap-based Exploration
Runzhe Wan
Haoyu Wei
B. Kveton
R. Song
16
2
0
03 Feb 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
30
10
0
09 Dec 2022
Robust Contextual Linear Bandits
Rong Zhu
B. Kveton
22
3
0
26 Oct 2022
Hierarchical Conversational Preference Elicitation with Bandit Feedback
Jinhang Zuo
Songwen Hu
Tong Yu
Shuai Li
Handong Zhao
Carlee Joe-Wong
34
10
0
06 Sep 2022
Thompson Sampling for Robust Transfer in Multi-Task Bandits
Zhi Wang
Chicheng Zhang
Kamalika Chaudhuri
AAML
39
5
0
17 Jun 2022
Mixed-Effect Thompson Sampling
Imad Aouali
B. Kveton
S. Katariya
OffRL
45
11
0
30 May 2022
Safe Exploration for Efficient Policy Evaluation and Comparison
Runzhe Wan
B. Kveton
Rui Song
OffRL
23
10
0
26 Feb 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Runzhe Wan
Linjuan Ge
Rui Song
18
13
0
26 Feb 2022
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Deep Hierarchy in Bandits
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
30
20
0
03 Feb 2022
Hierarchical Bayesian Bandits
Joey Hong
B. Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
FedML
47
37
0
12 Nov 2021
Metalearning Linear Bandits by Prior Update
Amit Peleg
Naama Pearl
Ron Meir
32
18
0
12 Jul 2021
1