ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.06422
  4. Cited By
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models

Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models

13 August 2021
Runzhe Wan
Linjuan Ge
Rui Song
ArXivPDFHTML

Papers citing "Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models"

21 / 21 papers shown
Title
Multi-Task Combinatorial Bandits for Budget Allocation
Multi-Task Combinatorial Bandits for Budget Allocation
Lin Ge
Yang Xu
Jianing Chu
David Cramer
Fuhong Li
Kelly Paulson
Rui Song
30
0
0
31 Aug 2024
Meta Clustering of Neural Bandits
Meta Clustering of Neural Bandits
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
42
2
0
10 Aug 2024
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis
  Use
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use
Susobhan Ghosh
Yongyi Guo
Pei-Yao Hung
Lara N. Coughlin
Erin Bonar
Inbal Nahum-Shani
Maureen A. Walton
Susan Murphy
41
4
0
27 Feb 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
DiffM
27
4
0
15 Feb 2024
Zero-Inflated Bandits
Zero-Inflated Bandits
Haoyu Wei
Runzhe Wan
Lei Shi
Rui Song
42
0
0
25 Dec 2023
Effect Size Estimation for Duration Recommendation in Online
  Experiments: Leveraging Hierarchical Models and Objective Utility Approaches
Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches
Yu Liu
Runzhe Wan
James McQueen
Doug Hains
Jinxiang Gu
Rui Song
20
0
0
20 Dec 2023
Online Clustering of Bandits with Misspecified User Models
Online Clustering of Bandits with Misspecified User Models
Zhiyong Wang
Jize Xie
Xutong Liu
Shuai Li
J. C. Lui
47
10
0
04 Oct 2023
Concurrent Constrained Optimization of Unknown Rewards for Multi-Robot
  Task Allocation
Concurrent Constrained Optimization of Unknown Rewards for Multi-Robot Task Allocation
Sukriti Singh
Anusha Srikanthan
Vivek Mallampati
Harish Ravichandar
21
3
0
24 May 2023
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling
Aadirupa Saha
B. Kveton
36
1
0
16 Mar 2023
Multiplier Bootstrap-based Exploration
Multiplier Bootstrap-based Exploration
Runzhe Wan
Haoyu Wei
B. Kveton
R. Song
16
2
0
03 Feb 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
30
10
0
09 Dec 2022
Robust Contextual Linear Bandits
Robust Contextual Linear Bandits
Rong Zhu
B. Kveton
22
3
0
26 Oct 2022
Hierarchical Conversational Preference Elicitation with Bandit Feedback
Hierarchical Conversational Preference Elicitation with Bandit Feedback
Jinhang Zuo
Songwen Hu
Tong Yu
Shuai Li
Handong Zhao
Carlee Joe-Wong
34
10
0
06 Sep 2022
Thompson Sampling for Robust Transfer in Multi-Task Bandits
Thompson Sampling for Robust Transfer in Multi-Task Bandits
Zhi Wang
Chicheng Zhang
Kamalika Chaudhuri
AAML
39
5
0
17 Jun 2022
Mixed-Effect Thompson Sampling
Mixed-Effect Thompson Sampling
Imad Aouali
B. Kveton
S. Katariya
OffRL
45
11
0
30 May 2022
Safe Exploration for Efficient Policy Evaluation and Comparison
Safe Exploration for Efficient Policy Evaluation and Comparison
Runzhe Wan
B. Kveton
Rui Song
OffRL
31
10
0
26 Feb 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning
  Framework
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Runzhe Wan
Linjuan Ge
Rui Song
18
13
0
26 Feb 2022
Meta-Learning for Simple Regret Minimization
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
22
10
0
25 Feb 2022
Deep Hierarchy in Bandits
Deep Hierarchy in Bandits
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
30
20
0
03 Feb 2022
Hierarchical Bayesian Bandits
Hierarchical Bayesian Bandits
Joey Hong
B. Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
FedML
47
37
0
12 Nov 2021
Metalearning Linear Bandits by Prior Update
Metalearning Linear Bandits by Prior Update
Amit Peleg
Naama Pearl
Ron Meir
34
18
0
12 Jul 2021
1