Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.01454
Cited By
Deep Hierarchy in Bandits
3 February 2022
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Hierarchy in Bandits"
15 / 15 papers shown
Title
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
23
0
0
06 Apr 2025
Online Posterior Sampling with a Diffusion Prior
B. Kveton
Boris Oreshkin
Youngsuk Park
Aniket Deshmukh
Rui Song
DiffM
40
0
0
04 Oct 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
72
2
0
07 Jun 2024
Unified PAC-Bayesian Study of Pessimism for Offline Policy Learning with Regularized Importance Sampling
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
49
1
0
05 Jun 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
DiffM
27
4
0
15 Feb 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
34
2
0
08 Feb 2024
Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis
Sharu Theresa Jose
Shana Moothedath
30
2
0
21 Jan 2024
Exponential Smoothing for Off-Policy Learning
Imad Aouali
Victor-Emmanuel Brunel
D. Rohde
Anna Korba
OffRL
30
11
0
25 May 2023
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling
Aadirupa Saha
B. Kveton
36
1
0
16 Mar 2023
Overcoming Prior Misspecification in Online Learning to Rank
Javad Azizi
Ofer Meshi
M. Zoghi
Maryam Karimzadehgan
20
1
0
25 Jan 2023
Thompson Sampling with Diffusion Generative Prior
Yu-Guan Hsieh
S. Kasiviswanathan
B. Kveton
Patrick Blobaum
DiffM
27
7
0
12 Jan 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
30
10
0
09 Dec 2022
Hierarchical Conversational Preference Elicitation with Bandit Feedback
Jinhang Zuo
Songwen Hu
Tong Yu
Shuai Li
Handong Zhao
Carlee Joe-Wong
34
10
0
06 Sep 2022
Mixed-Effect Thompson Sampling
Imad Aouali
B. Kveton
S. Katariya
OffRL
45
11
0
30 May 2022
No Regrets for Learning the Prior in Bandits
Soumya Basu
B. Kveton
Manzil Zaheer
Csaba Szepesvári
41
33
0
13 Jul 2021
1