Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1210.4839
Cited By
Leveraging Side Observations in Stochastic Bandits
16 October 2012
S. Caron
Branislav Kveton
Marc Lelarge
Smriti Bhagat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Leveraging Side Observations in Stochastic Bandits"
30 / 30 papers shown
Title
Asymptotically-Optimal Gaussian Bandits with Side Observations
Alexia Atsidakou
Orestis Papadigenopoulos
Constantine Caramanis
Sujay Sanghavi
Sanjay Shakkottai
16
4
0
15 May 2025
Graph Feedback Bandits on Similar Arms: With and Without Graph Structures
Han Qi
Fei-Yu Guo
Li Zhu
Qiaosheng Zhang
Xiaochen Li
43
0
0
24 Jan 2025
Graph Feedback Bandits with Similar Arms
Han Qi
Guo Fei
Li Zhu
27
0
0
18 May 2024
Causally Abstracted Multi-armed Bandits
Fabio Massimo Zennaro
Nicholas Bishop
Joel Dyer
Yorgos Felekis
Anisoara Calinescu
Michael Wooldridge
Theodoros Damoulas
38
2
0
26 Apr 2024
Stochastic Graph Bandit Learning with Side-Observations
Xueping Gong
Jiheng Zhang
34
1
0
29 Aug 2023
Online Network Source Optimization with Graph-Kernel MAB
Laura Toni
P. Frossard
34
1
0
07 Jul 2023
Invariant Lipschitz Bandits: A Side Observation Approach
Nam-Phuong Tran
Long Tran-Thanh
51
1
0
14 Dec 2022
Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs
Haipeng Luo
Hanghang Tong
Mengxiao Zhang
Yuheng Zhang
18
5
0
04 Oct 2022
Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality
T. V. Marinov
M. Mohri
Julian Zimmert
21
6
0
20 Jun 2022
Nearly Optimal Best-of-Both-Worlds Algorithms for Online Learning with Feedback Graphs
Shinji Ito
Taira Tsuchiya
Junya Honda
35
24
0
02 Jun 2022
A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs
Chloé Rouyer
Dirk van der Hoeven
Nicolò Cesa-Bianchi
Yevgeny Seldin
28
15
0
01 Jun 2022
Signal Transformer: Complex-valued Attention and Meta-Learning for Signal Recognition
Yihong Dong
Ying Peng
Muqiao Yang
Songtao Lu
Qingjiang Shi
49
9
0
05 Jun 2021
Bandit based centralized matching in two-sided markets for peer to peer lending
Soumajyoti Sarkar
11
0
0
06 May 2021
Policy Optimization as Online Learning with Mediator Feedback
Alberto Maria Metelli
Matteo Papini
P. DÓro
Marcello Restelli
OffRL
27
10
0
15 Dec 2020
Multi-Armed Bandits with Dependent Arms
Rahul Singh
Fang Liu
Yin Sun
Ness B. Shroff
24
11
0
13 Oct 2020
Bounded Regret for Finitely Parameterized Multi-Armed Bandits
Kishan Panaganti
D. Kalathil
18
1
0
03 Mar 2020
Adaptive Gradient Sparsification for Efficient Federated Learning: An Online Learning Approach
Pengchao Han
Shiqiang Wang
K. Leung
FedML
35
175
0
14 Jan 2020
Waiting but not Aging: Optimizing Information Freshness Under the Pull Model
Fengjiao Li
Yu Sang
Zhongdong Liu
Bin Li
Huasen Wu
Bo Ji
14
32
0
17 Dec 2019
Feedback graph regret bounds for Thompson Sampling and UCB
Thodoris Lykouris
Éva Tardos
Drishti Wali
11
29
0
23 May 2019
Almost Boltzmann Exploration
Harsh Gupta
Seo Taek Kong
R. Srikant
Weina Wang
14
1
0
25 Jan 2019
Multi-Armed Bandits on Partially Revealed Unit Interval Graphs
Xiao Xu
Sattar Vakili
Qing Zhao
A. Swami
18
5
0
12 Feb 2018
Reward Maximization Under Uncertainty: Leveraging Side-Observations on Networks
Swapna Buccapatnam
Fang Liu
A. Eryilmaz
Ness B. Shroff
21
28
0
26 Apr 2017
Online Learning with Abstention
Corinna Cortes
Giulia DeSalvo
Claudio Gentile
M. Mohri
Scott Yang
11
47
0
09 Mar 2017
Horde of Bandits using Gaussian Markov Random Fields
Sharan Vaswani
Mark Schmidt
L. Lakshmanan
26
14
0
07 Mar 2017
Thompson Sampling For Stochastic Bandits with Graph Feedback
Aristide C. Y. Tossou
Christos Dimitrakakis
Devdatt Dubhashi
11
28
0
16 Jan 2017
When to Reset Your Keys: Optimal Timing of Security Updates via Learning
Zizhan Zheng
Ness B. Shroff
P. Mohapatra
AAML
14
7
0
01 Dec 2016
Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs
R. Meshram
Aditya Gopalan
D. Manjunath
OffRL
19
24
0
30 Mar 2016
Online Learning with Gaussian Payoffs and Side Observations
Yifan Wu
András Gyorgy
Csaba Szepesvári
10
44
0
27 Oct 2015
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback
N. Alon
Nicolò Cesa-Bianchi
Claudio Gentile
Shie Mannor
Yishay Mansour
Ohad Shamir
OffRL
38
130
0
30 Sep 2014
Online Clustering of Bandits
Claudio Gentile
Shuai Li
Giovanni Zappella
36
263
0
31 Jan 2014
1