Leveraging Side Observations in Stochastic Bandits

16 October 2012

Papers citing "Leveraging Side Observations in Stochastic Bandits"

30 / 30 papers shown

Title
Asymptotically-Optimal Gaussian Bandits with Side Observations Alexia Atsidakou Orestis Papadigenopoulos Constantine Caramanis Sujay Sanghavi Sanjay Shakkottai 16 4 0 15 May 2025
Graph Feedback Bandits on Similar Arms: With and Without Graph Structures Han Qi Fei-Yu Guo Li Zhu Qiaosheng Zhang Xiaochen Li 43 0 0 24 Jan 2025
Graph Feedback Bandits with Similar Arms Han Qi Guo Fei Li Zhu 27 0 0 18 May 2024
Causally Abstracted Multi-armed Bandits Fabio Massimo Zennaro Nicholas Bishop Joel Dyer Yorgos Felekis Anisoara Calinescu Michael Wooldridge Theodoros Damoulas 38 2 0 26 Apr 2024
Stochastic Graph Bandit Learning with Side-Observations Xueping Gong Jiheng Zhang 34 1 0 29 Aug 2023
Online Network Source Optimization with Graph-Kernel MAB Laura Toni P. Frossard 34 1 0 07 Jul 2023
Invariant Lipschitz Bandits: A Side Observation Approach Nam-Phuong Tran Long Tran-Thanh 51 1 0 14 Dec 2022
Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs Haipeng Luo Hanghang Tong Mengxiao Zhang Yuheng Zhang 18 5 0 04 Oct 2022
Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality T. V. Marinov M. Mohri Julian Zimmert 21 6 0 20 Jun 2022
Nearly Optimal Best-of-Both-Worlds Algorithms for Online Learning with Feedback Graphs Shinji Ito Taira Tsuchiya Junya Honda 35 24 0 02 Jun 2022
A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs Chloé Rouyer Dirk van der Hoeven Nicolò Cesa-Bianchi Yevgeny Seldin 28 15 0 01 Jun 2022
Signal Transformer: Complex-valued Attention and Meta-Learning for Signal Recognition Yihong Dong Ying Peng Muqiao Yang Songtao Lu Qingjiang Shi 49 9 0 05 Jun 2021
Bandit based centralized matching in two-sided markets for peer to peer lending Soumajyoti Sarkar 11 0 0 06 May 2021
Policy Optimization as Online Learning with Mediator Feedback Alberto Maria Metelli Matteo Papini P. DÓro Marcello Restelli OffRL 27 10 0 15 Dec 2020
Multi-Armed Bandits with Dependent Arms Rahul Singh Fang Liu Yin Sun Ness B. Shroff 24 11 0 13 Oct 2020
Bounded Regret for Finitely Parameterized Multi-Armed Bandits Kishan Panaganti D. Kalathil 18 1 0 03 Mar 2020
Adaptive Gradient Sparsification for Efficient Federated Learning: An Online Learning Approach Pengchao Han Shiqiang Wang K. Leung FedML 35 175 0 14 Jan 2020
Waiting but not Aging: Optimizing Information Freshness Under the Pull Model Fengjiao Li Yu Sang Zhongdong Liu Bin Li Huasen Wu Bo Ji 14 32 0 17 Dec 2019
Feedback graph regret bounds for Thompson Sampling and UCB Thodoris Lykouris Éva Tardos Drishti Wali 11 29 0 23 May 2019
Almost Boltzmann Exploration Harsh Gupta Seo Taek Kong R. Srikant Weina Wang 14 1 0 25 Jan 2019
Multi-Armed Bandits on Partially Revealed Unit Interval Graphs Xiao Xu Sattar Vakili Qing Zhao A. Swami 18 5 0 12 Feb 2018
Reward Maximization Under Uncertainty: Leveraging Side-Observations on Networks Swapna Buccapatnam Fang Liu A. Eryilmaz Ness B. Shroff 21 28 0 26 Apr 2017
Online Learning with Abstention Corinna Cortes Giulia DeSalvo Claudio Gentile M. Mohri Scott Yang 11 47 0 09 Mar 2017
Horde of Bandits using Gaussian Markov Random Fields Sharan Vaswani Mark Schmidt L. Lakshmanan 26 14 0 07 Mar 2017
Thompson Sampling For Stochastic Bandits with Graph Feedback Aristide C. Y. Tossou Christos Dimitrakakis Devdatt Dubhashi 11 28 0 16 Jan 2017
When to Reset Your Keys: Optimal Timing of Security Updates via Learning Zizhan Zheng Ness B. Shroff P. Mohapatra AAML 14 7 0 01 Dec 2016
Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs R. Meshram Aditya Gopalan D. Manjunath OffRL 19 24 0 30 Mar 2016
Online Learning with Gaussian Payoffs and Side Observations Yifan Wu András Gyorgy Csaba Szepesvári 10 44 0 27 Oct 2015
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback N. Alon Nicolò Cesa-Bianchi Claudio Gentile Shie Mannor Yishay Mansour Ohad Shamir OffRL 38 130 0 30 Sep 2014
Online Clustering of Bandits Claudio Gentile Shuai Li Giovanni Zappella 36 263 0 31 Jan 2014