Online Learning with Gaussian Payoffs and Side Observations

27 October 2015

Papers citing "Online Learning with Gaussian Payoffs and Side Observations"

11 / 11 papers shown

Title
Asymptotically-Optimal Gaussian Bandits with Side Observations Alexia Atsidakou Orestis Papadigenopoulos Constantine Caramanis Sujay Sanghavi Sanjay Shakkottai 19 4 0 15 May 2025
uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs Yu Chen Jiatai Huang Yan Dai Longbo Huang 34 0 0 04 Oct 2024
Allocating Divisible Resources on Arms with Unknown and Random Rewards Ningyuan Chen Wenhao Li 24 0 0 28 Jun 2023
Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality T. V. Marinov M. Mohri Julian Zimmert 24 6 0 20 Jun 2022
Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback Fang-yuan Kong Yichi Zhou Shuai Li 19 8 0 16 Jun 2022
Exploiting Correlation to Achieve Faster Learning Rates in Low-Rank Preference Bandits Suprovat Ghoshal Aadirupa Saha 25 11 0 23 Feb 2022
Regret Minimization with Performative Feedback Meena Jagadeesan Tijana Zrnic Celestine Mendler-Dünner 40 33 0 01 Feb 2022
Thompson Sampling for Unsupervised Sequential Selection Arun Verma M. Hanawal N. Hemachandra 17 5 0 16 Sep 2020
The End of Optimism? An Asymptotic Analysis of Finite-Armed Linear Bandits Tor Lattimore Csaba Szepesvári 22 103 0 14 Oct 2016
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems Aurélien Garivier Pierre Ménard Gilles Stoltz 25 210 0 23 Feb 2016
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback N. Alon Nicolò Cesa-Bianchi Claudio Gentile Shie Mannor Yishay Mansour Ohad Shamir OffRL 38 130 0 30 Sep 2014