ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.08108
  4. Cited By
Online Learning with Gaussian Payoffs and Side Observations

Online Learning with Gaussian Payoffs and Side Observations

27 October 2015
Yifan Wu
András Gyorgy
Csaba Szepesvári
ArXivPDFHTML

Papers citing "Online Learning with Gaussian Payoffs and Side Observations"

11 / 11 papers shown
Title
Asymptotically-Optimal Gaussian Bandits with Side Observations
Asymptotically-Optimal Gaussian Bandits with Side Observations
Alexia Atsidakou
Orestis Papadigenopoulos
Constantine Caramanis
Sujay Sanghavi
Sanjay Shakkottai
19
4
0
15 May 2025
uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs
uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs
Yu Chen
Jiatai Huang
Yan Dai
Longbo Huang
34
0
0
04 Oct 2024
Allocating Divisible Resources on Arms with Unknown and Random Rewards
Allocating Divisible Resources on Arms with Unknown and Random Rewards
Ningyuan Chen
Wenhao Li
24
0
0
28 Jun 2023
Stochastic Online Learning with Feedback Graphs: Finite-Time and
  Asymptotic Optimality
Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality
T. V. Marinov
M. Mohri
Julian Zimmert
24
6
0
20 Jun 2022
Simultaneously Learning Stochastic and Adversarial Bandits with General
  Graph Feedback
Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback
Fang-yuan Kong
Yichi Zhou
Shuai Li
19
8
0
16 Jun 2022
Exploiting Correlation to Achieve Faster Learning Rates in Low-Rank
  Preference Bandits
Exploiting Correlation to Achieve Faster Learning Rates in Low-Rank Preference Bandits
Suprovat Ghoshal
Aadirupa Saha
25
11
0
23 Feb 2022
Regret Minimization with Performative Feedback
Regret Minimization with Performative Feedback
Meena Jagadeesan
Tijana Zrnic
Celestine Mendler-Dünner
40
33
0
01 Feb 2022
Thompson Sampling for Unsupervised Sequential Selection
Thompson Sampling for Unsupervised Sequential Selection
Arun Verma
M. Hanawal
N. Hemachandra
17
5
0
16 Sep 2020
The End of Optimism? An Asymptotic Analysis of Finite-Armed Linear
  Bandits
The End of Optimism? An Asymptotic Analysis of Finite-Armed Linear Bandits
Tor Lattimore
Csaba Szepesvári
22
103
0
14 Oct 2016
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems
Aurélien Garivier
Pierre Ménard
Gilles Stoltz
25
210
0
23 Feb 2016
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback
N. Alon
Nicolò Cesa-Bianchi
Claudio Gentile
Shie Mannor
Yishay Mansour
Ohad Shamir
OffRL
38
130
0
30 Sep 2014
1