ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.02089
  4. Cited By
Linear Bandits with Stochastic Delayed Feedback
v1v2v3 (latest)

Linear Bandits with Stochastic Delayed Feedback

5 July 2018
Claire Vernade
Alexandra Carpentier
Tor Lattimore
Giovanni Zappella
Beyza Ermis
M. Brueckner
ArXiv (abs)PDFHTML

Papers citing "Linear Bandits with Stochastic Delayed Feedback"

17 / 17 papers shown
Title
Contextual Linear Bandits with Delay as Payoff
Contextual Linear Bandits with Delay as Payoff
Mengxiao Zhang
Yingfei Wang
Haipeng Luo
193
2
0
18 Feb 2025
Biased Dueling Bandits with Stochastic Delayed Feedback
Biased Dueling Bandits with Stochastic Delayed Feedback
Bongsoo Yi
Yue Kang
Yao Li
87
1
0
26 Aug 2024
Reinforcement Learning with Delayed, Composite, and Partially Anonymous
  Reward
Reinforcement Learning with Delayed, Composite, and Partially Anonymous Reward
Washim Uddin Mondal
Vaneet Aggarwal
79
2
0
04 May 2023
Multi-Armed Bandits with Generalized Temporally-Partitioned Rewards
Multi-Armed Bandits with Generalized Temporally-Partitioned Rewards
Ronald C. van den Broek
Rik Litjens
Tobias Sagis
Luc Siecker
Nina Verbeeke
Pratik Gajane
89
0
0
01 Mar 2023
A Reduction-based Framework for Sequential Decision Making with Delayed
  Feedback
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback
Yunchang Yang
Hangshi Zhong
Tianhao Wu
B. Liu
Liwei Wang
S. Du
OffRL
119
8
0
03 Feb 2023
Dynamical Linear Bandits
Dynamical Linear Bandits
Marco Mussi
Alberto Maria Metelli
Marcello Restelli
71
2
0
16 Nov 2022
Delayed Feedback in Generalised Linear Bandits Revisited
Delayed Feedback in Generalised Linear Bandits Revisited
Benjamin Howson
Ciara Pike-Burke
Sarah Filippi
61
16
0
21 Jul 2022
Bayesian Optimization under Stochastic Delayed Feedback
Bayesian Optimization under Stochastic Delayed Feedback
Arun Verma
Zhongxiang Dai
Bryan Kian Hsiang Low
80
12
0
19 Jun 2022
Multi-Armed Bandit Problem with Temporally-Partitioned Rewards: When
  Partial Feedback Counts
Multi-Armed Bandit Problem with Temporally-Partitioned Rewards: When Partial Feedback Counts
Giulia Romano
Andrea Agostini
F. Trovò
N. Gatti
Marcello Restelli
34
2
0
01 Jun 2022
Thompson Sampling with Unrestricted Delays
Thompson Sampling with Unrestricted Delays
Hang Wu
Stefan Wager
73
8
0
24 Feb 2022
Versatile Dueling Bandits: Best-of-both-World Analyses for Online
  Learning from Preferences
Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences
Aadirupa Saha
Pierre Gaillard
71
7
0
14 Feb 2022
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
Tiancheng Jin
Tal Lancewicki
Haipeng Luo
Yishay Mansour
Aviv A. Rosenberg
131
22
0
31 Jan 2022
Optimism and Delays in Episodic Reinforcement Learning
Optimism and Delays in Episodic Reinforcement Learning
Benjamin Howson
Ciara Pike-Burke
Sarah Filippi
56
6
0
15 Nov 2021
Smooth Sequential Optimisation with Delayed Feedback
Smooth Sequential Optimisation with Delayed Feedback
S. Chennu
Jamie Martin
P. Liyanagama
Phil Mohr
29
2
0
21 Jun 2021
Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or
  Bayesian?
Multi-armed Bandit Algorithms on System-on-Chip: Go Frequentist or Bayesian?
S. Santosh
S. Darak
53
0
0
05 Jun 2021
Leveraging Post Hoc Context for Faster Learning in Bandit Settings with
  Applications in Robot-Assisted Feeding
Leveraging Post Hoc Context for Faster Learning in Bandit Settings with Applications in Robot-Assisted Feeding
E. Gordon
Sumegh Roychowdhury
Tapomayukh Bhattacharjee
Kevin Jamieson
S. Srinivasa
145
20
0
05 Nov 2020
Cooperative Multi-Agent Bandits with Heavy Tails
Cooperative Multi-Agent Bandits with Heavy Tails
Abhimanyu Dubey
Alex Pentland
54
50
0
14 Aug 2020
1