ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.09387
  4. Cited By
Learning from Delayed Outcomes via Proxies with Applications to
  Recommender Systems

Learning from Delayed Outcomes via Proxies with Applications to Recommender Systems

24 July 2018
Timothy A. Mann
Sven Gowal
András Gyorgy
Ray Jiang
Huiyi Hu
Balaji Lakshminarayanan
Prav Srinivasan
    AI4TS
ArXivPDFHTML

Papers citing "Learning from Delayed Outcomes via Proxies with Applications to Recommender Systems"

3 / 3 papers shown
Title
Stochastic bandits with arm-dependent delays
Stochastic bandits with arm-dependent delays
Anne Gael Manegueu
Claire Vernade
Alexandra Carpentier
Michal Valko
19
44
0
18 Jun 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
120
0
24 Mar 2020
Nonstochastic Multiarmed Bandits with Unrestricted Delays
Nonstochastic Multiarmed Bandits with Unrestricted Delays
Tobias Sommer Thune
Nicolò Cesa-Bianchi
Yevgeny Seldin
15
52
0
03 Jun 2019
1