ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.10799
  4. Cited By
Three Methods for Training on Bandit Feedback
v1v2 (latest)

Three Methods for Training on Bandit Feedback

24 April 2019
Dmytro Mykhaylov
D. Rohde
Flavian Vasile
Martin Bompaire
Olivier Jeunen
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Three Methods for Training on Bandit Feedback"

2 / 2 papers shown
Title
RecoGym: A Reinforcement Learning Environment for the problem of Product
  Recommendation in Online Advertising
RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising
D. Rohde
Stephen Bonner
Travis Dunlop
Flavian Vasile
Alexandros Karatzoglou
OffRL
57
150
0
02 Aug 2018
The Offset Tree for Learning with Partial Labels
The Offset Tree for Learning with Partial Labels
A. Beygelzimer
John Langford
317
185
0
21 Dec 2008
1