Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.04875
Cited By
Aligning GPTRec with Beyond-Accuracy Goals with Reinforcement Learning
7 March 2024
Aleksandr V. Petrov
Craig MacDonald
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Aligning GPTRec with Beyond-Accuracy Goals with Reinforcement Learning"
11 / 11 papers shown
Title
Generative Slate Recommendation with Reinforcement Learning
Romain Deffayet
Thibaut Thonet
Jean-Michel Render
Maarten de Rijke
58
24
0
20 Jan 2023
Multi-Task Fusion via Reinforcement Learning for Long-Term User Satisfaction in Recommender Systems
Qihua Zhang
Junning Liu
Yuzhuo Dai
Yiyan Qi
Yifan Yuan
Kunlun Zheng
Fan Huang
Xianfeng Tan
OffRL
60
51
0
09 Aug 2022
A Systematic Review and Replicability Study of BERT4Rec for Sequential Recommendation
Aleksandr V. Petrov
Craig Macdonald
72
47
0
15 Jul 2022
Self-Supervised Reinforcement Learning for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
SSL
OffRL
139
202
0
10 Jun 2020
Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation
Xueying Bai
Jian Guan
Hongning Wang
OffRL
60
75
0
10 Nov 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
459
20,317
0
23 Oct 2019
BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer
Fei Sun
Jun Liu
Jian Wu
Changhua Pei
Xiao Lin
Wenwu Ou
Peng Jiang
BDL
HAI
186
2,177
0
14 Apr 2019
Self-Attentive Sequential Recommendation
Wang-Cheng Kang
Julian McAuley
HAI
BDL
175
2,442
0
20 Aug 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
529
19,237
0
20 Jul 2017
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
107
3,434
0
08 Jun 2015
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
129
12,265
0
19 Dec 2013
1