Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.01680
Cited By
Two-Stage Constrained Actor-Critic for Short Video Recommendation
3 February 2023
Qingpeng Cai
Zhenghai Xue
Chi Zhang
Wanqi Xue
Shuchang Liu
Ruohan Zhan
Xueliang Wang
Tianyou Zuo
Wentao Xie
Dong Zheng
Peng Jiang
Kun Gai
OffRL
CML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Two-Stage Constrained Actor-Critic for Short Video Recommendation"
14 / 14 papers shown
Title
xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems
Yang Cao
Changhao Zhang
Xiaoshuang Chen
Kaiqiao Zhan
Ben Wang
28
0
0
08 Apr 2025
Retrieval-Augmented Purifier for Robust LLM-Empowered Recommendation
Liangbo Ning
Wenqi Fan
Qing Li
AAML
41
1
0
03 Apr 2025
Short-video Propagation Influence Rating: A New Real-world Dataset and A New Large Graph Model
Dizhan Xue
Jing Cui
Shengsheng Qian
Chuanrui Hu
Changsheng Xu
39
0
0
31 Mar 2025
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer
Chongming Gao
Kexin Huang
Ziang Fei
Jiaju Chen
Jianfei Chen
Jianshan Sun
Shuchang Liu
Qingpeng Cai
Peng Jiang
OffRL
34
0
0
13 Jan 2025
How to Find the Exact Pareto Front for Multi-Objective MDPs?
Yining Li
Peizhong Ju
Ness B. Shroff
160
0
0
21 Oct 2024
Incorporating Group Prior into Variational Inference for Tail-User Behavior Modeling in CTR Prediction
Han Xu
Taoxing Pan
Zhiqiang Liu
Xiaoxiao Xu
Lantao Hu
23
0
0
19 Oct 2024
RPAF: A Reinforcement Prediction-Allocation Framework for Cache Allocation in Large-Scale Recommender Systems
Shuo Su
Xiaoshuang Chen
Yao Wang
Yulin Wu
Ziqiang Zhang
Kaiqiao Zhan
Ben Wang
Kun Gai
AI4TS
26
1
0
20 Sep 2024
Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning
Tianchen Zhou
Fnu Hairi
Haibo Yang
Jia-Wei Liu
Tian Tong
Fan Yang
Michinari Momma
Yan Gao
43
1
0
05 May 2024
A Model-based Multi-Agent Personalized Short-Video Recommender System
Peilun Zhou
Xiaoxiao Xu
Lantao Hu
Han Li
Peng Jiang
OffRL
26
1
0
03 May 2024
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems
Yuanqing Yu
Chongming Gao
Jiawei Chen
Heng Tang
Yuefeng Sun
Qian Chen
Weizhi Ma
Min Zhang
OffRL
42
2
0
23 Feb 2024
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
Zhenghai Xue
Qingpeng Cai
Tianyou Zuo
Bin Yang
Lantao Hu
Peng Jiang
Kun Gai
33
2
0
06 Oct 2023
Reinforcing User Retention in a Billion Scale Short Video Recommender System
Qingpeng Cai
Shuchang Liu
Xueliang Wang
Tianyou Zuo
Wentao Xie
Bin Yang
Dong Zheng
Peng Jiang
Kun Gai
OffRL
27
41
0
03 Feb 2023
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement
Wanqi Xue
Qingpeng Cai
Zhenghai Xue
Shuo Sun
Shuchang Liu
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
36
25
0
06 Dec 2022
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Wanqi Xue
Qingpeng Cai
Ruohan Zhan
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
30
24
0
01 Jun 2022
1