ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.01680
  4. Cited By
Two-Stage Constrained Actor-Critic for Short Video Recommendation

Two-Stage Constrained Actor-Critic for Short Video Recommendation

3 February 2023
Qingpeng Cai
Zhenghai Xue
Chi Zhang
Wanqi Xue
Shuchang Liu
Ruohan Zhan
Xueliang Wang
Tianyou Zuo
Wentao Xie
Dong Zheng
Peng Jiang
Kun Gai
    OffRL
    CML
ArXivPDFHTML

Papers citing "Two-Stage Constrained Actor-Critic for Short Video Recommendation"

14 / 14 papers shown
Title
xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems
xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems
Yang Cao
Changhao Zhang
Xiaoshuang Chen
Kaiqiao Zhan
Ben Wang
28
0
0
08 Apr 2025
Retrieval-Augmented Purifier for Robust LLM-Empowered Recommendation
Retrieval-Augmented Purifier for Robust LLM-Empowered Recommendation
Liangbo Ning
Wenqi Fan
Qing Li
AAML
41
1
0
03 Apr 2025
Short-video Propagation Influence Rating: A New Real-world Dataset and A New Large Graph Model
Short-video Propagation Influence Rating: A New Real-world Dataset and A New Large Graph Model
Dizhan Xue
Jing Cui
Shengsheng Qian
Chuanrui Hu
Changsheng Xu
39
0
0
31 Mar 2025
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer
Chongming Gao
Kexin Huang
Ziang Fei
Jiaju Chen
Jianfei Chen
Jianshan Sun
Shuchang Liu
Qingpeng Cai
Peng Jiang
OffRL
34
0
0
13 Jan 2025
How to Find the Exact Pareto Front for Multi-Objective MDPs?
How to Find the Exact Pareto Front for Multi-Objective MDPs?
Yining Li
Peizhong Ju
Ness B. Shroff
160
0
0
21 Oct 2024
Incorporating Group Prior into Variational Inference for Tail-User
  Behavior Modeling in CTR Prediction
Incorporating Group Prior into Variational Inference for Tail-User Behavior Modeling in CTR Prediction
Han Xu
Taoxing Pan
Zhiqiang Liu
Xiaoxiao Xu
Lantao Hu
23
0
0
19 Oct 2024
RPAF: A Reinforcement Prediction-Allocation Framework for Cache
  Allocation in Large-Scale Recommender Systems
RPAF: A Reinforcement Prediction-Allocation Framework for Cache Allocation in Large-Scale Recommender Systems
Shuo Su
Xiaoshuang Chen
Yao Wang
Yulin Wu
Ziqiang Zhang
Kaiqiao Zhan
Ben Wang
Kun Gai
AI4TS
26
1
0
20 Sep 2024
Finite-Time Convergence and Sample Complexity of Actor-Critic
  Multi-Objective Reinforcement Learning
Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning
Tianchen Zhou
Fnu Hairi
Haibo Yang
Jia-Wei Liu
Tian Tong
Fan Yang
Michinari Momma
Yan Gao
43
1
0
05 May 2024
A Model-based Multi-Agent Personalized Short-Video Recommender System
A Model-based Multi-Agent Personalized Short-Video Recommender System
Peilun Zhou
Xiaoxiao Xu
Lantao Hu
Han Li
Peng Jiang
OffRL
26
1
0
03 May 2024
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based
  Recommender Systems
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems
Yuanqing Yu
Chongming Gao
Jiawei Chen
Heng Tang
Yuefeng Sun
Qian Chen
Weizhi Ma
Min Zhang
OffRL
42
2
0
23 Feb 2024
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
Zhenghai Xue
Qingpeng Cai
Tianyou Zuo
Bin Yang
Lantao Hu
Peng Jiang
Kun Gai
33
2
0
06 Oct 2023
Reinforcing User Retention in a Billion Scale Short Video Recommender
  System
Reinforcing User Retention in a Billion Scale Short Video Recommender System
Qingpeng Cai
Shuchang Liu
Xueliang Wang
Tianyou Zuo
Wentao Xie
Bin Yang
Dong Zheng
Peng Jiang
Kun Gai
OffRL
27
41
0
03 Feb 2023
PrefRec: Recommender Systems with Human Preferences for Reinforcing
  Long-term User Engagement
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement
Wanqi Xue
Qingpeng Cai
Zhenghai Xue
Shuo Sun
Shuchang Liu
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
36
25
0
06 Dec 2022
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation
  with Residual Actor
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
Wanqi Xue
Qingpeng Cai
Ruohan Zhan
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
30
24
0
01 Jun 2022
1