ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.06260
  4. Cited By
Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent
  Reinforcement Learning

Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning

17 September 2018
Jun Feng
Heng Li
Minlie Huang
Shichen Liu
Wenwu Ou
Zhirong Wang
Xiaoyan Zhu
ArXivPDFHTML

Papers citing "Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning"

9 / 9 papers shown
Title
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward
Yi Zhang
Ruihong Qiu
Xuwei Xu
Jiajun Liu
Sen Wang
OffRL
34
0
0
12 May 2025
Learning List-wise Representation in Reinforcement Learning for Ads
  Allocation with Multiple Auxiliary Tasks
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks
Zehua Wang
Guogang Liao
Xiaowen Shi
Xiaoxu Wu
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
OffRL
19
4
0
02 Apr 2022
Deep Page-Level Interest Network in Reinforcement Learning for Ads
  Allocation
Deep Page-Level Interest Network in Reinforcement Learning for Ads Allocation
Guogang Liao
Xiaowen Shi
Zehua Wang
Xiaoxu Wu
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
27
10
0
01 Apr 2022
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed
Guogang Liao
Zewen Wang
Xiaoxu Wu
Xiaowen Shi
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
35
36
0
09 Sep 2021
A Survey on Deep Reinforcement Learning for Data Processing and
  Analytics
A Survey on Deep Reinforcement Learning for Data Processing and Analytics
Qingpeng Cai
Can Cui
Yiyuan Xiong
Wei Wang
Zhongle Xie
Meihui Zhang
OffRL
21
29
0
10 Aug 2021
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
54
94
0
01 Jul 2021
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward
  Decomposition
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition
Ryuichi Takanobu
Runze Liang
Minlie Huang
LLMAG
19
54
0
08 Apr 2020
Aggregating E-commerce Search Results from Heterogeneous Sources via
  Hierarchical Reinforcement Learning
Aggregating E-commerce Search Results from Heterogeneous Sources via Hierarchical Reinforcement Learning
Ryuichi Takanobu
Tao Zhuang
Minlie Huang
Jun Feng
Haihong Tang
Bo Zheng
22
17
0
24 Feb 2019
Deep reinforcement learning for search, recommendation, and online
  advertising: a survey
Deep reinforcement learning for search, recommendation, and online advertising: a survey
Xiangyu Zhao
Long Xia
Jiliang Tang
Dawei Yin
OffRL
11
82
0
18 Dec 2018
1