ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.10000
  4. Cited By
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for
  Reinforcement Learning

Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning

25 May 2018
Jing-Cheng Shi
Yang Yu
Qing Da
Shi-Yong Chen
Anxiang Zeng
    OffRL
ArXivPDFHTML

Papers citing "Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning"

50 / 82 papers shown
Title
Beyond Static Testbeds: An Interaction-Centric Agent Simulation Platform for Dynamic Recommender Systems
Beyond Static Testbeds: An Interaction-Centric Agent Simulation Platform for Dynamic Recommender Systems
Song Jin
Junxuan Zhang
Yuhan Liu
Xun Zhang
Yufei Zhang
Guojun Yin
Fei Jiang
Wei Lin
Rui Yan
7
0
0
22 May 2025
AdvKT: An Adversarial Multi-Step Training Framework for Knowledge Tracing
AdvKT: An Adversarial Multi-Step Training Framework for Knowledge Tracing
Lingyue Fu
Ting Long
Jianghao Lin
Wei Xia
Xinyi Dai
Ruiming Tang
Yuran Wang
Wenbo Zhang
Yong Yu
OffRL
42
0
0
07 Apr 2025
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios
Songyi Gao
Zuolin Tu
Rong-Jun Qin
Yi-Hao Sun
Xiong-Hui Chen
Yang Yu
OffRL
45
0
0
25 Mar 2025
CreAgent: Towards Long-Term Evaluation of Recommender System under Platform-Creator Information Asymmetry
CreAgent: Towards Long-Term Evaluation of Recommender System under Platform-Creator Information Asymmetry
Xiaopeng Ye
Chen Xu
Zhongxiang Sun
Jun Xu
Gang Wang
Zhenhua Dong
Ji-Rong Wen
96
0
0
11 Feb 2025
Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation
Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation
Siyu Wang
Xiaocong Chen
Lina Yao
CML
OffRL
95
0
0
04 Feb 2025
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration
Yirui Zhou
Xiaowei Liu
Xiaofeng Zhang
Yangchun Zhang
44
0
0
22 Jan 2025
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory
Yangchun Zhang
Wang Zhou
Yirui Zhou
55
0
0
31 Dec 2024
Provably and Practically Efficient Adversarial Imitation Learning with
  General Function Approximation
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
Tian Xu
Zhilong Zhang
Ruishuo Chen
Yihao Sun
Yang Yu
37
1
0
01 Nov 2024
Neural Click Models for Recommender Systems
Neural Click Models for Recommender Systems
Mikhail Shirokikh
Ilya Shenbin
Anton M. Alekseev
Anna Volodkevich
Alexey Vasilev
Andrey Savchenko
Sergey I. Nikolenko
LRM
3DV
14
1
0
30 Sep 2024
ARTAI: An Evaluation Platform to Assess Societal Risk of Recommender
  Algorithms
ARTAI: An Evaluation Platform to Assess Societal Risk of Recommender Algorithms
Qin Ruan
Jin Xu
Ruihai Dong
Arjumand Younus
Tai Tan Mai
Barry O'Sullivan
Susan Leavy
36
0
0
19 Sep 2024
An Extremely Data-efficient and Generative LLM-based Reinforcement
  Learning Agent for Recommenders
An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders
Shuang Feng
Grace Feng
OffRL
29
1
0
28 Aug 2024
On Causally Disentangled State Representation Learning for Reinforcement
  Learning based Recommender Systems
On Causally Disentangled State Representation Learning for Reinforcement Learning based Recommender Systems
Siyu Wang
Xiaocong Chen
Lina Yao
CML
39
0
0
18 Jul 2024
SUBER: An RL Environment with Simulated Human Behavior for Recommender
  Systems
SUBER: An RL Environment with Simulated Human Behavior for Recommender Systems
Nathan Corecco
Giorgio Piatti
Luca A. Lanzendörfer
Flint Xiaofeng Fan
Roger Wattenhofer
OffRL
29
2
0
01 Jun 2024
Lusifer: LLM-based User SImulated Feedback Environment for online Recommender systems
Lusifer: LLM-based User SImulated Feedback Environment for online Recommender systems
Danial Ebrat
Luis Rueda
Luis Rueda
65
3
0
22 May 2024
Retentive Decision Transformer with Adaptive Masking for Reinforcement
  Learning based Recommendation Systems
Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems
Siyu Wang
Xiaocong Chen
Lina Yao
OffRL
38
1
0
26 Mar 2024
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation,
  Transferable Reward Recovery and Algebraic Equilibrium Proof
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof
Yangchun Zhang
Qiang Liu
Weiming Li
Yirui Zhou
45
0
0
21 Mar 2024
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based
  Recommender Systems
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems
Yuanqing Yu
Chongming Gao
Jiawei Chen
Heng Tang
Yuefeng Sun
Qian Chen
Weizhi Ma
Min Zhang
OffRL
53
2
0
23 Feb 2024
Computational Experiments Meet Large Language Model Based Agents: A
  Survey and Perspective
Computational Experiments Meet Large Language Model Based Agents: A Survey and Perspective
Qun Ma
Xiao Xue
Deyu Zhou
Xiangning Yu
Donghua Liu
...
Yifan Shen
Peilin Ji
Juanjuan Li
Gang Wang
Wanpeng Ma
AI4CE
LM&Ro
LLMAG
31
7
0
01 Feb 2024
Exploring Gradient Explosion in Generative Adversarial Imitation
  Learning: A Probabilistic Perspective
Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective
Wanying Wang
Yichen Zhu
Yirui Zhou
Yaxin Peng
Jian Tang
Zhiyuan Xu
Chaomin Shen
Yangchun Zhang
39
4
0
18 Dec 2023
Model-enhanced Contrastive Reinforcement Learning for Sequential
  Recommendation
Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation
Chengpeng Li
Zhengyi Yang
Jizhi Zhang
Jiancan Wu
Dingxian Wang
Xiangnan He
Xiang Wang
OffRL
37
1
0
25 Oct 2023
On Generative Agents in Recommendation
On Generative Agents in Recommendation
An Zhang
Yuxin Chen
Leheng Sheng
Xiang Wang
Tat-Seng Chua
43
44
0
16 Oct 2023
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline
  Reinforcement Learning
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
37
7
0
09 Oct 2023
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
AURO: Reinforcement Learning for Adaptive User Retention Optimization in Recommender Systems
Zhenghai Xue
Qingpeng Cai
Tianyou Zuo
Bin Yang
Lantao Hu
Peng Jiang
Kun Gai
33
2
0
06 Oct 2023
Marketing Budget Allocation with Offline Constrained Deep Reinforcement
  Learning
Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning
Tianchi Cai
Jiyan Jiang
Wenpeng Zhang
Shiji Zhou
Xierui Song
Li Yu
Lihong Gu
Xiaodong Zeng
Jinjie Gu
Guannan Zhang
OffRL
33
2
0
06 Sep 2023
INTAGS: Interactive Agent-Guided Simulation
INTAGS: Interactive Agent-Guided Simulation
Song Wei
Andrea Coletta
Svitlana Vyetrenko
T. Balch
24
1
0
04 Sep 2023
Model-free Reinforcement Learning with Stochastic Reward Stabilization
  for Recommender Systems
Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
Tianchi Cai
Shenliao Bao
Jiyan Jiang
Shiji Zhou
Wenpeng Zhang
Lihong Gu
Jinjie Gu
Guannan Zhang
OffRL
34
2
0
25 Aug 2023
Diverse Policies Converge in Reward-free Markov Decision Processe
Diverse Policies Converge in Reward-free Markov Decision Processe
Fanqing Lin
Shiyu Huang
Weiwei Tu
30
0
0
23 Aug 2023
On the Opportunities and Challenges of Offline Reinforcement Learning
  for Recommender Systems
On the Opportunities and Challenges of Offline Reinforcement Learning for Recommender Systems
Xiaocong Chen
Siyu Wang
Julian McAuley
Dietmar Jannach
Lina Yao
OffRL
30
5
0
22 Aug 2023
Provably Efficient Adversarial Imitation Learning with Unknown
  Transitions
Provably Efficient Adversarial Imitation Learning with Unknown Transitions
Tian Xu
Ziniu Li
Yang Yu
Zhimin Luo
18
8
0
11 Jun 2023
User Behavior Simulation with Large Language Model based Agents
User Behavior Simulation with Large Language Model based Agents
Lei Wang
Jingsen Zhang
Hao-ran Yang
Zhiyuan Chen
Jiakai Tang
...
Wayne Xin Zhao
Jun Xu
Zhicheng Dou
Jun Wang
Ji-Rong Wen
LM&Ro
LLMAG
34
40
0
05 Jun 2023
Simulating News Recommendation Ecosystem for Fun and Profit
Simulating News Recommendation Ecosystem for Fun and Profit
Guangping Zhang
Dongsheng Li
Hansu Gu
Tun Lu
Li Shang
Ning Gu
16
0
0
23 May 2023
Sim2Rec: A Simulator-based Decision-making Approach to Optimize
  Real-World Long-term User Engagement in Sequential Recommender Systems
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems
Xiong-Hui Chen
Bowei He
Yangze Yu
Qingyang Li
Zhiwei Qin
Wenjie Shang
Jieping Ye
Chen Ma
OffRL
33
11
0
03 May 2023
Interactive System-wise Anomaly Detection
Interactive System-wise Anomaly Detection
Guanchu Wang
Ninghao Liu
Daochen Zha
Xia Hu
AAML
22
1
0
21 Apr 2023
Causal Decision Transformer for Recommender Systems via Offline
  Reinforcement Learning
Causal Decision Transformer for Recommender Systems via Offline Reinforcement Learning
Siyu Wang
Xiaocong Chen
Dietmar Jannach
Lina Yao
CML
OffRL
24
27
0
17 Apr 2023
How To Guide Your Learner: Imitation Learning with Active Adaptive
  Expert Involvement
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement
Xu-Hui Liu
Feng Xu
Xinyu Zhang
Tianyuan Liu
Shengyi Jiang
Rui Chen
Zongzhang Zhang
Yang Yu
34
12
0
03 Mar 2023
Creating Synthetic Datasets for Collaborative Filtering Recommender
  Systems using Generative Adversarial Networks
Creating Synthetic Datasets for Collaborative Filtering Recommender Systems using Generative Adversarial Networks
Jesús Bobadilla
Abraham Gutiérrez
Raciel Yera
L. Martínez
23
12
0
02 Mar 2023
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement Learning
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zhifeng Hao
CML
31
27
0
10 Feb 2023
Learning to Simulate Daily Activities via Modeling Dynamic Human Needs
Learning to Simulate Daily Activities via Modeling Dynamic Human Needs
Yuan Yuan
Huandong Wang
Jingtao Ding
Depeng Jin
Yong Li
AI4TS
AI4CE
22
30
0
09 Feb 2023
PrefRec: Recommender Systems with Human Preferences for Reinforcing
  Long-term User Engagement
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement
Wanqi Xue
Qingpeng Cai
Zhenghai Xue
Shuo Sun
Shuchang Liu
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
36
25
0
06 Dec 2022
RITA: Boost Driving Simulators with Realistic Interactive Traffic Flow
RITA: Boost Driving Simulators with Realistic Interactive Traffic Flow
Zhengbang Zhu
Shenyu Zhang
Yuzheng Zhuang
Yuecheng Liu
Minghuan Liu
...
Bin Wang
Siqi Cheng
Xinyu Wang
Jianye Hao
Yong Yu
19
8
0
07 Nov 2022
Understanding or Manipulation: Rethinking Online Performance Gains of
  Modern Recommender Systems
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems
Zhengbang Zhu
Rongjun Qin
Junjie Huang
Xinyi Dai
Yang Yu
Yong Yu
Weinan Zhang
46
2
0
11 Oct 2022
Model-based Reinforcement Learning with Multi-step Plan Value Estimation
Model-based Reinforcement Learning with Multi-step Plan Value Estimation
Hao-Chu Lin
Yihao Sun
Jiajin Zhang
Yang Yu
OffRL
42
7
0
12 Sep 2022
Extending Open Bandit Pipeline to Simulate Industry Challenges
Extending Open Bandit Pipeline to Simulate Industry Challenges
Bram van den Akker
N. Weber
Felipe Moraes
Dmitri Goldenberg
OffRL
21
1
0
09 Sep 2022
Dynamic Regret of Online Markov Decision Processes
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi-Hua Zhou
OffRL
44
17
0
26 Aug 2022
Synthetic Data-Based Simulators for Recommender Systems: A Survey
Synthetic Data-Based Simulators for Recommender Systems: A Survey
Elizaveta Stavinova
A. Grigorievskiy
A. Volodkevich
P. Chunaev
Klavdiya Olegovna Bochenina
D. Bugaychenko
SyDa
40
7
0
22 Jun 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
53
101
0
19 Jun 2022
Adversarial Counterfactual Environment Model Learning
Adversarial Counterfactual Environment Model Learning
Xiong-Hui Chen
Yang Yu
Zhenghong Zhu
Zhihua Yu
Zhen-Yu Chen
...
Yinan Wu
Hongqiu Wu
Rongjun Qin
Rui Ding
Fangsheng Huang
CML
OffRL
29
12
0
10 Jun 2022
Hybrid Value Estimation for Off-policy Evaluation and Offline
  Reinforcement Learning
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning
Xuefeng Jin
Xu-Hui Liu
Shengyi Jiang
Yang Yu
OffRL
33
4
0
04 Jun 2022
Offline Reinforcement Learning with Causal Structured World Models
Offline Reinforcement Learning with Causal Structured World Models
Zhengbang Zhu
Xiong-Hui Chen
Hong Tian
Kun Zhang
Yang Yu
CML
OffRL
19
16
0
03 Jun 2022
Estimating and Penalizing Induced Preference Shifts in Recommender
  Systems
Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Micah Carroll
Anca Dragan
Stuart J. Russell
Dylan Hadfield-Menell
OffRL
38
41
0
25 Apr 2022
12
Next