ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.10000
  4. Cited By
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for
  Reinforcement Learning

Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning

25 May 2018
Jing-Cheng Shi
Yang Yu
Qing Da
Shi-Yong Chen
Anxiang Zeng
    OffRL
ArXivPDFHTML

Papers citing "Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning"

32 / 82 papers shown
Title
Computational Experiments: Past, Present and Future
Computational Experiments: Past, Present and Future
Xiao Xue
Xiangning Yu
Deyu Zhou
Tianlin Li
Zhang-Bin Zhou
Fei-Yue Wang
13
5
0
28 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
38
9
0
23 Feb 2022
Rethinking ValueDice: Does It Really Improve Performance?
Rethinking ValueDice: Does It Really Improve Performance?
Ziniu Li
Tian Xu
Yang Yu
Zhimin Luo
OffRL
28
17
0
05 Feb 2022
Offline Reinforcement Learning for Mobile Notifications
Offline Reinforcement Learning for Mobile Notifications
Yiping Yuan
A. Muralidharan
Preetam Nandy
Miao Cheng
Prakruthi Prabhakar
OffRL
36
9
0
04 Feb 2022
Context Uncertainty in Contextual Bandits with Applications to
  Recommender Systems
Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Hao Wang
Yifei Ma
Hao Ding
Yuyang Wang
46
6
0
01 Feb 2022
Fair ranking: a critical review, challenges, and future directions
Fair ranking: a critical review, challenges, and future directions
Gourab K. Patro
Lorenzo Porcaro
Laura Mitchell
Qiuyue Zhang
Meike Zehlike
Nikhil Garg
28
51
0
29 Jan 2022
Multiscale Generative Models: Improving Performance of a Generative
  Model Using Feedback from Other Dependent Generative Models
Multiscale Generative Models: Improving Performance of a Generative Model Using Feedback from Other Dependent Generative Models
Changyu Chen
Avinandan Bose
Shih-Fen Cheng
Arunesh Sinha
SyDa
AI4CE
24
0
0
24 Jan 2022
Supervised Advantage Actor-Critic for Recommender Systems
Supervised Advantage Actor-Critic for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
OffRL
32
30
0
05 Nov 2021
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender
  System
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
Kai Wang
Zhene Zou
Minghao Zhao
Qilin Deng
Yue Shang
Yile Liang
Runze Wu
Xudong Shen
Tangjie Lyu
Changjie Fan
OffRL
31
9
0
18 Oct 2021
Deep Exploration for Recommendation Systems
Deep Exploration for Recommendation Systems
Zheqing Zhu
Benjamin Van Roy
37
11
0
26 Sep 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A
  Systematic Review and Future Directions
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
28
61
0
08 Sep 2021
Imitate TheWorld: A Search Engine Simulation Platform
Imitate TheWorld: A Search Engine Simulation Platform
Yongqing Gao
Guangda Huzhang
Weijie Shen
Yawen Liu
Wen-Ji Zhou
Qing Da
Yang Yu
26
3
0
16 Jul 2021
We Know What You Want: An Advertising Strategy Recommender System for
  Online Advertising
We Know What You Want: An Advertising Strategy Recommender System for Online Advertising
Liyi Guo
Junqi Jin
Haoqi Zhang
Zhenzhe Zheng
Zhiye Yang
...
Fan Wu
Haiyang Xu
Chuan Yu
Yuning Jiang
Xiaoqiang Zhu
14
11
0
25 May 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency
  Inference
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
22
8
0
03 May 2021
RecSim NG: Toward Principled Uncertainty Modeling for Recommender
  Ecosystems
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Martin Mladenov
Chih-Wei Hsu
Vihan Jain
Eugene Ie
Christopher Colby
Nicolas Mayoraz
H. Pham
Dustin Tran
Ivan Vendrov
Craig Boutilier
BDL
15
32
0
14 Mar 2021
DeepThermal: Combustion Optimization for Thermal Power Generating Units
  Using Offline Reinforcement Learning
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning
Xianyuan Zhan
Haoran Xu
Yueying Zhang
Xiangyu Zhu
Honglei Yin
Yu Zheng
OffRL
AI4CE
45
68
0
23 Feb 2021
Derivative-Free Reinforcement Learning: A Review
Derivative-Free Reinforcement Learning: A Review
Hong Qian
Yang Yu
OffRL
26
42
0
10 Feb 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
140
80
0
01 Feb 2021
Interactive Search Based on Deep Reinforcement Learning
Interactive Search Based on Deep Reinforcement Learning
Yang Yu
Zhenhao Gu
Rongqi Tao
Jingtian Ge
Kenglun Chang
OffRL
15
0
0
09 Dec 2020
Generative Inverse Deep Reinforcement Learning for Online Recommendation
Generative Inverse Deep Reinforcement Learning for Online Recommendation
Xiaocong Chen
Lina Yao
Aixin Sun
Xianzhi Wang
Xiwei Xu
Liming Zhu
OffRL
12
27
0
04 Nov 2020
Error Bounds of Imitating Policies and Environments
Error Bounds of Imitating Policies and Environments
Tian Xu
Ziniu Li
Yang Yu
38
118
0
22 Oct 2020
Learning to Infer User Hidden States for Online Sequential Advertising
Learning to Infer User Hidden States for Online Sequential Advertising
Zhaoqing Peng
Junqi Jin
Lan Luo
Yaodong Yang
Rui Luo
...
Chuan Yu
Tiejian Luo
Han Li
Jian Xu
Kun Gai
OffRL
39
4
0
03 Sep 2020
Self-Supervised Reinforcement Learning for Recommender Systems
Self-Supervised Reinforcement Learning for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
SSL
OffRL
32
198
0
10 Jun 2020
Generating Realistic Stock Market Order Streams
Generating Realistic Stock Market Order Streams
Junyi Li
Xintong Wang
Yaoyang Lin
Arunesh Sinha
Michael P. Wellman
GAN
AIFin
34
81
0
07 Jun 2020
Generator and Critic: A Deep Reinforcement Learning Approach for Slate
  Re-ranking in E-commerce
Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce
Jianxiong Wei
Anxiang Zeng
Yueqiu Wu
P. Guo
Q. Hua
Qingpeng Cai
OffRL
35
9
0
25 May 2020
AliExpress Learning-To-Rank: Maximizing Online Model Performance without
  Going Online
AliExpress Learning-To-Rank: Maximizing Online Model Performance without Going Online
Guangda Huzhang
Zhen-Jia Pang
Yongqing Gao
Yawen Liu
Weijie Shen
...
Qing Da
Anxiang Zeng
Han Yu
Yang Yu
Zhi-Hua Zhou
22
4
0
25 Mar 2020
RLCard: A Toolkit for Reinforcement Learning in Card Games
RLCard: A Toolkit for Reinforcement Learning in Card Games
Daochen Zha
Kwei-Herng Lai
Yuanpu Cao
Songyi Huang
Ruzhe Wei
Junyu Guo
Xia Hu
OffRL
18
58
0
10 Oct 2019
RecSim: A Configurable Simulation Platform for Recommender Systems
RecSim: A Configurable Simulation Platform for Recommender Systems
Eugene Ie
Chih-Wei Hsu
Martin Mladenov
Vihan Jain
Sanmit Narvekar
Jing Wang
Rui Wu
Craig Boutilier
30
179
0
11 Sep 2019
Imitation Learning from Pixel-Level Demonstrations by HashReward
Imitation Learning from Pixel-Level Demonstrations by HashReward
Xin-Qiang Cai
Yao-Xiang Ding
Yuan Jiang
Zhi-Hua Zhou
6
10
0
09 Sep 2019
Environment Reconstruction with Hidden Confounders for Reinforcement
  Learning based Recommendation
Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation
Wenjie Shang
Yang Yu
Qingyang Li
Zhiwei Qin
Yiping Meng
Jieping Ye
CML
30
51
0
12 Jul 2019
Generative Adversarial User Model for Reinforcement Learning Based
  Recommendation System
Generative Adversarial User Model for Reinforcement Learning Based Recommendation System
Xinshi Chen
Shuang Li
Hui Li
Shaohua Jiang
Yuan Qi
Le Song
22
207
0
27 Dec 2018
Deep Reinforcement Learning
Deep Reinforcement Learning
Yuxi Li
VLM
OffRL
28
144
0
15 Oct 2018
Previous
12