ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.09034
  4. Cited By
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User
  Experiences in Recommender Systems
v1v2 (latest)

UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems

17 January 2024
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
    OffRL
ArXiv (abs)PDFHTML

Papers citing "UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems"

35 / 35 papers shown
Title
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Jiakai Tang
Sunhao Dai
Teng Shi
Jun Xu
X. Chen
Wen Chen
Wu Jian
Yuning Jiang
LRM
137
10
0
28 Mar 2025
Cold & Warm Net: Addressing Cold-Start Users in Recommender Systems
Cold & Warm Net: Addressing Cold-Start Users in Recommender Systems
Xinming Zhang
Zongqiang Kuang
Zehao Zhang
Fan Huang
Xianfeng Tan
OffRL
52
4
0
27 Sep 2023
Safe Collaborative Filtering
Safe Collaborative Filtering
Riku Togashi
Tatsushi Oka
Naoto Ohsaka
Tetsuro Morimura
39
1
0
08 Jun 2023
Multi-objective Optimization of Notifications Using Offline
  Reinforcement Learning
Multi-objective Optimization of Notifications Using Offline Reinforcement Learning
Prakruthi Prabhakar
Yiping Yuan
Guangyu Yang
Wensheng Sun
A. Muralidharan
OffRL
60
6
0
07 Jul 2022
Efficient Risk-Averse Reinforcement Learning
Efficient Risk-Averse Reinforcement Learning
Ido Greenberg
Yinlam Chow
Mohammad Ghavamzadeh
Shie Mannor
78
42
0
10 May 2022
Deep Exploration for Recommendation Systems
Deep Exploration for Recommendation Systems
Zheqing Zhu
Benjamin Van Roy
88
11
0
26 Sep 2021
A Semi-Personalized System for User Cold Start Recommendation on Music
  Streaming Apps
A Semi-Personalized System for User Cold Start Recommendation on Music Streaming Apps
Léa Briand
Guillaume Salha-Galvan
Walid Bendada
M. Morlon
Viet-Anh Tran
61
36
0
07 Jun 2021
Risk-Averse Offline Reinforcement Learning
Risk-Averse Offline Reinforcement Learning
Núria Armengol Urpí
Sebastian Curi
Andreas Krause
OffRL
42
70
0
10 Feb 2021
Fairness-Aware Explainable Recommendation over Knowledge Graphs
Fairness-Aware Explainable Recommendation over Knowledge Graphs
Zuohui Fu
Yikun Xian
Ruoyuan Gao
Jieyu Zhao
Qiaoying Huang
...
Shuyuan Xu
Shijie Geng
C. Shah
Yongfeng Zhang
Gerard de Melo
FaML
106
208
0
03 Jun 2020
Controlling Overestimation Bias with Truncated Mixture of Continuous
  Distributional Quantile Critics
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov
Pavel Shvechikov
Alexander Grishin
Dmitry Vetrov
232
195
0
08 May 2020
Effective Diversity in Population Based Reinforcement Learning
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
104
164
0
03 Feb 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
46
182
0
09 Jan 2020
Worst Cases Policy Gradients
Worst Cases Policy Gradients
Yichuan Tang
Jian Zhang
Ruslan Salakhutdinov
56
75
0
09 Nov 2019
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill
  Discovery
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery
Kristian Hartikainen
Xinyang Geng
Tuomas Haarnoja
Sergey Levine
SSL
70
81
0
18 Jul 2019
Dual Graph Attention Networks for Deep Latent Representation of
  Multifaceted Social Effects in Recommender Systems
Dual Graph Attention Networks for Deep Latent Representation of Multifaceted Social Effects in Recommender Systems
Qitian Wu
Hengrui Zhang
Xiaofeng Gao
Peng He
Paul Weng
Han Gao
Guihai Chen
OffRLCML
86
321
0
25 Mar 2019
Novelty Search for Deep Reinforcement Learning Policy Network Weights by
  Action Sequence Edit Metric Distance
Novelty Search for Deep Reinforcement Learning Policy Network Weights by Action Sequence Edit Metric Distance
Ethan C. Jackson
Mark Daley
34
20
0
08 Feb 2019
Top-K Off-Policy Correction for a REINFORCE Recommender System
Top-K Off-Policy Correction for a REINFORCE Recommender System
Minmin Chen
Alex Beutel
Paul Covington
Sagar Jain
Francois Belletti
Ed H. Chi
CMLOffRL
117
482
0
06 Dec 2018
Active Learning in Recommendation Systems with Multi-level User
  Preferences
Active Learning in Recommendation Systems with Multi-level User Preferences
Yuheng Bu
Kevin Small
68
5
0
30 Nov 2018
Model-Based Active Exploration
Model-Based Active Exploration
Pranav Shyam
Wojciech Ja'skowski
Faustino J. Gomez
86
179
0
29 Oct 2018
Self-Attentive Sequential Recommendation
Self-Attentive Sequential Recommendation
Wang-Cheng Kang
Julian McAuley
HAIBDL
175
2,442
0
20 Aug 2018
Large-Scale Study of Curiosity-Driven Learning
Large-Scale Study of Curiosity-Driven Learning
Yuri Burda
Harrison Edwards
Deepak Pathak
Amos Storkey
Trevor Darrell
Alexei A. Efros
LRM
72
707
0
13 Aug 2018
Implicit Quantile Networks for Distributional Reinforcement Learning
Implicit Quantile Networks for Distributional Reinforcement Learning
Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
OffRL
139
532
0
14 Jun 2018
Reinforcement Learning to Rank in E-Commerce Search Engine:
  Formalization, Analysis, and Application
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application
Yujing Hu
Qing Da
Anxiang Zeng
Yang Yu
Yinghui Xu
71
180
0
02 Mar 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
182
5,212
0
26 Feb 2018
Diversity is All You Need: Learning Skills without a Reward Function
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
109
1,088
0
16 Feb 2018
Distributional Reinforcement Learning with Quantile Regression
Distributional Reinforcement Learning with Quantile Regression
Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
92
762
0
27 Oct 2017
A Distributional Perspective on Reinforcement Learning
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
98
1,506
0
21 Jul 2017
Parameter Space Noise for Exploration
Parameter Space Noise for Exploration
Matthias Plappert
Rein Houthooft
Prafulla Dhariwal
Szymon Sidor
Richard Y. Chen
Xi Chen
Tamim Asfour
Pieter Abbeel
Marcin Andrychowicz
73
597
0
06 Jun 2017
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
179
1,483
0
06 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
204
8,875
0
04 Feb 2016
Deep Reinforcement Learning in Large Discrete Action Spaces
Deep Reinforcement Learning in Large Discrete Action Spaces
Gabriel Dulac-Arnold
Richard Evans
H. V. Hasselt
P. Sunehag
Timothy Lillicrap
Jonathan J. Hunt
Timothy A. Mann
T. Weber
T. Degris
Ben Coppin
OffRL
71
575
0
24 Dec 2015
Variational Information Maximisation for Intrinsically Motivated
  Reinforcement Learning
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning
S. Mohamed
Danilo Jimenez Rezende
DRLSSL
99
402
0
29 Sep 2015
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
170
7,662
0
22 Sep 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
325
13,272
0
09 Sep 2015
An MDP-based Recommender System
An MDP-based Recommender System
Guy Shani
Ronen I. Brafman
David Heckerman
LRM
114
973
0
12 Dec 2012
1