ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.00678
  4. Cited By
A General Offline Reinforcement Learning Framework for Interactive
  Recommendation

A General Offline Reinforcement Learning Framework for Interactive Recommendation

1 October 2023
Teng Xiao
Donglin Wang
    OffRL
ArXivPDFHTML

Papers citing "A General Offline Reinforcement Learning Framework for Interactive Recommendation"

27 / 27 papers shown
Title
InfoPO: On Mutual Information Maximization for Large Language Model Alignment
InfoPO: On Mutual Information Maximization for Large Language Model Alignment
Teng Xiao
Zhen Ge
Sujay Sanghavi
Tian Wang
Julian Katz-Samuels
Marc Versage
Qingjun Cui
Trishul Chilimbi
31
0
0
13 May 2025
Preserving Cultural Identity with Context-Aware Translation Through Multi-Agent AI Systems
Mahfuz Ahmed Anik
Abdur Rahman
Azmine Toushik Wasi
Md Manjurul Ahsan
52
0
0
05 Mar 2025
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Teng Xiao
Yige Yuan
Ziyang Chen
Mingxiao Li
Shangsong Liang
Zhaochun Ren
V. Honavar
110
6
0
21 Feb 2025
GeomCLIP: Contrastive Geometry-Text Pre-training for Molecules
Teng Xiao
Chao Cui
Huaisheng Zhu
V. Honavar
AI4CE
CLIP
42
0
0
16 Nov 2024
How to Leverage Demonstration Data in Alignment for Large Language
  Model? A Self-Imitation Learning Perspective
How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective
Teng Xiao
Mingxiao Li
Yige Yuan
Huaisheng Zhu
Chao Cui
V. Honavar
ALM
39
7
0
14 Oct 2024
RePlay: a Recommendation Framework for Experimentation and Production
  Use
RePlay: a Recommendation Framework for Experimentation and Production Use
Alexey Vasilev
Anna Volodkevich
Denis Kulandin
Tatiana Bysheva
Anton Klenitskiy
31
0
0
11 Sep 2024
3M-Diffusion: Latent Multi-Modal Diffusion for Text-Guided Generation of
  Molecular Graphs
3M-Diffusion: Latent Multi-Modal Diffusion for Text-Guided Generation of Molecular Graphs
Huaisheng Zhu
Teng Xiao
V. Honavar
DiffM
45
1
0
11 Mar 2024
Contrastive Diffuser: Planning Towards High Return States via
  Contrastive Learning
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Yixiang Shan
Zhengbang Zhu
Ting Long
Qifan Liang
Yi-Ju Chang
Weinan Zhang
Liang Yin
OffRL
51
1
0
05 Feb 2024
Towards Off-Policy Reinforcement Learning for Ranking Policies with
  Human Feedback
Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback
Teng Xiao
Suhang Wang
OffRL
41
8
0
17 Jan 2024
Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from
  Imperfect Demonstration for Interactive Recommendation
Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from Imperfect Demonstration for Interactive Recommendation
Jialin Liu
Xinyan Su
Zeyu He
Xiangyu Zhao
Jun Li
OffRL
26
0
0
30 Oct 2023
A General Neural Causal Model for Interactive Recommendation
A General Neural Causal Model for Interactive Recommendation
Jialin Liu
Xinyan Su
Peng Zhou
Xiangyu Zhao
Jun Li
CML
20
0
0
30 Oct 2023
Simple and Asymmetric Graph Contrastive Learning without Augmentations
Simple and Asymmetric Graph Contrastive Learning without Augmentations
Teng Xiao
Huaisheng Zhu
Zhengyu Chen
Suhang Wang
35
31
0
29 Oct 2023
Learning How to Propagate Messages in Graph Neural Networks
Learning How to Propagate Messages in Graph Neural Networks
Teng Xiao
Zhengyu Chen
Donglin Wang
Suhang Wang
GNN
34
76
0
01 Oct 2023
Model-free Reinforcement Learning with Stochastic Reward Stabilization
  for Recommender Systems
Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
Tianchi Cai
Shenliao Bao
Jiyan Jiang
Shiji Zhou
Wenpeng Zhang
Lihong Gu
Jinjie Gu
Guannan Zhang
OffRL
34
2
0
25 Aug 2023
On the Opportunities and Challenges of Offline Reinforcement Learning
  for Recommender Systems
On the Opportunities and Challenges of Offline Reinforcement Learning for Recommender Systems
Xiaocong Chen
Siyu Wang
Julian McAuley
Dietmar Jannach
Lina Yao
OffRL
32
5
0
22 Aug 2023
Towards Fair Graph Neural Networks via Graph Counterfactual
Towards Fair Graph Neural Networks via Graph Counterfactual
Zhimeng Guo
Jialiang Li
Teng Xiao
Yao Ma
Suhang Wang
53
21
0
10 Jul 2023
Elastic Decision Transformer
Elastic Decision Transformer
Yueh-hua Wu
Xiaolong Wang
Masashi Hamaya
OffRL
34
39
0
05 Jul 2023
Hierarchical Reinforcement Learning for Modeling User Novelty-Seeking
  Intent in Recommender Systems
Hierarchical Reinforcement Learning for Modeling User Novelty-Seeking Intent in Recommender Systems
Pan Li
Yuyan Wang
Ed H. Chi
Minmin Chen
21
2
0
02 Jun 2023
Understanding or Manipulation: Rethinking Online Performance Gains of
  Modern Recommender Systems
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems
Zhengbang Zhu
Rongjun Qin
Junjie Huang
Xinyi Dai
Yang Yu
Yong Yu
Weinan Zhang
46
2
0
11 Oct 2022
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online
  Reinforcement Learning
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
Haoyi Niu
Shubham Sharma
Yiwen Qiu
Ming Li
Guyue Zhou
Jianming Hu
Xianyuan Zhan
OffRL
OnRL
40
48
0
27 Jun 2022
Decoupled Self-supervised Learning for Non-Homophilous Graphs
Decoupled Self-supervised Learning for Non-Homophilous Graphs
Teng Xiao
Zhengyu Chen
Zhimeng Guo
Zeyang Zhuang
Suhang Wang
BDL
SSL
36
18
0
07 Jun 2022
Reconsidering Learning Objectives in Unbiased Recommendation with
  Unobserved Confounders
Reconsidering Learning Objectives in Unbiased Recommendation with Unobserved Confounders
Teng Xiao
Zhengyu Chen
Suhang Wang
OOD
CML
OffRL
32
0
0
07 Jun 2022
DraftRec: Personalized Draft Recommendation for Winning in Multi-Player
  Online Battle Arena Games
DraftRec: Personalized Draft Recommendation for Winning in Multi-Player Online Battle Arena Games
Hojoon Lee
Dongyoon Hwang
Hyunseung Kim
ByungKun Lee
Jaegul Choo
14
11
0
27 Apr 2022
Accelerating Offline Reinforcement Learning Application in Real-Time
  Bidding and Recommendation: Potential Use of Simulation
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation
Haruka Kiyohara
K. Kawakami
Yuta Saito
OffRL
32
12
0
17 Sep 2021
Recommendation Fairness: From Static to Dynamic
Recommendation Fairness: From Static to Dynamic
De-Fu Zhang
Jun Wang
OffRL
24
15
0
05 Sep 2021
Personalization for Web-based Services using Offline Reinforcement
  Learning
Personalization for Web-based Services using Offline Reinforcement Learning
P. Apostolopoulos
Zehui Wang
Hanson Wang
Chad Zhou
Kittipat Virochsiri
Norm Zhou
Igor L. Markov
OffRL
OnRL
27
7
0
10 Feb 2021
Soft Actor-Critic for Discrete Action Settings
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
104
292
0
16 Oct 2019
1