Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.00678
Cited By
A General Offline Reinforcement Learning Framework for Interactive Recommendation
1 October 2023
Teng Xiao
Donglin Wang
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A General Offline Reinforcement Learning Framework for Interactive Recommendation"
27 / 27 papers shown
Title
InfoPO: On Mutual Information Maximization for Large Language Model Alignment
Teng Xiao
Zhen Ge
Sujay Sanghavi
Tian Wang
Julian Katz-Samuels
Marc Versage
Qingjun Cui
Trishul Chilimbi
31
0
0
13 May 2025
Preserving Cultural Identity with Context-Aware Translation Through Multi-Agent AI Systems
Mahfuz Ahmed Anik
Abdur Rahman
Azmine Toushik Wasi
Md Manjurul Ahsan
49
0
0
05 Mar 2025
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Teng Xiao
Yige Yuan
Ziyang Chen
Mingxiao Li
Shangsong Liang
Zhaochun Ren
V. Honavar
110
6
0
21 Feb 2025
GeomCLIP: Contrastive Geometry-Text Pre-training for Molecules
Teng Xiao
Chao Cui
Huaisheng Zhu
V. Honavar
AI4CE
CLIP
42
0
0
16 Nov 2024
How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective
Teng Xiao
Mingxiao Li
Yige Yuan
Huaisheng Zhu
Chao Cui
V. Honavar
ALM
36
7
0
14 Oct 2024
RePlay: a Recommendation Framework for Experimentation and Production Use
Alexey Vasilev
Anna Volodkevich
Denis Kulandin
Tatiana Bysheva
Anton Klenitskiy
31
0
0
11 Sep 2024
3M-Diffusion: Latent Multi-Modal Diffusion for Text-Guided Generation of Molecular Graphs
Huaisheng Zhu
Teng Xiao
V. Honavar
DiffM
45
1
0
11 Mar 2024
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Yixiang Shan
Zhengbang Zhu
Ting Long
Qifan Liang
Yi-Ju Chang
Weinan Zhang
Liang Yin
OffRL
51
1
0
05 Feb 2024
Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback
Teng Xiao
Suhang Wang
OffRL
41
8
0
17 Jan 2024
Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from Imperfect Demonstration for Interactive Recommendation
Jialin Liu
Xinyan Su
Zeyu He
Xiangyu Zhao
Jun Li
OffRL
26
0
0
30 Oct 2023
A General Neural Causal Model for Interactive Recommendation
Jialin Liu
Xinyan Su
Peng Zhou
Xiangyu Zhao
Jun Li
CML
20
0
0
30 Oct 2023
Simple and Asymmetric Graph Contrastive Learning without Augmentations
Teng Xiao
Huaisheng Zhu
Zhengyu Chen
Suhang Wang
35
31
0
29 Oct 2023
Learning How to Propagate Messages in Graph Neural Networks
Teng Xiao
Zhengyu Chen
Donglin Wang
Suhang Wang
GNN
34
76
0
01 Oct 2023
Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
Tianchi Cai
Shenliao Bao
Jiyan Jiang
Shiji Zhou
Wenpeng Zhang
Lihong Gu
Jinjie Gu
Guannan Zhang
OffRL
34
2
0
25 Aug 2023
On the Opportunities and Challenges of Offline Reinforcement Learning for Recommender Systems
Xiaocong Chen
Siyu Wang
Julian McAuley
Dietmar Jannach
Lina Yao
OffRL
32
5
0
22 Aug 2023
Towards Fair Graph Neural Networks via Graph Counterfactual
Zhimeng Guo
Jialiang Li
Teng Xiao
Yao Ma
Suhang Wang
53
21
0
10 Jul 2023
Elastic Decision Transformer
Yueh-hua Wu
Xiaolong Wang
Masashi Hamaya
OffRL
34
39
0
05 Jul 2023
Hierarchical Reinforcement Learning for Modeling User Novelty-Seeking Intent in Recommender Systems
Pan Li
Yuyan Wang
Ed H. Chi
Minmin Chen
21
2
0
02 Jun 2023
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems
Zhengbang Zhu
Rongjun Qin
Junjie Huang
Xinyi Dai
Yang Yu
Yong Yu
Weinan Zhang
46
2
0
11 Oct 2022
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
Haoyi Niu
Shubham Sharma
Yiwen Qiu
Ming Li
Guyue Zhou
Jianming Hu
Xianyuan Zhan
OffRL
OnRL
40
48
0
27 Jun 2022
Decoupled Self-supervised Learning for Non-Homophilous Graphs
Teng Xiao
Zhengyu Chen
Zhimeng Guo
Zeyang Zhuang
Suhang Wang
BDL
SSL
36
18
0
07 Jun 2022
Reconsidering Learning Objectives in Unbiased Recommendation with Unobserved Confounders
Teng Xiao
Zhengyu Chen
Suhang Wang
OOD
CML
OffRL
32
0
0
07 Jun 2022
DraftRec: Personalized Draft Recommendation for Winning in Multi-Player Online Battle Arena Games
Hojoon Lee
Dongyoon Hwang
Hyunseung Kim
ByungKun Lee
Jaegul Choo
14
11
0
27 Apr 2022
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation
Haruka Kiyohara
K. Kawakami
Yuta Saito
OffRL
32
12
0
17 Sep 2021
Recommendation Fairness: From Static to Dynamic
De-Fu Zhang
Jun Wang
OffRL
24
15
0
05 Sep 2021
Personalization for Web-based Services using Offline Reinforcement Learning
P. Apostolopoulos
Zehui Wang
Hanson Wang
Chad Zhou
Kittipat Virochsiri
Norm Zhou
Igor L. Markov
OffRL
OnRL
27
7
0
10 Feb 2021
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
104
292
0
16 Oct 2019
1