ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.07377
  4. Cited By
Process-Supervised LLM Recommenders via Flow-guided Tuning
v1v2v3 (latest)

Process-Supervised LLM Recommenders via Flow-guided Tuning

10 March 2025
Chongming Gao
Mengyao Gao
Chenxiao Fan
Shuai Yuan
Wentao Shi
Xiangnan He
ArXiv (abs)PDFHTML

Papers citing "Process-Supervised LLM Recommenders via Flow-guided Tuning"

36 / 36 papers shown
Title
Fine-grained List-wise Alignment for Generative Medication Recommendation
Fine-grained List-wise Alignment for Generative Medication Recommendation
Chenxiao Fan
Chongming Gao
Wentao Shi
Yaxin Gong
Zihao Zhao
Fuli Feng
LM&MA
42
0
0
26 May 2025
In-context Ranking Preference Optimization
In-context Ranking Preference Optimization
Junda Wu
Rohan Surana
Zhouhang Xie
Yiran Shen
Yu Xia
Tong Yu
Ryan Rossi
Prithviraj Ammanabrolu
Julian McAuley
83
0
0
21 Apr 2025
CFaiRLLM: Consumer Fairness Evaluation in Large-Language Model Recommender System
CFaiRLLM: Consumer Fairness Evaluation in Large-Language Model Recommender System
Yashar Deldjoo
Tommaso Di Noia
177
23
0
24 Feb 2025
A Survey of Controllable Learning: Methods and Applications in Information Retrieval
A Survey of Controllable Learning: Methods and Applications in Information Retrieval
Chenglei Shen
Xiao Zhang
Teng Shi
Changshuo Zhang
Guofu Xie
Jun Xu
125
6
0
03 Jan 2025
Causality-Enhanced Behavior Sequence Modeling in LLMs for Personalized
  Recommendation
Causality-Enhanced Behavior Sequence Modeling in LLMs for Personalized Recommendation
Yang Zhang
Juntao You
Yimeng Bai
Jizhi Zhang
Keqin Bao
Wenjie Wang
Tat-Seng Chua
CML
60
5
0
30 Oct 2024
Process Reward Model with Q-Value Rankings
Process Reward Model with Q-Value Rankings
W. Li
Yixuan Li
LRM
133
25
0
15 Oct 2024
Rewarding Progress: Scaling Automated Process Verifiers for LLM
  Reasoning
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Amrith Rajagopal Setlur
Chirag Nagpal
Adam Fisch
Xinyang Geng
Jacob Eisenstein
Rishabh Agarwal
Alekh Agarwal
Jonathan Berant
Aviral Kumar
OffRLLRM
98
77
0
10 Oct 2024
On Softmax Direct Preference Optimization for Recommendation
On Softmax Direct Preference Optimization for Recommendation
Yuxin Chen
Junfei Tan
An Zhang
Zhengyi Yang
Leheng Sheng
Enzhi Zhang
Xiang Wang
Tat-Seng Chua
79
31
0
13 Jun 2024
Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal Examples
Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal Examples
Fangxu Yu
Lai Jiang
Haoqiang Kang
Shibo Hao
Lianhui Qin
LRMAI4CE
144
10
0
09 Jun 2024
XRec: Large Language Models for Explainable Recommendation
XRec: Large Language Models for Explainable Recommendation
Qiyao Ma
Xubin Ren
Chao Huang
LRM
82
22
0
04 Jun 2024
RGFN: Synthesizable Molecular Generation Using GFlowNets
RGFN: Synthesizable Molecular Generation Using GFlowNets
Michal Koziarski
Andrei Rekesh
Dmytro Shevchuk
A. V. D. Sloot
Piotr Gaiñski
Yoshua Bengio
Cheng-Hao Liu
Mike Tyers
Robert A. Batey
72
16
0
01 Jun 2024
Large Language Models Enhanced Sequential Recommendation for Long-tail
  User and Item
Large Language Models Enhanced Sequential Recommendation for Long-tail User and Item
Qidong Liu
Xian Wu
Xiangyu Zhao
Yejing Wang
Zijian Zhang
Feng Tian
Yefeng Zheng
65
20
0
31 May 2024
Learning diverse attacks on large language models for robust red-teaming and safety tuning
Learning diverse attacks on large language models for robust red-teaming and safety tuning
Seanie Lee
Minsu Kim
Lynn Cherif
David Dobre
Juho Lee
...
Kenji Kawaguchi
Gauthier Gidel
Yoshua Bengio
Nikolay Malkin
Moksh Jain
AAML
136
20
0
28 May 2024
Breaking the Length Barrier: LLM-Enhanced CTR Prediction in Long Textual
  User Behaviors
Breaking the Length Barrier: LLM-Enhanced CTR Prediction in Long Textual User Behaviors
Binzong Geng
Zhaoxin Huan
Xiaolu Zhang
Yong He
Liang Zhang
Fajie Yuan
Jun Zhou
Linjian Mo
88
25
0
28 Mar 2024
CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve
  Long-tail Recommendation
CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation
Junda Wu
Cheng-Chun Chang
Tong Yu
Zhankui He
Jianing Wang
Yupeng Hou
Julian McAuley
LRMRALM
79
27
0
11 Mar 2024
Aligning Large Language Models for Controllable Recommendations
Aligning Large Language Models for Controllable Recommendations
Wensheng Lu
Jianxun Lian
Wei Zhang
Guanghua Li
Mingyang Zhou
Hao Liao
Xing Xie
ALM
78
16
0
08 Mar 2024
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human
  Annotations
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Peiyi Wang
Lei Li
Zhihong Shao
R. X. Xu
Damai Dai
Yifei Li
Deli Chen
Y.Wu
Zhifang Sui
AIMatLRMALM
143
398
0
14 Dec 2023
CoLLM: Integrating Collaborative Embeddings into Large Language Models for Recommendation
CoLLM: Integrating Collaborative Embeddings into Large Language Models for Recommendation
Yang Zhang
Fuli Feng
Jizhi Zhang
Keqin Bao
Qifan Wang
Xiangnan He
26
87
0
30 Oct 2023
AgentCF: Collaborative Learning with Autonomous Language Agents for
  Recommender Systems
AgentCF: Collaborative Learning with Autonomous Language Agents for Recommender Systems
Junjie Zhang
Yupeng Hou
Ruobing Xie
Wenqi Sun
Julian McAuley
Wayne Xin Zhao
Leyu Lin
Ji-Rong Wen
LLMAG
45
84
0
13 Oct 2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
AI4CEALM
182
149
0
10 Oct 2023
RecMind: Large Language Model Powered Agent For Recommendation
RecMind: Large Language Model Powered Agent For Recommendation
Yancheng Wang
Ziyan Jiang
Zheng Chen
Fan Yang
Yingxue Zhou
Eunah Cho
Xing Fan
Xiaojiang Huang
Yanbin Lu
Yingzhen Yang
LLMAGLM&RoLRM
105
103
0
28 Aug 2023
How Can Recommender Systems Benefit from Large Language Models: A Survey
How Can Recommender Systems Benefit from Large Language Models: A Survey
Jianghao Lin
Xinyi Dai
Yunjia Xi
Weiwen Liu
Bo Chen
...
Chenxu Zhu
Huifeng Guo
Yong Yu
Ruiming Tang
Weinan Zhang
LRM
132
219
0
09 Jun 2023
User Behavior Simulation with Large Language Model based Agents
User Behavior Simulation with Large Language Model based Agents
Lei Wang
Jingsen Zhang
Hao-ran Yang
Zhiyuan Chen
Jiakai Tang
...
Wayne Xin Zhao
Jun Xu
Zhicheng Dou
Jun Wang
Ji-Rong Wen
LM&RoLLMAG
84
50
0
05 Jun 2023
Let's Verify Step by Step
Let's Verify Step by Step
Hunter Lightman
V. Kosaraju
Yura Burda
Harrison Edwards
Bowen Baker
Teddy Lee
Jan Leike
John Schulman
Ilya Sutskever
K. Cobbe
ALMOffRLLRM
195
1,233
0
31 May 2023
A Survey on Large Language Models for Recommendation
A Survey on Large Language Models for Recommendation
Likang Wu
Zhilan Zheng
Zhaopeng Qiu
Hao Wang
Hongchao Gu
...
Chen Zhu
Hengshu Zhu
Qi Liu
Hui Xiong
Enhong Chen
128
399
0
31 May 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward
  Model
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
389
4,139
0
29 May 2023
Leveraging Large Language Models in Conversational Recommender Systems
Leveraging Large Language Models in Conversational Recommender Systems
Luke Friedman
Sameer Ahuja
David Allen
Zhenning Tan
Hakim Sidahmed
...
Ajay Patel
Harsh Lara
Brian Chu
Zexiang Chen
Manoj Kumar Tiwari
97
109
0
13 May 2023
Recommendation as Instruction Following: A Large Language Model
  Empowered Recommendation Approach
Recommendation as Instruction Following: A Large Language Model Empowered Recommendation Approach
Junjie Zhang
Ruobing Xie
Yupeng Hou
Wayne Xin Zhao
Leyu Lin
Ji-Rong Wen
83
225
0
11 May 2023
Sample-efficient Multi-objective Molecular Optimization with GFlowNets
Sample-efficient Multi-objective Molecular Optimization with GFlowNets
Yiheng Zhu
Jialun Wu
Chaowen Hu
Jiahuan Yan
Chang-Yu Hsieh
Tingjun Hou
Jian Wu
83
35
0
08 Feb 2023
Learning GFlowNets from partial episodes for improved convergence and
  stability
Learning GFlowNets from partial episodes for improved convergence and stability
Kanika Madan
Jarrid Rector-Brooks
Maksym Korablyov
Emmanuel Bengio
Moksh Jain
A. Nica
Tom Bosc
Yoshua Bengio
Nikolay Malkin
74
101
0
26 Sep 2022
Bayesian Structure Learning with Generative Flow Networks
Bayesian Structure Learning with Generative Flow Networks
T. Deleu
António Góis
Chris C. Emezue
M. Rankawat
Simon Lacoste-Julien
Stefan Bauer
Yoshua Bengio
BDL
98
156
0
28 Feb 2022
Generative Flow Networks for Discrete Probabilistic Modeling
Generative Flow Networks for Discrete Probabilistic Modeling
Dinghuai Zhang
Nikolay Malkin
Ziqiang Liu
Alexandra Volokhova
Aaron Courville
Yoshua Bengio
73
109
0
03 Feb 2022
Trajectory balance: Improved credit assignment in GFlowNets
Trajectory balance: Improved credit assignment in GFlowNets
Nikolay Malkin
Moksh Jain
Emmanuel Bengio
Chen Sun
Yoshua Bengio
239
181
0
31 Jan 2022
GFlowNet Foundations
GFlowNet Foundations
Yoshua Bengio
Salem Lahlou
T. Deleu
J. E. Hu
Mo Tiwari
Emmanuel Bengio
76
238
0
17 Nov 2021
Self-Attentive Sequential Recommendation
Self-Attentive Sequential Recommendation
Wang-Cheng Kang
Julian McAuley
HAIBDL
181
2,442
0
20 Aug 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
541
19,265
0
20 Jul 2017
1