Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.15477
Cited By
In-context Ranking Preference Optimization
21 April 2025
Junda Wu
Rohan Surana
Zhouhang Xie
Yiran Shen
Yu Xia
Tong Yu
Ryan Rossi
Prithviraj Ammanabrolu
Julian McAuley
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"In-context Ranking Preference Optimization"
16 / 16 papers shown
Title
Process-Supervised LLM Recommenders via Flow-guided Tuning
Chongming Gao
Mengyao Gao
Chenxiao Fan
Shuai Yuan
Wentao Shi
Xiangnan He
126
6
0
10 Mar 2025
LiPO: Listwise Preference Optimization through Learning-to-Rank
Tianqi Liu
Zhen Qin
Junru Wu
Jiaming Shen
Misha Khalman
...
Mohammad Saleh
Simon Baumgartner
Jialu Liu
Peter J. Liu
Xuanhui Wang
313
59
0
28 Jan 2025
Knowledge-Aware Query Expansion with Large Language Models for Textual and Relational Retrieval
Yu Xia
Junda Wu
Sungchul Kim
Tong Yu
Ryan A. Rossi
Haoliang Wang
Julian McAuley
80
4
0
17 Oct 2024
Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models
Qi Liu
Bo Wang
Nan Wang
Jiaxin Mao
RALM
125
4
0
21 Jun 2024
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
176
58
0
23 Apr 2024
Make Large Language Model a Better Ranker
Wenshuo Chao
Zhi Zheng
Hengshu Zhu
Hao Liu
ALM
90
7
0
28 Mar 2024
Enhancing Recommendation Diversity by Re-ranking with Large Language Models
Diego Carraro
Derek Bridge
LRM
ALM
112
16
0
21 Jan 2024
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models
Ronak Pradeep
Sahel Sharifymoghaddam
Jimmy Lin
ALM
95
43
0
26 Sep 2023
On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-
n
n
n
Recommendation
Olivier Jeunen
Ivan Potapov
Aleksei Ustimenko
ELM
OffRL
107
12
0
27 Jul 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
389
4,163
0
29 May 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
886
13,207
0
04 Mar 2022
Towards Deep Conversational Recommendations
Raymond Li
Samira Ebrahimi Kahou
Hannes Schulz
Vincent Michalski
Laurent Charlin
C. Pal
62
374
0
18 Dec 2018
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
144
1,752
0
02 Nov 2018
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
Zhilin Yang
Peng Qi
Saizheng Zhang
Yoshua Bengio
William W. Cohen
Ruslan Salakhutdinov
Christopher D. Manning
RALM
191
2,700
0
25 Sep 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
541
19,296
0
20 Jul 2017
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
218
3,365
0
12 Jun 2017
1