Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.14510
Cited By
PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization
19 December 2024
Jiayi Wu
Hengyi Cai
Lingyong Yan
Hao Sun
Xiang Li
Shuaiqiang Wang
Dawei Yin
Ming Gao
Re-assign community
ArXiv (abs)
PDF
HTML
Github (18★)
Papers citing
"PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization"
5 / 5 papers shown
Title
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
389
4,139
0
29 May 2023
Enabling Large Language Models to Generate Text with Citations
Tianyu Gao
Howard Yen
Jiatong Yu
Danqi Chen
LM&MA
HILM
103
354
0
24 May 2023
Large Dual Encoders Are Generalizable Retrievers
Jianmo Ni
Chen Qu
Jing Lu
Zhuyun Dai
Gustavo Hernández Ábrego
...
Vincent Zhao
Yi Luan
Keith B. Hall
Ming-Wei Chang
Yinfei Yang
DML
167
459
0
15 Dec 2021
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
218
3,365
0
12 Jun 2017
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
228
2,686
0
09 May 2017
1