Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.01325
Cited By
v1
v2
v3 (latest)
Learning to summarize from human feedback
2 September 2020
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning to summarize from human feedback"
50 / 1,548 papers shown
Title
Leveraging Human Feedback to Scale Educational Datasets: Combining Crowdworkers and Comparative Judgement
Owen Henkel
Libby Hills
66
1
0
22 May 2023
Continually Improving Extractive QA via Human Feedback
Ge Gao
Hung-Ting Chen
Yoav Artzi
Eunsol Choi
89
12
0
21 May 2023
LMs: Understanding Code Syntax and Semantics for Code Analysis
Wei Ma
Shangqing Liu
Zhihao Lin
Wenhan Wang
Q. Hu
Ye Liu
Cen Zhang
Liming Nie
Li Li
Yang Liu
128
16
0
20 May 2023
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Hiroki Furuta
Kuang-Huei Lee
Ofir Nachum
Yutaka Matsuo
Aleksandra Faust
S. Gu
Izzeddin Gur
LM&Ro
181
103
0
19 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
156
399
0
19 May 2023
Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models
Wanqiao Xu
Shi Dong
Dilip Arumugam
Benjamin Van Roy
85
8
0
19 May 2023
Language Models Meet World Models: Embodied Experiences Enhance Language Models
Jiannan Xiang
Tianhua Tao
Yi Gu
Tianmin Shu
Zirui Wang
Zichao Yang
Zhiting Hu
ALM
LLMAG
LM&Ro
CLL
132
106
0
18 May 2023
SLiC-HF: Sequence Likelihood Calibration with Human Feedback
Yao-Min Zhao
Rishabh Joshi
Tianqi Liu
Misha Khalman
Mohammad Saleh
Peter J. Liu
100
307
0
17 May 2023
LeTI: Learning to Generate from Textual Interactions
Xingyao Wang
Hao Peng
Reyhaneh Jabbarvand
Heng Ji
120
30
0
17 May 2023
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Linhao Yu
Tianyu Dong
...
Peiyi Zhang
Qingqing Lyu
Xiaowen Su
Qun Liu
Deyi Xiong
ELM
ALM
121
26
0
17 May 2023
Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text
H. Khorashadizadeh
Nandana Mihindukulasooriya
Sanju Tiwari
Jinghua Groppe
Sven Groppe
73
23
0
15 May 2023
Leveraging Large Language Models in Conversational Recommender Systems
Luke Friedman
Sameer Ahuja
David Allen
Zhenning Tan
Hakim Sidahmed
...
Ajay Patel
Harsh Lara
Brian Chu
Zexiang Chen
Manoj Kumar Tiwari
122
110
0
13 May 2023
LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development
Ilias Chalkidis
Nicolas Garneau
Catalina Goanta
Daniel Martin Katz
Anders Søgaard
AILaw
ELM
70
64
0
12 May 2023
Taking Advice from ChatGPT
Peter Zhang
72
5
0
11 May 2023
WebCPM: Interactive Web Search for Chinese Long-form Question Answering
Yujia Qin
Zihan Cai
Di Jin
Lan Yan
Shi Liang
...
Ruobing Xie
Fanchao Qi
Zhiyuan Liu
Maosong Sun
Jie Zhou
RALM
80
93
0
11 May 2023
GFlowNets with Human Feedback
Yinchuan Li
Shuang Luo
Yunfeng Shao
Jianye Hao
AI4CE
68
5
0
11 May 2023
Do LLMs Understand User Preferences? Evaluating LLMs On User Rating Prediction
Wang-Cheng Kang
Jianmo Ni
Nikhil Mehta
M. Sathiamoorthy
Lichan Hong
Ed H. Chi
D. Cheng
68
123
0
10 May 2023
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health Management: A Survey and Roadmaps
Yanfang Li
Huan Wang
Muxia Sun
LM&MA
AI4TS
AI4CE
110
59
0
10 May 2023
Towards Building the Federated GPT: Federated Instruction Tuning
Jianyi Zhang
Saeed Vahidian
Martin Kuo
Chunyuan Li
Ruiyi Zhang
Tong Yu
Yufan Zhou
Guoyin Wang
Yiran Chen
ALM
FedML
92
132
0
09 May 2023
Fine-tuning Language Models with Generative Adversarial Reward Modelling
Z. Yu
Lau Jia Jaw
Zhang Hui
Bryan Kian Hsiang Low
ALM
130
4
0
09 May 2023
Putting Natural in Natural Language Processing
Grzegorz Chrupała
104
9
0
08 May 2023
Improving Cross-Task Generalization with Step-by-Step Instructions
Yang Wu
Yanyan Zhao
Zhongyang Li
Bing Qin
Kai Xiong
LRM
ALM
80
9
0
08 May 2023
Large Language Models in Sport Science & Medicine: Opportunities, Risks and Considerations
M. Connor
Michael OÑeill
LM&MA
49
7
0
05 May 2023
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Patrick Fernandes
Aman Madaan
Emmy Liu
António Farinhas
Pedro Henrique Martins
...
José G. C. de Souza
Shuyan Zhou
Tongshuang Wu
Graham Neubig
André F. T. Martins
ALM
192
59
0
01 May 2023
ChatGPT vs State-of-the-Art Models: A Benchmarking Study in Keyphrase Generation Task
Roberto Martínez-Cruz
Alvaro J. López-López
J. Portela
106
23
0
27 Apr 2023
SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
Bin Wang
Xinnian Liang
Jian Yang
Huijia Huang
Shuangzhi Wu
Peihao Wu
Lu Lu
Zejun Ma
Zhoujun Li
LLMAG
KELM
RALM
150
29
0
26 Apr 2023
Boosting Theory-of-Mind Performance in Large Language Models via Prompting
Shima Rahimi Moghaddam
C. Honey
LLMAG
LRM
AI4CE
101
83
0
22 Apr 2023
Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers
Felipe Urrutia
R. Araya
61
3
0
21 Apr 2023
In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
Xinyue Shen
Zhenpeng Chen
Michael Backes
Yang Zhang
106
59
0
18 Apr 2023
A Survey for Biomedical Text Summarization: From Pre-trained to Large Language Models
Qianqian Xie
Zheheng Luo
Benyou Wang
Sophia Ananiadou
LM&MA
VLM
72
11
0
18 Apr 2023
Tool Learning with Foundation Models
Yujia Qin
Shengding Hu
Yankai Lin
Weize Chen
Ning Ding
...
Cheng Yang
Tongshuang Wu
Heng Ji
Zhiyuan Liu
Maosong Sun
150
222
0
17 Apr 2023
A Comparative Study between Full-Parameter and LoRA-based Fine-Tuning on Chinese Instruction Data for Instruction Following Large Language Model
Xianghui Sun
Yunjie Ji
Baochang Ma
Xiangang Li
ALM
85
19
0
17 Apr 2023
Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation
Yunjie Ji
Yan Gong
Yong Deng
Yiping Peng
Qiang Niu
Baochang Ma
Xiangang Li
ALM
ELM
107
25
0
16 Apr 2023
OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Andreas Kopf
Yannic Kilcher
Dimitri von Rutte
Sotiris Anagnostidis
Zhi Rui Tam
...
Arnav Dantuluri
Andrew Maguire
Christoph Schuhmann
Huu Nguyen
A. Mattick
ALM
LM&MA
167
641
0
14 Apr 2023
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
Hanze Dong
Wei Xiong
Deepanshu Goyal
Yihan Zhang
Winnie Chow
Boyao Wang
Shizhe Diao
Jipeng Zhang
Kashun Shum
Tong Zhang
ALM
162
470
0
13 Apr 2023
ChatGPT Needs SPADE (Sustainability, PrivAcy, Digital divide, and Ethics) Evaluation: A Review
Sunder Ali Khowaja
P. Khuwaja
Kapal Dev
Weizheng Wang
Lewis Nkenyereye
136
88
0
13 Apr 2023
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
Jiazheng Xu
Xiao Liu
Yuchen Wu
Yuxuan Tong
Qinkai Li
Ming Ding
Jie Tang
Yuxiao Dong
185
413
0
12 Apr 2023
RRHF: Rank Responses to Align Language Models with Human Feedback without tears
Zheng Yuan
Hongyi Yuan
Chuanqi Tan
Wei Wang
Songfang Huang
Feiran Huang
ALM
215
385
0
11 Apr 2023
Learning a Universal Human Prior for Dexterous Manipulation from Human Preference
Zihan Ding
Yuanpei Chen
Allen Z. Ren
S. Gu
Qianxu Wang
Hao Dong
Chi Jin
86
10
0
10 Apr 2023
VOICE: Visual Oracle for Interaction, Conversation, and Explanation
Donggang Jia
Alexandra Irger
Lonni Besancon
Ondrej Strnad
Deng Luo
Johanna Björklund
Anders Ynnerman
I. Viola
108
2
0
08 Apr 2023
Towards Interpretable Mental Health Analysis with Large Language Models
Kailai Yang
Shaoxiong Ji
Tianlin Zhang
Qianqian Xie
Zi-Zhou Kuang
Sophia Ananiadou
ELM
AI4MH
LRM
121
61
0
06 Apr 2023
Human-like Summarization Evaluation with ChatGPT
Mingqi Gao
Jie Ruan
Renliang Sun
Xunjian Yin
Shiping Yang
Xiaojun Wan
ALM
AI4MH
78
135
0
05 Apr 2023
REFINER: Reasoning Feedback on Intermediate Representations
Debjit Paul
Mete Ismayilzada
Maxime Peyrard
Beatriz Borges
Antoine Bosselut
Robert West
Boi Faltings
ReLM
LRM
134
182
0
04 Apr 2023
Cross-Domain Image Captioning with Discriminative Finetuning
Roberto Dessì
Michele Bevilacqua
Eleonora Gualdoni
Nathanaël Carraz Rakotonirina
Francesca Franzon
Marco Baroni
CLIP
101
19
0
04 Apr 2023
Eight Things to Know about Large Language Models
Sam Bowman
ALM
103
117
0
02 Apr 2023
Towards Healthy AI: Large Language Models Need Therapists Too
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
Kush R. Varshney
AI4MH
96
19
0
02 Apr 2023
Pair Programming with Large Language Models for Sampling and Estimation of Copulas
Jan Górecki
LLMAG
39
1
0
31 Mar 2023
Self-Refine: Iterative Refinement with Self-Feedback
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
...
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
ReLM
LRM
DiffM
256
1,690
0
30 Mar 2023
Training Language Models with Language Feedback at Scale
Jérémy Scheurer
Jon Ander Campos
Tomasz Korbak
Jun Shern Chan
Angelica Chen
Kyunghyun Cho
Ethan Perez
ALM
116
107
0
28 Mar 2023
Improving Code Generation by Training with Natural Language Feedback
Angelica Chen
Jérémy Scheurer
Tomasz Korbak
Jon Ander Campos
Jun Shern Chan
Samuel R. Bowman
Kyunghyun Cho
Ethan Perez
SyDa
ALM
AI4CE
109
78
0
28 Mar 2023
Previous
1
2
3
...
26
27
28
29
30
31
Next