Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.01325
Cited By
Learning to summarize from human feedback
2 September 2020
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to summarize from human feedback"
50 / 1,442 papers shown
Title
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Linhao Yu
Tianyu Dong
...
Peiyi Zhang
Qingqing Lyu
Xiaowen Su
Qun Liu
Deyi Xiong
ELM
ALM
16
24
0
17 May 2023
Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text
H. Khorashadizadeh
Nandana Mihindukulasooriya
Sanju Tiwari
Jinghua Groppe
Sven Groppe
27
22
0
15 May 2023
Leveraging Large Language Models in Conversational Recommender Systems
Luke Friedman
Sameer Ahuja
David Allen
Zhenning Tan
Hakim Sidahmed
...
Ajay Patel
Harsh Lara
Brian Chu
Zexiang Chen
Manoj Kumar Tiwari
32
104
0
13 May 2023
LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development
Ilias Chalkidis
Nicolas Garneau
Catalina Goanta
Daniel Martin Katz
Anders Søgaard
AILaw
ELM
32
56
0
12 May 2023
Taking Advice from ChatGPT
Peter Zhang
45
5
0
11 May 2023
WebCPM: Interactive Web Search for Chinese Long-form Question Answering
Yujia Qin
Zihan Cai
Di Jin
Lan Yan
Shi Liang
...
Ruobing Xie
Fanchao Qi
Zhiyuan Liu
Maosong Sun
Jie Zhou
RALM
25
87
0
11 May 2023
GFlowNets with Human Feedback
Yinchuan Li
Shuang Luo
Yunfeng Shao
Jianye Hao
AI4CE
21
5
0
11 May 2023
Do LLMs Understand User Preferences? Evaluating LLMs On User Rating Prediction
Wang-Cheng Kang
Jianmo Ni
Nikhil Mehta
M. Sathiamoorthy
Lichan Hong
Ed H. Chi
D. Cheng
23
116
0
10 May 2023
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health Management: A Survey and Roadmaps
Yanfang Li
Huan Wang
Muxia Sun
LM&MA
AI4TS
AI4CE
29
46
0
10 May 2023
Towards Building the Federated GPT: Federated Instruction Tuning
Jianyi Zhang
Saeed Vahidian
Martin Kuo
Chunyuan Li
Ruiyi Zhang
Tong Yu
Yufan Zhou
Guoyin Wang
Yiran Chen
ALM
FedML
42
113
0
09 May 2023
Fine-tuning Language Models with Generative Adversarial Reward Modelling
Z. Yu
Lau Jia Jaw
Zhang Hui
Bryan Kian Hsiang Low
ALM
18
4
0
09 May 2023
Putting Natural in Natural Language Processing
Grzegorz Chrupała
30
9
0
08 May 2023
Improving Cross-Task Generalization with Step-by-Step Instructions
Yang Wu
Yanyan Zhao
Zhongyang Li
Bing Qin
Kai Xiong
LRM
ALM
24
9
0
08 May 2023
Large Language Models in Sport Science & Medicine: Opportunities, Risks and Considerations
M. Connor
Michael OÑeill
LM&MA
16
6
0
05 May 2023
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Patrick Fernandes
Aman Madaan
Emmy Liu
António Farinhas
Pedro Henrique Martins
...
José G. C. de Souza
Shuyan Zhou
Tongshuang Wu
Graham Neubig
André F. T. Martins
ALM
117
56
0
01 May 2023
ChatGPT vs State-of-the-Art Models: A Benchmarking Study in Keyphrase Generation Task
Roberto Martínez-Cruz
Alvaro J. López-López
J. Portela
42
18
0
27 Apr 2023
SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
Bin Wang
Xinnian Liang
Jian Yang
Huijia Huang
Shuangzhi Wu
Peihao Wu
Lu Lu
Zejun Ma
Zhoujun Li
LLMAG
KELM
RALM
98
26
0
26 Apr 2023
Boosting Theory-of-Mind Performance in Large Language Models via Prompting
Shima Rahimi Moghaddam
C. Honey
LLMAG
LRM
AI4CE
24
79
0
22 Apr 2023
Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers
Felipe Urrutia
R. Araya
34
3
0
21 Apr 2023
In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
Xinyue Shen
Zhenpeng Chen
Michael Backes
Yang Zhang
27
57
0
18 Apr 2023
A Survey for Biomedical Text Summarization: From Pre-trained to Large Language Models
Qianqian Xie
Zheheng Luo
Benyou Wang
Sophia Ananiadou
LM&MA
VLM
34
8
0
18 Apr 2023
Tool Learning with Foundation Models
Yujia Qin
Shengding Hu
Yankai Lin
Weize Chen
Ning Ding
...
Cheng Yang
Tongshuang Wu
Heng Ji
Zhiyuan Liu
Maosong Sun
42
200
0
17 Apr 2023
A Comparative Study between Full-Parameter and LoRA-based Fine-Tuning on Chinese Instruction Data for Instruction Following Large Language Model
Xianghui Sun
Yunjie Ji
Baochang Ma
Xiangang Li
ALM
13
18
0
17 Apr 2023
Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation
Yunjie Ji
Yan Gong
Yong Deng
Yiping Peng
Qiang Niu
Baochang Ma
Xiangang Li
ALM
ELM
30
22
0
16 Apr 2023
OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Andreas Kopf
Yannic Kilcher
Dimitri von Rutte
Sotiris Anagnostidis
Zhi Rui Tam
...
Arnav Dantuluri
Andrew Maguire
Christoph Schuhmann
Huu Nguyen
A. Mattick
ALM
LM&MA
65
591
0
14 Apr 2023
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
Hanze Dong
Wei Xiong
Deepanshu Goyal
Yihan Zhang
Winnie Chow
Rui Pan
Shizhe Diao
Jipeng Zhang
Kashun Shum
Tong Zhang
ALM
18
410
0
13 Apr 2023
ChatGPT Needs SPADE (Sustainability, PrivAcy, Digital divide, and Ethics) Evaluation: A Review
Sunder Ali Khowaja
P. Khuwaja
K. Dev
Weizheng Wang
Lewis Nkenyereye
29
76
0
13 Apr 2023
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
Jiazheng Xu
Xiao Liu
Yuchen Wu
Yuxuan Tong
Qinkai Li
Ming Ding
Jie Tang
Yuxiao Dong
63
325
0
12 Apr 2023
RRHF: Rank Responses to Align Language Models with Human Feedback without tears
Zheng Yuan
Hongyi Yuan
Chuanqi Tan
Wei Wang
Songfang Huang
Feiran Huang
ALM
45
348
0
11 Apr 2023
Learning a Universal Human Prior for Dexterous Manipulation from Human Preference
Zihan Ding
Yuanpei Chen
Allen Z. Ren
S. Gu
Qianxu Wang
Hao Dong
Chi Jin
39
8
0
10 Apr 2023
VOICE: Visual Oracle for Interaction, Conversation, and Explanation
Donggang Jia
Alexandra Irger
Lonni Besancon
Ondrej Strnad
Deng Luo
Johanna Björklund
Anders Ynnerman
I. Viola
35
2
0
08 Apr 2023
Towards Interpretable Mental Health Analysis with Large Language Models
Kailai Yang
Shaoxiong Ji
Tianlin Zhang
Qianqian Xie
Zi-Zhou Kuang
Sophia Ananiadou
ELM
AI4MH
LRM
35
59
0
06 Apr 2023
Human-like Summarization Evaluation with ChatGPT
Mingqi Gao
Jie Ruan
Renliang Sun
Xunjian Yin
Shiping Yang
Xiaojun Wan
ALM
AI4MH
29
125
0
05 Apr 2023
REFINER: Reasoning Feedback on Intermediate Representations
Debjit Paul
Mete Ismayilzada
Maxime Peyrard
Beatriz Borges
Antoine Bosselut
Robert West
Boi Faltings
ReLM
LRM
41
171
0
04 Apr 2023
Cross-Domain Image Captioning with Discriminative Finetuning
Roberto Dessì
Michele Bevilacqua
Eleonora Gualdoni
Nathanaël Carraz Rakotonirina
Francesca Franzon
Marco Baroni
CLIP
27
19
0
04 Apr 2023
Eight Things to Know about Large Language Models
Sam Bowman
ALM
27
113
0
02 Apr 2023
Towards Healthy AI: Large Language Models Need Therapists Too
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
Kush R. Varshney
AI4MH
37
19
0
02 Apr 2023
Pair Programming with Large Language Models for Sampling and Estimation of Copulas
Jan Górecki
LLMAG
28
1
0
31 Mar 2023
Self-Refine: Iterative Refinement with Self-Feedback
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
...
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
ReLM
LRM
DiffM
56
1,459
0
30 Mar 2023
Training Language Models with Language Feedback at Scale
Jérémy Scheurer
Jon Ander Campos
Tomasz Korbak
Jun Shern Chan
Angelica Chen
Kyunghyun Cho
Ethan Perez
ALM
50
103
0
28 Mar 2023
Improving Code Generation by Training with Natural Language Feedback
Angelica Chen
Jérémy Scheurer
Tomasz Korbak
Jon Ander Campos
Jun Shern Chan
Samuel R. Bowman
Kyunghyun Cho
Ethan Perez
SyDa
ALM
AI4CE
39
76
0
28 Mar 2023
On the Creativity of Large Language Models
Giorgio Franceschelli
Mirco Musolesi
74
54
0
27 Mar 2023
MGTBench: Benchmarking Machine-Generated Text Detection
Xinlei He
Xinyue Shen
Zhenpeng Chen
Michael Backes
Yang Zhang
DeLMO
63
101
0
26 Mar 2023
Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases
Yunjie Ji
Yong Deng
Yan Gong
Yiping Peng
Qiang Niu
Lefei Zhang
Baochang Ma
Xiangang Li
ALM
27
93
0
26 Mar 2023
SmartBook: AI-Assisted Situation Report Generation for Intelligence Analysts
R. Reddy
Daniel Lee
Yi R. Fung
Khanh Duy Nguyen
Qi Zeng
Manling Li
Ziqi Wang
Clare R. Voss
Heng Ji
20
6
0
25 Mar 2023
SPEC: Summary Preference Decomposition for Low-Resource Abstractive Summarization
Yi-Syuan Chen
Yun-Zhu Song
Hong-Han Shuai
33
6
0
24 Mar 2023
Large Language Model Instruction Following: A Survey of Progresses and Challenges
Renze Lou
Kai Zhang
Wenpeng Yin
ALM
LRM
35
20
0
18 Mar 2023
Blind Multimodal Quality Assessment of Low-light Images
Miaohui Wang
Zhuowei Xu
Mai Xu
Weisi Lin
41
2
0
18 Mar 2023
HIVE: Harnessing Human Feedback for Instructional Visual Editing
Shu Zhen Zhang
Xinyi Yang
Yihao Feng
Can Qin
Chia-Chih Chen
...
Haiquan Wang
Silvio Savarese
Stefano Ermon
Caiming Xiong
Ran Xu
20
105
0
16 Mar 2023
Robot Navigation in Risky, Crowded Environments: Understanding Human Preferences
A. Suresh
Angelique Taylor
L. Riek
Sonia Martínez
23
8
0
15 Mar 2023
Previous
1
2
3
...
24
25
26
27
28
29
Next