Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,395 papers shown
Title
Navigating the OverKill in Large Language Models
Chenyu Shi
Xiao Wang
Qiming Ge
Songyang Gao
Xianjun Yang
Tao Gui
Qi Zhang
Xuanjing Huang
Xun Zhao
Dahua Lin
101
13
0
31 Jan 2024
Neighboring Perturbations of Knowledge Editing on Large Language Models
Jun-Yu Ma
Zhen-Hua Ling
Ningyu Zhang
Jia-Chen Gu
KELM
82
6
0
31 Jan 2024
SPECTRUM: Speaker-Enhanced Pre-Training for Long Dialogue Summarization
Sangwoo Cho
Kaiqiang Song
Chao Zhao
Xiaoyang Wang
Dong Yu
79
0
0
31 Jan 2024
How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?
Rheeya Uppaal
Yixuan Li
Junjie Hu
159
6
0
31 Jan 2024
Weaver: Foundation Models for Creative Writing
Tiannan Wang
Jiamin Chen
Qingrui Jia
Shuai Wang
Ruoyu Fang
...
Xiaohua Xu
Ningyu Zhang
Huajun Chen
Yuchen Eleanor Jiang
Wangchunshu Zhou
99
20
0
30 Jan 2024
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks
Andy Zhou
Bo Li
Haohan Wang
AAML
143
88
0
30 Jan 2024
Rethinking Interpretability in the Era of Large Language Models
Chandan Singh
J. Inala
Michel Galley
Rich Caruana
Jianfeng Gao
LRM
AI4CE
133
72
0
30 Jan 2024
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Ansar Aynetdinov
Alan Akbik
ALM
87
12
0
30 Jan 2024
Can Large Language Models be Trusted for Evaluation? Scalable Meta-Evaluation of LLMs as Evaluators via Agent Debate
Steffi Chern
Ethan Chern
Graham Neubig
Pengfei Liu
LLMAG
ALM
ELM
46
30
0
30 Jan 2024
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models
Wai-Chung Kwan
Xingshan Zeng
Yuxin Jiang
Yufei Wang
Liangyou Li
Lifeng Shang
Xin Jiang
Qun Liu
Kam-Fai Wong
LRM
ELM
51
22
0
30 Jan 2024
Security and Privacy Challenges of Large Language Models: A Survey
B. Das
M. H. Amini
Yanzhao Wu
PILM
ELM
138
145
0
30 Jan 2024
Gradient-Based Language Model Red Teaming
Nevan Wichers
Carson E. Denison
Ahmad Beirami
78
33
0
30 Jan 2024
TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese
N. Corrêa
Sophia Falk
Shiza Fatimah
Aniket Sen
N. D. Oliveira
93
9
0
30 Jan 2024
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
Shun Zhang
Zhenfang Chen
Sunli Chen
Yikang Shen
Zhiqing Sun
Chuang Gan
82
27
0
30 Jan 2024
Weak-to-Strong Jailbreaking on Large Language Models
Xuandong Zhao
Xianjun Yang
Tianyu Pang
Chao Du
Lei Li
Yu-Xiang Wang
William Y. Wang
145
62
0
30 Jan 2024
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks
Bolei Ma
Ercong Nie
Shuzhou Yuan
Helmut Schmid
Michael Farber
Frauke Kreuter
Hinrich Schütze
VLM
154
6
0
29 Jan 2024
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Xiao-wen Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Bin Wang
...
Conghui He
Xingcheng Zhang
Yu Qiao
Dahua Lin
Jiaqi Wang
VLM
MLLM
175
268
0
29 Jan 2024
Zero-shot Imitation Policy via Search in Demonstration Dataset
Federico Malato
Florian Leopold
Andrew Melnik
Ville Hautamaki
LM&Ro
OffRL
52
7
0
29 Jan 2024
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Pratyush Maini
Skyler Seto
Richard He Bai
David Grangier
Yizhe Zhang
Navdeep Jaitly
SyDa
92
67
0
29 Jan 2024
Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Banghua Zhu
Michael I. Jordan
Jiantao Jiao
84
33
0
29 Jan 2024
VIALM: A Survey and Benchmark of Visually Impaired Assistance with Large Models
Yi Zhao
Yilin Zhang
Rong Xiang
Jing Li
Hillming Li
77
16
0
29 Jan 2024
KAUCUS: Knowledge Augmented User Simulators for Training Language Model Assistants
Kaustubh D. Dhole
84
3
0
29 Jan 2024
Corrective Retrieval Augmented Generation
Shi-Qi Yan
Jia-Chen Gu
Yun Zhu
Zhen-Hua Ling
RALM
250
89
0
29 Jan 2024
LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs' Vulnerability Reasoning
Yuqiang Sun
Daoyuan Wu
Yue Xue
Han Liu
Wei Ma
Lyuye Zhang
Miaolei Shi
Yingjiu Li
ELM
201
55
0
29 Jan 2024
PILOT: Legal Case Outcome Prediction with Case Law
Lang Cao
Zifeng Wang
Cao Xiao
Jimeng Sun
AILaw
ELM
72
8
0
28 Jan 2024
YODA: Teacher-Student Progressive Learning for Language Models
Jianqiao Lu
Wanjun Zhong
Yufei Wang
Zhijiang Guo
Qi Zhu
...
Baojun Wang
Yasheng Wang
Lifeng Shang
Xin Jiang
Qun Liu
LRM
106
7
0
28 Jan 2024
PRE: A Peer Review Based Large Language Model Evaluator
Zhumin Chu
Qingyao Ai
Yiteng Tu
Haitao Li
Yiqun Liu
LRM
ALM
112
21
0
28 Jan 2024
Diffusion-based Graph Generative Methods
Hongyang Chen
Can Xu
Lingyu Zheng
Qiang Zhang
Xuemin Lin
DiffM
MedIm
100
0
0
28 Jan 2024
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
Timothy Baldwin
LRM
81
33
0
28 Jan 2024
Harnessing Network Effect for Fake News Mitigation: Selecting Debunkers via Self-Imitation Learning
Xiaofei Xu
Ke Deng
Michael Dann
Xiuzhen Zhang
42
6
0
28 Jan 2024
Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain
Yiming Gao
Feiyu Liu
Liang Wang
Zhenjie Lian
Dehua Zheng
...
Jing Dai
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
82
1
0
28 Jan 2024
Quantifying Stereotypes in Language
Yang Liu
75
1
0
28 Jan 2024
DataFrame QA: A Universal LLM Framework on DataFrame Question Answering Without Data Exposure
J. Ye
Mengnan Du
Guiling Wang
LMTD
51
8
0
27 Jan 2024
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation
Yuxin Liang
Zhuoyang Song
Hao Wang
Jiaxing Zhang
HILM
102
36
0
27 Jan 2024
An Empirical Study on Large Language Models in Accuracy and Robustness under Chinese Industrial Scenarios
Zongjie Li
Wenying Qiu
Pingchuan Ma
Yichen Li
You Li
Sijia He
Baozheng Jiang
Shuai Wang
Weixi Gu
122
2
0
27 Jan 2024
Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models
Minbyul Jeong
Jiwoong Sohn
Mujeen Sung
Jaewoo Kang
120
34
0
27 Jan 2024
Health Text Simplification: An Annotated Corpus for Digestive Cancer Education and Novel Strategies for Reinforcement Learning
Md Mushfiqur Rahman
Mohammad Sabik Irbaz
Kai North
Michelle S. Williams
Marcos Zampieri
Kevin Lybarger
86
1
0
26 Jan 2024
GeoDecoder: Empowering Multimodal Map Understanding
Feng Qi
Mian Dai
Zixian Zheng
Chao Wang
90
2
0
26 Jan 2024
Design Principles for Generative AI Applications
Justin D. Weisz
Jessica He
Michael J. Muller
Gabriela Hoefer
Rachel Miles
Werner Geyer
AI4CE
92
143
0
25 Jan 2024
Wordflow: Social Prompt Engineering for Large Language Models
Zijie J. Wang
Aishwarya Chakravarthy
David Munechika
Duen Horng Chau
80
14
0
25 Jan 2024
Genie: Achieving Human Parity in Content-Grounded Datasets Generation
Asaf Yehudai
Boaz Carmeli
Y. Mass
Ofir Arviv
Nathaniel Mills
Assaf Toledo
Eyal Shnarch
Leshem Choshen
67
25
0
25 Jan 2024
Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement
Hana Kim
Kai Tzu-iunn Ong
Seoyeon Kim
Dongha Lee
Jinyoung Yeo
78
9
0
25 Jan 2024
Parameter-Efficient Conversational Recommender System as a Language Processing Task
Mathieu Ravaut
Hao Zhang
Lu Xu
Aixin Sun
Yong Liu
113
11
0
25 Jan 2024
True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning
Weihao Tan
Wentao Zhang
Shanqi Liu
Longtao Zheng
Xinrun Wang
Bo An
OffRL
109
22
0
25 Jan 2024
Mapping the Design Space of Teachable Social Media Feed Experiences
K. J. Kevin Feng
Xander Koo
Lawrence Tan
Amy Bruckman
David W. McDonald
Amy X. Zhang
110
15
0
25 Jan 2024
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
Hongliang He
Wenlin Yao
Kaixin Ma
Wenhao Yu
Yong Dai
Hongming Zhang
Zhenzhong Lan
Dong Yu
LLMAG
191
151
0
25 Jan 2024
MULTIVERSE: Exposing Large Language Model Alignment Problems in Diverse Worlds
Xiaolong Jin
Zhuo Zhang
Xiangyu Zhang
39
4
0
25 Jan 2024
The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support
Inhwa Song
Sachin R. Pendse
Neha Kumar
Munmun De Choudhury
AI4MH
79
18
0
25 Jan 2024
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
Xinghao Wang
Junliang He
Pengyu Wang
Yunhua Zhou
Tianxiang Sun
Xipeng Qiu
AI4TS
51
4
0
24 Jan 2024
MM-LLMs: Recent Advances in MultiModal Large Language Models
Duzhen Zhang
Yahan Yu
Jiahua Dong
Chenxing Li
Dan Su
Chenhui Chu
Dong Yu
OffRL
LRM
174
217
0
24 Jan 2024
Previous
1
2
3
...
102
103
104
...
126
127
128
Next