Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 7,311 papers shown
Title
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
Rui Pan
Xiang Liu
Shizhe Diao
Renjie Pi
Jipeng Zhang
Chi Han
Tong Zhang
48
38
0
26 Mar 2024
Assessment of Multimodal Large Language Models in Alignment with Human Values
Zhelun Shi
Zhipin Wang
Hongxing Fan
Zaibin Zhang
Lijun Li
Yongting Zhang
Zhen-fei Yin
Lu Sheng
Yu Qiao
Jing Shao
47
16
0
26 Mar 2024
Continual Few-shot Event Detection via Hierarchical Augmentation Networks
Chenlong Zhang
Pengfei Cao
Yubo Chen
Kang Liu
Qing Cui
Mengshu Sun
Jun Zhao
48
3
0
26 Mar 2024
Language Models for Text Classification: Is In-Context Learning Enough?
A. Edwards
Jose Camacho-Collados
LRM
49
18
0
26 Mar 2024
RuBia: A Russian Language Bias Detection Dataset
Veronika Grigoreva
Anastasiia Ivanova
I. Alimova
Ekaterina Artemova
45
1
0
26 Mar 2024
Large Language Models Are State-of-the-Art Evaluator for Grammatical Error Correction
Masamune Kobayashi
Masato Mita
Mamoru Komachi
ELM
47
3
0
26 Mar 2024
ILLUMINER: Instruction-tuned Large Language Models as Few-shot Intent Classifier and Slot Filler
Paramita Mirza
Viju Sudhi
S. Sahoo
Sinchana Ramakanth Bhat
40
4
0
26 Mar 2024
KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on Large Language Models for Knowledge Graph Completion
Yilin Wang
Minghao Hu
Zhen Huang
Dongsheng Li
Dong Yang
Xicheng Lu
29
2
0
26 Mar 2024
Robust and Scalable Model Editing for Large Language Models
Yingfa Chen
Zhengyan Zhang
Xu Han
Chaojun Xiao
Zhiyuan Liu
Chen Chen
Kuai Li
Tao Yang
Maosong Sun
KELM
42
2
0
26 Mar 2024
LM-Combiner: A Contextual Rewriting Model for Chinese Grammatical Error Correction
Yixuan Wang
Baoxin Wang
Yijun Liu
Dayong Wu
Wanxiang Che
KELM
54
1
0
26 Mar 2024
ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales?
Fan Huang
Haewoon Kwak
Kunwoo Park
Jisun An
ALM
ELM
AI4MH
45
12
0
26 Mar 2024
Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models
Zhiyuan Yu
Xiaogeng Liu
Shunning Liang
Zach Cameron
Chaowei Xiao
Ning Zhang
35
43
0
26 Mar 2024
The Pursuit of Fairness in Artificial Intelligence Models: A Survey
Tahsin Alamgir Kheya
Mohamed Reda Bouadjenek
Sunil Aryal
41
8
0
26 Mar 2024
DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion
Yuanze Lin
Ronald Clark
Philip Torr
3DGS
39
10
0
25 Mar 2024
Extracting Social Support and Social Isolation Information from Clinical Psychiatry Notes: Comparing a Rule-based NLP System and a Large Language Model
Braja Gopal Patra
L. Lepow
Praneet Kasi Reddy Jagadeesh Kumar
V. Vekaria
M. M. Sharma
...
Myrna Weissman
M. Olfson
J. Mann
Alexander W. Charney
Jyoti Pathak
AI4MH
21
3
0
25 Mar 2024
Reflecting the Male Gaze: Quantifying Female Objectification in 19th and 20th Century Novels
Kexin Luo
Yue Mao
Bei Zhang
Sophie Hao
40
1
0
25 Mar 2024
MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models
Kailai Yang
Zhiwei Liu
Qianqian Xie
Jimin Huang
Tianlin Zhang
Sophia Ananiadou
37
15
0
25 Mar 2024
The Strong Pull of Prior Knowledge in Large Language Models and Its Impact on Emotion Recognition
Georgios Chochlakis
Alexandros Potamianos
Kristina Lerman
Shrikanth Narayanan
42
5
0
25 Mar 2024
GOLF: Goal-Oriented Long-term liFe tasks supported by human-AI collaboration
Ben Wang
30
0
0
25 Mar 2024
Investigation of the effectiveness of applying ChatGPT in Dialogic Teaching Using Electroencephalography
Jiayue Zhang
Yi-Hsueh Liu
Wenqi Cai
Lanlan Wu
Yali Peng
Jingjing Yu
Senqing Qi
Taotao Long
Bao Ge
21
3
0
25 Mar 2024
CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment
Feiteng Fang
Liang Zhu
Min Yang
Xi Feng
Jinchang Hou
Qixuan Zhao
Chengming Li
Xiping Hu
Ruifeng Xu
32
0
0
25 Mar 2024
Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization
Xiangxin Zhou
Dongyu Xue
Ruizhe Chen
Zaixiang Zheng
Liang Wang
Quanquan Gu
DiffM
65
20
0
25 Mar 2024
Learning To Guide Human Decision Makers With Vision-Language Models
Debodeep Banerjee
Stefano Teso
Burcu Sayin
Andrea Passerini
40
1
0
25 Mar 2024
KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Dongjun Jang
Sungjoo Byun
Hyemi Jo
Hyopil Shin
ALM
23
0
0
25 Mar 2024
CodeS: Natural Language to Code Repository via Multi-Layer Sketch
Daoguang Zan
Ailun Yu
Wei Liu
Dong Chen
Bo Shen
...
Bei Guan
Zhiguang Yang
Yongji Wang
Qianxiang Wang
Li-zhen Cui
35
14
0
25 Mar 2024
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
Reza Esfandiarpoor
Cristina Menghini
Stephen H. Bach
CoGe
VLM
45
8
0
25 Mar 2024
Is There a One-Model-Fits-All Approach to Information Extraction? Revisiting Task Definition Biases
Wenhao Huang
Qi He
Zhixu Li
Jiaqing Liang
Yanghua Xiao
34
2
0
25 Mar 2024
ChatGPT Incorrectness Detection in Software Reviews
M. Tanzil
Junaed Younus Khan
Gias Uddin
24
4
0
25 Mar 2024
Enhanced Facet Generation with LLM Editing
Joosung Lee
Jinhong Kim
29
2
0
25 Mar 2024
AIOS: LLM Agent Operating System
Kai Mei
Zelong Li
Wujiang Xu
Wenyue Hua
Mingyu Jin
Yongfeng Zhang
Shuyuan Xu
Ruosong Ye
Yingqiang Ge
Yongfeng Zhang
LLMAG
30
17
0
25 Mar 2024
Large Language Models Offer an Alternative to the Traditional Approach of Topic Modelling
Yida Mu
Chun Dong
Kalina Bontcheva
Xingyi Song
31
19
0
24 Mar 2024
ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models
Zequan Liu
Jiawen Lyn
Wei-wei Zhu
Xing Tian
Yvette Graham
OffRL
45
10
0
24 Mar 2024
Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models
Minchan Kim
Minyeong Kim
Junik Bae
Suhwan Choi
Sungkyung Kim
Buru Chang
VLM
32
4
0
24 Mar 2024
A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish
Masahiro Kaneko
Timothy Baldwin
PILM
36
3
0
24 Mar 2024
WangchanLion and WangchanX MRC Eval
Wannaphong Phatthiyaphaibun
Surapon Nonesung
Patomporn Payoungkhamdee
Peerat Limkonchotiwat
Can Udomcharoenchaikit
Jitkapat Sawatphol
Chompakorn Chaksangchaichot
Ekapol Chuangsuwanich
Sarana Nutanong
62
0
0
24 Mar 2024
Opportunities and challenges in the application of large artificial intelligence models in radiology
Liangrui Pan
Zhenyu Zhao
Ying Lu
Kewei Tang
Liyong Fu
Qingchun Liang
Shaoliang Peng
LM&MA
MedIm
AI4CE
47
5
0
24 Mar 2024
Argument Quality Assessment in the Age of Instruction-Following Large Language Models
Henning Wachsmuth
Gabriella Lapesa
Elena Cabrio
Anne Lauscher
Joonsuk Park
Eva Maria Vecchi
S. Villata
Timon Ziegenbein
39
0
0
24 Mar 2024
Monotonic Paraphrasing Improves Generalization of Language Model Prompting
Qin Liu
Fei Wang
Nan Xu
Tianyi Yan
Tao Meng
Muhao Chen
LRM
45
7
0
24 Mar 2024
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
Shengyi Huang
Michael Noukhovitch
Arian Hosseini
Kashif Rasul
Weixun Wang
Lewis Tunstall
VLM
30
31
0
24 Mar 2024
LlamBERT: Large-scale low-cost data annotation in NLP
Bálint Csanády
Lajos Muzsai
Péter Vedres
Zoltán Nádasdy
András Lukács
56
6
0
23 Mar 2024
Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition
Shijian Deng
Erin E. Kosloski
Siddhi Patel
Zeke A. Barnett
Yiyang Nan
...
William T. Doan
Matthew Wang
Harsh Singh
P. Rollins
Yapeng Tian
39
4
0
22 Mar 2024
SensoryT5: Infusing Sensorimotor Norms into T5 for Enhanced Fine-grained Emotion Classification
Yuhan Xia
Qingqing Zhao
Yunfei Long
Ge Xu
Jia Wang
28
0
0
22 Mar 2024
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
Orion Weller
Benjamin Chang
Sean MacAvaney
Kyle Lo
Arman Cohan
Benjamin Van Durme
Dawn J Lawrie
Luca Soldaini
63
30
0
22 Mar 2024
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
Taeheon Kim
Sangyun Chung
Damin Yeom
Youngjoon Yu
Hak Gu Kim
Y. Ro
43
2
0
22 Mar 2024
Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
Erik Miehling
Manish Nagireddy
P. Sattigeri
Elizabeth M. Daly
David Piorkowski
John T. Richards
ALM
44
11
0
22 Mar 2024
Argument-Aware Approach To Event Linking
I-Hung Hsu
Zihan Xue
Nilay Pochh
Sahil Bansal
Premkumar Natarajan
Jayanth Srinivasa
Nanyun Peng
39
0
0
22 Mar 2024
DP-Dueling: Learning from Preference Feedback without Compromising User Privacy
Aadirupa Saha
Hilal Asi
38
1
0
22 Mar 2024
Risk and Response in Large Language Models: Evaluating Key Threat Categories
Bahareh Harandizadeh
A. Salinas
Fred Morstatter
32
3
0
22 Mar 2024
Generative Active Learning for Image Synthesis Personalization
Xu-Lu Zhang
Wengyu Zhang
Xiao Wei
Jinlin Wu
Zhaoxiang Zhang
Zhen Lei
Qing Li
113
2
0
22 Mar 2024
Evidence-Driven Retrieval Augmented Response Generation for Online Misinformation
Zhenrui Yue
Huimin Zeng
Yimeng Lu
Lanyu Shang
Yang Zhang
Dong Wang
RALM
OffRL
38
19
0
22 Mar 2024
Previous
1
2
3
...
76
77
78
...
145
146
147
Next