Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,392 papers shown
Title
Rethinking Image Editing Detection in the Era of Generative AI Revolution
Zhihao Sun
Haipeng Fang
Xinying Zhao
Danding Wang
Juan Cao
99
10
0
29 Nov 2023
StructRe: Rewriting for Structured Shape Modeling
Jiepeng Wang
Hao Pan
Yang Liu
Xin Tong
Taku Komura
Wenping Wang
133
1
0
29 Nov 2023
Critical Influence of Overparameterization on Sharpness-aware Minimization
Sungbin Shin
Dongyeop Lee
Maksym Andriushchenko
Namhoon Lee
AAML
164
2
0
29 Nov 2023
Digital Twin-Enhanced Deep Reinforcement Learning for Resource Management in Networks Slicing
Zhengming Zhang
Yongming Huang
Cheng Zhang
Qingbi Zheng
Luxi Yang
Xiaohu You
65
15
0
28 Nov 2023
CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models
Yuhang Wang
Yanxu Zhu
Chao Kong
Shuyu Wei
Xiaoyuan Yi
Xing Xie
Jitao Sang
ALM
VLM
ELM
66
8
0
28 Nov 2023
End-to-End Breast Cancer Radiotherapy Planning via LMMs with Consistency Embedding
Kwanyoung Kim
Y. Oh
S. Park
H. Byun
Joongyo Lee
Jin Sung Kim
Yong Bae Kim
Jong Chul Ye
126
0
0
27 Nov 2023
Potential Societal Biases of ChatGPT in Higher Education: A Scoping Review
Ming Li
Ariunaa Enkhtur
B. Yamamoto
Fei Cheng
Lilan Chen
AI4CE
141
7
0
24 Nov 2023
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
Weijia Wu
Zhuang Li
Yefei He
Mike Zheng Shou
Chunhua Shen
Lele Cheng
Yan Li
Yan Li
Di Zhang
VLM
237
25
0
24 Nov 2023
PrivateLoRA For Efficient Privacy Preserving LLM
Yiming Wang
Yu Lin
Xiaodong Zeng
Guannan Zhang
109
14
0
23 Nov 2023
Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions
S. Cao
Jiajie Zhang
Jiaxin Shi
Xin Lv
Zijun Yao
Qingwen Tian
Juanzi Li
Lei Hou
LRM
91
19
0
23 Nov 2023
RLIF: Interactive Imitation Learning as Reinforcement Learning
Jianlan Luo
Perry Dong
Yuexiang Zhai
Yi-An Ma
Sergey Levine
OffRL
128
18
0
21 Nov 2023
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Jiaxi Lv
Yi Huang
Mingfu Yan
Jiancheng Huang
Jianzhuang Liu
Yifan Liu
Yafei Wen
Xiaoxin Chen
Shifeng Chen
VGen
DiffM
122
25
0
21 Nov 2023
LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
Han Guo
P. Greengard
Eric P. Xing
Yoon Kim
MQ
146
57
0
20 Nov 2023
LLM aided semi-supervision for Extractive Dialog Summarization
Nishant Mishra
Gaurav Sahu
Iacer Calixto
Ameen Abu-Hanna
The Netherlands
95
3
0
19 Nov 2023
Graph Elicitation for Guiding Multi-Step Reasoning in Large Language Models
Jinyoung Park
Ameen Patel
Omar Zia Khan
Hyunwoo J. Kim
Jooyeon Kim
KELM
LRM
ReLM
78
4
0
16 Nov 2023
RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models
Jiong Wang
Junlin Wu
Muhao Chen
Yevgeniy Vorobeychik
Chaowei Xiao
AAML
104
15
0
16 Nov 2023
MacGyver: Are Large Language Models Creative Problem Solvers?
Yufei Tian
Abhilasha Ravichander
Lianhui Qin
Ronan Le Bras
Raja Marjieh
Nanyun Peng
Yejin Choi
Thomas Griffiths
Faeze Brahman
AI4CE
LLMAG
123
14
0
16 Nov 2023
Ever: Mitigating Hallucination in Large Language Models through Real-Time Verification and Rectification
Haoqiang Kang
Juntong Ni
Huaxiu Yao
HILM
LRM
113
37
0
15 Nov 2023
Speculative Contrastive Decoding
Hongyi Yuan
Keming Lu
Fei Huang
Zheng Yuan
Chang Zhou
76
6
0
15 Nov 2023
Towards Long-term Annotators: A Supervised Label Aggregation Baseline
Haoyu Liu
Fei-Yue Wang
Minmin Lin
Runze Wu
Renyu Zhu
Shiwei Zhao
Kai Wang
Tangjie Lv
Changjie Fan
76
1
0
15 Nov 2023
When Large Language Models contradict humans? Large Language Models' Sycophantic Behaviour
Leonardo Ranaldi
Giulia Pucci
82
34
0
15 Nov 2023
Selecting Shots for Demographic Fairness in Few-Shot Learning with Large Language Models
Carlos Alejandro Aguirre
Kuleen Sasse
Isabel Cachola
Mark Dredze
107
2
0
14 Nov 2023
A Material Lens on Coloniality in NLP
William B. Held
Camille Harris
Michael Best
Diyi Yang
98
14
0
14 Nov 2023
Predicting Text Preference Via Structured Comparative Reasoning
Jing Nathan Yan
Tianqi Liu
Justin T Chiu
Jiaming Shen
Zhen Qin
...
Charumathi Lakshmanan
Y. Kurzion
Alexander M. Rush
Jialu Liu
Michael Bendersky
LRM
98
7
0
14 Nov 2023
Fair Abstractive Summarization of Diverse Perspectives
Yusen Zhang
Nan Zhang
Yixin Liu
Alexander R. Fabbri
Junru Liu
...
Caiming Xiong
Jieyu Zhao
Dragomir R. Radev
Kathleen McKeown
Rui Zhang
82
11
0
14 Nov 2023
In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax
Aaron Mueller
Albert Webson
Jackson Petty
Tal Linzen
ReLM
LRM
103
16
0
13 Nov 2023
LM-Polygraph: Uncertainty Estimation for Language Models
Ekaterina Fadeeva
Roman Vashurin
Akim Tsvigun
Artem Vazhentsev
Sergey Petrakov
...
Elizaveta Goncharova
Alexander Panchenko
Maxim Panov
Timothy Baldwin
Artem Shelmanov
62
69
0
13 Nov 2023
Language Models can be Logical Solvers
Jiazhan Feng
Ruochen Xu
Junheng Hao
Hiteshi Sharma
Yelong Shen
Dongyan Zhao
Weizhu Chen
ReLM
LRM
ELM
107
27
0
10 Nov 2023
Making LLMs Worth Every Penny: Resource-Limited Text Classification in Banking
Lefteris Loukas
Ilias Stogiannidis
Odysseas Diamantopoulos
Prodromos Malakasiotis
Stavros Vassos
110
51
0
10 Nov 2023
ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences
Yuanhe Tian
Ruyi Gan
Yan Song
Jiaxing Zhang
Yongdong Zhang
AI4MH
AI4CE
LM&MA
129
41
0
10 Nov 2023
Deep Natural Language Feature Learning for Interpretable Prediction
Felipe Urrutia
Cristian Buc
Valentin Barriere
80
2
0
09 Nov 2023
FigStep: Jailbreaking Large Vision-Language Models via Typographic Visual Prompts
Yichen Gong
Delong Ran
Jinyuan Liu
Conglei Wang
Tianshuo Cong
Anyu Wang
Sisi Duan
Xiaoyun Wang
MLLM
240
161
0
09 Nov 2023
Challenging the Validity of Personality Tests for Large Language Models
Tom Sühr
Florian E. Dorner
Samira Samadi
Augustin Kelava
34
10
0
09 Nov 2023
Chain of Images for Intuitively Reasoning
Fanxu Meng
Haotong Yang
Yiding Wang
Muhan Zhang
LRM
78
10
0
09 Nov 2023
GRASP: A Disagreement Analysis Framework to Assess Group Associations in Perspectives
Vinodkumar Prabhakaran
Christopher Homan
Lora Aroyo
Aida Mostafazadeh Davani
Alicia Parrish
Alex S. Taylor
Mark Díaz
Ding Wang
Greg Serapio-García
106
9
0
09 Nov 2023
LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language Models
Jianxin Yang
43
6
0
08 Nov 2023
ADaPT: As-Needed Decomposition and Planning with Language Models
Archiki Prasad
Alexander Koller
Mareike Hartmann
Peter Clark
Ashish Sabharwal
Mohit Bansal
Tushar Khot
LM&Ro
99
93
0
08 Nov 2023
Successor Features for Efficient Multisubject Controlled Text Generation
Mengyao Cao
Mehdi Fatemi
Jackie Chi Kit Cheung
Samira Shabanian
BDL
89
0
0
03 Nov 2023
People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection
Indira Sen
Dennis Assenmacher
Mattia Samory
Isabelle Augenstein
Wil M.P. van der Aalst
Claudia Wagner
91
21
0
02 Nov 2023
The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization
Sian Gooding
Hassan Mansoor
42
2
0
02 Nov 2023
Multimodal Foundation Models for Zero-shot Animal Species Recognition in Camera Trap Images
Zalan Fabian
Zhongqi Miao
Chunyuan Li
Yuanhan Zhang
Ziwei Liu
...
Laura Siabatto
Andrés Link
Pablo Arbelaez
Rahul Dodhia
J. L. Ferres
111
11
0
02 Nov 2023
LLM4Drive: A Survey of Large Language Models for Autonomous Driving
Zhenjie Yang
Xiaosong Jia
Hongyang Li
Junchi Yan
ELM
154
122
0
02 Nov 2023
Emotion Detection for Misinformation: A Review
Zhiwei Liu
Tianlin Zhang
Kailai Yang
Paul Thompson
Zeping Yu
Sophia Ananiadou
110
35
0
01 Nov 2023
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models
Xinwei Wu
Junzhuo Li
Minghui Xu
Weilong Dong
Shuangzhi Wu
Chao Bian
Deyi Xiong
MU
KELM
108
55
0
31 Oct 2023
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Allen Nie
Yuhui Zhang
Atharva Amdekar
Chris Piech
Tatsunori Hashimoto
Tobias Gerstenberg
84
40
0
30 Oct 2023
CoLLM: Integrating Collaborative Embeddings into Large Language Models for Recommendation
Yang Zhang
Fuli Feng
Jizhi Zhang
Keqin Bao
Qifan Wang
Xiangnan He
97
88
0
30 Oct 2023
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
Zelai Xu
Chao Yu
Fei Fang
Yu Wang
Yi Wu
LLMAG
133
95
0
29 Oct 2023
Are NLP Models Good at Tracing Thoughts: An Overview of Narrative Understanding
Lixing Zhu
Runcong Zhao
Lin Gui
Yulan He
82
5
0
28 Oct 2023
Expanding the Set of Pragmatic Considerations in Conversational AI
S. M. Seals
V. Shalin
74
2
0
27 Oct 2023
Fine-Tuning Language Models Using Formal Methods Feedback
Yunhao Yang
N. Bhatt
Tyler Ingebrand
William Ward
Steven Carr
Zhangyang Wang
Ufuk Topcu
69
9
0
27 Oct 2023
Previous
1
2
3
...
110
111
112
...
126
127
128
Next