Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,388 papers shown
Title
GPT Can Solve Mathematical Problems Without a Calculator
Zhiyong Yang
Ming Ding
Qingsong Lv
Zhihuan Jiang
Zehai He
Yuyi Guo
Jinfeng Bai
Jie Tang
RALM
LRM
116
56
0
06 Sep 2023
Certifying LLM Safety against Adversarial Prompting
Aounon Kumar
Chirag Agarwal
Suraj Srinivas
Aaron Jiaxun Li
Soheil Feizi
Himabindu Lakkaraju
AAML
157
197
0
06 Sep 2023
Enhance Multi-domain Sentiment Analysis of Review Texts through Prompting Strategies
Yajing Wang
Zongwei Luo
LRM
40
5
0
05 Sep 2023
Bias Testing and Mitigation in LLM-based Code Generation
Dong Huang
Qingwen Bu
Jie M. Zhang
Xiaofei Xie
Junjie Chen
Heming Cui
128
27
0
03 Sep 2023
Generative Social Choice
Sara Fish
Paul Gölz
David C. Parkes
Ariel D. Procaccia
Gili Rusak
Itai Shapira
Manuel Wüthrich
116
38
0
03 Sep 2023
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks
Sarthak Anand
60
0
0
02 Sep 2023
Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior
Ashmit Khandelwal
Aditya Agrawal
Aanisha Bhattacharyya
Yaman Kumar Singla
Somesh Singh
...
Ishita Dasgupta
Stefano Petrangeli
R. Shah
Changyou Chen
Balaji Krishnamurthy
81
8
0
01 Sep 2023
FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-Checking
Tsun-hin Cheung
K. Lam
KELM
HILM
LRM
77
34
0
01 Sep 2023
Is the U.S. Legal System Ready for AI's Challenges to Human Values?
Inyoung Cheong
Aylin Caliskan
Tadayoshi Kohno
SILM
ELM
AILaw
70
1
0
30 Aug 2023
Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models
Qingyue Wang
Y. Fu
Yanan Cao
Zhiliang Tian
Shi Wang
Dacheng Tao
LLMAG
KELM
RALM
182
29
0
29 Aug 2023
On Reward Structures of Markov Decision Processes
Falcon Z. Dai
34
1
0
28 Aug 2023
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
205
13
0
28 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
282
31
0
27 Aug 2023
Rethinking Language Models as Symbolic Knowledge Graphs
Vishwas Mruthyunjaya
Pouya Pezeshkpour
Estevam R. Hruschka
Nikita Bhutani
ELM
ALM
39
12
0
25 Aug 2023
EntropyRank: Unsupervised Keyphrase Extraction via Side-Information Optimization for Language Model-based Text Compression
Alexander Tsvetkov
A. Kipnis
33
4
0
25 Aug 2023
DARWIN Series: Domain Specific Large Language Models for Natural Science
Tong Xie
Yuwei Wan
Wei-Ping Huang
Zhenyu Yin
Yixuan Liu
...
Chunyu Kit
Clara Grazian
Wenjie Zhang
Imran Razzak
B. Hoex
ELM
ALM
AI4CE
52
30
0
25 Aug 2023
Large Language Models in Analyzing Crash Narratives -- A Comparative Study of ChatGPT, BARD and GPT-4
M. Mumtarin
Md. Samiullah Chowdhury
Jonathan S. Wood
59
11
0
25 Aug 2023
Harnessing the Power of David against Goliath: Exploring Instruction Data Generation without Using Closed-Source Models
Yue Wang
Xinrui Wang
Juntao Li
Jinxiong Chang
Qishen Zhang
Zhongyi Liu
Guannan Zhang
Min Zhang
ALM
39
6
0
24 Aug 2023
Mind vs. Mouth: On Measuring Re-judge Inconsistency of Social Bias in Large Language Models
Yachao Zhao
Bo Wang
Dongming Zhao
Kun Huang
Yan Wang
Ruifang He
Yuexian Hou
97
4
0
24 Aug 2023
Aligning Language Models with Offline Learning from Human Feedback
Jian Hu
Li Tao
J. Yang
Chandler Zhou
ALM
OffRL
90
7
0
23 Aug 2023
Prompt-Based Length Controlled Generation with Reinforcement Learning
Renlong Jie
Xiaojun Meng
Lifeng Shang
Xin Jiang
Qun Liu
89
11
0
23 Aug 2023
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Junyi Chen
Longteng Guo
Jianxiang Sun
Shuai Shao
Zehuan Yuan
Liang Lin
Dongyu Zhang
MLLM
VLM
MoE
77
10
0
23 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
195
19
0
23 Aug 2023
Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models
Mohamed S. Elaraby
Mengyin Lu
Jacob Dunn
Xueying Zhang
Yu Wang
Shizhu Liu
Pingchuan Tian
Yuping Wang
Yuxuan Wang
HILM
109
27
0
22 Aug 2023
ViCo: Engaging Video Comment Generation with Human Preference Rewards
Yuchong Sun
Bei Liu
Xu Chen
Ruihua Song
Jianlong Fu
VGen
61
2
0
22 Aug 2023
Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models
Bilgehan Sel
Ahmad S. Al-Tawaha
Vanshaj Khattar
R. Jia
Ming Jin
LM&Ro
LRM
98
70
0
20 Aug 2023
Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?
Kai Sun
Yongjun Xu
Hanwen Zha
Yue Liu
Xinhsuai Dong
AI4MH
140
148
0
20 Aug 2023
ExpeL: LLM Agents Are Experiential Learners
Andrew Zhao
Daniel Huang
Quentin Xu
Matthieu Lin
Yang Liu
Gao Huang
LLMAG
123
227
0
20 Aug 2023
PUMGPT: A Large Vision-Language Model for Product Understanding
Wei Xue
Zongyi Guo
Baoliang Cui
Zengming Tang
Weiwei Zhang
Haihong Tang
Shuhui Wu
Weiming Lu
VLM
72
2
0
18 Aug 2023
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Haipeng Luo
Qingfeng Sun
Can Xu
Pu Zhao
Jian-Guang Lou
...
Xiubo Geng
Qingwei Lin
Shifeng Chen
Yansong Tang
Dongmei Zhang
LRM
OSLM
303
468
0
18 Aug 2023
Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language Tasks
Fawaz Sammani
Nikos Deligiannis
53
5
0
17 Aug 2023
An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning
Yun Luo
Zhen Yang
Fandong Meng
Yafu Li
Jie Zhou
Yue Zhang
CLL
KELM
213
319
0
17 Aug 2023
Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Operation
Xinshuo Hu
Dongfang Li
Baotian Hu
Zihao Zheng
Zhenyu Liu
Hao Fei
KELM
MU
96
30
0
16 Aug 2023
Better Zero-Shot Reasoning with Role-Play Prompting
Aobo Kong
Shiwan Zhao
Hao Chen
Qicheng Li
Yong Qin
Ruiqi Sun
Xiaoxia Zhou
Enzhi Wang
Xiaohang Dong
ReLM
LLMAG
LRM
101
179
0
15 Aug 2023
AIGC In China: Current Developments And Future Outlook
Xiangyu Li
Yuqing Fan
S. Cheng
43
11
0
14 Aug 2023
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
Jiajia Li
Mingle Xu
Lirong Xiang
Dong Chen
Weichao Zhuang
Xunyuan Yin
Zhao Li
132
3
0
13 Aug 2023
A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control
Marshall Wang
John Willes
Thomas Jiralerspong
Matin Moezzi
OffRL
AI4CE
54
2
0
10 Aug 2023
CLEVA: Chinese Language Models EVAluation Platform
Yanyang Li
Jianqiao Zhao
Duo Zheng
Zi-Yuan Hu
Zhi Chen
...
Yongfeng Huang
Shijia Huang
Dahua Lin
Michael R. Lyu
Liwei Wang
ALM
ELM
103
11
0
09 Aug 2023
Learning Evaluation Models from Large Language Models for Sequence Generation
Chenglong Wang
Hang Zhou
Kai-Chun Chang
Tongran Liu
Chunliang Zhang
Quan Du
Tong Xiao
Yue Zhang
Jingbo Zhu
ELM
163
4
0
08 Aug 2023
Revisiting Prompt Engineering via Declarative Crowdsourcing
Aditya G. Parameswaran
Shreya Shankar
Parth Asawa
Naman Jain
Yujie Wang
86
21
0
07 Aug 2023
GPTScan: Detecting Logic Vulnerabilities in Smart Contracts by Combining GPT with Program Analysis
Yuqiang Sun
Daoyuan Wu
Yue Xue
Hangbo Liu
Haijun Wang
Yulong Shen
Xiaofei Xie
Yang Liu
79
96
0
07 Aug 2023
SAPIEN: Affective Virtual Agents Powered by Large Language Models
Masum Hasan
Cengiz Ozel
Sammy Potter
E. Hoque
VLM
LLMAG
74
8
0
06 Aug 2023
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from Text
Nandana Mihindukulasooriya
Sanju Tiwari
Carlos F. Enguix
K. Lata
91
62
0
04 Aug 2023
ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation
Chenglong Wang
Hang Zhou
Yimin Hu
Yi Huo
Bei Li
Tongran Liu
Tong Xiao
Jingbo Zhu
81
9
0
04 Aug 2023
From Prompt Injections to SQL Injection Attacks: How Protected is Your LLM-Integrated Web Application?
Rodrigo Pedro
Daniel Castro
Paulo Carreira
Nuno Santos
SILM
AAML
136
57
0
03 Aug 2023
Curricular Transfer Learning for Sentence Encoded Tasks
Jader Martins Camboim de Sá
Matheus Ferraroni Sanches
R. R. Souza
Júlio Cesar dos Reis
Leandro A. Villas
78
0
0
03 Aug 2023
DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales
Z. Yao
Reza Yazdani Aminabadi
Olatunji Ruwase
Samyam Rajbhandari
Xiaoxia Wu
...
Heyang Qin
Masahiro Tanaka
Shuai Che
Shuaiwen Leon Song
Yuxiong He
ALM
OffRL
114
74
0
02 Aug 2023
Teaching Smaller Language Models To Generalise To Unseen Compositional Questions
Tim Hartill
N. Tan
Michael Witbrock
Patricia J. Riddle
ReLM
KELM
LRM
86
2
0
02 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
87
39
0
02 Aug 2023
FinVis-GPT: A Multimodal Large Language Model for Financial Chart Analysis
Ziao Wang
Yuhang Li
Junda Wu
Jaehyeon Soon
Xiaofeng Zhang
MLLM
59
18
0
31 Jul 2023
Previous
1
2
3
...
114
115
116
...
126
127
128
Next