Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,392 papers shown
Title
Enhancing Quantitative Reasoning Skills of Large Language Models through Dimension Perception
Yuncheng Huang
Qi He
Jiaqing Liang
Sihang Jiang
Yanghua Xiao
Yunwen Chen
LRM
95
3
0
29 Dec 2023
Overview of the PromptCBLUE Shared Task in CHIP2023
Wei-wei Zhu
Xiaoling Wang
Mosha Chen
Buzhou Tang
LM&MA
ELM
98
7
0
29 Dec 2023
Differentially Private Low-Rank Adaptation of Large Language Model Using Federated Learning
Xiao-Yang Liu
Rongyi Zhu
Daochen Zha
Jiechao Gao
Shan Zhong
Matt White
Meikang Qiu
80
26
0
29 Dec 2023
Tracking with Human-Intent Reasoning
Jiawen Zhu
Zhi-Qi Cheng
Jun-Yan He
Chenyang Li
Bin Luo
Huchuan Lu
Yifeng Geng
Xuansong Xie
LRM
VOS
87
11
0
29 Dec 2023
State Machine of Thoughts: Leveraging Past Reasoning Trajectories for Enhancing Problem Solving
Jia Liu
Jie Shuai
Xiyao Li
LRM
93
2
0
29 Dec 2023
Learning to Generate Text in Arbitrary Writing Styles
Aleem Khan
Andrew Wang
Sophia Hager
Nicholas Andrews
101
6
0
28 Dec 2023
A Simple LLM Framework for Long-Range Video Question-Answering
Ce Zhang
Taixi Lu
Md. Mohaiminul Islam
Ziyang Wang
Shoubin Yu
Mohit Bansal
Gedas Bertasius
192
92
0
28 Dec 2023
Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources
Rohan Deb
Aadirupa Saha
69
0
0
28 Dec 2023
Structured Packing in LLM Training Improves Long Context Utilization
Konrad Staniszewski
Szymon Tworkowski
Sebastian Jaszczur
Yu Zhao
Henryk Michalewski
Lukasz Kuciñski
Piotr Milo's
142
13
0
28 Dec 2023
LLM4EDA: Emerging Progress in Large Language Models for Electronic Design Automation
Ruizhe Zhong
Xingbo Du
Shixiong Kai
Zhentao Tang
Siyuan Xu
Hui-Ling Zhen
Jianye Hao
Qiang Xu
Mingxuan Yuan
Junchi Yan
73
40
0
28 Dec 2023
Experiential Co-Learning of Software-Developing Agents
Cheng Qian
Yufan Dang
Jiahao Li
Wei Liu
Zihao Xie
...
Cheng Yang
Xin Cong
Xiaoyin Che
Zhiyuan Liu
Maosong Sun
LLMAG
114
47
0
28 Dec 2023
DrugAssist: A Large Language Model for Molecule Optimization
Geyan Ye
Xibao Cai
Houtim Lai
Xing Wang
Junhong Huang
Longyue Wang
Wei Liu
Xian Zeng
135
33
0
28 Dec 2023
ZONE: Zero-Shot Instruction-Guided Local Editing
Shanglin Li
Bo-Wen Zeng
Yutang Feng
Sicheng Gao
Xuhui Liu
...
Li Lin
Xu Tang
Yao Hu
Jianzhuang Liu
Baochang Zhang
DiffM
103
35
0
28 Dec 2023
Beyond Output Matching: Bidirectional Alignment for Enhanced In-Context Learning
Chengwei Qin
Wenhan Xia
Fangkai Jiao
Chen Chen
Yuchen Hu
Bosheng Ding
R. Chen
Shafiq Joty
111
7
0
28 Dec 2023
Large Language Models for Conducting Advanced Text Analytics Information Systems Research
Benjamin Ampel
Chi-Heng Yang
Junjie Hu
Hsinchun Chen
120
8
0
27 Dec 2023
Some things are more CRINGE than others: Iterative Preference Optimization with the Pairwise Cringe Loss
Jing Xu
Andrew Lee
Sainbayar Sukhbaatar
Jason Weston
80
97
0
27 Dec 2023
Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges
Qingyao Li
Lingyue Fu
Weiming Zhang
Xianyu Chen
Jingwei Yu
Wei Xia
Weinan Zhang
Ruiming Tang
Yong Yu
AI4Ed
ELM
146
24
0
27 Dec 2023
Preference as Reward, Maximum Preference Optimization with Importance Sampling
Zaifan Jiang
Xing Huang
Chao Wei
105
2
0
27 Dec 2023
Task Contamination: Language Models May Not Be Few-Shot Anymore
Changmao Li
Jeffrey Flanigan
175
104
0
26 Dec 2023
Observable Propagation: Uncovering Feature Vectors in Transformers
Jacob Dunefsky
Arman Cohan
100
2
0
26 Dec 2023
RoleEval: A Bilingual Role Evaluation Benchmark for Large Language Models
Tianhao Shen
Sun Li
Quan Tu
Deyi Xiong
LLMAG
ELM
63
9
0
26 Dec 2023
Can ChatGPT Read Who You Are?
Erik Derner
D. Kučera
Nuria Oliver
Jan Zahálka
78
7
0
26 Dec 2023
AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning from Mobile GUI
Lihang Pan
Bowen Wang
Chun Yu
Yuxuan Chen
Xiangyu Zhang
Yuanchun Shi
84
3
0
26 Dec 2023
Aligning Large Language Models with Human Preferences through Representation Engineering
Tianlong Li
Xiaohua Wang
Muling Wu
Changze Lv
Changze Lv
Zixuan Ling
Jianhao Zhu
Cenyuan Zhang
Xiaoqing Zheng
Xuanjing Huang
69
41
0
26 Dec 2023
ChartBench: A Benchmark for Complex Visual Reasoning in Charts
Zhengzhuo Xu
Sinan Du
Yiyan Qi
Chengjin Xu
Chun Yuan
Jian Guo
162
49
0
26 Dec 2023
Align on the Fly: Adapting Chatbot Behavior to Established Norms
Chunpu Xu
Steffi Chern
Ethan Chern
Ge Zhang
Zekun Wang
Ruibo Liu
Jing Li
Jie Fu
Pengfei Liu
79
20
0
26 Dec 2023
SecQA: A Concise Question-Answering Dataset for Evaluating Large Language Models in Computer Security
Zefang Liu
ELM
48
27
0
26 Dec 2023
Alleviating Hallucinations of Large Language Models through Induced Hallucinations
Yue Zhang
Leyang Cui
Wei Bi
Shuming Shi
HILM
111
57
0
25 Dec 2023
EcomGPT-CT: Continual Pre-training of E-commerce Large Language Models with Semi-structured Data
Shirong Ma
Shen Huang
Shulin Huang
Xiaobin Wang
Yangning Li
Hai-Tao Zheng
Pengjun Xie
Fei Huang
Yong Jiang
118
6
0
25 Dec 2023
Instruction Fusion: Advancing Prompt Evolution through Hybridization
Weidong Guo
Jiuding Yang
Kaitong Yang
Xiangyang Li
Zhuwei Rao
Yu-Syuan Xu
Di Niu
84
6
0
25 Dec 2023
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Wei Liu
Weihao Zeng
Keqing He
Yong Jiang
Junxian He
ALM
145
239
0
25 Dec 2023
IQAGPT: Image Quality Assessment with Vision-language and ChatGPT Models
Zhihao Chen
Bin Hu
Chuang Niu
Tao Chen
Yuxin Li
Hongming Shan
Ge Wang
LM&MA
MLLM
71
4
0
25 Dec 2023
Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation
Jiaxin Bai
Yicheng Wang
Tianshi Zheng
Yue Guo
Xin Liu
Yangqiu Song
132
7
0
25 Dec 2023
Chatbot is Not All You Need: Information-rich Prompting for More Realistic Responses
Seokhoon Jeong
Assentay Makhmud
80
1
0
25 Dec 2023
A Survey on Open-Set Image Recognition
Qiulei Dong
Qiulei Dong
BDL
ObjD
92
7
0
25 Dec 2023
YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information Extraction
Xinglin Xiao
Yijie Wang
Nan Xu
Yuqi Wang
Hanxuan Yang
Minzheng Wang
Yin Luo
Lei Wang
Wenji Mao
Daniel Zeng
57
21
0
24 Dec 2023
A Group Fairness Lens for Large Language Models
Guanqun Bi
Lei Shen
Yuqiang Xie
Yanan Cao
Tiangang Zhu
Xiao-feng He
ALM
86
4
0
24 Dec 2023
LARP: Language-Agent Role Play for Open-World Games
Ming Yan
Ruihao Li
Hao Zhang
Hao Wang
Zhilan Yang
Ji Yan
LLMAG
LM&Ro
AI4CE
86
17
0
24 Dec 2023
A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators
Chen Zhang
L. F. D’Haro
Yiming Chen
Malu Zhang
Haizhou Li
ELM
83
31
0
24 Dec 2023
Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought
Limin Zheng
Hao Fei
Fei Li
Bobo Li
Lizi Liao
Donghong Ji
Chong Teng
71
7
0
23 Dec 2023
PokeMQA: Programmable knowledge editing for Multi-hop Question Answering
Hengrui Gu
Kaixiong Zhou
Xiaotian Han
Ninghao Liu
Ruobing Wang
Xin Wang
LRM
KELM
128
27
0
23 Dec 2023
Large Language Models as Zero-Shot Keyphrase Extractors: A Preliminary Empirical Study
Mingyang Song
Xuelian Geng
Songfang Yao
Shilong Lu
Yi Feng
Liping Jing
110
6
0
23 Dec 2023
Generative AI and the History of Architecture
J. Ploennigs
Markus Berger
83
1
0
22 Dec 2023
SutraNets: Sub-series Autoregressive Networks for Long-Sequence, Probabilistic Forecasting
Shane Bergsma
Timothy J. Zeyl
Lei Guo
AI4TS
100
3
0
22 Dec 2023
YAYI 2: Multilingual Open-Source Large Language Models
Yin Luo
Qingchao Kong
Nan Xu
Jia Cao
Bao Hao
...
Zhaoxin Yu
Zhengda Luo
Wenji Mao
Lei Wang
Dajun Zeng
ALM
OSLM
73
7
0
22 Dec 2023
Plan, Posture and Go: Towards Open-World Text-to-Motion Generation
Jinpeng Liu
Wen-Dao Dai
Chunyu Wang
Yiji Cheng
Yansong Tang
Xin Tong
VGen
DiffM
125
19
0
22 Dec 2023
A Mathematical Guide to Operator Learning
Nicolas Boullé
Alex Townsend
99
47
0
22 Dec 2023
FoodLMM: A Versatile Food Assistant using Large Multi-modal Model
Yuehao Yin
Huiyan Qi
B. Zhu
Jingjing Chen
Yu-Gang Jiang
Chong-Wah Ngo
87
21
0
22 Dec 2023
Reasons to Reject? Aligning Language Models with Judgments
Weiwen Xu
Deng Cai
Zhisong Zhang
Wai Lam
Shuming Shi
ALM
91
15
0
22 Dec 2023
Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise
Boyao Wang
Yuxing Liu
Xiaoyu Wang
Tong Zhang
46
5
0
22 Dec 2023
Previous
1
2
3
...
107
108
109
...
126
127
128
Next