Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,384 papers shown
Title
Investigation of the effectiveness of applying ChatGPT in Dialogic Teaching Using Electroencephalography
Jiayue Zhang
Yi-Hsueh Liu
Wenqi Cai
Lanlan Wu
Yali Peng
Jingjing Yu
Senqing Qi
Taotao Long
Bao Ge
35
3
0
25 Mar 2024
CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment
Feiteng Fang
Liang Zhu
Min Yang
Xi Feng
Jinchang Hou
Qixuan Zhao
Chengming Li
Xiping Hu
Ruifeng Xu
52
0
0
25 Mar 2024
Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization
Xiangxin Zhou
Dongyu Xue
Ruizhe Chen
Zaixiang Zheng
Liang Wang
Quanquan Gu
DiffM
116
24
0
25 Mar 2024
Learning To Guide Human Decision Makers With Vision-Language Models
Debodeep Banerjee
Stefano Teso
Burcu Sayin
Andrea Passerini
108
1
0
25 Mar 2024
KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Dongjun Jang
Sungjoo Byun
Hyemi Jo
Hyopil Shin
ALM
74
0
0
25 Mar 2024
CodeS: Natural Language to Code Repository via Multi-Layer Sketch
Daoguang Zan
Ailun Yu
Wei Liu
Dong Chen
Bo Shen
...
Bei Guan
Zhiguang Yang
Yongji Wang
Qianxiang Wang
Li-zhen Cui
96
16
0
25 Mar 2024
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
Reza Esfandiarpoor
Cristina Menghini
Stephen H. Bach
CoGe
VLM
79
12
0
25 Mar 2024
Is There a One-Model-Fits-All Approach to Information Extraction? Revisiting Task Definition Biases
Wenhao Huang
Qi He
Zhixu Li
Jiaqing Liang
Yanghua Xiao
60
2
0
25 Mar 2024
ChatGPT Incorrectness Detection in Software Reviews
M. Tanzil
Junaed Younus Khan
Gias Uddin
89
4
0
25 Mar 2024
Enhanced Facet Generation with LLM Editing
Joosung Lee
Jinhong Kim
38
2
0
25 Mar 2024
AIOS: LLM Agent Operating System
Kai Mei
Zelong Li
Wujiang Xu
Wenyue Hua
Mingyu Jin
Yongfeng Zhang
Shuyuan Xu
Ruosong Ye
Yingqiang Ge
Yongfeng Zhang
LLMAG
158
25
0
25 Mar 2024
Large Language Models Offer an Alternative to the Traditional Approach of Topic Modelling
Yida Mu
Chun Dong
Kalina Bontcheva
Xingyi Song
77
25
0
24 Mar 2024
ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models
Zequan Liu
Jiawen Lyn
Wei-wei Zhu
Xing Tian
Yvette Graham
OffRL
111
18
0
24 Mar 2024
Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models
Minchan Kim
Minyeong Kim
Junik Bae
Suhwan Choi
Sungkyung Kim
Buru Chang
VLM
45
4
0
24 Mar 2024
A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish
Masahiro Kaneko
Timothy Baldwin
PILM
76
4
0
24 Mar 2024
WangchanLion and WangchanX MRC Eval
Wannaphong Phatthiyaphaibun
Surapon Nonesung
Patomporn Payoungkhamdee
Peerat Limkonchotiwat
Can Udomcharoenchaikit
Jitkapat Sawatphol
Chompakorn Chaksangchaichot
Ekapol Chuangsuwanich
Sarana Nutanong
108
0
0
24 Mar 2024
Opportunities and challenges in the application of large artificial intelligence models in radiology
Liangrui Pan
Zhenyu Zhao
Ying Lu
Kewei Tang
Liyong Fu
Qingchun Liang
Shaoliang Peng
LM&MA
MedIm
AI4CE
83
6
0
24 Mar 2024
Argument Quality Assessment in the Age of Instruction-Following Large Language Models
Henning Wachsmuth
Gabriella Lapesa
Elena Cabrio
Anne Lauscher
Joonsuk Park
Eva Maria Vecchi
S. Villata
Timon Ziegenbein
70
0
0
24 Mar 2024
Monotonic Paraphrasing Improves Generalization of Language Model Prompting
Qin Liu
Fei Wang
Nan Xu
Tianyi Yan
Tao Meng
Muhao Chen
LRM
91
8
0
24 Mar 2024
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
Shengyi Huang
Michael Noukhovitch
Arian Hosseini
Kashif Rasul
Weixun Wang
Lewis Tunstall
VLM
112
38
0
24 Mar 2024
LlamBERT: Large-scale low-cost data annotation in NLP
Bálint Csanády
Lajos Muzsai
Péter Vedres
Zoltán Nádasdy
András Lukács
101
9
0
23 Mar 2024
Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition
Shijian Deng
Erin E. Kosloski
Siddhi Patel
Zeke A. Barnett
Yiyang Nan
...
William T. Doan
Matthew Wang
Harsh Singh
P. Rollins
Yapeng Tian
69
5
0
22 Mar 2024
SensoryT5: Infusing Sensorimotor Norms into T5 for Enhanced Fine-grained Emotion Classification
Yuhan Xia
Qingqing Zhao
Yunfei Long
Ge Xu
Jia Wang
37
0
0
22 Mar 2024
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
Orion Weller
Benjamin Chang
Sean MacAvaney
Kyle Lo
Arman Cohan
Benjamin Van Durme
Dawn J Lawrie
Luca Soldaini
106
39
0
22 Mar 2024
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
Taeheon Kim
Sangyun Chung
Damin Yeom
Youngjoon Yu
Hak Gu Kim
Y. Ro
99
3
0
22 Mar 2024
Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
Erik Miehling
Manish Nagireddy
P. Sattigeri
Elizabeth M. Daly
David Piorkowski
John T. Richards
ALM
125
15
0
22 Mar 2024
Argument-Aware Approach To Event Linking
I-Hung Hsu
Zihan Xue
Nilay Pochh
Sahil Bansal
Premkumar Natarajan
Jayanth Srinivasa
Nanyun Peng
101
0
0
22 Mar 2024
DP-Dueling: Learning from Preference Feedback without Compromising User Privacy
Aadirupa Saha
Hilal Asi
81
1
0
22 Mar 2024
Risk and Response in Large Language Models: Evaluating Key Threat Categories
Bahareh Harandizadeh
A. Salinas
Fred Morstatter
98
4
0
22 Mar 2024
Generative Active Learning for Image Synthesis Personalization
Xu-Lu Zhang
Wengyu Zhang
Xiao Wei
Jinlin Wu
Zhaoxiang Zhang
Zhen Lei
Qing Li
142
4
0
22 Mar 2024
Evidence-Driven Retrieval Augmented Response Generation for Online Misinformation
Zhenrui Yue
Huimin Zeng
Yimeng Lu
Lanyu Shang
Yang Zhang
Dong Wang
RALM
OffRL
98
22
0
22 Mar 2024
GPT-Connect: Interaction between Text-Driven Human Motion Generator and 3D Scenes in a Training-free Manner
Haoxuan Qu
Ziyan Guo
Jun Liu
VGen
87
3
0
22 Mar 2024
On Zero-Shot Counterspeech Generation by LLMs
Punyajoy Saha
Aalok Agrawal
Abhik Jana
Chris Biemann
Animesh Mukherjee
92
13
0
22 Mar 2024
Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit Reasoning
Maksym Taranukhin
Vered Shwartz
E. Milios
LRM
79
9
0
22 Mar 2024
Can 3D Vision-Language Models Truly Understand Natural Language?
Weipeng Deng
Jihan Yang
Runyu Ding
Jiahui Liu
Yijiang Li
Xiaojuan Qi
Edith C.H. Ngai
123
6
0
21 Mar 2024
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Renrui Zhang
Dongzhi Jiang
Yichi Zhang
Haokun Lin
Ziyu Guo
...
Aojun Zhou
Pan Lu
Kai-Wei Chang
Peng Gao
Hongsheng Li
107
253
0
21 Mar 2024
DreamReward: Text-to-3D Generation with Human Preference
Junliang Ye
Fangfu Liu
Qixiu Li
Zhengyi Wang
Yikai Wang
Xinzhou Wang
Yueqi Duan
Jun Zhu
109
29
0
21 Mar 2024
MyVLM: Personalizing VLMs for User-Specific Queries
Yuval Alaluf
Elad Richardson
Sergey Tulyakov
Kfir Aberman
Daniel Cohen-Or
MLLM
VLM
110
23
0
21 Mar 2024
ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy
Zonghan Yang
Peng Li
Ming Yan
Ji Zhang
Fei Huang
Yang Liu
LLMAG
LRM
117
9
0
21 Mar 2024
From Large to Tiny: Distilling and Refining Mathematical Expertise for Math Word Problems with Weakly Supervision
Qingwen Lin
Boyan Xu
Zhengting Huang
Ruichu Cai
97
3
0
21 Mar 2024
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction
Yuren Mao
Xuemei Dong
Wenyi Xu
Yunjun Gao
Bin Wei
Ying Zhang
62
10
0
21 Mar 2024
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting
Xiaoxue Cheng
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
LRM
AI4CE
ReLM
91
9
0
21 Mar 2024
ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion Classification
Sehee Lim
Yejin Kim
Chi-Hyun Choi
Jy-yong Sohn
Byung-Hoon Kim
62
5
0
21 Mar 2024
Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Kyungjae Lee
Dasol Hwang
Sunghyun Park
Youngsoo Jang
Moontae Lee
70
8
0
21 Mar 2024
Improving the Robustness of Large Language Models via Consistency Alignment
Zhao Yukun
Lingyong Yan
Weiwei Sun
Guoliang Xing
Shuaiqiang Wang
Meng Chong
Zhicong Cheng
Zhaochun Ren
Yin Dawei
88
22
0
21 Mar 2024
MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge Distillation
Longzheng Wang
Xiaohan Xu
Lei Zhang
Jiarui Lu
Yongxiu Xu
Hongbo Xu
Xuancheng Huang
Chuang Zhang
110
5
0
21 Mar 2024
Policy Mirror Descent with Lookahead
Kimon Protopapas
Anas Barakat
84
2
0
21 Mar 2024
Empowering Segmentation Ability to Multi-modal Large Language Models
Yuqi Yang
Peng-Tao Jiang
Jing Wang
Hao Zhang
Kai Zhao
Jinwei Chen
Yue Liu
LRM
VLM
90
4
0
21 Mar 2024
Protected group bias and stereotypes in Large Language Models
Hadas Kotek
David Q. Sun
Zidi Xiu
Margit Bowler
Christopher Klein
AILaw
ALM
59
3
0
21 Mar 2024
On Prompt Sensitivity of ChatGPT in Affective Computing
Mostafa M. Amin
Björn W. Schuller
49
7
0
20 Mar 2024
Previous
1
2
3
...
86
87
88
...
126
127
128
Next