Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,381 papers shown
Title
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Bu Jin
Yupeng Zheng
Pengfei Li
Weize Li
Yuhang Zheng
...
Kun Zhan
Peng Jia
Xiaoxiao Long
Yilun Chen
Hao Zhao
3DV
114
20
0
28 Mar 2024
JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Eri Onami
Shuhei Kurita
Taiki Miyanishi
Taro Watanabe
74
4
0
28 Mar 2024
Mixed Preference Optimization: Reinforcement Learning with Data Selection and Better Reference Model
Qi Gou
Cam-Tu Nguyen
127
14
0
28 Mar 2024
Breaking the Length Barrier: LLM-Enhanced CTR Prediction in Long Textual User Behaviors
Binzong Geng
Zhaoxin Huan
Xiaolu Zhang
Yong He
Liang Zhang
Fajie Yuan
Jun Zhou
Linjian Mo
97
25
0
28 Mar 2024
Plug-and-Play Grounding of Reasoning in Multimodal Large Language Models
Jiaxing Chen
Yuxuan Liu
Dehu Li
Xiang An
Weimo Deng
Ziyong Feng
Yongle Zhao
Yin Xie
LRM
102
15
0
28 Mar 2024
Fine-Tuning Language Models with Reward Learning on Policy
Hao Lang
Fei Huang
Yongbin Li
ALM
67
7
0
28 Mar 2024
Text Data-Centric Image Captioning with Interactive Prompts
Yiyu Wang
Hao Luo
Jungang Xu
Yingfei Sun
Fan Wang
VLM
80
0
0
28 Mar 2024
Disentangling Length from Quality in Direct Preference Optimization
Ryan Park
Rafael Rafailov
Stefano Ermon
Chelsea Finn
ALM
98
145
0
28 Mar 2024
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
Patrick Chao
Edoardo Debenedetti
Alexander Robey
Maksym Andriushchenko
Francesco Croce
...
Nicolas Flammarion
George J. Pappas
F. Tramèr
Hamed Hassani
Eric Wong
ALM
ELM
AAML
135
143
0
28 Mar 2024
Learning From Correctness Without Prompting Makes LLM Efficient Reasoner
Yuxuan Yao
Han Wu
Zhijiang Guo
Biyan Zhou
Jiahui Gao
Sichun Luo
Hanxu Hou
Xiaojin Fu
Linqi Song
LLMAG
LRM
130
10
0
28 Mar 2024
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Zeren Chen
Zhelun Shi
Xiaoya Lu
Lehan He
Sucheng Qian
...
Zhen-fei Yin
Jing Shao
Jing Shao
Cewu Lu
Cewu Lu
77
6
0
28 Mar 2024
TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios
Yanling Wang
Jing Zhang
Zeyao Ma
Yang Li
Bohan Zhang
...
Didong Li
Shu Zhao
Juan-Zi Li
Jie Tang
J. Tang
LMTD
RALM
122
37
0
28 Mar 2024
IDGenRec: LLM-RecSys Alignment with Textual ID Learning
Juntao Tan
Shuyuan Xu
Wenyue Hua
Yingqiang Ge
Zelong Li
Yongfeng Zhang
105
32
0
27 Mar 2024
A Survey on Large Language Models from Concept to Implementation
Chen Wang
Jin Zhao
Jiaqi Gong
LLMAG
LM&MA
104
3
0
27 Mar 2024
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Yanwei Li
Yuechen Zhang
Chengyao Wang
Zhisheng Zhong
Yixin Chen
Ruihang Chu
Shaoteng Liu
Jiaya Jia
VLM
MLLM
MoE
133
238
0
27 Mar 2024
CYCLE: Learning to Self-Refine the Code Generation
Yangruibo Ding
Marcus J. Min
Gail E. Kaiser
Baishakhi Ray
133
37
0
27 Mar 2024
Understanding the Learning Dynamics of Alignment with Human Feedback
Shawn Im
Yixuan Li
ALM
107
14
0
27 Mar 2024
Non-Linear Inference Time Intervention: Improving LLM Truthfulness
Jakub Hoscilowicz
Adam Wiacek
Jan Chojnacki
Adam Cieślak
Leszek Michon
Vitalii Urbanevych
Artur Janicki
KELM
73
4
0
27 Mar 2024
Vulnerability Detection with Code Language Models: How Far Are We?
Yangruibo Ding
Yanjun Fu
Omniyyah Ibrahim
Chawin Sitawarin
Xinyun Chen
Basel Alomair
David Wagner
Baishakhi Ray
Yizheng Chen
AAML
91
58
0
27 Mar 2024
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text
Elliot Bolton
Abhinav Venigalla
Michihiro Yasunaga
David Leo Wright Hall
Betty Xiong
...
R. Daneshjou
Jonathan Frankle
Percy Liang
Michael Carbin
Christopher D. Manning
LM&MA
MedIm
101
64
0
27 Mar 2024
Improving Attributed Text Generation of Large Language Models via Preference Learning
Dongfang Li
Zetian Sun
Baotian Hu
Zhenyu Liu
Xinshuo Hu
Xuebo Liu
Min Zhang
90
15
0
27 Mar 2024
BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Haitao Li
Qingyao Ai
Jia Chen
Qian Dong
Zhijing Wu
Yiqun Liu
Chong Chen
Qi Tian
AILaw
102
14
0
27 Mar 2024
Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback
Hongshen Xu
Zichen Zhu
Situo Zhang
Da Ma
Shuai Fan
Lu Chen
Kai Yu
HILM
107
45
0
27 Mar 2024
Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal Perspective
Meiqi Chen
Yixin Cao
Yan Zhang
Chaochao Lu
107
16
0
27 Mar 2024
LC-LLM: Explainable Lane-Change Intention and Trajectory Predictions with Large Language Models
Mingxing Peng
Xusen Guo
Xianda Chen
Meixin Zhu
Kehua Chen
Hao
Hao Yang
Xuesong Wang
Yinhai Wang
LRM
107
21
0
27 Mar 2024
Beyond Embeddings: The Promise of Visual Table in Visual Reasoning
Yiwu Zhong
Zi-Yuan Hu
Michael R. Lyu
Liwei Wang
73
1
0
27 Mar 2024
Boosting Conversational Question Answering with Fine-Grained Retrieval-Augmentation and Self-Check
Linhao Ye
Zhikai Lei
Jia-Peng Yin
Qin Chen
Jie Zhou
Liang He
3DV
RALM
75
19
0
27 Mar 2024
Exploring the Privacy Protection Capabilities of Chinese Large Language Models
Yuqi Yang
Xiaowen Huang
Jitao Sang
ELM
PILM
AILaw
107
1
0
27 Mar 2024
FoC: Figure out the Cryptographic Functions in Stripped Binaries with LLMs
Guoqiang Chen
Xiuwei Shang
Shaoyin Cheng
Yanming Zhang
Weiming Zhang
Neng H. Yu
N. Yu
175
2
0
27 Mar 2024
Large Language Models as Financial Data Annotators: A Study on Effectiveness and Efficiency
Toyin Aguda
S. Siddagangappa
Elena Kochkina
Simerjot Kaur
Dongsheng Wang
Charese Smiley
Sameena Shah
89
12
0
26 Mar 2024
ChatGPT Role-play Dataset: Analysis of User Motives and Model Naturalness
Sabrina Bodmer
Ameeta Agrawal
Judit Dombi
Tetyana Sydorenko
Jung In Lee
50
5
0
26 Mar 2024
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
Boyao Wang
Xiang Liu
Shizhe Diao
Renjie Pi
Jipeng Zhang
Chi Han
Tong Zhang
106
55
0
26 Mar 2024
Assessment of Multimodal Large Language Models in Alignment with Human Values
Zhelun Shi
Zhipin Wang
Hongxing Fan
Zaibin Zhang
Lijun Li
Yongting Zhang
Zhen-fei Yin
Lu Sheng
Yu Qiao
Jing Shao
77
22
0
26 Mar 2024
Continual Few-shot Event Detection via Hierarchical Augmentation Networks
Chenlong Zhang
Pengfei Cao
Yubo Chen
Kang Liu
Qing Cui
Mengshu Sun
Jun Zhao
73
3
0
26 Mar 2024
Language Models for Text Classification: Is In-Context Learning Enough?
A. Edwards
Jose Camacho-Collados
LRM
93
24
0
26 Mar 2024
RuBia: A Russian Language Bias Detection Dataset
Veronika Grigoreva
Anastasiia Ivanova
I. Alimova
Ekaterina Artemova
102
1
0
26 Mar 2024
Large Language Models Are State-of-the-Art Evaluator for Grammatical Error Correction
Masamune Kobayashi
Masato Mita
Mamoru Komachi
ELM
85
3
0
26 Mar 2024
ILLUMINER: Instruction-tuned Large Language Models as Few-shot Intent Classifier and Slot Filler
Paramita Mirza
Viju Sudhi
S. Sahoo
Sinchana Ramakanth Bhat
93
5
0
26 Mar 2024
KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on Large Language Models for Knowledge Graph Completion
Yilin Wang
Minghao Hu
Zhen Huang
Dongsheng Li
Dong Yang
Xicheng Lu
78
2
0
26 Mar 2024
Robust and Scalable Model Editing for Large Language Models
Yingfa Chen
Zhengyan Zhang
Xu Han
Chaojun Xiao
Zhiyuan Liu
Chen Chen
Kuai Li
Tao Yang
Maosong Sun
KELM
52
2
0
26 Mar 2024
LM-Combiner: A Contextual Rewriting Model for Chinese Grammatical Error Correction
Yixuan Wang
Baoxin Wang
Yijun Liu
Dayong Wu
Wanxiang Che
KELM
97
2
0
26 Mar 2024
ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales?
Fan Huang
Haewoon Kwak
Kunwoo Park
Jisun An
ALM
ELM
AI4MH
114
12
0
26 Mar 2024
Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models
Zhiyuan Yu
Xiaogeng Liu
Shunning Liang
Zach Cameron
Chaowei Xiao
Ning Zhang
94
54
0
26 Mar 2024
The Pursuit of Fairness in Artificial Intelligence Models: A Survey
Tahsin Alamgir Kheya
Mohamed Reda Bouadjenek
Sunil Aryal
86
9
0
26 Mar 2024
DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion
Yuanze Lin
Ronald Clark
Philip Torr
3DGS
87
11
0
25 Mar 2024
Extracting Social Support and Social Isolation Information from Clinical Psychiatry Notes: Comparing a Rule-based NLP System and a Large Language Model
Braja Gopal Patra
L. Lepow
Praneet Kasi Reddy Jagadeesh Kumar
V. Vekaria
M. M. Sharma
...
Myrna Weissman
M. Olfson
J. Mann
Alexander W. Charney
Jyoti Pathak
AI4MH
38
4
0
25 Mar 2024
Reflecting the Male Gaze: Quantifying Female Objectification in 19th and 20th Century Novels
Kexin Luo
Yue Mao
Bei Zhang
Sophie Hao
125
1
0
25 Mar 2024
MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models
Kailai Yang
Zhiwei Liu
Qianqian Xie
Jimin Huang
Tianlin Zhang
Sophia Ananiadou
86
18
0
25 Mar 2024
The Strong Pull of Prior Knowledge in Large Language Models and Its Impact on Emotion Recognition
Georgios Chochlakis
Alexandros Potamianos
Kristina Lerman
Shrikanth Narayanan
95
7
0
25 Mar 2024
GOLF: Goal-Oriented Long-term liFe tasks supported by human-AI collaboration
Ben Wang
116
0
0
25 Mar 2024
Previous
1
2
3
...
85
86
87
...
126
127
128
Next