Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 7,311 papers shown
Title
Model Will Tell: Training Membership Inference for Diffusion Models
Xiaomeng Fu
Xi Wang
Qiao Li
Jin Liu
Jiao Dai
Jizhong Han
52
5
0
13 Mar 2024
Specification Overfitting in Artificial Intelligence
Benjamin Roth
Pedro Henrique Luz de Araujo
Yuxi Xia
Saskia Kaltenbrunner
Christoph Korab
58
0
0
13 Mar 2024
Tastle: Distract Large Language Models for Automatic Jailbreak Attack
Zeguan Xiao
Yan Yang
Guanhua Chen
Yun-Nung Chen
AAML
48
18
0
13 Mar 2024
LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments
Maonan Wang
Aoyu Pang
Yuheng Kan
Man-On Pun
Chung Shue Chen
Bo Huang
43
18
0
13 Mar 2024
Knowledge Conflicts for LLMs: A Survey
Rongwu Xu
Zehan Qi
Zhijiang Guo
Cunxiang Wang
Hongru Wang
Yue Zhang
Wei Xu
208
96
0
13 Mar 2024
OverleafCopilot: Empowering Academic Writing in Overleaf with Large Language Models
Haomin Wen
Zhenjie Wei
Yan Lin
Jiyuan Wang
Keli Zhang
Huaiyu Wan
27
0
0
13 Mar 2024
HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Ang Li
Qiugen Xiao
Peng Cao
Jian Tang
Yi Yuan
...
Weidong Guo
Yukang Gan
Jeffrey Xu Yu
D. Wang
Ying Shan
VLM
ALM
44
10
0
13 Mar 2024
AutoDev: Automated AI-Driven Development
Michele Tufano
Anisha Agarwal
Jinu Jang
Roshanak Zilouchian Moghaddam
Neel Sundaresan
44
15
0
13 Mar 2024
Gemma: Open Models Based on Gemini Research and Technology
Gemma Team
Gemma Team Thomas Mesnard
Cassidy Hardin
Robert Dadashi
Surya Bhupatiraju
...
Armand Joulin
Noah Fiedel
Evan Senter
Alek Andreev
Kathleen Kenealy
VLM
LLMAG
131
441
0
13 Mar 2024
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale
Xiang Hu
Pengyu Ji
Qingyang Zhu
Wei Wu
Kewei Tu
LRM
41
4
0
13 Mar 2024
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Ning Ding
Yulin Chen
Ganqu Cui
Xingtai Lv
Weilin Zhao
Ruobing Xie
Bowen Zhou
Zhiyuan Liu
Maosong Sun
ALM
MoMe
AI4CE
43
7
0
13 Mar 2024
Learning to Watermark LLM-generated Text via Reinforcement Learning
Xiaojun Xu
Yuanshun Yao
Yang Liu
31
10
0
13 Mar 2024
Deep Submodular Peripteral Networks
Gantavya Bhatt
Arnav M. Das
Jeff Bilmes
49
1
0
13 Mar 2024
Large Language Models are Contrastive Reasoners
Liang Yao
ReLM
ELM
LRM
50
2
0
13 Mar 2024
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection
Hanning Chen
Wenjun Huang
Yang Ni
Sanggeon Yun
Fei Wen
Hugo Latapie
Mohsen Imani
ObjD
MLLM
VLM
42
17
0
12 Mar 2024
TutoAI: A Cross-domain Framework for AI-assisted Mixed-media Tutorial Creation on Physical Tasks
Yuexi Chen
Vlad I. Morariu
Anh Truong
Zhicheng Liu
DiffM
VGen
45
4
0
12 Mar 2024
Do Agents Dream of Electric Sheep?: Improving Generalization in Reinforcement Learning through Generative Learning
Giorgio Franceschelli
Mirco Musolesi
AI4CE
51
0
0
12 Mar 2024
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Lei Zhu
Fangyun Wei
Yanye Lu
MLLM
VLM
57
18
0
12 Mar 2024
Rethinking Generative Large Language Model Evaluation for Semantic Comprehension
Fangyun Wei
Xi Chen
Linzi Luo
ELM
ALM
LRM
38
7
0
12 Mar 2024
CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion
Qibing Ren
Chang Gao
Jing Shao
Junchi Yan
Xin Tan
Wai Lam
Lizhuang Ma
ALM
ELM
AAML
50
22
0
12 Mar 2024
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Sainbayar Sukhbaatar
O. Yu. Golovneva
Vasu Sharma
Hu Xu
Xi Lin
...
Jacob Kahn
Shang-Wen Li
Wen-tau Yih
Jason Weston
Xian Li
MoMe
OffRL
MoE
45
62
0
12 Mar 2024
Beyond Memorization: The Challenge of Random Memory Access in Language Models
Tongyao Zhu
Qian Liu
Liang Pang
Zhengbao Jiang
Min-Yen Kan
Min Lin
KELM
42
6
0
12 Mar 2024
Fine-tuning Large Language Models with Sequential Instructions
Hanxu Hu
Simon Yu
Pinzhen Chen
Edoardo Ponti
ALM
LRM
84
15
0
12 Mar 2024
FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models
Yan Liu
Renren Jin
Ling Shi
Zheng Yao
Deyi Xiong
LRM
37
4
0
12 Mar 2024
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction
Zixuan Li
Yutao Zeng
Yuxin Zuo
Weicheng Ren
Wenxuan Liu
...
Yidan Liu
Pan Yang
Xiaolong Jin
Jiafeng Guo
Xueqi Cheng
OffRL
38
25
0
12 Mar 2024
ORPO: Monolithic Preference Optimization without Reference Model
Jiwoo Hong
Noah Lee
James Thorne
OSLM
42
213
0
12 Mar 2024
Characterization of Large Language Model Development in the Datacenter
Qi Hu
Zhisheng Ye
Zerui Wang
Guoteng Wang
Mengdie Zhang
...
Dahua Lin
Xiaolin Wang
Yingwei Luo
Yonggang Wen
Tianwei Zhang
56
45
0
12 Mar 2024
generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation
Thilo Spinner
Rebecca Kehlbeck
Rita Sevastjanova
Tobias Stähle
Daniel A. Keim
Oliver Deussen
Mennatallah El-Assady
46
3
0
12 Mar 2024
RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model
Mingze Wang
Lili Su
Cilin Yan
Sheng Xu
Pengcheng Yuan
Xiaolong Jiang
Baochang Zhang
51
11
0
12 Mar 2024
SIFiD: Reassess Summary Factual Inconsistency Detection with LLM
Jiuding Yang
Hui Liu
Weidong Guo
Zhuwei Rao
Yu-Syuan Xu
Di Niu
HILM
29
0
0
12 Mar 2024
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Byung-Kwan Lee
Beomchan Park
Chae Won Kim
Yonghyun Ro
MLLM
VLM
53
20
0
12 Mar 2024
Matrix-Transformation Based Low-Rank Adaptation (MTLoRA): A Brain-Inspired Method for Parameter-Efficient Fine-Tuning
Yao Liang
Yuwei Wang
Yang Li
Yi Zeng
44
0
0
12 Mar 2024
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models
Yu Yang
Siddhartha Mishra
Jeffrey N Chiang
Baharan Mirzasoleiman
42
18
0
12 Mar 2024
Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery
Linan Yue
Qi Liu
Yichao Du
Li Wang
Weibo Gao
Yanqing An
39
5
0
12 Mar 2024
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Yang Jiao
Shaoxiang Chen
Zequn Jie
Wenke Huang
Lin Ma
Yueping Jiang
MLLM
50
18
0
12 Mar 2024
Curry-DPO: Enhancing Alignment using Curriculum Learning & Ranked Preferences
Pulkit Pattnaik
Rishabh Maheshwary
Kelechi Ogueji
Vikas Yadav
Sathwik Tejaswi Madhusudhan
42
18
0
12 Mar 2024
(
N
,
K
)
\mathbf{(N,K)}
(
N
,
K
)
-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Yufeng Zhang
Liyu Chen
Boyi Liu
Yingxiang Yang
Qiwen Cui
Yunzhe Tao
Hongxia Yang
119
0
0
11 Mar 2024
Improving deep learning with prior knowledge and cognitive models: A survey on enhancing explainability, adversarial robustness and zero-shot learning
F. Mumuni
A. Mumuni
AAML
42
5
0
11 Mar 2024
The pitfalls of next-token prediction
Gregor Bachmann
Vaishnavh Nagarajan
39
63
0
11 Mar 2024
Materials science in the era of large language models: a perspective
Ge Lei
Ronan Docherty
Samuel J. Cooper
50
18
0
11 Mar 2024
RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback
Yanming Liu
Xinyue Peng
Xuhong Zhang
Weihao Liu
Jianwei Yin
Jiannan Cao
Tianyu Du
RALM
38
37
0
11 Mar 2024
ALaRM: Align Language Models via Hierarchical Rewards Modeling
Yuhang Lai
Siyuan Wang
Shujun Liu
Xuanjing Huang
Zhongyu Wei
37
4
0
11 Mar 2024
Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning
Zijian Zhou
Miaojing Shi
Meng Wei
Oluwatosin O. Alabi
Zijie Yue
Tom Kamiel Magda Vercauteren
LM&MA
43
6
0
11 Mar 2024
Elephants Never Forget: Testing Language Models for Memorization of Tabular Data
Sebastian Bordt
Harsha Nori
Rich Caruana
LMTD
53
14
0
11 Mar 2024
RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models
Liangliang Chen
Yutian Lei
Shiyu Jin
Ying Zhang
Liangjun Zhang
LM&Ro
48
8
0
11 Mar 2024
A Logical Pattern Memory Pre-trained Model for Entailment Tree Generation
Li Yuan
Yi Cai
Haopeng Ren
Jiexin Wang
LRM
30
5
0
11 Mar 2024
A Knowledge-Injected Curriculum Pretraining Framework for Question Answering
Xin Lin
Tianhuang Su
Zhenya Huang
Shangzi Xue
Haifeng Liu
Enhong Chen
33
1
0
11 Mar 2024
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource Languages
Michael Andersland
25
0
0
11 Mar 2024
From Instructions to Constraints: Language Model Alignment with Automatic Constraint Verification
Fei Wang
Chao Shang
Sarthak Jain
Shuai Wang
Qiang Ning
Bonan Min
Vittorio Castelli
Yassine Benajiba
Dan Roth
ALM
27
8
0
10 Mar 2024
FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
Hanfang Lyu
Yuanchen Bai
Xin Liang
Ujaan Das
Chuhan Shi
Leiliang Gong
Yingchi Li
Mingfei Sun
Ming Ge
Xiaojuan Ma
45
0
0
10 Mar 2024
Previous
1
2
3
...
79
80
81
...
145
146
147
Next