Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,392 papers shown
Title
How Well Do LLMs Identify Cultural Unity in Diversity?
Jialin Li
Junli Wang
Junjie Hu
Ming Jiang
78
4
0
09 Aug 2024
Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts
Tingchen Fu
Yupeng Hou
Julian McAuley
Rui Yan
92
4
0
09 Aug 2024
Instruction Tuning-free Visual Token Complement for Multimodal LLMs
Dongsheng Wang
Jiequan Cui
Miaoge Li
Wang Lin
Bo Chen
Hanwang Zhang
MLLM
50
4
0
09 Aug 2024
Towards a Generative Approach for Emotion Detection and Reasoning
Ankita Bhaumik
T. Strzalkowski
ReLM
LRM
84
3
0
09 Aug 2024
SCOI: Syntax-augmented Coverage-based In-context Example Selection for Machine Translation
Chenming Tang
Zhixiang Wang
Hao Sun
96
1
0
09 Aug 2024
Better Alignment with Instruction Back-and-Forth Translation
Thao Nguyen
Jeffrey Li
Sewoong Oh
Ludwig Schmidt
Jason Weston
Luke Zettlemoyer
Xian Li
SyDa
88
7
0
08 Aug 2024
How Transformers Utilize Multi-Head Attention in In-Context Learning? A Case Study on Sparse Linear Regression
Xingwu Chen
Lei Zhao
Difan Zou
77
8
0
08 Aug 2024
Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models
Fabio Pernisi
Dirk Hovy
Paul Röttger
84
1
0
08 Aug 2024
Emergence in Multi-Agent Systems: A Safety Perspective
Philipp Altmann
Julian Schonberger
Steffen Illium
Maximilian Zorn
Fabian Ritz
Tom Haider
Simon Burton
Thomas Gabor
74
1
0
08 Aug 2024
Can LLMs Beat Humans in Debating? A Dynamic Multi-agent Framework for Competitive Debate
Yiqun Zhang
Xiaocui Yang
Shi Feng
Daling Wang
Yifei Zhang
Kaisong Song
LLMAG
116
6
0
08 Aug 2024
Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset
Kentaro Ozeki
Risako Ando
Takanobu Morishita
Hirohiko Abe
K. Mineshima
Mitsuhiro Okada
LRM
61
4
0
08 Aug 2024
Overview of the NLPCC 2024 Shared Task on Chinese Metaphor Generation
Xingwei Qu
Ge Zhang
Siwei Wu
Yizhi Li
Chenghua Lin
100
2
0
08 Aug 2024
EMTeC: A Corpus of Eye Movements on Machine-Generated Texts
Lena S. Bolliger
Patrick Haller
Isabelle Caroline Rose Cretton
D. R. Reich
Tannon Kew
Lena Ann Jäger
76
5
0
08 Aug 2024
Diffusion Guided Language Modeling
Justin Lovelace
Varsha Kishore
Yiwei Chen
Kilian Q. Weinberger
124
8
0
08 Aug 2024
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning
Heewoong Choi
Sangwon Jung
Hongjoon Ahn
Taesup Moon
OffRL
122
4
0
08 Aug 2024
Patchview: LLM-Powered Worldbuilding with Generative Dust and Magnet Visualization
John Joon Young Chung
Max Kreminski
103
14
0
07 Aug 2024
Human Speech Perception in Noise: Can Large Language Models Paraphrase to Improve It?
Anupama Chingacham
Miaoran Zhang
Vera Demberg
Dietrich Klakow
73
0
0
07 Aug 2024
Simplifying Scholarly Abstracts for Accessible Digital Libraries
Haining Wang
Jason Clark
86
1
0
07 Aug 2024
Generative Language Models with Retrieval Augmented Generation for Automated Short Answer Scoring
Zifan Wang
Christopher Ormerod
ELM
65
1
0
07 Aug 2024
Prompt and Prejudice
Lorenzo Berlincioni
Luca Cultrera
Federico Becattini
Marco Bertini
A. Bimbo
73
0
0
07 Aug 2024
CARE: A Clue-guided Assistant for CSRs to Read User Manuals
Weihong Du
Jia-Wei Liu
Zujie Wen
Dingnan Jin
Hongru Liang
Wenqiang Lei
91
0
0
07 Aug 2024
PAGED: A Benchmark for Procedural Graphs Extraction from Documents
Weihong Du
Wenrui Liao
Hongru Liang
Wenqiang Lei
93
1
0
07 Aug 2024
EnJa: Ensemble Jailbreak on Large Language Models
Jiahao Zhang
Zilong Wang
Ruofan Wang
Xingjun Ma
Yu-Gang Jiang
AAML
46
2
0
07 Aug 2024
MoExtend: Tuning New Experts for Modality and Task Extension
Shanshan Zhong
Shanghua Gao
Zhongzhan Huang
Wushao Wen
Marinka Zitnik
Pan Zhou
VLM
MLLM
MoE
116
7
0
07 Aug 2024
On the Generalization of Preference Learning with DPO
Shawn Im
Yixuan Li
85
2
0
06 Aug 2024
StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation
Boxi Cao
Mengjie Ren
Hongyu Lin
Xianpei Han
Feng Zhang
Junfeng Zhan
Le Sun
ELM
78
3
0
06 Aug 2024
Conditioning LLMs with Emotion in Neural Machine Translation
Charles Brazier
Jean-Luc Rouas
CVBM
68
2
0
06 Aug 2024
Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations
Leo Donisch
Sigurd Schacht
Carsten Lanquillon
81
2
0
06 Aug 2024
Compromising Embodied Agents with Contextual Backdoor Attacks
Aishan Liu
Yuguang Zhou
Xianglong Liu
Tianyuan Zhang
Siyuan Liang
...
Tianlin Li
Junqi Zhang
Wenbo Zhou
Qing Guo
Dacheng Tao
LLMAG
AAML
120
13
0
06 Aug 2024
Body of Her: A Preliminary Study on End-to-End Humanoid Agent
Tenglong Ao
LM&Ro
55
3
0
06 Aug 2024
A Framework for Fine-Tuning LLMs using Heterogeneous Feedback
Ryan Aponte
Ryan Rossi
Shunan Guo
Franck Dernoncourt
Tong Yu
Xiang Chen
Subrata Mitra
Nedim Lipka
OffRL
43
0
0
05 Aug 2024
XMainframe: A Large Language Model for Mainframe Modernization
Anh T. V. Dau
Hieu Trung Dao
Anh Tuan Nguyen
Hieu Trung Tran
Phong X. Nguyen
Nghi D. Q. Bui
90
2
0
05 Aug 2024
Self-Taught Evaluators
Tianlu Wang
Ilia Kulikov
O. Yu. Golovneva
Ping Yu
Weizhe Yuan
Jane Dwivedi-Yu
Richard Yuanzhe Pang
Maryam Fazel-Zarandi
Jason Weston
Xian Li
ALM
LRM
84
27
0
05 Aug 2024
Can Reinforcement Learning Unlock the Hidden Dangers in Aligned Large Language Models?
Mohammad Bahrami Karkevandi
Nishant Vishwamitra
Peyman Najafirad
AAML
87
1
0
05 Aug 2024
SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models
Muxi Diao
Rumei Li
Shiyang Liu
Guogang Liao
Jingang Wang
Xunliang Cai
Weiran Xu
AAML
111
2
0
05 Aug 2024
Language Model Can Listen While Speaking
Ziyang Ma
Yakun Song
Chenpeng Du
Jian Cong
Zhuo Chen
Yuping Wang
Yansen Wang
Xie Chen
AuLLM
103
28
0
05 Aug 2024
Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information
Yauwai Yim
Chunkit Chan
Tianyu Shi
Zheye Deng
Wei Fan
Tianshi Zheng
Yangqiu Song
LLMAG
107
13
0
05 Aug 2024
SNFinLLM: Systematic and Nuanced Financial Domain Adaptation of Chinese Large Language Models
Shujuan Zhao
Lingfeng Qiao
Kangyang Luo
Qian-Wen Zhang
Junru Lu
Di Yin
AIFin
80
3
0
05 Aug 2024
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
Zi Liang
Haibo Hu
Qingqing Ye
Yaxin Xiao
Haoyang Li
AAML
ELM
SILM
148
9
0
05 Aug 2024
Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding
Renato Vukovic
David Arps
Carel van Niekerk
Benjamin Matthias Ruppik
Hsien-chin Lin
Michael Heck
Milica Gašić
105
1
0
05 Aug 2024
Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions
Xinbei Ma
Yiting Wang
Yao Yao
Tongxin Yuan
Aston Zhang
Zhuosheng Zhang
Hai Zhao
LLMAG
AAML
122
26
0
05 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
94
0
0
04 Aug 2024
Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process
Peng Wang
Xiaobin Wang
Chao Lou
Shengyu Mao
Pengjun Xie
Yong Jiang
97
4
0
04 Aug 2024
Representation Bias of Adolescents in AI: A Bilingual, Bicultural Study
Robert Wolfe
Aayushi Dangol
Bill Howe
Alexis Hiniker
102
6
0
04 Aug 2024
Sólo Escúchame: Spanish Emotional Accompaniment Chatbot
Bruno Gil Ramírez
Jessica Nayeli López Espejel
María del Carmen Santiago Díaz
Gustavo Trinidad Rubín Linares
AI4MH
51
0
0
03 Aug 2024
Evaluating the Impact of Advanced LLM Techniques on AI-Lecture Tutors for a Robotics Course
Sebastian Kahl
Felix Löffler
Martin Maciol
Fabian Ridder
Marius Schmitz
Jennifer Spanagel
Jens Wienkamp
Christopher Burgahn
M. Schilling
66
4
0
02 Aug 2024
Mission Impossible: A Statistical Perspective on Jailbreaking LLMs
Jingtong Su
Mingyu Lee
SangKeun Lee
93
12
0
02 Aug 2024
Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs
Yilun Hua
Yoav Artzi
98
4
0
02 Aug 2024
Conditional LoRA Parameter Generation
Aaron Mueller
Millicent Li
Koyena Pal
Wangbo Zhao
Yukun Zhou
Jiuding Sun
Yonatan Belinkov
DiffM
91
6
0
02 Aug 2024
NOLO: Navigate Only Look Once
Mengyu Bu
Shuhao Gu
Yang Feng
EgoV
108
1
0
02 Aug 2024
Previous
1
2
3
...
55
56
57
...
126
127
128
Next