Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,380 papers shown
Title
FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models
Zhikai Zhang
Yitang Li
Haofeng Huang
Mingxian Lin
Li Yi
140
3
0
15 Jun 2024
Improving Large Models with Small models: Lower Costs and Better Performance
Dong Chen
Shuo Zhang
Yueting Zhuang
Siliang Tang
Qidong Liu
Hua Wang
Mingliang Xu
96
6
0
15 Jun 2024
Emerging Safety Attack and Defense in Federated Instruction Tuning of Large Language Models
Rui Ye
Jingyi Chai
Xiangrui Liu
Yaodong Yang
Yanfeng Wang
Siheng Chen
AAML
156
10
0
15 Jun 2024
LLM-Mediated Domain-Specific Voice Agents: The Case of TextileBot
Shu Zhong
Elia Gatti
James Hardwick
Miriam Ribul
Youngjun Cho
Marianna Obrist
74
4
0
15 Jun 2024
Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning
Jifan Zhang
Lalit P. Jain
Yang Guo
Jiayi Chen
Kuan Lok Zhou
...
Scott Sievert
Timothy T. Rogers
Kevin Jamieson
Robert Mankoff
Robert Nowak
107
6
0
15 Jun 2024
Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?
Haozhe An
Christabel Acquaye
Colin Wang
Zongxia Li
Rachel Rudinger
99
12
0
15 Jun 2024
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
Rui Yang
Ruomeng Ding
Yong Lin
Huan Zhang
Tong Zhang
122
62
0
14 Jun 2024
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning
Hao Bai
Yifei Zhou
Mert Cemri
Jiayi Pan
Alane Suhr
Sergey Levine
Aviral Kumar
OffRL
114
65
0
14 Jun 2024
GenQA: Generating Millions of Instructions from a Handful of Prompts
Jiuhai Chen
Rifaa Qadri
Yuxin Wen
Neel Jain
John Kirchenbauer
Dinesh Manocha
Tom Goldstein
ALM
156
24
0
14 Jun 2024
IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce
Wenxuan Ding
Weiqi Wang
Sze Heng Douglas Kwok
Minghao Liu
Tianqing Fang
Jiaxin Bai
Junxian He
Yangqiu Song
RALM
88
8
0
14 Jun 2024
Off-Policy Evaluation from Logged Human Feedback
Aniruddha Bhargava
Lalit P. Jain
Branislav Kveton
Ge Liu
Subhojyoti Mukherjee
OffRL
68
2
0
14 Jun 2024
Deep Bayesian Active Learning for Preference Modeling in Large Language Models
Luckeciano C. Melo
P. Tigas
Alessandro Abate
Yarin Gal
122
11
0
14 Jun 2024
GEB-1.3B: Open Lightweight Large Language Model
Jie Wu
Yufeng Zhu
Lei Shen
Xuqing Lu
ALM
44
0
0
14 Jun 2024
A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation
Yongkang Liu
Ercong Nie
Shi Feng
Zheng Hua
Zifeng Ding
Daling Wang
Yifei Zhang
Hinrich Schütze
81
2
0
14 Jun 2024
Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments
Zhenrui Yue
Huimin Zeng
Lanyu Shang
Yifan Liu
Yang Zhang
Dong Wang
RALM
81
9
0
14 Jun 2024
From Text to Life: On the Reciprocal Relationship between Artificial Life and Large Language Models
Eleni Nisioti
Claire Glanois
Elias Najarro
Andrew Dai
Elliot Meyerson
J. Pedersen
Laetitia Teodorescu
Conor F. Hayes
Shyam Sudhakaran
Sebastian Risi
AI4CE
LM&Ro
103
4
0
14 Jun 2024
Unlock the Correlation between Supervised Fine-Tuning and Reinforcement Learning in Training Code Large Language Models
Jie Chen
Xintian Han
Yu Ma
Xun Zhou
Liang Xiang
ALM
LRM
76
2
0
14 Jun 2024
A Survey on Large Language Models from General Purpose to Medical Applications: Datasets, Methodologies, and Evaluations
Jinqiang Wang
Huansheng Ning
Yi Peng
Qikai Wei
Daniel Tesfai
Wenwei Mao
Tao Zhu
Runhe Huang
LM&MA
AI4MH
ELM
147
8
0
14 Jun 2024
Bootstrapping Language Models with DPO Implicit Rewards
Changyu Chen
Zichen Liu
Chao Du
Tianyu Pang
Qian Liu
Arunesh Sinha
Pradeep Varakantham
Min Lin
SyDa
ALM
129
27
0
14 Jun 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Holy Lovenia
Rahmad Mahendra
Salsabil Maulana Akbar
Lester James V. Miranda
Jennifer Santoso
...
Genta Indra Winata
Ruochen Zhang
Fajri Koto
Zheng-Xin Yong
Samuel Cahyawijaya
233
14
0
14 Jun 2024
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Miaosen Zhang
Yixuan Wei
Zhen Xing
Yifei Ma
Zuxuan Wu
...
Zheng Zhang
Qi Dai
Chong Luo
Xin Geng
Baining Guo
VLM
86
1
0
13 Jun 2024
Towards Vision-Language Geo-Foundation Model: A Survey
Yue Zhou
Xue Jiang
Yiping Ke
Yiping Ke
Junchi Yan
Xue Yang
Wayne Zhang
89
19
0
13 Jun 2024
ElicitationGPT: Text Elicitation Mechanisms via Language Models
Yifan Wu
Jason D. Hartline
67
5
0
13 Jun 2024
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback
Hamish Ivison
Yizhong Wang
Jiacheng Liu
Zeqiu Wu
Valentina Pyatkin
Nathan Lambert
Noah A. Smith
Yejin Choi
Hannaneh Hajishirzi
108
64
0
13 Jun 2024
On Softmax Direct Preference Optimization for Recommendation
Yuxin Chen
Junfei Tan
An Zhang
Zhengyi Yang
Leheng Sheng
Enzhi Zhang
Xiang Wang
Tat-Seng Chua
115
35
0
13 Jun 2024
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs
Xuan Zhang
Chao Du
Tianyu Pang
Qian Liu
Wei Gao
Min Lin
LRM
AI4CE
103
64
0
13 Jun 2024
SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models
Kehua Feng
Keyan Ding
Weijie Wang
Xiang Zhuang
Zeyuan Wang
Ming Qin
Yu Zhao
Jianhua Yao
Qiang Zhang
H. Chen
ELM
93
9
0
13 Jun 2024
Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?
Zhaochen Su
Juntao Li
Jun Zhang
Tong Zhu
Xiaoye Qu
Pan Zhou
Yan Bowen
Yu Cheng
Min zhang
LRM
133
25
0
13 Jun 2024
Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors
Ying Zhou
Ben He
Le Sun
DeLMO
71
1
0
13 Jun 2024
ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions
Xu Zhang
Xunjian Yin
Xiaojun Wan
79
3
0
13 Jun 2024
Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models
Minghao Wu
Thuy-Trang Vu
Zhuang Li
Gholamreza Haffari
75
6
0
13 Jun 2024
Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning
Janghoon Han
Changho Lee
Joongbo Shin
Stanley Jungkyu Choi
Honglak Lee
Kynghoon Bae
ALM
68
1
0
13 Jun 2024
RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack against LLMs
Xuan Chen
Yuzhou Nie
Lu Yan
Yunshu Mao
Wenbo Guo
Xiangyu Zhang
63
7
0
13 Jun 2024
Multi-Agent Collaboration via Cross-Team Orchestration
Zhuoyun Du
Chen Qian
Wei Liu
Zihao Xie
Yifei Wang
...
Weize Chen
Cheng Yang
Ye Tian
Xuantang Xiong
Lei Han
LLMAG
103
21
0
13 Jun 2024
MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning
Hanqing Wang
Zeguan Xiao
Shuo Wang
Guanhua Chen
Guanhua Chen
112
27
0
13 Jun 2024
REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space
Tomer Ashuach
Martin Tutek
Yonatan Belinkov
MU
KELM
178
7
0
13 Jun 2024
Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QA
Jongwoo Park
Kanchana Ranasinghe
Kumara Kahatapitiya
Wonjeong Ryoo
Donghyun Kim
Michael S. Ryoo
162
25
0
13 Jun 2024
Security of AI Agents
Yifeng He
Ethan Wang
Yuyang Rong
Zifei Cheng
Hao Chen
LLMAG
106
10
0
12 Jun 2024
Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification
Martin Juan José Bucher
Marco Martini
ALM
AI4MH
135
36
0
12 Jun 2024
Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs
Chen Zheng
Ke Sun
Xun Zhou
MoE
85
0
0
12 Jun 2024
PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences
Daiwei Chen
Yi Chen
Aniket Rege
Ramya Korlakai Vinayak
114
23
0
12 Jun 2024
TasTe: Teaching Large Language Models to Translate through Self-Reflection
Yutong Wang
Jiali Zeng
Xuebo Liu
Fandong Meng
Jie Zhou
Min Zhang
106
19
0
12 Jun 2024
Research Trends for the Interplay between Large Language Models and Knowledge Graphs
H. Khorashadizadeh
Fatima Zahra Amara
Morteza Ezzabady
Frédéric Ieng
Sanju Tiwari
Nandana Mihindukulasooriya
Jinghua Groppe
S. Sahri
Farah Benamara
Sven Groppe
110
10
0
12 Jun 2024
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams
Haoji Zhang
Yiqin Wang
Yansong Tang
Yong-Jin Liu
Jiashi Feng
Jifeng Dai
Xiaojie Jin
109
45
0
12 Jun 2024
AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detection
Pia Pachinger
Janis Goldzycher
A. Planitzer
Wojciech Kusa
Allan Hanbury
Julia Neidhardt
80
3
0
12 Jun 2024
Large Language Models Meet Text-Centric Multimodal Sentiment Analysis: A Survey
Hao Yang
Yanyan Zhao
Yang Wu
Shilong Wang
Tian Zheng
Hongbo Zhang
Zongyang Ma
Wanxiang Che
Bing Qin
133
14
0
12 Jun 2024
Fewer Tokens and Fewer Videos: Extending Video Understanding Abilities in Large Vision-Language Models
Shimin Chen
Yitian Yuan
Shaoxiang Chen
Zequn Jie
Lin Ma
VLM
84
4
0
12 Jun 2024
It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF
Taiming Lu
Lingfeng Shen
Xinyu Yang
Weiting Tan
Beidi Chen
Huaxiu Yao
108
3
0
12 Jun 2024
Can Large Language Models Understand Spatial Audio?
Changli Tang
Wenyi Yu
Guangzhi Sun
Xianzhao Chen
Tian Tan
...
Jun Zhang
Lu Lu
Zejun Ma
Yuxuan Wang
Chao Zhang
125
7
0
12 Jun 2024
When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions
Zhening Li
Gabriel Poesia
Armando Solar-Lezama
OffRL
75
1
0
12 Jun 2024
Previous
1
2
3
...
67
68
69
...
126
127
128
Next