Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,398 papers shown
Title
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
Hanguang Xiao
Feizhong Zhou
Xianglong Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILaw
LM&MA
LRM
166
30
0
31 Dec 2024
From Generalist to Specialist: A Survey of Large Language Models for Chemistry
Yang Han
Ziping Wan
Lu Chen
Kai Yu
Xin Chen
LM&MA
111
3
0
31 Dec 2024
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Weiwei Sun
Lingyong Yan
Xinyu Ma
Shuaiqiang Wang
Fajie Yuan
Zhumin Chen
D. Yin
Zhaochun Ren
RALM
ALM
ELM
LRM
LM&MA
236
315
0
31 Dec 2024
ConTrans: Weak-to-Strong Alignment Engineering via Concept Transplantation
Weilong Dong
Xinwei Wu
Renren Jin
Shaoyang Xu
Deyi Xiong
148
9
0
31 Dec 2024
LLM-based Translation Inference with Iterative Bilingual Understanding
Andong Chen
Kehai Chen
Yang Xiang
Xuefeng Bai
Muyun Yang
Yang Feng
Tiejun Zhao
Min Zhang
LRM
156
5
0
31 Dec 2024
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Jiajun Song
Zhuoyan Xu
Yiqiao Zhong
162
10
0
31 Dec 2024
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
220
13
0
31 Dec 2024
Geometric-Averaged Preference Optimization for Soft Preference Labels
Hiroki Furuta
Kuang-Huei Lee
Shixiang Shane Gu
Y. Matsuo
Aleksandra Faust
Heiga Zen
Izzeddin Gur
155
13
0
31 Dec 2024
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
LLM-jp
Akiko Aizawa
Eiji Aramaki
Bowen Chen
Fei Cheng
...
Yuya Yamamoto
Yusuke Yamauchi
Hitomi Yanaka
Rio Yokota
Koichiro Yoshino
111
17
0
31 Dec 2024
Altogether: Image Captioning via Re-aligning Alt-text
Hu Xu
Po-Yao (Bernie) Huang
Xiaoqing Ellen Tan
Ching-Feng Yeh
Jacob Kahn
...
Luke Zettlemoyer
Wen-tau Yih
Shang-Wen Li
Saining Xie
Christoph Feichtenhofer
DiffM
102
9
0
31 Dec 2024
LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Dongge Han
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Peter Bell
Amos Storkey
119
12
0
31 Dec 2024
Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment
Jianfei Zhang
Jun Bai
Yangqiu Song
Yanmeng Wang
Rumei Li
Chenghua Lin
Wenge Rong
154
0
0
31 Dec 2024
AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies
Yibo Wen
Chenwei Xu
Jerry Yao-Chieh Hu
Han Liu
DiffM
117
5
0
31 Dec 2024
GaLore
+
+
+
: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection
Xutao Liao
Shaohui Li
Yuhui Xu
Zhi Li
Yebin Liu
You He
VLM
126
6
0
31 Dec 2024
Nash CoT: Multi-Path Inference with Preference Equilibrium
Ziqi Zhang
Cunxiang Wang
Xiong Xiao
Yue Zhang
Donglin Wang
LRM
108
2
0
31 Dec 2024
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
Chia-Yu Hung
Navonil Majumder
Zhifeng Kong
Ambuj Mehrish
Rafael Valle
Bryan Catanzaro
Soujanya Poria
Bryan Catanzaro
Soujanya Poria
162
10
0
30 Dec 2024
Training Software Engineering Agents and Verifiers with SWE-Gym
Jiayi Pan
Xingyao Wang
Graham Neubig
Navdeep Jaitly
Chenhui Xu
Alane Suhr
Yizhe Zhang
156
36
0
30 Dec 2024
Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey
Junqiao Wang
Zeng Zhang
Yangfan He
Yuyang Song
Tianyu Shi
...
Tang Jingqun
Guangwu Qian
Keqin Li
Qiuwu Chen
Lewei He
151
22
0
29 Dec 2024
FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration
Jia Liu
Min Chen
LM&Ro
AI4CE
87
3
0
28 Dec 2024
Using Large Language Models for Automated Grading of Student Writing about Science
Chris Impey
Matthew Wenger
Nikhil Garuda
Shahriar Golchin
Sarah Stamer
ELM
AI4Ed
72
4
0
25 Dec 2024
Retention Score: Quantifying Jailbreak Risks for Vision Language Models
Zaitang Li
Pin-Yu Chen
Tsung-Yi Ho
AAML
63
0
0
23 Dec 2024
DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
Hao Wang
Hao Li
Junda Zhu
Xinyuan Wang
Changzai Pan
Minlie Huang
Lei Sha
362
0
0
23 Dec 2024
Multimodal Preference Data Synthetic Alignment with Reward Model
Robert Wijaya
Ngoc-Bao Nguyen
Ngai-Man Cheung
MLLM
SyDa
135
4
0
23 Dec 2024
Boosting LLM via Learning from Data Iteratively and Selectively
Qi Jia
Siyu Ren
Ziheng Qin
Fuzhao Xue
Jinjie Ni
Yang You
57
0
0
23 Dec 2024
WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models
Huawen Feng
Pu Zhao
Qingfeng Sun
Can Xu
Fangkai Yang
...
Qianli Ma
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
Qi Zhang
AAML
ALM
178
0
0
23 Dec 2024
Understanding the Logic of Direct Preference Alignment through Logic
Kyle Richardson
Vivek Srikumar
Ashish Sabharwal
233
2
0
23 Dec 2024
Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models
Cameron R. Jones
Benjamin Bergen
158
7
0
22 Dec 2024
Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs
Alexander von Recum
Christoph Schnabl
Gabor Hollbeck
Silas Alberti
Philip Blinde
Marvin von Hagen
145
2
0
22 Dec 2024
System-2 Mathematical Reasoning via Enriched Instruction Tuning
Huanqia Cai
Yijun Yang
Zhifeng Li
LRM
135
1
0
22 Dec 2024
An Interaction Design Toolkit for Physical Task Guidance with Artificial Intelligence and Mixed Reality
Arthur Caetano
Alejandro Aponte
Misha Sra
111
0
0
22 Dec 2024
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
Yuxiang Zhang
Yuqi Yang
Jiangming Shu
Yuhang Wang
Jinlin Xiao
Jitao Sang
ALM
VLM
OffRL
LRM
136
5
0
22 Dec 2024
Online Learning from Strategic Human Feedback in LLM Fine-Tuning
Shugang Hao
Lingjie Duan
166
5
0
22 Dec 2024
MVREC: A General Few-shot Defect Classification Model Using Multi-View Region-Context
Shuai Lyu
Fangjian Liao
Zeqi Ma
Rongchen Zhang
Dongmei Mo
W. Wong
158
1
0
22 Dec 2024
From Correlation to Causation: Understanding Climate Change through Causal Analysis and LLM Interpretations
Shan Shan
AI4CE
113
2
0
21 Dec 2024
The Task Shield: Enforcing Task Alignment to Defend Against Indirect Prompt Injection in LLM Agents
Feiran Jia
Tong Wu
Xin Qin
Anna Squicciarini
LLMAG
AAML
152
7
0
21 Dec 2024
REO-VLM: Transforming VLM to Meet Regression Challenges in Earth Observation
Xizhe Xue
Guoting Wei
Hao Chen
Han Zhang
Feng Lin
Chunhua Shen
Xiao Xiang Zhu
191
4
0
21 Dec 2024
LearnLM: Improving Gemini for Learning
LearnLM Team
Abhinit Modi
Aditya Srikanth Veerubhotla
Aliya Rysbek
Andrea Huber
...
Shaojian Zhu
Stephanie Chan
Steve Yadlowsky
Viknesh Sounderajah
Yannis Assael
148
8
0
21 Dec 2024
Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models
Zhisong Zhang
Yan Wang
Xinting Huang
Tianqing Fang
Han Zhang
Chenlong Deng
Shuaiyi Li
Dong Yu
154
6
0
21 Dec 2024
Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval
Luo Ji
Feixiang Guo
Teng Chen
Qingqing Gu
Xiaoyu Wang
...
Peng Yu
Yue Zhao
Hongyang Lei
Zhonglin Jiang
Yong Chen
RALM
LRM
178
0
0
21 Dec 2024
A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation
Shijie Zhou
Ruiyi Zhang
Yufan Zhou
Changyou Chen
VLM
126
1
0
20 Dec 2024
JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs
Haoyang Li
Jiawei Ye
Jie Wu
Tianjie Yan
Chu Wang
Zhixin Li
AAML
91
1
0
20 Dec 2024
PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time
Alireza Pourali
Arian Boukani
Hamzeh Khazaei
115
0
0
20 Dec 2024
Lexicography Saves Lives (LSL): Automatically Translating Suicide-Related Language
Annika Marie Schoene
J. Ortega
Rodolfo Zevallos
Laura Haaber Ihle
86
2
0
20 Dec 2024
REFA: Reference Free Alignment for multi-preference optimization
Taneesh Gupta
Rahul Madhavan
Xuchao Zhang
Chetan Bansal
Saravan Rajmohan
191
1
0
20 Dec 2024
Social Science Is Necessary for Operationalizing Socially Responsible Foundation Models
Adam Davies
Elisa Nguyen
Michael Simeone
Erik Johnston
Martin Gubri
215
0
0
20 Dec 2024
FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF
Flint Xiaofeng Fan
Cheston Tan
Yew-Soon Ong
Roger Wattenhofer
Wei Tsang Ooi
177
1
0
20 Dec 2024
Human-Readable Adversarial Prompts: An Investigation into LLM Vulnerabilities Using Situational Context
Nilanjana Das
Edward Raff
Aman Chadha
Manas Gaur
AAML
241
1
0
20 Dec 2024
Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization
Sahil Wadhwa
Chengtian Xu
Haoming Chen
Aakash Mahalingam
Akankshya Kar
Divya Chaudhary
100
1
0
19 Dec 2024
Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation
Joanne Boisson
Zara Siddique
Hsuvas Borkakoty
Dimosthenis Antypas
Luis Espinosa-Anke
Jose Camacho-Collados
115
0
0
19 Dec 2024
Learning to Generate Research Idea with Dynamic Control
Ruochen Li
Liqiang Jing
Chi Han
Jiawei Zhou
Xinya Du
LRM
124
6
0
19 Dec 2024
Previous
1
2
3
...
34
35
36
...
126
127
128
Next