Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,392 papers shown
Title
Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance
Bryan Etzine
Masoud Hashemi
Nishanth Madhusudhan
Sagar Davasam
Roshnee Sharma
Sathwik Tejaswi Madhusudhan
Vikas Yadav
72
0
0
07 Mar 2025
Conformal Prediction for Image Segmentation Using Morphological Prediction Sets
Luca Mossina
Corentin Friedrich
MedIm
106
1
0
07 Mar 2025
Soft Policy Optimization: Online Off-Policy RL for Sequence Models
Taco Cohen
David W. Zhang
Kunhao Zheng
Yunhao Tang
Rémi Munos
Gabriel Synnaeve
OffRL
117
1
0
07 Mar 2025
Superintelligence Strategy: Expert Version
Dan Hendrycks
Eric Schmidt
Alexandr Wang
118
3
0
07 Mar 2025
SANDWiCH: Semantical Analysis of Neighbours for Disambiguating Words in Context ad Hoc
Daniel Guzman-Olivares
Lara Quijano-Sanchez
Federico Liberatore
65
0
0
07 Mar 2025
Similarity-Based Domain Adaptation with LLMs
Jie He
Wendi Zhou
Xiang Li
Jeff Z. Pan
90
0
0
07 Mar 2025
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Hyungkyu Kang
Min-hwan Oh
OffRL
121
0
0
07 Mar 2025
Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Ling Team
B. Zeng
Chenyu Huang
Chao Zhang
Changxin Tian
...
Zhaoxin Huan
Zujie Wen
Zhenhang Sun
Zhuoxuan Du
Z. He
MoE
ALM
198
5
0
07 Mar 2025
Dynamic Knowledge Integration for Evidence-Driven Counter-Argument Generation with Large Language Models
Anar Yeginbergen
Maite Oronoz
Rodrigo Agerri
138
0
0
07 Mar 2025
DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models
Ruizhe Chen
Wenhao Chai
Zhifei Yang
Xiaotian Zhang
Qiufeng Wang
Tony Q.S. Quek
Soujanya Poria
Zuozhu Liu
144
1
0
06 Mar 2025
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
Jialong Xue
Wei Gao
Yu Wang
Chao Ji
Dongdong Zhao
Shi Yan
Shiwu Zhang
91
1
0
06 Mar 2025
Mixed Likelihood Variational Gaussian Processes
Kaiwen Wu
Craig Sanders
Benjamin Letham
Phillip Guan
116
0
0
06 Mar 2025
TIMER: Temporal Instruction Modeling and Evaluation for Longitudinal Clinical Records
Hejie Cui
Alyssa Unell
Bowen Chen
Jason Alan Fries
Emily Alsentzer
Sanmi Koyejo
N. Shah
137
3
0
06 Mar 2025
Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation
Aishik Konwer
Zhijian Yang
Erhan Bas
Cao Xiao
Prateek Prasanna
Parminder Bhatia
Taha A. Kass-Hout
MedIm
VLM
116
1
0
06 Mar 2025
Robust Data Watermarking in Language Models by Injecting Fictitious Knowledge
Xinyue Cui
Johnny Tian-Zheng Wei
Swabha Swayamdipta
Robin Jia
WaLM
151
2
0
06 Mar 2025
IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval
Tingyu Song
Guo Gan
Mingsheng Shang
Yilun Zhao
VLM
101
2
0
06 Mar 2025
Adding Alignment Control to Language Models
Wenhong Zhu
Weinan Zhang
Rui Wang
127
0
0
06 Mar 2025
Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment
Wen Yang
Junhong Wu
Chen Wang
Chengqing Zong
J.N. Zhang
166
1
0
06 Mar 2025
Lost in Literalism: How Supervised Training Shapes Translationese in LLMs
Yafu Li
Ronghao Zhang
Zhilin Wang
Huajian Zhang
Leyang Cui
Yongjing Yin
Tong Xiao
Yue Zhang
112
0
0
06 Mar 2025
Talking Back -- human input and explanations to interactive AI systems
Alan Dix
Tommaso Turchi
Ben Wilson
Anna Monreale
Matt Roach
77
1
0
06 Mar 2025
Underlying Semantic Diffusion for Effective and Efficient In-Context Learning
Zhong Ji
Weilong Cao
Yan Zhang
Yanwei Pang
Jungong Han
Xuelong Li
DiffM
VLM
92
0
0
06 Mar 2025
An Empirical Study on Eliciting and Improving R1-like Reasoning Models
Zhongfu Chen
Yingqian Min
Beichen Zhang
Jie Chen
Jinhao Jiang
...
Xu Miao
Yaojie Lu
Lei Fang
Zhongyuan Wang
Ji-Rong Wen
ReLM
OffRL
LRM
156
37
0
06 Mar 2025
Uncovering Gaps in How Humans and LLMs Interpret Subjective Language
Erik Jones
Arjun Patrawala
Jacob Steinhardt
78
1
0
06 Mar 2025
Towards Autonomous Reinforcement Learning for Real-World Robotic Manipulation with Large Language Models
Niccolò Turcato
Matteo Iovino
Aris Synodinos
Alberto Dalla Libera
R. Carli
Pietro Falco
LM&Ro
124
0
0
06 Mar 2025
FANS -- Formal Answer Selection for Natural Language Math Reasoning Using Lean4
Jiarui Yao
Ruida Wang
Tong Zhang
LRM
119
2
0
05 Mar 2025
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
Guangyi Liu
Shuo Tang
Rui Ge
Yaxin Du
Zhenfei Yin
Tian Jin
Jing Shao
LLMAG
151
7
0
05 Mar 2025
Generative Artificial Intelligence in Robotic Manipulation: A Survey
Kun Zhang
Peng Yun
Jun Cen
Junhao Cai
DiDi Zhu
...
Qifeng Chen
Jia Pan
Wei Zhang
Bo Yang
Hua Chen
185
1
0
05 Mar 2025
Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization
Jiajun Yu
Y. Zheng
Huan Yee Koh
Xiaojun Jia
Tianyue Wang
Haishuai Wang
119
2
0
05 Mar 2025
Rebalanced Multimodal Learning with Data-aware Unimodal Sampling
Qingyuan Jiang
Zhouyang Chi
Xiao Ma
Qirong Mao
Yang Yang
Jinhui Tang
99
0
0
05 Mar 2025
Adversarial Training for Multimodal Large Language Models against Jailbreak Attacks
Liming Lu
Shuchao Pang
Siyuan Liang
Haotian Zhu
Xiyu Zeng
Aishan Liu
Yunhuai Liu
Yongbin Zhou
AAML
174
5
0
05 Mar 2025
Token-Level Privacy in Large Language Models
Reém Harel
Niv Gilboa
Yuval Pinter
89
0
0
05 Mar 2025
CodeIF-Bench: Evaluating Instruction-Following Capabilities of Large Language Models in Interactive Code Generation
Peiding Wang
Lulu Zhang
Fang Liu
Lin Shi
Minxiao Li
Bo Shen
An Fu
ELM
LRM
430
2
0
05 Mar 2025
Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks
Zihao Zhao
Chenxiao Fan
Chongming Gao
Fuli Feng
Xiangnan He
LM&MA
AI4MH
116
1
0
05 Mar 2025
Human Implicit Preference-Based Policy Fine-tuning for Multi-Agent Reinforcement Learning in USV Swarm
Haksub Kim
Kanghoon Lee
J. Park
Jiachen Li
Jinkyoo Park
126
1
0
05 Mar 2025
Deep Causal Behavioral Policy Learning: Applications to Healthcare
Jonas Knecht
Anna Zink
Jonathan Kolstad
Maya Petersen
CML
123
0
0
05 Mar 2025
Extrapolation Merging: Keep Improving With Extrapolation and Merging
Yiguan Lin
Bin Xu
Yinghao Li
Yang Gao
MoMe
97
1
0
05 Mar 2025
Preserving Cultural Identity with Context-Aware Translation Through Multi-Agent AI Systems
Mahfuz Ahmed Anik
Abdur Rahman
Azmine Toushik Wasi
Md Manjurul Ahsan
96
5
0
05 Mar 2025
Visualising Policy-Reward Interplay to Inform Zeroth-Order Preference Optimisation of Large Language Models
Alessio Galatolo
Zhenbang Dai
Katie Winkle
Meriem Beloucif
91
0
0
05 Mar 2025
Improving LLM Safety Alignment with Dual-Objective Optimization
Xuandong Zhao
Will Cai
Tianneng Shi
David Huang
Licong Lin
Song Mei
Dawn Song
AAML
MU
209
5
0
05 Mar 2025
Unified Mind Model: Reimagining Autonomous Agents in the LLM Era
Pengbo Hu
Xiang Ying
LLMAG
LM&Ro
AI4CE
157
1
0
05 Mar 2025
Can Frontier LLMs Replace Annotators in Biomedical Text Mining? Analyzing Challenges and Exploring Solutions
Yichong Zhao
Susumu Goto
122
0
0
05 Mar 2025
SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning
Borong Zhang
Yuhao Zhang
Yalan Qin
Yingshan Lei
Josef Dai
Yuanpei Chen
Yaodong Yang
130
0
0
05 Mar 2025
Enhancing LLM Knowledge Learning through Generalization
Mingkang Zhu
Xi Chen
Ziyi Wang
Bei Yu
Hengshuang Zhao
Jiaya Jia
115
0
0
05 Mar 2025
Target Return Optimizer for Multi-Game Decision Transformer
Kensuke Tatematsu
Akifumi Wachi
OffRL
125
0
0
04 Mar 2025
Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment
Matthew DosSantos DiSorbo
Harang Ju
Sinan Aral
ELM
LRM
83
1
0
04 Mar 2025
MPO: Boosting LLM Agents with Meta Plan Optimization
Weimin Xiong
Yifan Song
Qingxiu Dong
Bingchan Zhao
Feifan Song
Xun Wang
Sujian Li
LLMAG
150
3
0
04 Mar 2025
Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm
Hui Yuan
Yuhao Du
Xiaoqi Jiao
Yiwen Guo
Yuege Feng
Xiang Wan
Anningzhe Gao
Jinpeng Hu
98
0
0
04 Mar 2025
LoRA-Null: Low-Rank Adaptation via Null Space for Large Language Models
Pengwei Tang
Yebin Liu
Dongjie Zhang
Xing Wu
Debing Zhang
114
2
0
04 Mar 2025
EchoQA: A Large Collection of Instruction Tuning Data for Echocardiogram Reports
L. Moukheiber
Mira Moukheiber
Dana Moukheiiber
Jae-Woo Ju
Hyung-Chul Lee
LM&MA
141
0
0
04 Mar 2025
Effectively Steer LLM To Follow Preference via Building Confident Directions
Bingqing Song
Boran Han
Shuai Zhang
Hao Wang
Haoyang Fang
Bonan Min
Yuyang Wang
Mingyi Hong
LLMSV
95
4
0
04 Mar 2025
Previous
1
2
3
...
24
25
26
...
126
127
128
Next