Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,399 papers shown
Title
Teaching Models to Improve on Tape
L. Bezalel
Eyal Orgad
Amir Globerson
82
0
0
03 Nov 2024
Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models
Aliyah R. Hsu
James Zhu
Zhichao Wang
Bin Bi
Shubham Mehrotra
...
Sougata Chaudhuri
Regunathan Radhakrishnan
S. Asur
Claire Na Cheng
Bin Yu
ALM
LRM
190
0
0
03 Nov 2024
Regret of exploratory policy improvement and
q
q
q
-learning
Wenpin Tang
X. Zhou
94
2
0
02 Nov 2024
PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment
Dongxu Liu
Bing Xu
Yinzhuo Chen
Bufan Xu
Wenpeng Lu
Muyun Yang
Tiejun Zhao
MoE
65
1
0
02 Nov 2024
PRIMO: Progressive Induction for Multi-hop Open Rule Generation
Jianyu Liu
Sheng Bi
Guilin Qi
62
0
0
02 Nov 2024
Do LLMs Know to Respect Copyright Notice?
Jialiang Xu
Shenglan Li
Zhaozhuo Xu
Denghui Zhang
92
5
0
02 Nov 2024
Rule Based Rewards for Language Model Safety
Tong Mu
Alec Helyar
Johannes Heidecke
Joshua Achiam
Andrea Vallone
Ian Kivlichan
Molly Lin
Alex Beutel
John Schulman
Lilian Weng
ALM
135
50
0
02 Nov 2024
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models
Jianyi Zhang
Da-Cheng Juan
Cyrus Rashtchian
Chun-Sung Ferng
Heinrich Jiang
Yiran Chen
94
4
0
01 Nov 2024
LLMs: A Game-Changer for Software Engineers?
Md Asraful Haque
LLMAG
SyDa
82
2
0
01 Nov 2024
Token-level Proximal Policy Optimization for Query Generation
Yichen Ouyang
Lu Wang
Fangkai Yang
Pu Zhao
Chenghua Huang
...
Saravan Rajmohan
Weiwei Deng
Dongmei Zhang
Feng Sun
Qi Zhang
OffRL
432
5
0
01 Nov 2024
Can LLMs make trade-offs involving stipulated pain and pleasure states?
Geoff Keeling
Winnie Street
Martyna Stachaczyk
Daria Zakharova
Iulia M. Comsa
Anastasiya Sakovych
Isabella Logothesis
Zejia Zhang
Blaise Agüera y Arcas
Jonathan Birch
87
5
0
01 Nov 2024
On the Opportunities of Large Language Models for Programming Process Data
John Edwards
Arto Hellas
Juho Leinonen
82
1
0
01 Nov 2024
Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback
Song Yu
Xiaofei Xu
Fangfei Xu
Li Li
LM&MA
67
1
0
01 Nov 2024
Evaluating the Impact of Lab Test Results on Large Language Models Generated Differential Diagnoses from Clinical Case Vignettes
Balu Bhasuran
Qiao Jin
Yuzhang Xie
Carl Yang
Karim Hanna
Jennifer Costa
Cindy Shavor
Zhiyong Lu
Zhe He
LM&MA
ELM
49
0
0
01 Nov 2024
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation
Bohan Lyu
Yadi Cao
Duncan Watson-Parris
Leon Bergen
Taylor Berg-Kirkpatrick
Rose Yu
137
5
0
01 Nov 2024
Comparison-based Active Preference Learning for Multi-dimensional Personalization
Minhyeon Oh
Seungjoon Lee
Jungseul Ok
72
1
0
01 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
43
0
0
31 Oct 2024
Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters
Yujin Potter
Shiyang Lai
Junsol Kim
James Evans
Basel Alomair
103
20
0
31 Oct 2024
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Kai Yan
Alex Schwing
Yu-Xiong Wang
OffRL
OnRL
86
0
0
31 Oct 2024
Desert Camels and Oil Sheikhs: Arab-Centric Red Teaming of Frontier LLMs
Muhammed Saeed
Elgizouli Mohamed
Mukhtar Mohamed
Shaina Raza
Muhammad Abdul-Mageed
Shady Shehata
91
0
0
31 Oct 2024
Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks
Yingzhe Peng
Xiaoting Qin
Zhiyang Zhang
Jue Zhang
Qingwei Lin
Xu Yang
Dongmei Zhang
Saravan Rajmohan
Qi Zhang
60
3
0
31 Oct 2024
Exploring the Knowledge Mismatch Hypothesis: Hallucination Propensity in Small Models Fine-tuned on Data from Larger Models
Phil Wee
Riyadh Baghdadi
HILM
73
1
0
31 Oct 2024
OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models
Junda Wu
Xintong Li
Ruoyu Wang
Yu Xia
Yuxin Xiong
...
Xiang Chen
Branislav Kveton
Lina Yao
Jingbo Shang
Julian McAuley
OffRL
LRM
80
1
0
31 Oct 2024
Adaptive Alignment: Dynamic Preference Adjustments via Multi-Objective Reinforcement Learning for Pluralistic AI
Hadassah Harland
Richard Dazeley
Peter Vamplew
Hashini Senaratne
Bahareh Nakisa
Francisco Cruz
136
2
0
31 Oct 2024
LLaMo: Large Language Model-based Molecular Graph Assistant
Jinyoung Park
Minseong Bae
Dohwan Ko
Hyunwoo J. Kim
116
3
0
31 Oct 2024
Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization
Xiao Guo
Xiaohong Liu
I. Masi
Xiaoming Liu
185
11
0
31 Oct 2024
ALISE: Accelerating Large Language Model Serving with Speculative Scheduling
Youpeng Zhao
Jun Wang
69
0
0
31 Oct 2024
Constraint Back-translation Improves Complex Instruction Following of Large Language Models
Yunjia Qi
Hao Peng
Xinyu Wang
Bin Xu
Lei Hou
Juanzi Li
114
4
0
31 Oct 2024
Leveraging Language Models and Bandit Algorithms to Drive Adoption of Battery-Electric Vehicles
Keiichi Namikoshi
David A. Shamma
Rumen Iliev
Jingchao Fang
Alexandre L. S. Filipowicz
Candice L Hogan
Charlene C. Wu
Nikos Aréchiga
59
0
0
30 Oct 2024
COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences
Yongxu Liu
Argyris Oikonomou
Weiqiang Zheng
Yang Cai
Arman Cohan
99
1
0
30 Oct 2024
Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
Sheryl Hsu
Omar Khattab
Chelsea Finn
Archit Sharma
KELM
RALM
88
6
0
30 Oct 2024
Controlling Language and Diffusion Models by Transporting Activations
P. Rodríguez
Arno Blaas
Michal Klein
Luca Zappella
N. Apostoloff
Marco Cuturi
Xavier Suau
LLMSV
130
6
0
30 Oct 2024
Emotional RAG: Enhancing Role-Playing Agents through Emotional Retrieval
Le Huang
Hengzhi Lan
Zijun Sun
Chuan Shi
Ting Bai
441
1
0
30 Oct 2024
Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration
Yanchu Guan
Dong Wang
Yanjie Wang
Haiqing Wang
Renen Sun
Chenyi Zhuang
Jinjie Gu
Zhixuan Chu
LM&Ro
LLMAG
81
0
0
30 Oct 2024
VPO: Leveraging the Number of Votes in Preference Optimization
Jae Hyeon Cho
Minkyung Park
Byung-Jun Lee
30
2
0
30 Oct 2024
Effective and Efficient Adversarial Detection for Vision-Language Models via A Single Vector
Youcheng Huang
Fengbin Zhu
Jingkun Tang
Pan Zhou
Wenqiang Lei
Jiancheng Lv
Tat-Seng Chua
AAML
74
4
0
30 Oct 2024
How Well Do Large Language Models Disambiguate Swedish Words?
Richard Johansson
16
0
0
30 Oct 2024
Improving Uncertainty Quantification in Large Language Models via Semantic Embeddings
Yashvir S. Grewal
Edwin V. Bonilla
Thang D. Bui
UQCV
89
9
0
30 Oct 2024
SciPIP: An LLM-based Scientific Paper Idea Proposer
Wenxiao Wang
Lihui Gu
Liye Zhang
Yunxiang Luo
Yi Dai
Chen Shen
Liang Xie
Binbin Lin
Xiaofei He
Jieping Ye
121
6
0
30 Oct 2024
Focus On This, Not That! Steering LLMs with Adaptive Feature Specification
Tom A. Lamb
Adam Davies
Alasdair Paren
Philip Torr
Francesco Pinto
131
0
0
30 Oct 2024
Power side-channel leakage localization through adversarial training of deep neural networks
Jimmy Gammell
A. Raghunathan
Kaushik Roy
AAML
151
0
0
29 Oct 2024
AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts
Vishal Kumar
Zeyi Liao
Jaylen Jones
Huan Sun
AAML
126
3
0
29 Oct 2024
Topic-Conversation Relevance (TCR) Dataset and Benchmarks
Yaran Fan
Jamie Pool
Senja Filipi
Ross Cutler
88
0
0
29 Oct 2024
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types
Yutao Mou
Shikun Zhang
Wei Ye
ELM
92
16
0
29 Oct 2024
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications
Monica Riedler
Stefan Langer
VLM
90
18
0
29 Oct 2024
Preserving Pre-trained Representation Space: On Effectiveness of Prefix-tuning for Large Multi-modal Models
Donghoon Kim
Gusang Lee
Kyuhong Shim
B. Shim
104
1
0
29 Oct 2024
Robot Policy Learning with Temporal Optimal Transport Reward
Yuwei Fu
Haichao Zhang
Di Wu
Wei Xu
Benoit Boulet
OffRL
93
2
0
29 Oct 2024
MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Yuan Wang
Di Huang
Yaqi Zhang
Wanli Ouyang
J. Jiao
Xuetao Feng
Yan Zhou
Pengfei Wan
Shixiang Tang
Dan Xu
VGen
125
16
0
29 Oct 2024
Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate
Zhiqi Bu
Xiaomeng Jin
Bhanukiran Vinzamuri
Anil Ramakrishna
Kai-Wei Chang
Volkan Cevher
Mingyi Hong
MU
183
14
0
29 Oct 2024
f
f
f
-PO: Generalizing Preference Optimization with
f
f
f
-divergence Minimization
Jiaqi Han
Mingjian Jiang
Yuxuan Song
J. Leskovec
Stefano Ermon
144
6
0
29 Oct 2024
Previous
1
2
3
...
39
40
41
...
126
127
128
Next