ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,374 papers shown
Title
From Tarzan to Tolkien: Controlling the Language Proficiency Level of
  LLMs for Content Generation
From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation
Ali Malik
Stephen Mayhew
Chris Piech
K. Bicknell
65
4
0
05 Jun 2024
BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents
BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents
Yifei Wang
Dizhan Xue
Shengjie Zhang
Shengsheng Qian
AAMLLLMAG
102
38
0
05 Jun 2024
MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering
  Medical Knowledge
MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering Medical Knowledge
Yuxuan Zhou
Xien Liu
Chen Ning
Ji Wu
ELM
83
3
0
05 Jun 2024
Improving In-Context Learning with Prediction Feedback for Sentiment
  Analysis
Improving In-Context Learning with Prediction Feedback for Sentiment Analysis
Hongling Xu
Qianlong Wang
Yice Zhang
Min Yang
Xi Zeng
Bing Qin
Ruifeng Xu
59
6
0
05 Jun 2024
Scaling Laws for Reward Model Overoptimization in Direct Alignment
  Algorithms
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms
Rafael Rafailov
Yaswanth Chittepu
Ryan Park
Harshit S. Sikchi
Joey Hejna
Bradley Knox
Chelsea Finn
S. Niekum
127
69
0
05 Jun 2024
HYDRA: Model Factorization Framework for Black-Box LLM Personalization
HYDRA: Model Factorization Framework for Black-Box LLM Personalization
Yuchen Zhuang
Haotian Sun
Yue Yu
Rushi Qiang
Qifan Wang
Chao Zhang
Bo Dai
AAML
126
26
0
05 Jun 2024
PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with
  LLM
PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLM
Tao Yang
Yingmin Luo
Zhongang Qi
Yang Wu
Ying Shan
Chang Wen Chen
3DVMLLM
78
13
0
05 Jun 2024
Exact Conversion of In-Context Learning to Model Weights in
  Linearized-Attention Transformers
Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers
Brian K Chen
Tianyang Hu
Hui Jin
Hwee Kuan Lee
Kenji Kawaguchi
88
2
0
05 Jun 2024
Adaptive Preference Scaling for Reinforcement Learning with Human
  Feedback
Adaptive Preference Scaling for Reinforcement Learning with Human Feedback
Ilgee Hong
Zichong Li
Alexander Bukharin
Yixiao Li
Haoming Jiang
Tianbao Yang
Tuo Zhao
82
6
0
04 Jun 2024
Aligning Large Language Models via Fine-grained Supervision
Aligning Large Language Models via Fine-grained Supervision
Dehong Xu
Liang Qiu
Minseok Kim
Faisal Ladhak
Jaeyoung Do
79
3
0
04 Jun 2024
Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix
  Controller
Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller
Min Cai
Yuchen Zhang
Shichang Zhang
Fan Yin
Difan Zou
Yisong Yue
Ziniu Hu
87
1
0
04 Jun 2024
CoNav: A Benchmark for Human-Centered Collaborative Navigation
CoNav: A Benchmark for Human-Centered Collaborative Navigation
Changhao Li
Xinyu Sun
Peihao Chen
Jugang Fan
Zixu Wang
Yanxia Liu
Jinhui Zhu
Chuang Gan
Mingkui Tan
122
1
0
04 Jun 2024
On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and
  Latent Concept
On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept
Guangliang Liu
Haitao Mao
Bochuan Cao
Zhiyu Xue
K. Johnson
Jiliang Tang
Rongrong Wang
LRM
105
10
0
04 Jun 2024
Retaining Key Information under High Compression Ratios: Query-Guided
  Compressor for LLMs
Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs
Zhiwei Cao
Qian Cao
Yu Lu
Ningxin Peng
Luyang Huang
Shanbo Cheng
Jinsong Su
112
14
0
04 Jun 2024
Break the Chain: Large Language Models Can be Shortcut Reasoners
Break the Chain: Large Language Models Can be Shortcut Reasoners
Mengru Ding
Hanmeng Liu
Zhizhang Fu
Jian Song
Wenbo Xie
Yue Zhang
KELMLRM
75
12
0
04 Jun 2024
MidiCaps: A large-scale MIDI dataset with text captions
MidiCaps: A large-scale MIDI dataset with text captions
J. Melechovský
Abhinaba Roy
Dorien Herremans
94
13
0
04 Jun 2024
The current status of large language models in summarizing radiology
  report impressions
The current status of large language models in summarizing radiology report impressions
Danqing Hu
Shanyuan Zhang
Qing Liu
Xiaofeng Zhu
Bing Liu
LM&MA
39
2
0
04 Jun 2024
Diver: Large Language Model Decoding with Span-Level Mutual Information
  Verification
Diver: Large Language Model Decoding with Span-Level Mutual Information Verification
Jinliang Lu
Chen Wang
Jiajun Zhang
121
3
0
04 Jun 2024
FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement
  Learning
FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Wenzhe Li
Zihan Ding
Seth Karten
Chi Jin
109
2
0
04 Jun 2024
Conditional Language Learning with Context
Conditional Language Learning with Context
X. Zhang
Miao Li
Ji Wu
96
4
0
04 Jun 2024
DrEureka: Language Model Guided Sim-To-Real Transfer
DrEureka: Language Model Guided Sim-To-Real Transfer
Yecheng Jason Ma
William Liang
Hung-Ju Wang
Sam Wang
Yuke Zhu
Linxi Fan
Osbert Bastani
Dinesh Jayaraman
132
45
0
04 Jun 2024
Process-Driven Autoformalization in Lean 4
Process-Driven Autoformalization in Lean 4
Jianqiao Lu
Zhengying Liu
Yingjia Wan
Yinya Huang
Haiming Wang
Zhicheng YANG
Jing Tang
Zhijiang Guo
AI4CE
133
19
0
04 Jun 2024
Dishonesty in Helpful and Harmless Alignment
Dishonesty in Helpful and Harmless Alignment
Youcheng Huang
Jingkun Tang
Duanyu Feng
Zheng Zhang
Wenqiang Lei
Jiancheng Lv
Anthony G. Cohn
LLMSV
93
4
0
04 Jun 2024
AI Agents Under Threat: A Survey of Key Security Challenges and Future
  Pathways
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways
Zehang Deng
Yongjian Guo
Changzhou Han
Wanlun Ma
Junwu Xiong
Sheng Wen
Yang Xiang
157
49
0
04 Jun 2024
GRAM: Generative Retrieval Augmented Matching of Data Schemas in the
  Context of Data Security
GRAM: Generative Retrieval Augmented Matching of Data Schemas in the Context of Data Security
Xuanqing Liu
Luyang Kong
Runhui Wang
Patrick Song
Austin Nevins
Henrik Johnson
Nimish Amlathe
Davor Golac
71
3
0
04 Jun 2024
TabMDA: Tabular Manifold Data Augmentation for Any Classifier using
  Transformers with In-context Subsetting
TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context Subsetting
Andrei Margeloiu
A. Bazaga
Nikola Simidjievski
Pietro Lio
M. Jamnik
LMTD
138
6
0
03 Jun 2024
Multi-agent assignment via state augmented reinforcement learning
Multi-agent assignment via state augmented reinforcement learning
Leopoldo Agorio
Sean Van Alen
Miguel Calvo-Fullana
Santiago Paternain
J. Bazerque
26
2
0
03 Jun 2024
LLMs Beyond English: Scaling the Multilingual Capability of LLMs with
  Cross-Lingual Feedback
LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback
Wen Lai
Mohsen Mesgar
Alexander Fraser
LRMALM
123
26
0
03 Jun 2024
Ask-EDA: A Design Assistant Empowered by LLM, Hybrid RAG and
  Abbreviation De-hallucination
Ask-EDA: A Design Assistant Empowered by LLM, Hybrid RAG and Abbreviation De-hallucination
Luyao Shi
Michael A. Kazda
Bradley Sears
Nick Shropshire
Ruchir Puri
52
8
0
03 Jun 2024
Safeguarding Large Language Models: A Survey
Safeguarding Large Language Models: A Survey
Yi Dong
Ronghui Mu
Yanghao Zhang
Siqi Sun
Tianle Zhang
...
Yi Qi
Jinwei Hu
Jie Meng
Saddek Bensalem
Xiaowei Huang
OffRLKELMAILaw
99
26
0
03 Jun 2024
The Life Cycle of Large Language Models: A Review of Biases in Education
The Life Cycle of Large Language Models: A Review of Biases in Education
Jinsook Lee
Yann Hicke
Renzhe Yu
Christopher A. Brooks
René F. Kizilcec
AI4Ed
103
2
0
03 Jun 2024
MMLU-Pro: A More Robust and Challenging Multi-Task Language
  Understanding Benchmark
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Yubo Wang
Xueguang Ma
Ge Zhang
Yuansheng Ni
Abhranil Chandra
...
Kai Wang
Alex Zhuang
Rongqi Fan
Xiang Yue
Wenhu Chen
LRMELM
169
465
0
03 Jun 2024
An Information Bottleneck Perspective for Effective Noise Filtering on
  Retrieval-Augmented Generation
An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation
Kun Zhu
Xiaocheng Feng
Xiyuan Du
Yuxuan Gu
Weijiang Yu
Haotian Wang
Qianglong Chen
Zheng Chu
Jingchang Chen
Bing Qin
84
5
0
03 Jun 2024
Decoupled Alignment for Robust Plug-and-Play Adaptation
Decoupled Alignment for Robust Plug-and-Play Adaptation
Haozheng Luo
Jiahao Yu
Wenxin Zhang
Jialong Li
Jerry Yao-Chieh Hu
Xingyu Xing
Han Liu
106
11
0
03 Jun 2024
Sparsity-Accelerated Training for Large Language Models
Sparsity-Accelerated Training for Large Language Models
Da Ma
Lu Chen
Pengyu Wang
Hongshen Xu
Hanqi Li
Liangtai Sun
Su Zhu
Shuai Fan
Kai Yu
LRM
60
1
0
03 Jun 2024
DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via
  Adaptive Heads Fusion
DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion
Yilong Chen
Linhao Zhang
Junyuan Shang
Zhenyu Zhang
Tingwen Liu
Shuohuan Wang
Yu Sun
69
1
0
03 Jun 2024
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models
  and Their Defenses
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses
Xiaosen Zheng
Tianyu Pang
Chao Du
Qian Liu
Jing Jiang
Min Lin
AAML
142
42
0
03 Jun 2024
EffiQA: Efficient Question-Answering with Strategic Multi-Model
  Collaboration on Knowledge Graphs
EffiQA: Efficient Question-Answering with Strategic Multi-Model Collaboration on Knowledge Graphs
Zixuan Dong
Baoyun Peng
Yufei Wang
Jia Fu
Xiaodong Wang
Yongxue Shan
Xin Zhou
89
4
0
03 Jun 2024
Advancing DRL Agents in Commercial Fighting Games: Training,
  Integration, and Agent-Human Alignment
Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment
Chen Zhang
Qiang He
Zhou Yuan
Elvis S. Liu
Hong Wang
Jian Zhao
Yang-Feng Wang
118
2
0
03 Jun 2024
Guiding ChatGPT to Generate Salient Domain Summaries
Guiding ChatGPT to Generate Salient Domain Summaries
Jun Gao
Ziqiang Cao
Shaoyao Huang
Luozheng Qin
Chunhui Ai
109
1
0
03 Jun 2024
Strengthened Symbol Binding Makes Large Language Models Reliable
  Multiple-Choice Selectors
Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice Selectors
Mengge Xue
Zhenyu Hu
Liqun Liu
Kuo Liao
Shuang Li
Honglin Han
Meng Zhao
Chengguo Yin
83
8
0
03 Jun 2024
Scalable Ensembling For Mitigating Reward Overoptimisation
Scalable Ensembling For Mitigating Reward Overoptimisation
Ahmed M. Ahmed
Rafael Rafailov
Stepan Sharkov
Xuechen Li
Oluwasanmi Koyejo
159
5
0
03 Jun 2024
Luna: An Evaluation Foundation Model to Catch Language Model
  Hallucinations with High Accuracy and Low Cost
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
Masha Belyi
Robert Friel
Shuai Shao
Atindriyo Sanyal
HILMRALM
117
7
0
03 Jun 2024
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts
  Language Models
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
Tianwen Wei
Bo Zhu
Liang Zhao
Cheng Cheng
Biye Li
...
Yutuan Ma
Rui Hu
Shuicheng Yan
Han Fang
Yahui Zhou
MoE
155
32
0
03 Jun 2024
Annotation Guidelines-Based Knowledge Augmentation: Towards Enhancing
  Large Language Models for Educational Text Classification
Annotation Guidelines-Based Knowledge Augmentation: Towards Enhancing Large Language Models for Educational Text Classification
Shiqi Liu
Sannyuya Liu
Lele Sha
Zijie Zeng
D. Gašević
Zhi Liu
110
0
0
03 Jun 2024
Re-ReST: Reflection-Reinforced Self-Training for Language Agents
Re-ReST: Reflection-Reinforced Self-Training for Language Agents
Zi-Yi Dou
Cheng-Fu Yang
Xueqing Wu
Kai-Wei Chang
Nanyun Peng
LRM
166
10
0
03 Jun 2024
Predicting drug-gene relations via analogy tasks with word embeddings
Predicting drug-gene relations via analogy tasks with word embeddings
Hiroaki Yamagiwa
Ryoma Hashimoto
Kiwamu Arakane
Ken Murakami
Shou Soeda
Momose Oyama
Yihua Zhu
Mariko Okada
Hidetoshi Shimodaira
205
0
0
03 Jun 2024
PrivacyRestore: Privacy-Preserving Inference in Large Language Models via Privacy Removal and Restoration
PrivacyRestore: Privacy-Preserving Inference in Large Language Models via Privacy Removal and Restoration
Huiping Zhuang
Jianwei Wang
Zhengdong Lu
Huiping Zhuang
Haoran Li
Huiping Zhuang
Cen Chen
RALMKELM
129
8
0
03 Jun 2024
Self-Improving Robust Preference Optimization
Self-Improving Robust Preference Optimization
Eugene Choi
Arash Ahmadian
Matthieu Geist
Oilvier Pietquin
M. G. Azar
122
9
0
03 Jun 2024
FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism
  for Language Models
FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models
Kaixin Lan
Tao Fang
Derek F. Wong
Yabo Xu
Lidia S. Chao
Cecilia G. Zhao
116
4
0
02 Jun 2024
Previous
123...707172...126127128
Next