ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,381 papers shown
Title
GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large
  Language Models
GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models
Tao Zhang
Huiping Zhuang
Yuxiang Xiao
Huiping Zhuang
Cen Chen
James R. Foulds
Shimei Pan
CVBM
80
5
0
20 Jun 2024
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative
  Image Caption Enrichment
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment
Yusuke Hirota
Ryo Hachiuma
Chao-Han Huck Yang
Yuta Nakashima
VLM
89
4
0
20 Jun 2024
What Teaches Robots to Walk, Teaches Them to Trade too -- Regime
  Adaptive Execution using Informed Data and LLMs
What Teaches Robots to Walk, Teaches Them to Trade too -- Regime Adaptive Execution using Informed Data and LLMs
Raeid Saqur
76
3
0
20 Jun 2024
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal
Tinghao Xie
Xiangyu Qi
Yi Zeng
Yangsibo Huang
Udari Madhushani Sehwag
...
Bo Li
Kai Li
Danqi Chen
Peter Henderson
Prateek Mittal
ALMELM
191
79
0
20 Jun 2024
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing
Han Jiang
Xiaoyuan Yi
Zhihua Wei
Ziang Xiao
Shu Wang
Xing Xie
ELMALM
164
8
0
20 Jun 2024
APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking
APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking
Can Jin
Hongwu Peng
Shiyu Zhao
Zhenting Wang
Wujiang Xu
Ligong Han
Jiahui Zhao
Kai Zhong
Sanguthevar Rajasekaran
Dimitris N. Metaxas
KELM
153
33
0
20 Jun 2024
Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics
Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics
Seungbeen Lee
Seungwon Lim
Seungju Han
Giyeong Oh
Hyungjoo Chae
...
Beong-woo Kwak
Yeonsoo Lee
Dongha Lee
Jinyoung Yeo
Youngjae Yu
101
16
0
20 Jun 2024
Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective
Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective
Yuchen Wen
Keping Bi
Wei Chen
Jiafeng Guo
Xueqi Cheng
210
2
0
20 Jun 2024
FoRAG: Factuality-optimized Retrieval Augmented Generation for
  Web-enhanced Long-form Question Answering
FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering
Tianchi Cai
Zhiwen Tan
Xierui Song
Tao Sun
Jiyan Jiang
Yunqi Xu
Yinger Zhang
Jinjie Gu
82
7
0
19 Jun 2024
StableSemantics: A Synthetic Language-Vision Dataset of Semantic
  Representations in Naturalistic Images
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images
Rushikesh Zawar
Shaurya Dewan
Andrew F. Luo
Margaret M. Henderson
Michael J. Tarr
Leila Wehbe
VGenCoGe
78
1
0
19 Jun 2024
Leveraging Large Language Models for Patient Engagement: The Power of
  Conversational AI in Digital Health
Leveraging Large Language Models for Patient Engagement: The Power of Conversational AI in Digital Health
Bo Wen
R. Norel
Julia Liu
Thaddeus Stappenbeck
F. Zulkernine
Huamin Chen
AI4MHLM&MA
76
4
0
19 Jun 2024
Improving Visual Commonsense in Language Models via Multiple Image
  Generation
Improving Visual Commonsense in Language Models via Multiple Image Generation
Guy Yariv
Idan Schwartz
Yossi Adi
Sagie Benaim
VLMLRM
48
0
0
19 Jun 2024
Optimizing Psychological Counseling with Instruction-Tuned Large
  Language Models
Optimizing Psychological Counseling with Instruction-Tuned Large Language Models
Wenjie Li
Tianyu Sun
Kun Qian
Wenhong Wang
LM&MA
82
2
0
19 Jun 2024
Factual Confidence of LLMs: on Reliability and Robustness of Current
  Estimators
Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators
Matéo Mahaut
Laura Aina
Paula Czarnowska
Momchil Hardalov
Thomas Müller
Lluís Marquez
HILM
99
24
0
19 Jun 2024
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for
  LLM Agents
AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents
Edoardo Debenedetti
Jie Zhang
Mislav Balunović
Luca Beurer-Kellner
Marc Fischer
Florian Tramèr
LLMAGAAML
130
45
1
19 Jun 2024
BeHonest: Benchmarking Honesty in Large Language Models
BeHonest: Benchmarking Honesty in Large Language Models
Steffi Chern
Zhulin Hu
Yuqing Yang
Ethan Chern
Yuan Guo
Jiahe Jin
Binjie Wang
Pengfei Liu
HILMALM
143
6
0
19 Jun 2024
R^2AG: Incorporating Retrieval Information into Retrieval Augmented
  Generation
R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation
Fuda Ye
Shuangyin Li
Yongqi Zhang
Lei Chen
74
0
0
19 Jun 2024
Enhancing Collaborative Semantics of Language Model-Driven
  Recommendations via Graph-Aware Learning
Enhancing Collaborative Semantics of Language Model-Driven Recommendations via Graph-Aware Learning
Zhong Guan
Likang Wu
Hongke Zhao
Ming He
Jianpin Fan
90
3
0
19 Jun 2024
Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database
  Filtering with LLM-Extracted Metadata
Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted Metadata
Mykhailo Poliakov
N. Shvai
84
9
0
19 Jun 2024
Synthetic Context Generation for Question Generation
Synthetic Context Generation for Question Generation
Naiming Liu
Zichao Wang
Richard Baraniuk
LRM
97
3
0
19 Jun 2024
When Parts are Greater Than Sums: Individual LLM Components Can
  Outperform Full Models
When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models
Ting-Yun Chang
Jesse Thomason
Robin Jia
112
5
0
19 Jun 2024
ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models
ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models
Hwiyeol Jo
Hyunwoo Lee
Kang Min Yoo
Taiwoo Park
52
0
0
19 Jun 2024
Exploring and Benchmarking the Planning Capabilities of Large Language
  Models
Exploring and Benchmarking the Planning Capabilities of Large Language Models
Bernd Bohnet
Azade Nova
Aaron T Parisi
Kevin Swersky
Katayoon Goshvadi
Hanjun Dai
Dale Schuurmans
Noah Fiedel
Hanie Sedghi
83
10
0
18 Jun 2024
Think-then-Act: A Dual-Angle Evaluated Retrieval-Augmented Generation
Think-then-Act: A Dual-Angle Evaluated Retrieval-Augmented Generation
Yige Shen
Hao Jiang
Hua Qu
Jihong Zhao
RALMLRM
82
1
0
18 Jun 2024
Bayesian-LoRA: LoRA based Parameter Efficient Fine-Tuning using Optimal
  Quantization levels and Rank Values trough Differentiable Bayesian Gates
Bayesian-LoRA: LoRA based Parameter Efficient Fine-Tuning using Optimal Quantization levels and Rank Values trough Differentiable Bayesian Gates
Cristian Meo
Ksenia Sycheva
Anirudh Goyal
Justin Dauwels
MQ
75
5
0
18 Jun 2024
DrVideo: Document Retrieval Based Long Video Understanding
DrVideo: Document Retrieval Based Long Video Understanding
Ziyu Ma
Chenhui Gou
Hengcan Shi
Bin Sun
Shutao Li
Hamid Rezatofighi
Jianfei Cai
VLM
82
15
0
18 Jun 2024
Interpretable Preferences via Multi-Objective Reward Modeling and
  Mixture-of-Experts
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts
Haoxiang Wang
Wei Xiong
Tengyang Xie
Han Zhao
Tong Zhang
141
180
0
18 Jun 2024
Is It Good Data for Multilingual Instruction Tuning or Just Bad
  Multilingual Evaluation for Large Language Models?
Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
Pinzhen Chen
Simon Yu
Zhicheng Guo
Barry Haddow
ELM
114
3
0
18 Jun 2024
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All
  Tools
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Team GLM
:
Aohan Zeng
Bin Xu
Bowen Wang
...
Zhaoyu Wang
Zhen Yang
Zhengxiao Du
Zhenyu Hou
Zihan Wang
ALM
162
650
0
18 Jun 2024
Can Large Language Models Code Like a Linguist?: A Case Study in Low
  Resource Sound Law Induction
Can Large Language Models Code Like a Linguist?: A Case Study in Low Resource Sound Law Induction
Atharva Naik
Kexun Zhang
Nathaniel R. Robinson
Aravind Mysore
Clayton Marr
Hong Sng Rebecca Byrnes
Anna Cai
Kalvin Chang
David R. Mortensen
102
1
0
18 Jun 2024
Abstraction-of-Thought Makes Language Models Better Reasoners
Abstraction-of-Thought Makes Language Models Better Reasoners
Ruixin Hong
Hongming Zhang
Xiaoman Pan
Dong Yu
Changshui Zhang
LRM
110
8
0
18 Jun 2024
TourLLM: Enhancing LLMs with Tourism Knowledge
TourLLM: Enhancing LLMs with Tourism Knowledge
Qikai Wei
Mingzhi Yang
Jinqiang Wang
Wenwei Mao
Jiabo Xu
Huansheng Ning
92
6
0
18 Jun 2024
Beyond Under-Alignment: Atomic Preference Enhanced Factuality Tuning for
  Large Language Models
Beyond Under-Alignment: Atomic Preference Enhanced Factuality Tuning for Large Language Models
Hongbang Yuan
Yubo Chen
Pengfei Cao
Zhuoran Jin
Kang Liu
Jun Zhao
55
0
0
18 Jun 2024
EMO-KNOW: A Large Scale Dataset on Emotion and Emotion-cause
EMO-KNOW: A Large Scale Dataset on Emotion and Emotion-cause
M. Nguyen
Yasith Samaradivakara
P. Sasikumar
Chitralekha Gupta
Suranga Nanayakkara
68
1
0
18 Jun 2024
WebCanvas: Benchmarking Web Agents in Online Environments
WebCanvas: Benchmarking Web Agents in Online Environments
Yichen Pan
Dehan Kong
Sida Zhou
Cheng Cui
Yifei Leng
...
Hangyu Liu
Yanyi Shang
Shuyan Zhou
Tongshuang Wu
Zhengyang Wu
152
43
0
18 Jun 2024
SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large
  Language Models
SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models
Somnath Banerjee
Soham Tripathy
Sayan Layek
Shanu Kumar
Animesh Mukherjee
Rima Hazra
95
7
0
18 Jun 2024
PFID: Privacy First Inference Delegation Framework for LLMs
PFID: Privacy First Inference Delegation Framework for LLMs
Haoyan Yang
Zhitao Li
Yong Zhang
Jianzong Wang
Ning Cheng
Ming Li
Jing Xiao
119
2
0
18 Jun 2024
Order-Optimal Instance-Dependent Bounds for Offline Reinforcement
  Learning with Preference Feedback
Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback
Zhirui Chen
Vincent Y. F. Tan
OffRL
101
1
0
18 Jun 2024
Quantum Compiling with Reinforcement Learning on a Superconducting
  Processor
Quantum Compiling with Reinforcement Learning on a Superconducting Processor
Z. T. Wang
Qiuhao Chen
Yuxuan Du
Z. H. Yang
Xiaoxia Cai
...
Huikai Xu
Yirong Jin
Ruixia Wang
Haifeng Yu
S. P. Zhao
71
3
0
18 Jun 2024
CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models
CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models
Yuetai Li
Zhangchen Xu
Fengqing Jiang
Luyao Niu
D. Sahabandu
Bhaskar Ramasubramanian
Radha Poovendran
SILMAAML
124
10
0
18 Jun 2024
Who's asking? User personas and the mechanics of latent misalignment
Who's asking? User personas and the mechanics of latent misalignment
Asma Ghandeharioun
Ann Yuan
Marius Guerard
Emily Reif
Michael A. Lepori
Lucas Dixon
LLMSV
100
8
0
17 Jun 2024
COMMUNITY-CROSS-INSTRUCT: Unsupervised Instruction Generation for
  Aligning Large Language Models to Online Communities
COMMUNITY-CROSS-INSTRUCT: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities
Zihao He
Rebecca Dorn
Siyi Guo
Minh Duc Hoang Chu
Kristina Lerman
94
8
0
17 Jun 2024
MedCalc-Bench: Evaluating Large Language Models for Medical Calculations
MedCalc-Bench: Evaluating Large Language Models for Medical Calculations
Nikhil Khandekar
Qiao Jin
Guangzhi Xiong
Soren Dunn
Serina S Applebaum
...
Amisha D. Dave
Andrew Taylor
Aidong Zhang
Qingyu Chen
Zhiyong Lu
LM&MAELM
129
14
0
17 Jun 2024
Self-MoE: Towards Compositional Large Language Models with
  Self-Specialized Experts
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts
Junmo Kang
Leonid Karlinsky
Hongyin Luo
Zhen Wang
Jacob A. Hansen
James Glass
David D. Cox
Yikang Shen
Rogerio Feris
Alan Ritter
MoMeMoE
93
11
0
17 Jun 2024
Dialogue Action Tokens: Steering Language Models in Goal-Directed
  Dialogue with a Multi-Turn Planner
Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
Kenneth Li
Yiming Wang
Fernanda Viégas
Martin Wattenberg
77
7
0
17 Jun 2024
LLaNA: Large Language and NeRF Assistant
LLaNA: Large Language and NeRF Assistant
Andrea Amaduzzi
Pierluigi Zama Ramirez
Giuseppe Lisanti
Samuele Salti
Luigi Di Stefano
106
4
0
17 Jun 2024
mDPO: Conditional Preference Optimization for Multimodal Large Language
  Models
mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Fei Wang
Wenxuan Zhou
James Y. Huang
Nan Xu
Sheng Zhang
Hoifung Poon
Muhao Chen
118
28
0
17 Jun 2024
Exploring the Role of Large Language Models in Prompt Encoding for
  Diffusion Models
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models
Bingqi Ma
Zhuofan Zong
Guanglu Song
Hongsheng Li
Yu Liu
88
23
0
17 Jun 2024
WPO: Enhancing RLHF with Weighted Preference Optimization
WPO: Enhancing RLHF with Weighted Preference Optimization
Wenxuan Zhou
Ravi Agrawal
Shujian Zhang
Sathish Indurthi
Sanqiang Zhao
Kaiqiang Song
Silei Xu
Chenguang Zhu
105
20
0
17 Jun 2024
Iterative Length-Regularized Direct Preference Optimization: A Case
  Study on Improving 7B Language Models to GPT-4 Level
Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level
Jie Liu
Zhanhui Zhou
Jiaheng Liu
Xingyuan Bu
Chao Yang
Han-Sen Zhong
Wanli Ouyang
71
21
0
17 Jun 2024
Previous
123...656667...126127128
Next