ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,390 papers shown
Title
The Oscars of AI Theater: A Survey on Role-Playing with Language Models
The Oscars of AI Theater: A Survey on Role-Playing with Language Models
Nuo Chen
Yan Wang
Yang Deng
Jia Li
124
21
0
16 Jul 2024
Reflective Instruction Tuning: Mitigating Hallucinations in Large
  Vision-Language Models
Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Jinrui Zhang
Teng Wang
Haigang Zhang
Ping Lu
Feng Zheng
MLLMLRMVLM
92
4
0
16 Jul 2024
Thorns and Algorithms: Navigating Generative AI Challenges Inspired by
  Giraffes and Acacias
Thorns and Algorithms: Navigating Generative AI Challenges Inspired by Giraffes and Acacias
Waqar Hussain
102
1
0
16 Jul 2024
COMET: "Cone of experience" enhanced large multimodal model for
  mathematical problem generation
COMET: "Cone of experience" enhanced large multimodal model for mathematical problem generation
Sannyuya Liu
Jintian Feng
Zongkai Yang
Yawei Luo
Qian Wan
Xiaoxuan Shen
Jianwen Sun
90
7
0
16 Jul 2024
A Comprehensive Evaluation of Large Language Models on Temporal Event Forecasting
A Comprehensive Evaluation of Large Language Models on Temporal Event Forecasting
He Chang
Chenchen Ye
Zhulin Tao
Jie Wu
Zhengmao Yang
Yunshan Ma
Xianglin Huang
Tat-Seng Chua
AI4TS
91
2
0
16 Jul 2024
Does Refusal Training in LLMs Generalize to the Past Tense?
Does Refusal Training in LLMs Generalize to the Past Tense?
Maksym Andriushchenko
Nicolas Flammarion
147
36
0
16 Jul 2024
AIGC for Industrial Time Series: From Deep Generative Models to Large Generative Models
AIGC for Industrial Time Series: From Deep Generative Models to Large Generative Models
Lei Ren
Haiteng Wang
Yang Tang
Yang Tang
Chunhua Yang
AI4TSAI4CE
152
5
0
16 Jul 2024
Situated Instruction Following
Situated Instruction Following
So Yeon Min
Xavi Puig
Devendra Singh Chaplot
Tsung-Yen Yang
Akshara Rai
Priyam Parashar
Ruslan Salakhutdinov
Yonatan Bisk
Roozbeh Mottaghi
68
2
0
15 Jul 2024
Social and Ethical Risks Posed by General-Purpose LLMs for Settling
  Newcomers in Canada
Social and Ethical Risks Posed by General-Purpose LLMs for Settling Newcomers in Canada
I. Nejadgholi
Maryam Molamohammadi
Samir Bakhtawar
109
0
0
15 Jul 2024
MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with
  Open-domain Information Extraction Large Language Models
MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models
Chengguang Gan
Qingyu Yin
Xinyang He
Hanjun Wei
Yunhao Liang
...
Shijian Wang
Hexiang Huang
Qinghao Zhang
Shiwen Ni
Tatsunori Mori
72
0
0
15 Jul 2024
Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning
  and Format Alignment
Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment
Jinhao Jiang
Junyi Li
Wayne Xin Zhao
Yang Song
Tao Zhang
Ji-Rong Wen
CLL
92
3
0
15 Jul 2024
CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated
  Responses
CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses
Jing Yao
Xiaoyuan Yi
Xing Xie
ELMALM
92
11
0
15 Jul 2024
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs
Quang H. Nguyen
Duy C. Hoang
Juliette Decugis
Saurav Manchanda
Nitesh Chawla
Khoa D. Doan
Khoa D. Doan
238
11
0
15 Jul 2024
Towards Adapting Reinforcement Learning Agents to New Tasks: Insights
  from Q-Values
Towards Adapting Reinforcement Learning Agents to New Tasks: Insights from Q-Values
Ashwin Ramaswamy
Ransalu Senanayake
63
0
0
14 Jul 2024
Multi-Granularity Semantic Revision for Large Language Model
  Distillation
Multi-Granularity Semantic Revision for Large Language Model Distillation
Xiaoyu Liu
Yun-feng Zhang
Wei Li
Simiao Li
Xu Huang
Hanting Chen
Yehui Tang
Jie Hu
Zhiwei Xiong
Yunhe Wang
73
1
0
14 Jul 2024
Revolutionizing Bridge Operation and maintenance with LLM-based Agents:
  An Overview of Applications and Insights
Revolutionizing Bridge Operation and maintenance with LLM-based Agents: An Overview of Applications and Insights
Xinyu-Chen
Lianzhen-Zhang
LLMAGAI4CE
110
4
0
14 Jul 2024
AutoGRAMS: Autonomous Graphical Agent Modeling Software
AutoGRAMS: Autonomous Graphical Agent Modeling Software
Ben Krause
Lucia Chen
Emmanuel Kahembwe
72
1
0
14 Jul 2024
Speech-Copilot: Leveraging Large Language Models for Speech Processing
  via Task Decomposition, Modularization, and Program Generation
Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation
Chun-Yi Kuan
Chih-Kai Yang
Wei-Ping Huang
Ke-Han Lu
Hung-yi Lee
113
13
0
13 Jul 2024
Language-Augmented Symbolic Planner for Open-World Task Planning
Language-Augmented Symbolic Planner for Open-World Task Planning
Guanqi Chen
Lei Yang
Ruixing Jia
Zhe Hu
Yizhou Chen
Wei Zhang
Wenping Wang
Jia Pan
LM&RoLLMAG
78
9
0
13 Jul 2024
sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting
sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting
Sanchit Ahuja
Kumar Tanmay
Hardik Hansrajbhai Chauhan
Barun Patra
Kriti Aggarwal
...
Tejas I. Dhamecha
Ahmed Awadallah
Monojit Choudhary
Vishrav Chaudhary
Sunayana Sitaram
87
4
0
13 Jul 2024
Bridging the Gap Between Information Seeking and Product Search Systems:
  Q&A Recommendation for E-commerce
Bridging the Gap Between Information Seeking and Product Search Systems: Q&A Recommendation for E-commerce
Saar Kuzi
S. Malmasi
67
4
0
12 Jul 2024
Adaptive Prediction Ensemble: Improving Out-of-Distribution
  Generalization of Motion Forecasting
Adaptive Prediction Ensemble: Improving Out-of-Distribution Generalization of Motion Forecasting
Jinning Li
Jiachen Li
Sangjae Bae
David Isele
94
4
0
12 Jul 2024
Open (Clinical) LLMs are Sensitive to Instruction Phrasings
Open (Clinical) LLMs are Sensitive to Instruction Phrasings
Alberto Mario Ceballos Arroyo
Monica Munnangi
Jiuding Sun
Karen Y.C. Zhang
Denis Jered McInerney
Byron C. Wallace
Silvio Amir
LM&MA
34
8
0
12 Jul 2024
Scalability of Bayesian Network Structure Elicitation with Large
  Language Models: a Novel Methodology and Comparative Analysis
Scalability of Bayesian Network Structure Elicitation with Large Language Models: a Novel Methodology and Comparative Analysis
Nikolay Babakov
Ehud Reiter
Alberto Bugarin
81
1
0
12 Jul 2024
Instruction Following with Goal-Conditioned Reinforcement Learning in
  Virtual Environments
Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments
Zoya Volovikova
A. Skrynnik
Petr Kuderov
Aleksandr I. Panov
LLMAGLM&Ro
88
1
0
12 Jul 2024
The Sociolinguistic Foundations of Language Modeling
The Sociolinguistic Foundations of Language Modeling
Jack Grieve
Sara Bartl
Matteo Fuoli
Jason Grafmiller
Weihang Huang
A. Jawerbaum
Akira Murakami
Marcus Perlman
Dana Roemling
Bodo Winter
105
12
0
12 Jul 2024
Evaluating AI Evaluation: Perils and Prospects
Evaluating AI Evaluation: Perils and Prospects
John Burden
ELM
106
9
0
12 Jul 2024
A Survey on Symbolic Knowledge Distillation of Large Language Models
A Survey on Symbolic Knowledge Distillation of Large Language Models
Kamal Acharya
Alvaro Velasquez
Haoze Song
SyDa
78
7
0
12 Jul 2024
New Desiderata for Direct Preference Optimization
New Desiderata for Direct Preference Optimization
Xiangkun Hu
Tong He
David Wipf
93
3
0
12 Jul 2024
Refusing Safe Prompts for Multi-modal Large Language Models
Refusing Safe Prompts for Multi-modal Large Language Models
Zedian Shao
Hongbin Liu
Yuepeng Hu
Neil Zhenqiang Gong
MLLMLRM
87
1
0
12 Jul 2024
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous
  Control
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Huayu Chen
Kaiwen Zheng
Hang Su
Jun Zhu
147
5
0
12 Jul 2024
Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs
Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs
Aobo Kong
Shiwan Zhao
Hao Chen
Qicheng Li
Yong Qin
Ruiqi Sun
Xin Zhou
Jiaming Zhou
Haoqin Sun
100
12
0
12 Jul 2024
Detect Llama -- Finding Vulnerabilities in Smart Contracts using Large
  Language Models
Detect Llama -- Finding Vulnerabilities in Smart Contracts using Large Language Models
Peter Ince
Xiapu Luo
Jiangshan Yu
Joseph K. Liu
Xiaoning Du
ELM
65
6
0
12 Jul 2024
Empowering Few-Shot Relation Extraction with The Integration of
  Traditional RE Methods and Large Language Models
Empowering Few-Shot Relation Extraction with The Integration of Traditional RE Methods and Large Language Models
Ye Liu
Kai Zhang
Aoran Gan
Linan Yue
Feng Hu
Qi Liu
Enhong Chen
58
1
0
12 Jul 2024
Benchmarking Language Model Creativity: A Case Study on Code Generation
Benchmarking Language Model Creativity: A Case Study on Code Generation
Yining Lu
Dixuan Wang
Tianjian Li
Dongwei Jiang
Daniel Khashabi
Meng Jiang
Daniel Khashabi
LRM
138
15
0
12 Jul 2024
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
Youliang Yuan
Wenxiang Jiao
Wenxuan Wang
Jen-tse Huang
Jiahao Xu
Tian Liang
Pinjia He
Zhaopeng Tu
120
32
0
12 Jul 2024
Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay
Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay
Gonçalo Hora de Carvalho
Oscar Knap
R. Pollice
ReLMELMLRM
130
1
0
12 Jul 2024
Operationalizing the Blueprint for an AI Bill of Rights: Recommendations
  for Practitioners, Researchers, and Policy Makers
Operationalizing the Blueprint for an AI Bill of Rights: Recommendations for Practitioners, Researchers, and Policy Makers
Alex Oesterling
Usha Bhalla
Suresh Venkatasubramanian
Himabindu Lakkaraju
85
3
0
11 Jul 2024
Emergent Visual-Semantic Hierarchies in Image-Text Representations
Emergent Visual-Semantic Hierarchies in Image-Text Representations
Morris Alper
Hadar Averbuch-Elor
VLM
104
9
0
11 Jul 2024
Converging Paradigms: The Synergy of Symbolic and Connectionist AI in
  LLM-Empowered Autonomous Agents
Converging Paradigms: The Synergy of Symbolic and Connectionist AI in LLM-Empowered Autonomous Agents
Haoyi Xiong
Zhiyuan Wang
Xuhong Li
Jiang Bian
Zeke Xie
Shahid Mumtaz
Laura E. Barnes
LLMAG
134
8
0
11 Jul 2024
15M Multimodal Facial Image-Text Dataset
15M Multimodal Facial Image-Text Dataset
Dawei Dai
Yutang Li
Yingge Liu
Mingming Jia
Zhang YuanHui
Guoyin Wang
VLM
103
7
0
11 Jul 2024
SoupLM: Model Integration in Large Language and Multi-Modal Models
SoupLM: Model Integration in Large Language and Multi-Modal Models
Yue Bai
Zichen Zhang
Jiasen Lu
Yun Fu
MoMe
62
1
0
11 Jul 2024
Foundation Model Engineering: Engineering Foundation Models Just as
  Engineering Software
Foundation Model Engineering: Engineering Foundation Models Just as Engineering Software
Dezhi Ran
Mengzhou Wu
Wei Yang
Tao Xie
AI4CE
81
2
0
11 Jul 2024
Hypergraph Multi-modal Large Language Model: Exploiting EEG and
  Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video
  Understanding
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding
Minghui Wu
Chenxu Zhao
Anyang Su
Donglin Di
Tianyu Fu
...
Min He
Ya Gao
Meng Ma
Kun Yan
Ping Wang
87
1
0
11 Jul 2024
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Huanqian Wang
Yang Yue
Rui Lu
Jingxin Shi
Andrew Zhao
Shenzhi Wang
Shiji Song
Gao Huang
LM&RoKELM
143
0
0
11 Jul 2024
TIP: Tabular-Image Pre-training for Multimodal Classification with
  Incomplete Data
TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data
Siyi Du
Shaoming Zheng
Yinsong Wang
Wenjia Bai
D. O’Regan
Chen Qin
LMTD
97
5
0
10 Jul 2024
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with
  Semantic Graph Prior
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
Chenguo Lin
Yuchen Lin
Panwang Pan
Xuanyang Zhang
Yadong Mu
3DV
116
2
0
10 Jul 2024
CiteME: Can Language Models Accurately Cite Scientific Claims?
CiteME: Can Language Models Accurately Cite Scientific Claims?
Ori Press
Andreas Hochlehnert
Ameya Prabhu
Vishaal Udandarao
Ofir Press
Matthias Bethge
114
14
0
10 Jul 2024
GLBench: A Comprehensive Benchmark for Graph with Large Language Models
GLBench: A Comprehensive Benchmark for Graph with Large Language Models
Yuhan Li
Peisong Wang
Xiao Zhu
Aochuan Chen
Haiyun Jiang
Deng Cai
Victor Wai Kin Chan
Jia Li
123
18
0
10 Jul 2024
Interpretable Differential Diagnosis with Dual-Inference Large Language
  Models
Interpretable Differential Diagnosis with Dual-Inference Large Language Models
Shuang Zhou
Sirui Ding
Jiashuo Wang
Mingquan Lin
Genevieve B. Melton
Rui Zhang
LM&MA
72
2
0
10 Jul 2024
Previous
123...596061...126127128
Next