ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,395 papers shown
Title
OOP: Object-Oriented Programming Evaluation Benchmark for Large Language
  Models
OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models
Shuai Wang
Liang Ding
Li Shen
Yong Luo
Bo Du
Dacheng Tao
ELMALM
83
3
0
12 Jan 2024
Intention Analysis Makes LLMs A Good Jailbreak Defender
Intention Analysis Makes LLMs A Good Jailbreak Defender
Yuqi Zhang
Liang Ding
Lefei Zhang
Dacheng Tao
LLMSV
83
29
0
12 Jan 2024
INTERS: Unlocking the Power of Large Language Models in Search with
  Instruction Tuning
INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning
Yutao Zhu
Peitian Zhang
Chenghao Zhang
Yifei Chen
Binyu Xie
Zheng Liu
Ji-Rong Wen
Zhicheng Dou
62
17
0
12 Jan 2024
Kun: Answer Polishment for Chinese Self-Alignment with Instruction
  Back-Translation
Kun: Answer Polishment for Chinese Self-Alignment with Instruction Back-Translation
Tianyu Zheng
Shuyue Guo
Xingwei Qu
Jiawei Guo
Weixu Zhang
...
Chenghua Lin
Wenhao Huang
Wenhu Chen
Jie Fu
Ge Zhang
ALM
104
7
0
12 Jan 2024
Adapting Large Language Models for Document-Level Machine Translation
Adapting Large Language Models for Document-Level Machine Translation
Minghao Wu
Thuy-Trang Vu
Zhuang Li
George F. Foster
Gholamreza Haffari
158
45
0
12 Jan 2024
Human-AI Collaborative Essay Scoring: A Dual-Process Framework with LLMs
Human-AI Collaborative Essay Scoring: A Dual-Process Framework with LLMs
Changrong Xiao
Wenxing Ma
Qingping Song
Sean Xin Xu
Kunpeng Zhang
Yufang Wang
Qi Fu
AI4Ed
54
19
0
12 Jan 2024
Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Planning Case Study
Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Planning Case Study
Shangding Gu
LLMAG
112
0
0
12 Jan 2024
TOFU: A Task of Fictitious Unlearning for LLMs
TOFU: A Task of Fictitious Unlearning for LLMs
Pratyush Maini
Zhili Feng
Avi Schwarzschild
Zachary Chase Lipton
J. Zico Kolter
MUCLL
145
193
0
11 Jan 2024
Improving Large Language Models via Fine-grained Reinforcement Learning
  with Minimum Editing Constraint
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
Zhipeng Chen
Kun Zhou
Wayne Xin Zhao
Junchen Wan
Fuzheng Zhang
Di Zhang
Ji-Rong Wen
KELM
120
35
0
11 Jan 2024
Secrets of RLHF in Large Language Models Part II: Reward Modeling
Secrets of RLHF in Large Language Models Part II: Reward Modeling
Bing Wang
Rui Zheng
Luyao Chen
Yan Liu
Shihan Dou
...
Qi Zhang
Xipeng Qiu
Xuanjing Huang
Zuxuan Wu
Yuanyuan Jiang
ALM
113
110
0
11 Jan 2024
Chain of History: Learning and Forecasting with LLMs for Temporal
  Knowledge Graph Completion
Chain of History: Learning and Forecasting with LLMs for Temporal Knowledge Graph Completion
Ruilin Luo
Tianle Gu
Haoling Li
Junzhe Li
Zicheng Lin
Jiayi Li
Yujiu Yang
AI4CE
140
10
0
11 Jan 2024
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction
Siyu Yuan
Kaitao Song
Jiangjie Chen
Xu Tan
Yongliang Shen
Ren Kan
Dongsheng Li
Deqing Yang
LLMAG
104
68
0
11 Jan 2024
Mitigating Unhelpfulness in Emotional Support Conversations with
  Multifaceted AI Feedback
Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback
Jiashuo Wang
Chunpu Xu
Chak Tou Leong
Wenjie Li
Jing Li
111
2
0
11 Jan 2024
How Teachers Can Use Large Language Models and Bloom's Taxonomy to
  Create Educational Quizzes
How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes
Sabina Elkins
E. Kochmar
Jackie C.K. Cheung
Iulian Serban
ELMAI4Ed
70
14
0
11 Jan 2024
EpilepsyLLM: Domain-Specific Large Language Model Fine-tuned with
  Epilepsy Medical Knowledge
EpilepsyLLM: Domain-Specific Large Language Model Fine-tuned with Epilepsy Medical Knowledge
Xuyang Zhao
Qibin Zhao
Toshihisa Tanaka
AI4MH
55
1
0
11 Jan 2024
Tuning LLMs with Contrastive Alignment Instructions for Machine
  Translation in Unseen, Low-resource Languages
Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages
Zhuoyuan Mao
Yen Yu
ALM
60
2
0
11 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language
  Model Systems
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
ZuJie Wen
Ke Xu
Qi Li
165
64
0
11 Jan 2024
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for
  Text-to-Image Generation
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Seung Hyun Lee
Yinxiao Li
Junjie Ke
Innfarn Yoo
Han Zhang
...
Junfeng He
Gang Li
Sangpil Kim
Irfan Essa
Feng Yang
EGVM
104
24
0
11 Jan 2024
Towards Conversational Diagnostic AI
Towards Conversational Diagnostic AI
Tao Tu
Anil Palepu
M. Schaekermann
Khaled Saab
Jan Freyberg
...
Katherine Chou
Greg S. Corrado
Yossi Matias
Alan Karthikesalingam
Vivek Natarajan
AI4MHLM&MA
110
103
0
11 Jan 2024
Scaling Laws for Forgetting When Fine-Tuning Large Language Models
Scaling Laws for Forgetting When Fine-Tuning Large Language Models
Damjan Kalajdzievski
CLL
94
12
0
11 Jan 2024
An EcoSage Assistant: Towards Building A Multimodal Plant Care Dialogue
  Assistant
An EcoSage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant
Mohit Tomar
Abhisek Tiwari
Tulika Saha
Prince Jha
Sriparna Saha
42
1
0
10 Jan 2024
Exploring the Reasoning Abilities of Multimodal Large Language Models
  (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Yiqi Wang
Wentao Chen
Xiaotian Han
Xudong Lin
Haiteng Zhao
Yongfei Liu
Bohan Zhai
Jianbo Yuan
Quanzeng You
Hongxia Yang
LRM
117
88
0
10 Jan 2024
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Dennis Ulmer
Elman Mansimov
Kaixiang Lin
Justin Sun
Xibin Gao
Yi Zhang
LLMAG
76
30
0
10 Jan 2024
Large Model based Sequential Keyframe Extraction for Video Summarization
Large Model based Sequential Keyframe Extraction for Video Summarization
Kailong Tan
Yuxiang Zhou
Qianchen Xia
Rui Liu
Yong Chen
67
8
0
10 Jan 2024
Learning Audio Concepts from Counterfactual Natural Language
Learning Audio Concepts from Counterfactual Natural Language
Ali Vosoughi
Luca Bondi
Ho-Hsiang Wu
Chenliang Xu
CML
93
5
0
10 Jan 2024
The Impact of Reasoning Step Length on Large Language Models
The Impact of Reasoning Step Length on Large Language Models
Mingyu Jin
Qinkai Yu
Dong Shu
Haiyan Zhao
Wenyue Hua
Yanda Meng
Yongfeng Zhang
Jundong Li
ReLMLRM
185
113
0
10 Jan 2024
Multi-User Chat Assistant (MUCA): a Framework Using LLMs to Facilitate
  Group Conversations
Multi-User Chat Assistant (MUCA): a Framework Using LLMs to Facilitate Group Conversations
Manqing Mao
Paishun Ting
Yijian Xiang
Mingyang Xu
Julia Chen
Jianzhe Lin
LLMAG
161
6
0
10 Jan 2024
User Embedding Model for Personalized Language Prompting
User Embedding Model for Personalized Language Prompting
Sumanth Doddapaneni
Krishna Sayana
Ambarish Jash
Sukhdeep S. Sodhi
Dima Kuzmin
RALM
73
10
0
10 Jan 2024
Are Language Models More Like Libraries or Like Librarians?
  Bibliotechnism, the Novel Reference Problem, and the Attitudes of LLMs
Are Language Models More Like Libraries or Like Librarians? Bibliotechnism, the Novel Reference Problem, and the Attitudes of LLMs
Harvey Lederman
Kyle Mahowald
104
12
0
10 Jan 2024
Concept Alignment
Concept Alignment
Sunayana Rane
Polyphony J. Bruna
Ilia Sucholutsky
Christopher Kello
Thomas Griffiths
CVBM
69
8
0
09 Jan 2024
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation
Mahdi Nikdan
Soroush Tabesh
Elvir Crnčević
Dan Alistarh
134
32
0
09 Jan 2024
Agent Alignment in Evolving Social Norms
Agent Alignment in Evolving Social Norms
Shimin Li
Tianxiang Sun
Qinyuan Cheng
Xipeng Qiu
LLMAG
81
8
0
09 Jan 2024
Evaluating Language Model Agency through Negotiations
Evaluating Language Model Agency through Negotiations
Tim R. Davidson
V. Veselovsky
Martin Josifoski
Maxime Peyrard
Antoine Bosselut
Michal Kosinski
Robert West
LLMAG
89
29
0
09 Jan 2024
LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection
LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection
Hongcheng Guo
Jian Yang
Jiaheng Liu
Jiaqi Bai
Boyang Wang
Zhoujun Li
Tieqiao Zheng
Bo Zhang
Junran peng
Qi Tian
83
20
0
09 Jan 2024
TransportationGames: Benchmarking Transportation Knowledge of
  (Multimodal) Large Language Models
TransportationGames: Benchmarking Transportation Knowledge of (Multimodal) Large Language Models
Xue Zhang
Xiangyu Shi
Xinyue Lou
Rui Qi
Yufeng Chen
Jinan Xu
Wenjuan Han
77
5
0
09 Jan 2024
Probabilistic emotion and sentiment modelling of patient-reported
  experiences
Probabilistic emotion and sentiment modelling of patient-reported experiences
Curtis Murray
Lewis Mitchell
Simon Jonathan Tuke
Mark Mackay
71
0
0
09 Jan 2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy
Christoph Dann
Rahul Kidambi
Zhiwei Steven Wu
Alekh Agarwal
OffRL
134
112
0
08 Jan 2024
TextMachina: Seamless Generation of Machine-Generated Text Datasets
TextMachina: Seamless Generation of Machine-Generated Text Datasets
A. Sarvazyan
José Ángel González
Marc Franco-Salvador
DeLMO
99
3
0
08 Jan 2024
TeleChat Technical Report
TeleChat Technical Report
Zhongjiang He
Zihan Wang
Xinzhan Liu
Shixuan Liu
Yitong Yao
...
Zilu Huang
Sishi Xiong
Yuxiang Zhang
Chao Wang
Shuangyong Song
AI4MHLRMALM
108
4
0
08 Jan 2024
Enhanced Automated Code Vulnerability Repair using Large Language Models
Enhanced Automated Code Vulnerability Repair using Large Language Models
David de-Fitero-Dominguez
Eva García-López
Antonio Garcia-Cabot
J. Martínez-Herráiz
64
16
0
08 Jan 2024
LightHouse: A Survey of AGI Hallucination
LightHouse: A Survey of AGI Hallucination
Feng Wang
LRMHILMVLM
99
3
0
08 Jan 2024
An Exploratory Study on Automatic Identification of Assumptions in the
  Development of Deep Learning Frameworks
An Exploratory Study on Automatic Identification of Assumptions in the Development of Deep Learning Frameworks
Chen Yang
Peng Liang
Zinan Ma
45
0
0
08 Jan 2024
Using Zero-shot Prompting in the Automatic Creation and Expansion of
  Topic Taxonomies for Tagging Retail Banking Transactions
Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions
Daniel de S. Moraes
Pedro T. C. Santos
P. B. D. Costa
Matheus A. S. Pinto
Ivan de J. P. Pinto
...
Gabriela Tourinho
Marcos Rabaioli
Leandro Santos
Fellipe Marques
David Favaro
55
2
0
08 Jan 2024
InFoBench: Evaluating Instruction Following Ability in Large Language
  Models
InFoBench: Evaluating Instruction Following Ability in Large Language Models
Yiwei Qin
Kaiqiang Song
Yebowen Hu
Wenlin Yao
Sangwoo Cho
Xiaoyang Wang
Xuansheng Wu
Fei Liu
Pengfei Liu
Dong Yu
ELM
104
52
0
07 Jan 2024
Data-CUBE: Data Curriculum for Instruction-based Sentence Representation
  Learning
Data-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning
Yingqian Min
Kun Zhou
Dawei Gao
Wayne Xin Zhao
He Hu
Yaliang Li
80
1
0
07 Jan 2024
CharPoet: A Chinese Classical Poetry Generation System Based on
  Token-free LLM
CharPoet: A Chinese Classical Poetry Generation System Based on Token-free LLM
Chengyue Yu
Lei Zang
Jiaotuan Wang
Chenyi Zhuang
Jinjie Gu
65
4
0
07 Jan 2024
Malla: Demystifying Real-world Large Language Model Integrated Malicious
  Services
Malla: Demystifying Real-world Large Language Model Integrated Malicious Services
Zilong Lin
Jian Cui
Xiaojing Liao
Wenyuan Xu
66
23
0
06 Jan 2024
VLLaVO: Mitigating Visual Gap through LLMs
VLLaVO: Mitigating Visual Gap through LLMs
Shuhao Chen
Yulong Zhang
Weisen Jiang
Jiangang Lu
Yu Zhang
VLM
123
2
0
06 Jan 2024
Artificial Intelligence for Operations Research: Revolutionizing the
  Operations Research Process
Artificial Intelligence for Operations Research: Revolutionizing the Operations Research Process
Zhenan Fan
Bissan Ghaddar
Xinglu Wang
Linzi Xing
Yong Zhang
Zirui Zhou
AI4CE
101
13
0
06 Jan 2024
Human-Instruction-Free LLM Self-Alignment with Limited Samples
Human-Instruction-Free LLM Self-Alignment with Limited Samples
Hongyi Guo
Yuanshun Yao
Wei Shen
Jiaheng Wei
Xiaoying Zhang
Zhaoran Wang
Yang Liu
159
23
0
06 Jan 2024
Previous
123...105106107...126127128
Next