ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,380 papers shown
Title
ChatGPT vs State-of-the-Art Models: A Benchmarking Study in Keyphrase
  Generation Task
ChatGPT vs State-of-the-Art Models: A Benchmarking Study in Keyphrase Generation Task
Roberto Martínez-Cruz
Alvaro J. López-López
J. Portela
106
23
0
27 Apr 2023
Multimodal Grounding for Embodied AI via Augmented Reality Headsets for
  Natural Language Driven Task Planning
Multimodal Grounding for Embodied AI via Augmented Reality Headsets for Natural Language Driven Task Planning
Selma Wanna
Fabian Parra
R. Valner
Karl Kruusamäe
Mitch Pryor
LM&Ro
68
2
0
26 Apr 2023
The Internal State of an LLM Knows When It's Lying
The Internal State of an LLM Knows When It's Lying
A. Azaria
Tom Michael Mitchell
HILM
374
348
0
26 Apr 2023
SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
Bin Wang
Xinnian Liang
Jian Yang
Huijia Huang
Shuangzhi Wu
Peihao Wu
Lu Lu
Zejun Ma
Zhoujun Li
LLMAGKELMRALM
145
29
0
26 Apr 2023
AGI: Artificial General Intelligence for Education
AGI: Artificial General Intelligence for Education
Ehsan Latif
Gengchen Mai
Matthew Nyaaba
Xuansheng Wu
Ninghao Liu
Guoyu Lu
Sheng Li
Tianming Liu
Xiaoming Zhai
ELMAI4CE
142
24
0
24 Apr 2023
Generation-driven Contrastive Self-training for Zero-shot Text
  Classification with Instruction-following LLM
Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLM
Ruohong Zhang
Yau-Shian Wang
Yiming Yang
SyDa
54
10
0
24 Apr 2023
SketchXAI: A First Look at Explainability for Human Sketches
SketchXAI: A First Look at Explainability for Human Sketches
Zhiyu Qu
Yulia Gryaditskaya
Ke Li
Kaiyue Pang
Tao Xiang
Yi-Zhe Song
89
8
0
23 Apr 2023
Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in
  Large Language Models
Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
Jiashuo Sun
Yi Luo
Yeyun Gong
Chen Lin
Yelong Shen
Jian Guo
Nan Duan
LRM
112
21
0
23 Apr 2023
Evaluating ChatGPT's Information Extraction Capabilities: An Assessment
  of Performance, Explainability, Calibration, and Faithfulness
Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness
Bo Li
Gexiang Fang
Yang Yang
Quansen Wang
Wei Ye
Wen Zhao
Shikun Zhang
ELMAI4MH
138
167
0
23 Apr 2023
Differentiate ChatGPT-generated and Human-written Medical Texts
Differentiate ChatGPT-generated and Human-written Medical Texts
Wenxiong Liao
Zheng Liu
Haixing Dai
Shaochen Xu
Zihao Wu
...
Xiaoke Huang
Dajiang Zhu
Hongmin Cai
Tianming Liu
Xiang Li
LM&MADeLMOMedImAI4MH
62
60
0
23 Apr 2023
Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth
  Grade Math Answers
Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers
Felipe Urrutia
R. Araya
61
3
0
21 Apr 2023
Phoenix: Democratizing ChatGPT across Languages
Phoenix: Democratizing ChatGPT across Languages
Zhihong Chen
Feng Jiang
Junying Chen
Tiannan Wang
Fei Yu
...
Zhiyi Zhang
Jianquan Li
Xiang Wan
Benyou Wang
Haizhou Li
ALM
87
38
0
20 Apr 2023
Fully Autonomous Programming with Large Language Models
Fully Autonomous Programming with Large Language Models
Vadim Liventsev
Anastasiia Grishina
Aki Härmä
Leon Moonen
ELM
91
40
0
20 Apr 2023
Any-to-Any Style Transfer: Making Picasso and Da Vinci Collaborate
Any-to-Any Style Transfer: Making Picasso and Da Vinci Collaborate
Songhua Liu
Jingwen Ye
Xinchao Wang
DiffM
93
18
0
19 Apr 2023
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Simran Arora
Brandon Yang
Sabri Eyuboglu
A. Narayan
Andrew Hojel
Immanuel Trummer
Christopher Ré
SyDa
137
85
0
19 Apr 2023
Learning to Compress Prompts with Gist Tokens
Learning to Compress Prompts with Gist Tokens
Jesse Mu
Xiang Lisa Li
Noah D. Goodman
VLM
148
228
0
17 Apr 2023
Towards Better Instruction Following Language Models for Chinese:
  Investigating the Impact of Training Data and Evaluation
Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation
Yunjie Ji
Yan Gong
Yong Deng
Yiping Peng
Qiang Niu
Baochang Ma
Xiangang Li
ALMELM
102
25
0
16 Apr 2023
ArguGPT: evaluating, understanding and identifying argumentative essays
  generated by GPT models
ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models
Yikang Liu
Ziyin Zhang
Wanyang Zhang
Shisen Yue
Xiaojing Zhao
Xinyuan Cheng
Yiwen Zhang
Hai Hu
DeLMO
103
55
0
16 Apr 2023
One Explanation Does Not Fit XIL
One Explanation Does Not Fit XIL
Felix Friedrich
David Steinmann
Kristian Kersting
LRM
70
3
0
14 Apr 2023
On the Opportunities and Challenges of Foundation Models for Geospatial
  Artificial Intelligence
On the Opportunities and Challenges of Foundation Models for Geospatial Artificial Intelligence
Gengchen Mai
Weiming Huang
Jin Sun
Suhang Song
Deepak Mishra
...
Yingjie Hu
Chris Cundy
Ziyuan Li
Rui Zhu
Ni Lao
AI4CE
122
134
0
13 Apr 2023
Automated Mapping of CVE Vulnerability Records to MITRE CWE Weaknesses
Automated Mapping of CVE Vulnerability Records to MITRE CWE Weaknesses
Ashraf Haddad
N. Aaraj
Preslav Nakov
Septimiu Fabian Mare
48
5
0
13 Apr 2023
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
Hanze Dong
Wei Xiong
Deepanshu Goyal
Yihan Zhang
Winnie Chow
Boyao Wang
Shizhe Diao
Jipeng Zhang
Kashun Shum
Tong Zhang
ALM
153
470
0
13 Apr 2023
Learning Personalized Decision Support Policies
Learning Personalized Decision Support Policies
Umang Bhatt
Valerie Chen
Katherine M. Collins
Parameswaran Kamalaruban
Emma Kallina
Adrian Weller
Ameet Talwalkar
OffRL
224
11
0
13 Apr 2023
ChatGPT Needs SPADE (Sustainability, PrivAcy, Digital divide, and Ethics) Evaluation: A Review
ChatGPT Needs SPADE (Sustainability, PrivAcy, Digital divide, and Ethics) Evaluation: A Review
Sunder Ali Khowaja
P. Khuwaja
Kapal Dev
Weizheng Wang
Lewis Nkenyereye
136
88
0
13 Apr 2023
Are LLMs All You Need for Task-Oriented Dialogue?
Are LLMs All You Need for Task-Oriented Dialogue?
Vojtvech Hudevcek
Ondrej Dusek
94
62
0
13 Apr 2023
Language Instructed Reinforcement Learning for Human-AI Coordination
Language Instructed Reinforcement Learning for Human-AI Coordination
Hengyuan Hu
Dorsa Sadigh
LM&Ro
96
64
0
13 Apr 2023
AGI for Agriculture
AGI for Agriculture
Guoyu Lu
Sheng Li
Gengchen Mai
Jin Sun
Dajiang Zhu
...
R. Xu
Daniel Petti
Changying Li
Tianming Liu
Changying Li
AI4CE
98
17
0
12 Apr 2023
chatClimate: Grounding Conversational AI in Climate Science
chatClimate: Grounding Conversational AI in Climate Science
S. Vaghefi
Qian Wang
V. Muccione
Jingwei Ni
Mathias Kraus
...
Tobias Schimanski
Chiara Colesanti-Senni
Nicolas Webersinke
Christrian Huggel
Markus Leippold
KELMAI4MHHILM
109
74
0
11 Apr 2023
A Survey of Resources and Methods for Natural Language Processing of
  Serbian Language
A Survey of Resources and Methods for Natural Language Processing of Serbian Language
U. Marovac
A. Avdić
Nikola Milosevic
55
1
0
11 Apr 2023
Emergent autonomous scientific research capabilities of large language
  models
Emergent autonomous scientific research capabilities of large language models
Daniil A. Boiko
R. MacKnight
Gabe Gomes
ELMLM&RoAI4CELLMAG
164
129
0
11 Apr 2023
RRHF: Rank Responses to Align Language Models with Human Feedback
  without tears
RRHF: Rank Responses to Align Language Models with Human Feedback without tears
Zheng Yuan
Hongyi Yuan
Chuanqi Tan
Wei Wang
Songfang Huang
Feiran Huang
ALM
185
385
0
11 Apr 2023
Human-machine cooperation for semantic feature listing
Human-machine cooperation for semantic feature listing
Kushin Mukherjee
Siddharth Suresh
Timothy T. Rogers
VLM
56
2
0
11 Apr 2023
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language
  Models
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models
Emilio Ferrara
SILM
121
264
0
07 Apr 2023
Architecture-Preserving Provable Repair of Deep Neural Networks
Architecture-Preserving Provable Repair of Deep Neural Networks
Zhe Tao
Stephanie Nawas
Jacqueline Mitchell
Aditya V. Thakur
AAML
64
11
0
07 Apr 2023
Cerebras-GPT: Open Compute-Optimal Language Models Trained on the
  Cerebras Wafer-Scale Cluster
Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster
Nolan Dey
Gurpreet Gosal
Zhiming Chen
Chen
Hemant Khachane
William Marshall
Ribhu Pathria
Marvin Tom
Joel Hestness
MoELRM
126
108
0
06 Apr 2023
Evaluation of ChatGPT Family of Models for Biomedical Reasoning and
  Classification
Evaluation of ChatGPT Family of Models for Biomedical Reasoning and Classification
Shan Chen
Yingya Li
Sheng Lu
Hoang Van
Hugo J. W. L. Aerts
G. Savova
Danielle S. Bitterman
LM&MAAI4MHELM
71
47
0
05 Apr 2023
About optimal loss function for training physics-informed neural
  networks under respecting causality
About optimal loss function for training physics-informed neural networks under respecting causality
V. A. Es'kin
Danil V. Davydov
Ekaterina D. Egorova
Alexey O. Malkhanov
Mikhail A. Akhukov
Mikhail E. Smorkalov
PINN
93
7
0
05 Apr 2023
Mastering Symbolic Operations: Augmenting Language Models with Compiled
  Neural Networks
Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks
Yixuan Weng
Minjun Zhu
Fei Xia
Bin Li
Shizhu He
Kang Liu
Jun Zhao
90
6
0
04 Apr 2023
Cross-Domain Image Captioning with Discriminative Finetuning
Cross-Domain Image Captioning with Discriminative Finetuning
Roberto Dessì
Michele Bevilacqua
Eleonora Gualdoni
Nathanaël Carraz Rakotonirina
Francesca Franzon
Marco Baroni
CLIP
101
19
0
04 Apr 2023
The Vector Grounding Problem
The Vector Grounding Problem
Dimitri Coelho Mollo
Raphael Milliere
146
28
0
04 Apr 2023
Classification of integers based on residue classes via modern deep
  learning algorithms
Classification of integers based on residue classes via modern deep learning algorithms
Dangwei Wu
Jing Yang
Mian Umair Ahsan
Kai Wang
63
1
0
03 Apr 2023
Multi-Modal Perceiver Language Model for Outcome Prediction in Emergency
  Department
Multi-Modal Perceiver Language Model for Outcome Prediction in Emergency Department
Sabri Boughorbel
Fethi Jarray
Abdulaziz Yousuf Al-Homaid
Rashid Niaz
Khalid Alyafei
113
0
0
03 Apr 2023
Towards Healthy AI: Large Language Models Need Therapists Too
Towards Healthy AI: Large Language Models Need Therapists Too
Baihan Lin
Djallel Bouneffouf
Guillermo Cecchi
Kush R. Varshney
AI4MH
91
19
0
02 Apr 2023
Evaluating Large Language Models on a Highly-specialized Topic,
  Radiation Oncology Physics
Evaluating Large Language Models on a Highly-specialized Topic, Radiation Oncology Physics
J. Holmes
Zheng Liu
Hua Zhou
Yuzhen Ding
Terence T. Sio
...
Jonathan B. Ashman
Xiang Li
Tianming Liu
Jiajian Shen
Wen Liu
LM&MAAI4CEELM
94
124
0
01 Apr 2023
CQSumDP: A ChatGPT-Annotated Resource for Query-Focused Abstractive
  Summarization Based on Debatepedia
CQSumDP: A ChatGPT-Annotated Resource for Query-Focused Abstractive Summarization Based on Debatepedia
Md Tahmid Rahman Laskar
Mizanur Rahman
Israt Jahan
Enamul Hoque
J. Huang
84
9
0
31 Mar 2023
GPT-4 can pass the Korean National Licensing Examination for Korean
  Medicine Doctors
GPT-4 can pass the Korean National Licensing Examination for Korean Medicine Doctors
Dongyeop Jang
Tae-Rim Yun
Choong-Yeol Lee
Young-Kyu Kwon
Chang-Eop Kim
ELMLM&MA
67
29
0
31 Mar 2023
Aligning a medium-size GPT model in English to a small closed domain in
  Spanish
Aligning a medium-size GPT model in English to a small closed domain in Spanish
Oscar R. Navarrete-Parra
Víctor Uc Cetina
Jorge Reyes-Magaña
53
0
0
30 Mar 2023
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging
  Face
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
MLLM
155
913
0
30 Mar 2023
Recognition, recall, and retention of few-shot memories in large
  language models
Recognition, recall, and retention of few-shot memories in large language models
A. Orhan
LRMKELMCLL
69
3
0
30 Mar 2023
Advances in apparent conceptual physics reasoning in GPT-4
Advances in apparent conceptual physics reasoning in GPT-4
Colin G. West
AI4CE
110
32
0
29 Mar 2023
Previous
123...120121122...126127128
Next