ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,380 papers shown
Title
Not what you've signed up for: Compromising Real-World LLM-Integrated
  Applications with Indirect Prompt Injection
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
Kai Greshake
Sahar Abdelnabi
Shailesh Mishra
C. Endres
Thorsten Holz
Mario Fritz
SILM
186
504
0
23 Feb 2023
Sentence Simplification via Large Language Models
Sentence Simplification via Large Language Models
Yutao Feng
Jipeng Qiang
Yun Li
Yunhao Yuan
Yi Zhu
94
19
0
23 Feb 2023
On the Robustness of ChatGPT: An Adversarial and Out-of-distribution
  Perspective
On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective
Jindong Wang
Xixu Hu
Wenxin Hou
Hao Chen
Runkai Zheng
...
Weirong Ye
Xiubo Geng
Binxing Jiao
Yue Zhang
Xingxu Xie
AI4MH
175
241
0
22 Feb 2023
ChatGPT: Jack of all trades, master of none
ChatGPT: Jack of all trades, master of none
Jan Kocoñ
Igor Cichecki
Oliwier Kaszyca
Mateusz Kochanek
Dominika Szydło
...
Maciej Piasecki
Lukasz Radliñski
Konrad Wojtasik
Stanislaw Wo'zniak
Przemyslaw Kazienko
AI4MH
172
558
0
21 Feb 2023
ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPT
ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPT
Xiang Wei
Xingyu Cui
Ning Cheng
Xiaobin Wang
Xin Zhang
...
Jinan Xu
Jinan Xu
Meishan Zhang
Yong Jiang
Wenjuan Han
127
344
0
20 Feb 2023
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and
  Fine-tuned BERT
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
AI4MH
129
245
0
19 Feb 2023
Complex QA and language models hybrid architectures, Survey
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
214
16
0
17 Feb 2023
Tuning computer vision models with task rewards
Tuning computer vision models with task rewards
André Susano Pinto
Alexander Kolesnikov
Yuge Shi
Lucas Beyer
Xiaohua Zhai
VLM
85
41
0
16 Feb 2023
Aligning Language Models with Preferences through f-divergence
  Minimization
Aligning Language Models with Preferences through f-divergence Minimization
Dongyoung Go
Tomasz Korbak
Germán Kruszewski
Jos Rozen
Nahyeon Ryu
Marc Dymetman
109
76
0
16 Feb 2023
Adding Instructions during Pretraining: Effective Way of Controlling
  Toxicity in Language Models
Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models
Shrimai Prabhumoye
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
LM&MA
59
21
0
14 Feb 2023
The Programmer's Assistant: Conversational Interaction with a Large
  Language Model for Software Development
The Programmer's Assistant: Conversational Interaction with a Large Language Model for Software Development
Steven I. Ross
Fernando Martinez
Stephanie Houde
Michael J. Muller
Justin D. Weisz
106
227
0
14 Feb 2023
Transformer models: an introduction and catalog
Transformer models: an introduction and catalog
X. Amatriain
Ananth Sankar
Jie Bing
Praveen Kumar Bodigutla
Timothy J. Hazen
Michaeel Kazi
138
53
0
12 Feb 2023
Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard
  Security Attacks
Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks
Daniel Kang
Xuechen Li
Ion Stoica
Carlos Guestrin
Matei A. Zaharia
Tatsunori Hashimoto
AAML
105
253
0
11 Feb 2023
Synthesizing Human Gaze Feedback for Improved NLP Performance
Synthesizing Human Gaze Feedback for Improved NLP Performance
Varun Khurana
Yaman Kumar Singla
Nora Hollenstein
R. Kumar
Balaji Krishnamurthy
72
17
0
11 Feb 2023
The Wisdom of Hindsight Makes Language Models Better Instruction
  Followers
The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Tianjun Zhang
Fangchen Liu
Justin Wong
Pieter Abbeel
Joseph E. Gonzalez
103
47
0
10 Feb 2023
The Re-Label Method For Data-Centric Machine Learning
The Re-Label Method For Data-Centric Machine Learning
Tonglei Guo
NoLa
67
2
0
09 Feb 2023
GPTScore: Evaluate as You Desire
GPTScore: Evaluate as You Desire
Jinlan Fu
See-Kiong Ng
Zhengbao Jiang
Pengfei Liu
LM&MAALMELM
194
292
0
08 Feb 2023
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on
  Reasoning, Hallucination, and Interactivity
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
Yejin Bang
Samuel Cahyawijaya
Nayeon Lee
Wenliang Dai
Jane Polak Scowcroft
...
Tiezheng Yu
Willy Chung
Quyet V. Do
Yan Xu
Pascale Fung
ReLMLRM
166
1,403
0
08 Feb 2023
ChatGPT and Software Testing Education: Promises & Perils
ChatGPT and Software Testing Education: Promises & Perils
Sajed Jalil
Suzzana Rafi
Thomas D. Latoza
Kevin Moran
Wing Lam
ELM
106
178
0
07 Feb 2023
Exploring the Benefits of Training Expert Language Models over
  Instruction Tuning
Exploring the Benefits of Training Expert Language Models over Instruction Tuning
Joel Jang
Seungone Kim
Seonghyeon Ye
Doyoung Kim
Lajanugen Logeswaran
Moontae Lee
Kyungjae Lee
Minjoon Seo
LRMALM
134
83
0
07 Feb 2023
Regulating ChatGPT and other Large Generative AI Models
Regulating ChatGPT and other Large Generative AI Models
P. Hacker
A. Engel
M. Mauer
AILaw
166
354
0
05 Feb 2023
Evaluating Large Language Models in Theory of Mind Tasks
Evaluating Large Language Models in Theory of Mind Tasks
Michal Kosinskihttps://www.semanticscholar.org/me/account
LLMAGLRM
105
141
0
04 Feb 2023
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
Zhixuan Liang
Yao Mu
Mingyu Ding
Fei Ni
Masayoshi Tomizuka
Ping Luo
173
111
0
03 Feb 2023
Language Quantized AutoEncoders: Towards Unsupervised Text-Image
  Alignment
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Hao Liu
Wilson Yan
Pieter Abbeel
99
25
0
02 Feb 2023
Using In-Context Learning to Improve Dialogue Safety
Using In-Context Learning to Improve Dialogue Safety
Nicholas Meade
Spandana Gella
Devamanyu Hazarika
Prakhar Gupta
Di Jin
Siva Reddy
Yang Liu
Dilek Z. Hakkani-Tür
125
40
0
02 Feb 2023
Mathematical Capabilities of ChatGPT
Mathematical Capabilities of ChatGPT
Simon Frieder
Luca Pinchetti
Alexis Chevalier
Ryan-Rhys Griffiths
Tommaso Salvatori
Thomas Lukasiewicz
P. Petersen
Julius Berner
ELMAI4MH
141
434
0
31 Jan 2023
Grounding Language Models to Images for Multimodal Inputs and Outputs
Grounding Language Models to Images for Multimodal Inputs and Outputs
Jing Yu Koh
Ruslan Salakhutdinov
Daniel Fried
MLLM
133
123
0
31 Jan 2023
Execution-based Code Generation using Deep Reinforcement Learning
Execution-based Code Generation using Deep Reinforcement Learning
Parshin Shojaee
Aneesh Jain
Sindhu Tipirneni
Chandan K. Reddy
133
58
0
31 Jan 2023
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object
  Navigation
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
KAI-QING Zhou
Kai Zheng
Connor Pryor
Yilin Shen
Hongxia Jin
Lise Getoor
Xinze Wang
132
118
0
30 Jan 2023
Emerging Synergies in Causality and Deep Generative Models: A Survey
Emerging Synergies in Causality and Deep Generative Models: A Survey
Guanglin Zhou
Shaoan Xie
Guang-Yuan Hao
Shiming Chen
Erdun Gao
Xiwei Xu
Chen Wang
Liming Zhu
Lina Yao
Kun Zhang
AI4CE
147
11
0
29 Jan 2023
Understanding the Effectiveness of Very Large Language Models on Dialog
  Evaluation
Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Jessica Huynh
Cathy Jiao
Prakhar Gupta
Shikib Mehri
Payal Bajaj
Vishrav Chaudhary
M. Eskénazi
ELMLM&MA
73
17
0
27 Jan 2023
AI vs. Human -- Differentiation Analysis of Scientific Content
  Generation
AI vs. Human -- Differentiation Analysis of Scientific Content Generation
Yongqiang Ma
Jiawei Liu
Fan Yi
Qikai Cheng
Yong Huang
Wei Lu
Xiaozhong Liu
DeLMO
112
60
0
24 Jan 2023
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation,
  and Detection
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection
Biyang Guo
Xin Zhang
Ziyuan Wang
Minqi Jiang
Jinran Nie
Yuxuan Ding
Jianwei Yue
Yupeng Wu
DeLMOELM
132
622
0
18 Jan 2023
Are Language Models Worse than Humans at Following Prompts? It's
  Complicated
Are Language Models Worse than Humans at Following Prompts? It's Complicated
Albert Webson
A. Loo
Qinan Yu
Ellie Pavlick
LRM
86
17
0
17 Jan 2023
Dissociating language and thought in large language models
Dissociating language and thought in large language models
Kyle Mahowald
Anna A. Ivanova
I. Blank
Nancy Kanwisher
J. Tenenbaum
Evelina Fedorenko
ELMReLM
121
215
0
16 Jan 2023
PromptShots at the FinNLP-2022 ERAI Tasks: Pairwise Comparison and
  Unsupervised Ranking
PromptShots at the FinNLP-2022 ERAI Tasks: Pairwise Comparison and Unsupervised Ranking
Peratham Wiriyathammabhum
46
4
0
16 Jan 2023
Blind Judgement: Agent-Based Supreme Court Modelling With GPT
Blind Judgement: Agent-Based Supreme Court Modelling With GPT
S. Hamilton
LLMAGELM
78
41
0
12 Jan 2023
Mastering Diverse Domains through World Models
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
94
617
0
10 Jan 2023
Memory Augmented Large Language Models are Computationally Universal
Memory Augmented Large Language Models are Computationally Universal
Dale Schuurmans
84
46
0
10 Jan 2023
On The Fragility of Learned Reward Functions
On The Fragility of Learned Reward Functions
Lev McKinney
Yawen Duan
David M. Krueger
Adam Gleave
93
20
0
09 Jan 2023
AI2: The next leap toward native language based and explainable machine
  learning framework
AI2: The next leap toward native language based and explainable machine learning framework
J. Dessureault
Daniel Massicotte
53
1
0
09 Jan 2023
A Survey on Transformers in Reinforcement Learning
A Survey on Transformers in Reinforcement Learning
Wenzhe Li
Hao Luo
Zichuan Lin
Chongjie Zhang
Zongqing Lu
Deheng Ye
OffRLMUAI4CE
130
58
0
08 Jan 2023
Iterated Decomposition: Improving Science Q&A by Supervising Reasoning
  Processes
Iterated Decomposition: Improving Science Q&A by Supervising Reasoning Processes
Justin Reppert
Ben Rachbach
Charlie George
Luke Stebbing
Ju-Seung Byun
Maggie Appleton
Andreas Stuhlmuller
ReLMLRM
133
17
0
04 Jan 2023
Second Thoughts are Best: Learning to Re-Align With Human Values from
  Text Edits
Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits
Ruibo Liu
Chenyan Jia
Ge Zhang
Ziyu Zhuang
Tony X. Liu
Soroush Vosoughi
189
36
0
01 Jan 2023
Rethinking with Retrieval: Faithful Large Language Model Inference
Rethinking with Retrieval: Faithful Large Language Model Inference
Hangfeng He
Hongming Zhang
Dan Roth
KELMLRM
242
169
0
31 Dec 2022
Inconsistencies in Masked Language Models
Inconsistencies in Masked Language Models
Tom Young
Yunan Chen
Yang You
74
2
0
30 Dec 2022
ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on
  Simplified Radiology Reports
ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on Simplified Radiology Reports
Katharina Jeblick
B. Schachtner
Jakob Dexl
Andreas Mittermeier
Anna Theresa Stüber
...
Tobias Weber
Philipp Wesp
B. Sabel
J. Ricke
Michael Ingrisch
LM&MAMedIm
176
403
0
30 Dec 2022
Towards automating Codenames spymasters with deep reinforcement learning
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
61
2
0
28 Dec 2022
Demonstrate-Search-Predict: Composing retrieval and language models for
  knowledge-intensive NLP
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Omar Khattab
Keshav Santhanam
Xiang Lisa Li
David Leo Wright Hall
Percy Liang
Christopher Potts
Matei A. Zaharia
RALMKELM
116
269
0
28 Dec 2022
Large Language Models Encode Clinical Knowledge
Large Language Models Encode Clinical Knowledge
K. Singhal
Shekoofeh Azizi
T. Tu
S. S. Mahdavi
Jason W. Wei
...
A. Rajkomar
Joelle Barral
Christopher Semturs
Alan Karthikesalingam
Vivek Natarajan
LM&MAELMAI4MH
316
2,416
0
26 Dec 2022
Previous
123...122123124...126127128
Next