ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,381 papers shown
Title
System-Level Natural Language Feedback
System-Level Natural Language Feedback
Weizhe Yuan
Kyunghyun Cho
Jason Weston
119
5
0
23 Jun 2023
Product Information Extraction using ChatGPT
Product Information Extraction using ChatGPT
Alexander Brinkmann
Roee Shraga
Reng Chiz Der
Christian Bizer
43
11
0
23 Jun 2023
Visual Adversarial Examples Jailbreak Aligned Large Language Models
Visual Adversarial Examples Jailbreak Aligned Large Language Models
Xiangyu Qi
Kaixuan Huang
Ashwinee Panda
Peter Henderson
Mengdi Wang
Prateek Mittal
AAML
129
173
0
22 Jun 2023
Towards Understanding What Code Language Models Learned
Towards Understanding What Code Language Models Learned
Toufique Ahmed
Dian Yu
Chen Huang
Cathy Wang
Prem Devanbu
Kenji Sagae
ELM
77
5
0
20 Jun 2023
Exploring New Frontiers in Agricultural NLP: Investigating the Potential
  of Large Language Models for Food Applications
Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications
Saed Rezayi
Zheng Liu
Zihao Wu
Chandra Dhakal
Bao Ge
...
Gengchen Mai
Ninghao Liu
Chen Zhen
Tianming Liu
Sheng Li
73
33
0
20 Jun 2023
Pushing the Limits of 3D Shape Generation at Scale
Pushing the Limits of 3D Shape Generation at Scale
Wang Yu
Xuelin Qian
Jingyang Huo
Tiejun Huang
Bo Zhao
Yanwei Fu
115
11
0
20 Jun 2023
Large Language Models are Fixated by Red Herrings: Exploring Creative
  Problem Solving and Einstellung Effect using the Only Connect Wall Dataset
Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall Dataset
S. Naeini
Raeid Saqur
M. Saeidi
John Giorgi
Babak Taati
125
11
0
19 Jun 2023
MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators
MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators
Yaqi Zhang
Di Huang
B. Liu
Shixiang Tang
Yan Lu
Lu Chen
Lei Bai
Qi Chu
Nenghai Yu
Wanli Ouyang
171
104
0
19 Jun 2023
Developing Effective Educational Chatbots with ChatGPT prompts: Insights
  from Preliminary Tests in a Case Study on Social Media Literacy (with
  appendix)
Developing Effective Educational Chatbots with ChatGPT prompts: Insights from Preliminary Tests in a Case Study on Social Media Literacy (with appendix)
Cansu Koyuturk
Mona Yavari
Emily Theophilou
Sathya Bursic
Gregor Donabauer
...
Raffaele Boiano
A. Gabbiadini
Davinia Hernández Leo
Martin Ruskov
D. Ognibene
68
18
0
18 Jun 2023
ClinicalGPT: Large Language Models Finetuned with Diverse Medical Data
  and Comprehensive Evaluation
ClinicalGPT: Large Language Models Finetuned with Diverse Medical Data and Comprehensive Evaluation
Guangyu Wang
Guoxing Yang
Zongxin Du
Longjun Fan
Xiaohu Li
LM&MAELMAI4MH
74
88
0
16 Jun 2023
Mimicking Better by Matching the Approximate Action Distribution
Mimicking Better by Matching the Approximate Action Distribution
Joao A. Candido Ramos
Lionel Blondé
Naoya Takeishi
Alexandros Kalousis
71
2
0
16 Jun 2023
CHORUS: Foundation Models for Unified Data Discovery and Exploration
CHORUS: Foundation Models for Unified Data Discovery and Exploration
Moe Kayali
A. Lykov
Ilias Fountalis
N. Vasiloglou
Dan Olteanu
Dan Suciu
99
25
0
16 Jun 2023
Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering
Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering
Rabiul Awal
Le Zhang
Aishwarya Agrawal
LRM
149
13
0
16 Jun 2023
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and
  Text Integration
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration
Chenyang Lyu
Minghao Wu
Longyue Wang
Xinting Huang
Bingshuai Liu
Zefeng Du
Shuming Shi
Zhaopeng Tu
MLLMAuLLM
86
173
0
15 Jun 2023
Bridging the Gap between Decision and Logits in Decision-based Knowledge
  Distillation for Pre-trained Language Models
Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language Models
Qinhong Zhou
Zonghan Yang
Peng Li
Yang Liu
104
3
0
15 Jun 2023
Towards Building Voice-based Conversational Recommender Systems:
  Datasets, Potential Solutions, and Prospects
Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects
Xinghua Qu
Hongyang Liu
Zhu Sun
Xiang Yin
Yew-Soon Ong
Lu Lu
Zejun Ma
121
3
0
14 Jun 2023
Language models are not naysayers: An analysis of language models on
  negation benchmarks
Language models are not naysayers: An analysis of language models on negation benchmarks
Thinh Hung Truong
Timothy Baldwin
Karin Verspoor
Trevor Cohn
124
60
0
14 Jun 2023
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Ziyang Luo
Can Xu
Pu Zhao
Qingfeng Sun
Xiubo Geng
Wenxiang Hu
Chongyang Tao
Jing Ma
Qingwei Lin
Daxin Jiang
ELMSyDaALM
217
698
0
14 Jun 2023
FLamE: Few-shot Learning from Natural Language Explanations
FLamE: Few-shot Learning from Natural Language Explanations
Yangqiaoyu Zhou
Yiming Zhang
Chenhao Tan
LRMFAtt
95
11
0
13 Jun 2023
ReadProbe: A Demo of Retrieval-Enhanced Large Language Models to Support
  Lateral Reading
ReadProbe: A Demo of Retrieval-Enhanced Large Language Models to Support Lateral Reading
Dake Zhang
Ronak Pradeep
RALM
30
2
0
13 Jun 2023
Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions
Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions
Weizhen He
Yihe Deng
Shixiang Tang
Qihao Chen
Qingsong Xie
...
Feng Zhu
Rui Zhao
Wanli Ouyang
Donglian Qi
Yunfeng Yan
129
25
0
13 Jun 2023
Prompt-based Extraction of Social Determinants of Health Using Few-shot
  Learning
Prompt-based Extraction of Social Determinants of Health Using Few-shot Learning
Giridhar Kaushik Ramachandran
Yujuan Fu
Bin Han
K. Lybarger
Nicholas J. Dobbins
Özlem Uzuner
Meliha Yetisgen
FedML
61
16
0
12 Jun 2023
When Do Annotator Demographics Matter? Measuring the Influence of
  Annotator Demographics with the POPQUORN Dataset
When Do Annotator Demographics Matter? Measuring the Influence of Annotator Demographics with the POPQUORN Dataset
Jiaxin Pei
David Jurgens
78
34
0
12 Jun 2023
Valley: Video Assistant with Large Language model Enhanced abilitY
Valley: Video Assistant with Large Language model Enhanced abilitY
Ruipu Luo
Ziwang Zhao
Min Yang
Junwei Dong
Da Li
Pengcheng Lu
Tao Wang
Linmei Hu
Ming-Hui Qiu
MLLM
150
209
0
12 Jun 2023
AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural
  Language Processing
AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing
Asaad Alghamdi
Xinyu Duan
Wei Jiang
Zhenhai Wang
Yimeng Wu
...
Yifei Zheng
Mehdi Rezagholizadeh
Baoxing Huai
Peilun Cheng
Abbas Ghaddar
VLM
57
9
0
11 Jun 2023
Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive
  Question Answering
Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering
Hai Ye
Qizhe Xie
Hwee Tou Ng
90
8
0
11 Jun 2023
GPT-Calls: Enhancing Call Segmentation and Tagging by Generating
  Synthetic Conversations via Large Language Models
GPT-Calls: Enhancing Call Segmentation and Tagging by Generating Synthetic Conversations via Large Language Models
Itzik Malkiel
Uri Alon
Yakir Yehuda
Shahar Keren
Oren Barkan
Royi Ronen
Noam Koenigstein
VLM
87
1
0
09 Jun 2023
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT
  that Easy to Detect?
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?
Wissam Antoun
Virginie Mouilleron
Benoît Sagot
Djamé Seddah
DeLMO
85
33
0
09 Jun 2023
Exploring the Responses of Large Language Models to Beginner
  Programmers' Help Requests
Exploring the Responses of Large Language Models to Beginner Programmers' Help Requests
Arto Hellas
Juho Leinonen
Sami Sarsa
Charles Koutcheme
Lilja Kujanpää
Juha Sorva
AI4Ed
69
114
0
09 Jun 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALMOSLMELM
622
4,460
0
09 Jun 2023
Overview of the Problem List Summarization (ProbSum) 2023 Shared Task on
  Summarizing Patients' Active Diagnoses and Problems from Electronic Health
  Record Progress Notes
Overview of the Problem List Summarization (ProbSum) 2023 Shared Task on Summarizing Patients' Active Diagnoses and Problems from Electronic Health Record Progress Notes
Yanjun Gao
Dmitriy Dligach
Timothy A. Miller
M. Churpek
Majid Afshar
83
17
0
08 Jun 2023
Absformer: Transformer-based Model for Unsupervised Multi-Document
  Abstractive Summarization
Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization
M. Trabelsi
H. Uzunalioglu
76
2
0
07 Jun 2023
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large
  Language Models
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models
Yew Ken Chia
Pengfei Hong
Lidong Bing
Soujanya Poria
ELM
79
65
0
07 Jun 2023
Improving Open Language Models by Learning from Organic Interactions
Improving Open Language Models by Learning from Organic Interactions
Jing Xu
Da Ju
Joshua Lane
M. Komeili
Eric Michael Smith
...
Rashel Moritz
Sainbayar Sukhbaatar
Y-Lan Boureau
Jason Weston
Kurt Shuster
71
9
0
07 Jun 2023
Long-form analogies generated by chatGPT lack human-like
  psycholinguistic properties
Long-form analogies generated by chatGPT lack human-like psycholinguistic properties
S. M. Seals
V. Shalin
50
12
0
07 Jun 2023
World Models for Math Story Problems
World Models for Math Story Problems
Andreas Opedal
Niklas Stoehr
Abulhair Saparov
Mrinmaya Sachan
ReLM
112
13
0
07 Jun 2023
Deductive Verification of Chain-of-Thought Reasoning
Deductive Verification of Chain-of-Thought Reasoning
Z. Ling
Yunhao Fang
Xuanlin Li
Zhiao Huang
Mingu Lee
Roland Memisevic
Hao Su
ReLMLRM
121
136
0
06 Jun 2023
Iterative Translation Refinement with Large Language Models
Iterative Translation Refinement with Large Language Models
Pinzhen Chen
Zhicheng Guo
Barry Haddow
Kenneth Heafield
LRM
73
23
0
06 Jun 2023
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Zhe Ye
Ziyue Jiang
Yi Ren
Jinglin Liu
Chen Zhang
Xiang Yin
Zejun Ma
Zhou Zhao
93
5
0
06 Jun 2023
Applying Standards to Advance Upstream & Downstream Ethics in Large
  Language Models
Applying Standards to Advance Upstream & Downstream Ethics in Large Language Models
Jose Berengueres
Marybeth Sandell
70
0
0
06 Jun 2023
InstructZero: Efficient Instruction Optimization for Black-Box Large
  Language Models
InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models
Lichang Chen
Jiuhai Chen
Tom Goldstein
Heng-Chiao Huang
Dinesh Manocha
102
45
0
05 Jun 2023
Analyzing Syntactic Generalization Capacity of Pre-trained Language
  Models on Japanese Honorific Conversion
Analyzing Syntactic Generalization Capacity of Pre-trained Language Models on Japanese Honorific Conversion
Ryo Sekizawa
Hitomi Yanaka
20
0
0
05 Jun 2023
PolyVoice: Language Models for Speech to Speech Translation
PolyVoice: Language Models for Speech to Speech Translation
Qianqian Dong
Zhiying Huang
Qiao Tian
Chen Xu
Tom Ko
...
Lu Lu
Zejun Ma
Yuping Wang
Mingxuan Wang
Yuxuan Wang
109
25
0
05 Jun 2023
Leveraging Large Language Models for Topic Classification in the Domain
  of Public Affairs
Leveraging Large Language Models for Topic Classification in the Domain of Public Affairs
Alejandro Peña
Aythami Morales
Julian Fierrez
Ignacio Serna
J. Ortega-Garcia
Iñigo Puente
Jorge Cordova
Gonzalo Cordova
87
20
0
05 Jun 2023
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and
  Generative Fusion
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion
Dongfu Jiang
Xiang Ren
Bill Yuchen Lin
ELM
162
334
0
05 Jun 2023
Evaluation of AI Chatbots for Patient-Specific EHR Questions
Evaluation of AI Chatbots for Patient-Specific EHR Questions
Alaleh Hamidi
Kirk Roberts
ELMLM&MAAI4MH
65
13
0
05 Jun 2023
Evaluating and Improving Tool-Augmented Computation-Intensive Math
  Reasoning
Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning
Beichen Zhang
Kun Zhou
Xilin Wei
Wayne Xin Zhao
Jing Sha
Shijin Wang
Ji-Rong Wen
LRM
119
39
0
04 Jun 2023
On Computational Mechanisms for Shared Intentionality, and Speculation
  on Rationality and Consciousness
On Computational Mechanisms for Shared Intentionality, and Speculation on Rationality and Consciousness
John Rushby
58
0
0
03 Jun 2023
On Optimal Caching and Model Multiplexing for Large Model Inference
On Optimal Caching and Model Multiplexing for Large Model Inference
Banghua Zhu
Ying Sheng
Lianmin Zheng
Clark W. Barrett
Michael I. Jordan
Jiantao Jiao
99
21
0
03 Jun 2023
Fine-Grained Human Feedback Gives Better Rewards for Language Model
  Training
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Zeqiu Wu
Yushi Hu
Weijia Shi
Nouha Dziri
Alane Suhr
Prithviraj Ammanabrolu
Noah A. Smith
Mari Ostendorf
Hannaneh Hajishirzi
ALM
168
336
0
02 Jun 2023
Previous
123...116117118...126127128
Next