ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLM
    ALM
ArXivPDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 4,678 papers shown
Title
VPGTrans: Transfer Visual Prompt Generator across LLMs
VPGTrans: Transfer Visual Prompt Generator across LLMs
Ao Zhang
Hao Fei
Yuan Yao
Wei Ji
Li Li
Zhiyuan Liu
Tat-Seng Chua
MLLM
VLM
35
85
0
02 May 2023
Semi-Infinitely Constrained Markov Decision Processes and Efficient
  Reinforcement Learning
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning
Liangyu Zhang
Yang Peng
Wenhao Yang
Zhihua Zhang
21
1
0
29 Apr 2023
CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to
  Guardrail Models for Virtual Assistants
CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to Guardrail Models for Virtual Assistants
A. Sun
Varun Nair
Elliot Schumacher
Anitha Kannan
32
3
0
27 Apr 2023
Multimodal Grounding for Embodied AI via Augmented Reality Headsets for
  Natural Language Driven Task Planning
Multimodal Grounding for Embodied AI via Augmented Reality Headsets for Natural Language Driven Task Planning
Selma Wanna
Fabian Parra
R. Valner
Karl Kruusamäe
Mitch Pryor
LM&Ro
26
2
0
26 Apr 2023
Exploring the Curious Case of Code Prompts
Exploring the Curious Case of Code Prompts
Li Zhang
Liam Dugan
Hainiu Xu
Chris Callison-Burch
LRM
45
14
0
26 Apr 2023
SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
Bin Wang
Xinnian Liang
Jian Yang
Huijia Huang
Shuangzhi Wu
Peihao Wu
Lu Lu
Zejun Ma
Zhoujun Li
LLMAG
KELM
RALM
96
26
0
26 Apr 2023
AGI: Artificial General Intelligence for Education
AGI: Artificial General Intelligence for Education
Ehsan Latif
Gengchen Mai
Matthew Nyaaba
Xuansheng Wu
Ninghao Liu
Guoyu Lu
Sheng Li
Tianming Liu
Xiaoming Zhai
ELM
AI4CE
35
22
0
24 Apr 2023
AMR Parsing with Instruction Fine-tuned Pre-trained Language Models
AMR Parsing with Instruction Fine-tuned Pre-trained Language Models
Young-Suk Lee
Ramón Fernández Astudillo
Radu Florian
Tahira Naseem
Salim Roukos
30
4
0
24 Apr 2023
AI, write an essay for me: A large-scale comparison of human-written
  versus ChatGPT-generated essays
AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays
Steffen Herbold
Annette Hautli-Janisz
Ute Heuer
Zlata Kikteva
Alexander Trautsch
DeLMO
85
23
0
24 Apr 2023
Topological properties and organizing principles of semantic networks
Topological properties and organizing principles of semantic networks
Gabriel Budel
Yingzi Jin
P. Mieghem
M. Kitsak
11
5
0
24 Apr 2023
ChatLLM Network: More brains, More intelligence
ChatLLM Network: More brains, More intelligence
Rui Hao
Linmei Hu
Weijian Qi
Qingliu Wu
Yirui Zhang
Liqiang Nie
LLMAG
ALM
LRM
27
35
0
24 Apr 2023
Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in
  Large Language Models
Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
Jiashuo Sun
Yi Luo
Yeyun Gong
Chen Lin
Yelong Shen
Jian Guo
Nan Duan
LRM
41
19
0
23 Apr 2023
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
Archiki Prasad
Swarnadeep Saha
Xiang Zhou
Joey Tianyi Zhou
LRM
32
45
0
21 Apr 2023
OptoGPT: A Foundation Model for Inverse Design in Optical Multilayer
  Thin Film Structures
OptoGPT: A Foundation Model for Inverse Design in Optical Multilayer Thin Film Structures
Taigao Ma
Haozhu Wang
L. J. Guo
22
18
0
20 Apr 2023
MasakhaNEWS: News Topic Classification for African languages
MasakhaNEWS: News Topic Classification for African languages
David Ifeoluwa Adelani
Marek Masiak
Israel Abebe Azime
Jesujoba Oluwadara Alabi
A. Tonja
...
Moges Ahmed Mehamed
Evrard Ngabire
Jules Jules
Ivan Ssenkungu
Pontus Stenetorp
28
24
0
19 Apr 2023
A Latent Space Theory for Emergent Abilities in Large Language Models
A Latent Space Theory for Emergent Abilities in Large Language Models
Hui Jiang
LRM
25
35
0
19 Apr 2023
Progressive-Hint Prompting Improves Reasoning in Large Language Models
Progressive-Hint Prompting Improves Reasoning in Large Language Models
Chuanyang Zheng
Zhengying Liu
Enze Xie
Zhenguo Li
Yu Li
LLMAG
ReLM
LRM
41
103
0
19 Apr 2023
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Simran Arora
Brandon Yang
Sabri Eyuboglu
A. Narayan
Andrew Hojel
Immanuel Trummer
Christopher Ré
SyDa
47
69
0
19 Apr 2023
Learning to Compress Prompts with Gist Tokens
Learning to Compress Prompts with Gist Tokens
Jesse Mu
Xiang Lisa Li
Noah D. Goodman
VLM
53
206
0
17 Apr 2023
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction
  Tuning
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning
Qian Liu
Fan Zhou
Zhengbao Jiang
Longxu Dou
Min-Bin Lin
18
17
0
17 Apr 2023
SikuGPT: A Generative Pre-trained Model for Intelligent Information
  Processing of Ancient Texts from the Perspective of Digital Humanities
SikuGPT: A Generative Pre-trained Model for Intelligent Information Processing of Ancient Texts from the Perspective of Digital Humanities
Chang Liu
Dongbo Wang
Zhixiao Zhao
Die Hu
Mengcheng Wu
...
Si Shen
Bin Li
Jiangfeng Liu
Hai Zhang
Lianzheng Zhao
21
9
0
16 Apr 2023
A Comprehensive Evaluation of Neural SPARQL Query Generation from
  Natural Language Questions
A Comprehensive Evaluation of Neural SPARQL Query Generation from Natural Language Questions
Papa Abdou Karim Karou Diallo
Samuel Reyd
Amal Zouaq
11
6
0
16 Apr 2023
One Explanation Does Not Fit XIL
One Explanation Does Not Fit XIL
Felix Friedrich
David Steinmann
Kristian Kersting
LRM
37
2
0
14 Apr 2023
HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge
HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge
Hao Wang
Chi-Liang Liu
Nuwa Xi
Zewen Qiang
Sendong Zhao
Bing Qin
Ting Liu
LM&MA
ALM
19
198
0
14 Apr 2023
On the Opportunities and Challenges of Foundation Models for Geospatial
  Artificial Intelligence
On the Opportunities and Challenges of Foundation Models for Geospatial Artificial Intelligence
Gengchen Mai
Weiming Huang
Jin Sun
Suhang Song
Deepak Mishra
...
Yingjie Hu
Chris Cundy
Ziyuan Li
Rui Zhu
Ni Lao
AI4CE
32
123
0
13 Apr 2023
Automated Mapping of CVE Vulnerability Records to MITRE CWE Weaknesses
Automated Mapping of CVE Vulnerability Records to MITRE CWE Weaknesses
Ashraf Haddad
N. Aaraj
Preslav Nakov
Septimiu Fabian Mare
16
4
0
13 Apr 2023
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
Hanze Dong
Wei Xiong
Deepanshu Goyal
Yihan Zhang
Winnie Chow
Rui Pan
Shizhe Diao
Jipeng Zhang
Kashun Shum
Tong Zhang
ALM
18
408
0
13 Apr 2023
Learning Personalized Decision Support Policies
Learning Personalized Decision Support Policies
Umang Bhatt
Valerie Chen
Katherine M. Collins
Parameswaran Kamalaruban
Emma Kallina
Adrian Weller
Ameet Talwalkar
OffRL
56
10
0
13 Apr 2023
How Useful are Educational Questions Generated by Large Language Models?
How Useful are Educational Questions Generated by Large Language Models?
Sabina Elkins
E. Kochmar
Jackie C.K. Cheung
Iulian Serban
ELM
AI4Ed
35
31
0
13 Apr 2023
ChatGPT Needs SPADE (Sustainability, PrivAcy, Digital divide, and Ethics) Evaluation: A Review
ChatGPT Needs SPADE (Sustainability, PrivAcy, Digital divide, and Ethics) Evaluation: A Review
Sunder Ali Khowaja
P. Khuwaja
K. Dev
Weizheng Wang
Lewis Nkenyereye
29
76
0
13 Apr 2023
Are LLMs All You Need for Task-Oriented Dialogue?
Are LLMs All You Need for Task-Oriented Dialogue?
Vojtvech Hudevcek
Ondrej Dusek
26
57
0
13 Apr 2023
AGI for Agriculture
AGI for Agriculture
Guoyu Lu
Sheng Li
Gengchen Mai
Jin Sun
Dajiang Zhu
...
R. Xu
Daniel Petti
Changying Li
Tianming Liu
Changying Li
AI4CE
48
17
0
12 Apr 2023
chatClimate: Grounding Conversational AI in Climate Science
chatClimate: Grounding Conversational AI in Climate Science
S. Vaghefi
Qian Wang
V. Muccione
Jingwei Ni
Mathias Kraus
...
Tobias Schimanski
Chiara Colesanti-Senni
Nicolas Webersinke
Christrian Huggel
Markus Leippold
KELM
AI4MH
HILM
27
67
0
11 Apr 2023
A Survey of Resources and Methods for Natural Language Processing of
  Serbian Language
A Survey of Resources and Methods for Natural Language Processing of Serbian Language
U. Marovac
A. Avdić
Nikola Milosevic
36
1
0
11 Apr 2023
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image
  Models
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
Eslam Mohamed Bakr
Pengzhan Sun
Xiaoqian Shen
Faizan Farooq Khan
Li Erran Li
Mohamed Elhoseiny
VLM
24
76
0
11 Apr 2023
Multi-step Jailbreaking Privacy Attacks on ChatGPT
Multi-step Jailbreaking Privacy Attacks on ChatGPT
Haoran Li
Dadi Guo
Wei Fan
Mingshi Xu
Jie Huang
Fanpu Meng
Yangqiu Song
SILM
59
322
0
11 Apr 2023
Human-machine cooperation for semantic feature listing
Human-machine cooperation for semantic feature listing
Kushin Mukherjee
Siddharth Suresh
Timothy T. Rogers
VLM
17
2
0
11 Apr 2023
Automated Reading Passage Generation with OpenAI's Large Language Model
Automated Reading Passage Generation with OpenAI's Large Language Model
Ummugul Bezirhan
M. Davier
AI4Ed
21
23
0
10 Apr 2023
Graph-ToolFormer: To Empower LLMs with Graph Reasoning Ability via
  Prompt Augmented by ChatGPT
Graph-ToolFormer: To Empower LLMs with Graph Reasoning Ability via Prompt Augmented by ChatGPT
Jiawei Zhang
LRM
45
76
0
10 Apr 2023
OpenAGI: When LLM Meets Domain Experts
OpenAGI: When LLM Meets Domain Experts
Yingqiang Ge
Wenyue Hua
Kai Mei
Jianchao Ji
Juntao Tan
Shuyuan Xu
Zelong Li
Yongfeng Zhang
VLM
LRM
43
212
0
10 Apr 2023
VOICE: Visual Oracle for Interaction, Conversation, and Explanation
VOICE: Visual Oracle for Interaction, Conversation, and Explanation
Donggang Jia
Alexandra Irger
Lonni Besancon
Ondrej Strnad
Deng Luo
Johanna Björklund
Anders Ynnerman
I. Viola
27
2
0
08 Apr 2023
Towards Generating Functionally Correct Code Edits from Natural Language
  Issue Descriptions
Towards Generating Functionally Correct Code Edits from Natural Language Issue Descriptions
Sarah Fakhoury
Saikat Chakraborty
Madan Musuvathi
Shuvendu K. Lahiri
38
21
0
07 Apr 2023
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language
  Models
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models
Emilio Ferrara
SILM
36
247
0
07 Apr 2023
Large language models effectively leverage document-level context for
  literary translation, but critical errors persist
Large language models effectively leverage document-level context for literary translation, but critical errors persist
Marzena Karpinska
Mohit Iyyer
31
82
0
06 Apr 2023
Quantifying the Roles of Visual, Linguistic, and Visual-Linguistic
  Complexity in Verb Acquisition
Quantifying the Roles of Visual, Linguistic, and Visual-Linguistic Complexity in Verb Acquisition
Yuchen Zhou
Michael J. Tarr
Daniel Yurovsky
24
2
0
05 Apr 2023
Document-Level Machine Translation with Large Language Models
Document-Level Machine Translation with Large Language Models
Longyue Wang
Chenyang Lyu
Tianbo Ji
Zhirui Zhang
Dian Yu
Shuming Shi
Zhaopeng Tu
ELM
28
116
0
05 Apr 2023
Towards Self-Explainability of Deep Neural Networks with Heatmap
  Captioning and Large-Language Models
Towards Self-Explainability of Deep Neural Networks with Heatmap Captioning and Large-Language Models
Osman Tursun
Simon Denman
Sridha Sridharan
Clinton Fookes
ViT
VLM
16
6
0
05 Apr 2023
The Vector Grounding Problem
The Vector Grounding Problem
Dimitri Coelho Mollo
Raphael Milliere
44
26
0
04 Apr 2023
Multi-Modal Perceiver Language Model for Outcome Prediction in Emergency
  Department
Multi-Modal Perceiver Language Model for Outcome Prediction in Emergency Department
Sabri Boughorbel
Fethi Jarray
Abdulaziz Yousuf Al-Homaid
Rashid Niaz
Khalid Alyafei
32
0
0
03 Apr 2023
Evaluating Large Language Models on a Highly-specialized Topic,
  Radiation Oncology Physics
Evaluating Large Language Models on a Highly-specialized Topic, Radiation Oncology Physics
J. Holmes
Zheng Liu
Lian-Cheng Zhang
Yuzhen Ding
Terence T. Sio
...
Jonathan B. Ashman
Xiang Li
Tianming Liu
Jiajian Shen
Wei Liu
LM&MA
AI4CE
ELM
30
120
0
01 Apr 2023
Previous
123...878889...929394
Next