ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,380 papers shown
Title
InceptionNeXt: When Inception Meets ConvNeXt
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
191
142
0
29 Mar 2023
Hallucinations in Large Multilingual Translation Models
Hallucinations in Large Multilingual Translation Models
Nuno M. Guerreiro
Duarte M. Alves
Jonas Waldendorf
Barry Haddow
Alexandra Birch
Pierre Colombo
André F.T. Martins
VLMHILMLRM
197
154
0
28 Mar 2023
A Framework for Demonstrating Practical Quantum Advantage: Racing
  Quantum against Classical Generative Models
A Framework for Demonstrating Practical Quantum Advantage: Racing Quantum against Classical Generative Models
Mohamed Hibat-Allah
M. Mauri
Juan Carrasquilla
A. Perdomo-Ortiz
62
10
0
27 Mar 2023
Koala: An Index for Quantifying Overlaps with Pre-training Corpora
Koala: An Index for Quantifying Overlaps with Pre-training Corpora
Thuy-Trang Vu
Xuanli He
Gholamreza Haffari
Ehsan Shareghi
CLL
73
15
0
26 Mar 2023
Prompting Multilingual Large Language Models to Generate Code-Mixed
  Texts: The Case of South East Asian Languages
Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Zheng-Xin Yong
Ruochen Zhang
Jessica Zosa Forde
Skyler Wang
Arjun Subramonian
...
Yinghua Tan
Long Phan
Rowena Garcia
Thamar Solorio
Alham Fikri Aji
LRM
141
54
0
23 Mar 2023
Neuro-Symbolic Execution of Generic Source Code
Neuro-Symbolic Execution of Generic Source Code
Yaojie Hu
Jin Tian
NAI
85
0
0
23 Mar 2023
Paraphrasing evades detectors of AI-generated text, but retrieval is an
  effective defense
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense
Kalpesh Krishna
Yixiao Song
Marzena Karpinska
John Wieting
Mohit Iyyer
DeLMO
116
325
0
23 Mar 2023
MEGA: Multilingual Evaluation of Generative AI
MEGA: Multilingual Evaluation of Generative AI
Kabir Ahuja
Harshita Diddee
Rishav Hada
Millicent Ochieng
Krithika Ramesh
...
T. Ganu
Sameer Segal
Maxamed Axmed
Kalika Bali
Sunayana Sitaram
LM&MALRMELM
119
292
0
22 Mar 2023
Artificial Intelligence and Dual Contract
Artificial Intelligence and Dual Contract
Qian Qi
50
0
0
22 Mar 2023
The Open-domain Paradox for Chatbots: Common Ground as the Basis for
  Human-like Dialogue
The Open-domain Paradox for Chatbots: Common Ground as the Basis for Human-like Dialogue
Gabriel Skantze
A. Seza Doğruöz
LRM
87
7
0
21 Mar 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Muhammad Usama
Junaid Qadir
171
48
0
21 Mar 2023
Capabilities of GPT-4 on Medical Challenge Problems
Capabilities of GPT-4 on Medical Challenge Problems
Harsha Nori
Nicholas King
S. McKinney
Dean Carignan
Eric Horvitz
LM&MAELMAI4MH
156
812
0
20 Mar 2023
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
Zheng-Long Liu
Yue Huang
Xiao-Xing Yu
Lu Zhang
Zihao Wu
...
Dinggang Shen
Quanzheng Li
Tianming Liu
Dajiang Zhu
Xiang Li
LM&MAMedIm
129
178
0
20 Mar 2023
Retrieving Multimodal Information for Augmented Generation: A Survey
Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao
Hailin Chen
Weishi Wang
Fangkai Jiao
Do Xuan Long
...
Bosheng Ding
Xiaobao Guo
Minzhi Li
Xingxuan Li
Shafiq Joty
133
89
0
20 Mar 2023
DialogPaint: A Dialog-based Image Editing Model
DialogPaint: A Dialog-based Image Editing Model
Jingxuan Wei
Shiyu Wu
Xin Jiang
Yequan Wang
KELMDiffM
82
5
0
17 Mar 2023
Video Action Recognition with Attentive Semantic Units
Video Action Recognition with Attentive Semantic Units
Yifei Chen
Dapeng Chen
Ruijin Liu
Hao Li
Wei Peng
69
11
0
17 Mar 2023
Towards the Scalable Evaluation of Cooperativeness in Language Models
Towards the Scalable Evaluation of Cooperativeness in Language Models
Alan Chan
Maxime Riché
Jesse Clifton
LLMAG
81
7
0
16 Mar 2023
GLEN: General-Purpose Event Detection for Thousands of Types
GLEN: General-Purpose Event Detection for Thousands of Types
Qiusi Zhan
Sha Li
Kathryn Conger
Martha Palmer
Heng Ji
Jiawei Han
AI4TS
81
18
0
16 Mar 2023
PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical
  Documents
PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents
Weixiong Lin
Ziheng Zhao
Xiaoman Zhang
Chaoyi Wu
Ya Zhang
Yanfeng Wang
Weidi Xie
LM&MAVLMMedIm
77
159
0
13 Mar 2023
Large Language Models in the Workplace: A Case Study on Prompt
  Engineering for Job Type Classification
Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification
Benjamin Clavié
Alexandru Ciceu
Frederick Naylor
Guillaume Soulié
Thomas Brightwell
LLMAG
45
46
0
13 Mar 2023
Parachute: Evaluating Interactive Human-LM Co-writing Systems
Parachute: Evaluating Interactive Human-LM Co-writing Systems
Hua Shen
Tongshuang Wu
KELM
50
16
0
11 Mar 2023
Susceptibility to Influence of Large Language Models
Susceptibility to Influence of Large Language Models
Lewis D. Griffin
Bennett Kleinberg
Maximilian Mozes
Kimberly T. Mai
Maria Vau
M. Caldwell
Augustine N. Mavor-Parker
112
15
0
10 Mar 2023
Tag2Text: Guiding Vision-Language Model via Image Tagging
Tag2Text: Guiding Vision-Language Model via Image Tagging
Xinyu Huang
Youcai Zhang
Jinyu Ma
Weiwei Tian
Rui Feng
Yuejie Zhang
Yaqian Li
Yandong Guo
Lei Zhang
CLIPMLLMVLM3DV
146
77
0
10 Mar 2023
Personalisation within bounds: A risk taxonomy and policy framework for
  the alignment of large language models with personalised feedback
Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback
Hannah Rose Kirk
Bertie Vidgen
Paul Röttger
Scott A. Hale
106
107
0
09 Mar 2023
disco: a toolkit for Distributional Control of Generative Models
disco: a toolkit for Distributional Control of Generative Models
Germán Kruszewski
Jos Rozen
Marc Dymetman
59
4
0
08 Mar 2023
Byzantine-Robust Loopless Stochastic Variance-Reduced Gradient
Byzantine-Robust Loopless Stochastic Variance-Reduced Gradient
Nikita Fedin
Eduard A. Gorbunov
62
2
0
08 Mar 2023
Streaming Kernel PCA Algorithm With Small Space
Streaming Kernel PCA Algorithm With Small Space
Yichuan Deng
Zhao Song
Zifan Wang
Hangke Zhang
114
4
0
08 Mar 2023
MenuCraft: Interactive Menu System Design with Large Language Models
MenuCraft: Interactive Menu System Design with Large Language Models
Amir Hossein Kargaran
Nafiseh Nikeghbal
Abbas Heydarnoori
Hinrich Schütze
LLMAG
82
4
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
105
554
0
07 Mar 2023
Extracting Accurate Materials Data from Research Papers with
  Conversational Language Models and Prompt Engineering
Extracting Accurate Materials Data from Research Papers with Conversational Language Models and Prompt Engineering
Maciej P. Polak
Dane Morgan
139
182
0
07 Mar 2023
Is ChatGPT a Good NLG Evaluator? A Preliminary Study
Is ChatGPT a Good NLG Evaluator? A Preliminary Study
Jiaan Wang
Yunlong Liang
Fandong Meng
Zengkui Sun
Haoxiang Shi
Zhixu Li
Jinan Xu
Jianfeng Qu
Jie Zhou
LM&MAELMALMAI4MH
194
474
0
07 Mar 2023
Making a Computational Attorney
Making a Computational Attorney
Dell Zhang
Frank Schilder
Jack G. Conrad
Masoud Makrehchi
David von Rickenbach
Isabelle Moulinier
69
1
0
07 Mar 2023
Larger language models do in-context learning differently
Larger language models do in-context learning differently
Jerry W. Wei
Jason W. Wei
Yi Tay
Dustin Tran
Albert Webson
...
Xinyun Chen
Hanxiao Liu
Da Huang
Denny Zhou
Tengyu Ma
ReLMLRM
125
374
0
07 Mar 2023
CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation
  Verification
CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification
Seungone Kim
Se June Joo
Yul Jang
Hyungjoo Chae
Jinyoung Yeo
LRM
71
12
0
07 Mar 2023
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code
  Understanding, Generation, Translation and Retrieval
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Mohammad Abdullah Matin Khan
M Saiful Bari
Xuan Long Do
Weishi Wang
Md. Rizwan Parvez
Shafiq Joty
ALMELM
117
23
0
06 Mar 2023
FoundationTTS: Text-to-Speech for ASR Customization with Generative
  Language Model
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Rui Xue
Yanqing Liu
Lei He
Xuejiao Tan
Linquan Liu
Ed Lin
Sheng Zhao
118
7
0
06 Mar 2023
Perspectives on the Social Impacts of Reinforcement Learning with Human
  Feedback
Perspectives on the Social Impacts of Reinforcement Learning with Human Feedback
Gabrielle K. Liu
OffRL
116
21
0
06 Mar 2023
Will Affective Computing Emerge from Foundation Models and General AI? A
  First Evaluation on ChatGPT
Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT
Mostafa M. Amin
Min Zhang
Björn W. Schuller
AI4MH
94
74
0
03 Mar 2023
Investigating the Translation Performance of a Large Multilingual
  Language Model: the Case of BLOOM
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM
Rachel Bawden
François Yvon
VLMLRM
90
65
0
03 Mar 2023
Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering
Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering
Zhou Yu
Xuecheng Ouyang
Zhenwei Shao
Mei Wang
Jun Yu
MLLM
190
11
0
03 Mar 2023
AI and the FCI: Can ChatGPT Project an Understanding of Introductory
  Physics?
AI and the FCI: Can ChatGPT Project an Understanding of Introductory Physics?
Colin G. West
51
62
0
02 Mar 2023
EvoPrompting: Language Models for Code-Level Neural Architecture Search
EvoPrompting: Language Models for Code-Level Neural Architecture Search
Angelica Chen
David Dohan
David R. So
VLMLRM
118
91
0
28 Feb 2023
Goal Driven Discovery of Distributional Differences via Language
  Descriptions
Goal Driven Discovery of Distributional Differences via Language Descriptions
Ruiqi Zhong
Peter Zhang
Steve Li
Jinwoo Ahn
Dan Klein
Jacob Steinhardt
118
53
0
28 Feb 2023
A Survey on Long Text Modeling with Transformers
A Survey on Long Text Modeling with Transformers
Zican Dong
Tianyi Tang
Lunyi Li
Wayne Xin Zhao
VLM
140
57
0
28 Feb 2023
TabGenie: A Toolkit for Table-to-Text Generation
TabGenie: A Toolkit for Table-to-Text Generation
Zdeněk Kasner
E. Garanina
Ondvrej Plátek
Ondrej Dusek
LMTD
70
8
0
27 Feb 2023
Safety without alignment
Safety without alignment
András Kornai
M. Bukatin
Zsolt Zombori
LLMSV
55
0
0
27 Feb 2023
AugGPT: Leveraging ChatGPT for Text Data Augmentation
AugGPT: Leveraging ChatGPT for Text Data Augmentation
Haixing Dai
Zheng Liu
Wenxiong Liao
Xiaoke Huang
Yihan Cao
...
Lichao Sun
Quanzheng Li
Dinggang Shen
Tianming Liu
Xiang Li
139
160
0
25 Feb 2023
Spanish Built Factual Freectianary (Spanish-BFF): the first AI-generated
  free dictionary
Spanish Built Factual Freectianary (Spanish-BFF): the first AI-generated free dictionary
Miguel Ortega-Martín
Óscar García-Sierra
Alfonso Ardoiz
J. C. Armenteros
Jorge Álvarez
Adrián Alonso
59
2
0
24 Feb 2023
CARE: Collaborative AI-Assisted Reading Environment
CARE: Collaborative AI-Assisted Reading Environment
Dennis Zyska
Nils Dycke
Jan Buchmann
Ilia Kuznetsov
Iryna Gurevych
67
6
0
24 Feb 2023
Aligning Text-to-Image Models using Human Feedback
Aligning Text-to-Image Models using Human Feedback
Kimin Lee
Hao Liu
Moonkyung Ryu
Olivia Watkins
Yuqing Du
Craig Boutilier
Pieter Abbeel
Mohammad Ghavamzadeh
S. Gu
EGVM
162
285
0
23 Feb 2023
Previous
123...121122123...126127128
Next