ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,483 papers shown
Title
A Comparative Study of Pretrained Language Models for Long Clinical Text
A Comparative Study of Pretrained Language Models for Long Clinical Text
Yikuan Li
R. M. Wehbe
F. Ahmad
Hanyin Wang
Yuan Luo
LM&MAELMVLMMedIm
93
86
0
27 Jan 2023
Can We Use Probing to Better Understand Fine-tuning and Knowledge
  Distillation of the BERT NLU?
Can We Use Probing to Better Understand Fine-tuning and Knowledge Distillation of the BERT NLU?
Jakub Ho'scilowicz
Marcin Sowanski
Piotr Czubowski
Artur Janicki
61
2
0
27 Jan 2023
Probing Out-of-Distribution Robustness of Language Models with
  Parameter-Efficient Transfer Learning
Probing Out-of-Distribution Robustness of Language Models with Parameter-Efficient Transfer Learning
Hyunsoo Cho
Choonghyun Park
Junyeop Kim
Sungmin Cho
Kang Min Yoo
Sang-goo Lee
OODD
87
3
0
27 Jan 2023
Deep Quantum Error Correction
Deep Quantum Error Correction
Yoni Choukroun
Lior Wolf
70
11
0
27 Jan 2023
Robust Transformer with Locality Inductive Bias and Feature
  Normalization
Robust Transformer with Locality Inductive Bias and Feature Normalization
Omid Nejati Manzari
Hossein Kashiani
Hojat Asgarian Dehkordi
S. B. Shokouhi
ViT
77
15
0
27 Jan 2023
Projected Subnetworks Scale Adaptation
Projected Subnetworks Scale Adaptation
Siddhartha Datta
N. Shadbolt
VLMCLL
90
0
0
27 Jan 2023
MusicLM: Generating Music From Text
MusicLM: Generating Music From Text
A. Agostinelli
Timo I. Denk
Zalan Borsos
Jesse Engel
Mauro Verzetti
...
Adam Roberts
Marco Tagliasacchi
Matthew Sharifi
Neil Zeghidour
Christian Frank
MGen
152
451
0
26 Jan 2023
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability
  Curvature
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
E. Mitchell
Yoonho Lee
Alexander Khazatsky
Christopher D. Manning
Chelsea Finn
132
632
0
26 Jan 2023
ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients
ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients
Guihong Li
Yuedong Yang
Kartikeya Bhardwaj
R. Marculescu
121
63
0
26 Jan 2023
Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt
  Tuning
Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning
Mingyu Derek Ma
Jiun-Yu Kao
Shuyang Gao
Arpit Gupta
Di Jin
Tagyoung Chung
Nanyun Peng
73
7
0
26 Jan 2023
Distilling Cognitive Backdoor Patterns within an Image
Distilling Cognitive Backdoor Patterns within an Image
Hanxun Huang
Xingjun Ma
S. Erfani
James Bailey
AAML
119
26
0
26 Jan 2023
Distilling Text into Circuits
Distilling Text into Circuits
Vincent Wang-Ma'scianica
Jonathon Liu
B. Coecke
90
11
0
25 Jan 2023
Explainable AI does not provide the explanations end-users are asking
  for
Explainable AI does not provide the explanations end-users are asking for
Savio Rozario
G. Cevora
XAI
63
1
0
25 Jan 2023
Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual
  Conditional Generation with Interaction
Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction
Jonathan Pilault
Xavier Garcia
Arthur Bravzinskas
Orhan Firat
AI4CELRM
90
17
0
24 Jan 2023
Large language models can segment narrative events similarly to humans
Large language models can segment narrative events similarly to humans
Sebastian Michelmann
Manoj Kumar
K. A. Norman
Mariya Toneva
67
16
0
24 Jan 2023
Multitask Instruction-based Prompting for Fallacy Recognition
Multitask Instruction-based Prompting for Fallacy Recognition
Tariq Alhindi
Tuhin Chakrabarty
Elena Musi
Smaranda Muresan
LRM
72
30
0
24 Jan 2023
Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware
  Communication Compression
Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression
Jaeyong Song
Jinkyu Yim
Jaewon Jung
Hongsun Jang
H. Kim
Youngsok Kim
Jinho Lee
GNN
74
28
0
24 Jan 2023
AI vs. Human -- Differentiation Analysis of Scientific Content
  Generation
AI vs. Human -- Differentiation Analysis of Scientific Content Generation
Yongqiang Ma
Jiawei Liu
Fan Yi
Qikai Cheng
Yong Huang
Wei Lu
Xiaozhong Liu
DeLMO
112
60
0
24 Jan 2023
Transformer-Patcher: One Mistake worth One Neuron
Transformer-Patcher: One Mistake worth One Neuron
Zeyu Huang
Songlin Yang
Xiaofeng Zhang
Jie Zhou
Wenge Rong
Zhang Xiong
KELM
102
179
0
24 Jan 2023
Mathematics, word problems, common sense, and artificial intelligence
Mathematics, word problems, common sense, and artificial intelligence
E. Davis
AIMat
76
25
0
23 Jan 2023
The Backpropagation algorithm for a math student
The Backpropagation algorithm for a math student
S. Damadi
Golnaz Moharrer
Mostafa Cham
60
4
0
22 Jan 2023
ExClaim: Explainable Neural Claim Verification Using Rationalization
ExClaim: Explainable Neural Claim Verification Using Rationalization
Sai Gurrapu
Lifu Huang
Feras A. Batarseh
AAML
94
9
0
21 Jan 2023
AQuaMaM: An Autoregressive, Quaternion Manifold Model for Rapidly
  Estimating Complex SO(3) Distributions
AQuaMaM: An Autoregressive, Quaternion Manifold Model for Rapidly Estimating Complex SO(3) Distributions
Michael A. Alcorn
51
0
0
21 Jan 2023
Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt
  Learning for Automatic Scoring in Science Education
Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt Learning for Automatic Scoring in Science Education
Xuansheng Wu
Xinyu He
Tianming Li
Ninghao Liu
Xiaoming Zhai
102
26
0
20 Jan 2023
Baechi: Fast Device Placement of Machine Learning Graphs
Baechi: Fast Device Placement of Machine Learning Graphs
Beomyeol Jeon
L. Cai
Chirag Shetty
P. Srivastava
Jintao Jiang
Xiaolan Ke
Yitao Meng
Cong Xie
Indranil Gupta
GNN
47
19
0
20 Jan 2023
Multiview Compressive Coding for 3D Reconstruction
Multiview Compressive Coding for 3D Reconstruction
Chaozheng Wu
Justin Johnson
Jitendra Malik
Christoph Feichtenhofer
Georgia Gkioxari
128
75
0
19 Jan 2023
Language Embeddings Sometimes Contain Typological Generalizations
Language Embeddings Sometimes Contain Typological Generalizations
Robert Östling
Murathan Kurfali
NAI
115
11
0
19 Jan 2023
Towards Rigorous Understanding of Neural Networks via
  Semantics-preserving Transformations
Towards Rigorous Understanding of Neural Networks via Semantics-preserving Transformations
Maximilian Schlüter
Gerrit Nolte
Alnis Murtovi
Bernhard Steffen
75
6
0
19 Jan 2023
Universal Neural-Cracking-Machines: Self-Configurable Password Models
  from Auxiliary Data
Universal Neural-Cracking-Machines: Self-Configurable Password Models from Auxiliary Data
Dario Pasquini
G. Ateniese
Carmela Troncoso
FedML
51
4
0
18 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&RoOffRLAI4CELRM
139
119
0
18 Jan 2023
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation,
  and Detection
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection
Biyang Guo
Xin Zhang
Ziyuan Wang
Minqi Jiang
Jinran Nie
Yuxuan Ding
Jianwei Yue
Yupeng Wu
DeLMOELM
132
622
0
18 Jan 2023
Vision Learners Meet Web Image-Text Pairs
Vision Learners Meet Web Image-Text Pairs
Bingchen Zhao
Quan Cui
Hao Wu
Osamu Yoshie
Cheng Yang
Oisin Mac Aodha
VLM
86
5
0
17 Jan 2023
Are Language Models Worse than Humans at Following Prompts? It's
  Complicated
Are Language Models Worse than Humans at Following Prompts? It's Complicated
Albert Webson
A. Loo
Qinan Yu
Ellie Pavlick
LRM
86
17
0
17 Jan 2023
Prompting Large Language Model for Machine Translation: A Case Study
Prompting Large Language Model for Machine Translation: A Case Study
Biao Zhang
Barry Haddow
Alexandra Birch
LRM
141
300
0
17 Jan 2023
Dataset Distillation: A Comprehensive Review
Dataset Distillation: A Comprehensive Review
Ruonan Yu
Songhua Liu
Xinchao Wang
DD
163
132
0
17 Jan 2023
Which Model Shall I Choose? Cost/Quality Trade-offs for Text
  Classification Tasks
Which Model Shall I Choose? Cost/Quality Trade-offs for Text Classification Tasks
Shi Zong
Joshua Seltzer
Jia Pan
Pan
Kathy Cheng
Jimmy J. Lin
80
4
0
17 Jan 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
RILS: Masked Visual Reconstruction in Language Semantic Space
Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
CLIP
95
11
0
17 Jan 2023
Deep Conditional Measure Quantization
Deep Conditional Measure Quantization
G. Turinici
43
1
0
17 Jan 2023
Dissociating language and thought in large language models
Dissociating language and thought in large language models
Kyle Mahowald
Anna A. Ivanova
I. Blank
Nancy Kanwisher
J. Tenenbaum
Evelina Fedorenko
ELMReLM
121
215
0
16 Jan 2023
A Transformer-based Diffusion Probabilistic Model for Heart Rate and
  Blood Pressure Forecasting in Intensive Care Unit
A Transformer-based Diffusion Probabilistic Model for Heart Rate and Blood Pressure Forecasting in Intensive Care Unit
Ping Chang
Huayu Li
S. Quan
Shuyang Lu
Shu-Fen Wung
Janet Roveda
Ao Li
DiffM
131
20
0
16 Jan 2023
PromptShots at the FinNLP-2022 ERAI Tasks: Pairwise Comparison and
  Unsupervised Ranking
PromptShots at the FinNLP-2022 ERAI Tasks: Pairwise Comparison and Unsupervised Ranking
Peratham Wiriyathammabhum
46
4
0
16 Jan 2023
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Kinyugo Maina
85
7
0
16 Jan 2023
Deep Learning Models to Study Sentence Comprehension in the Human Brain
Deep Learning Models to Study Sentence Comprehension in the Human Brain
S. Arana
Jacques Pesnot Lerousseau
P. Hagoort
55
14
0
16 Jan 2023
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with
  Multimodal Models
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models
Zhiqiu Lin
Samuel Yu
Zhiyi Kuang
Deepak Pathak
Deva Ramana
VLM
152
117
0
16 Jan 2023
Bike Frames: Understanding the Implicit Portrayal of Cyclists in the
  News
Bike Frames: Understanding the Implicit Portrayal of Cyclists in the News
Xingmeng Zhao
Dan Schumacher
Sashank Nalluri
Xavier Walton
Suhana Shrestha
Anthony Rios
55
2
0
15 Jan 2023
Improving Reliability of Fine-tuning with Block-wise Optimisation
Improving Reliability of Fine-tuning with Block-wise Optimisation
Basel Barakat
Qiang Huang
60
1
0
15 Jan 2023
Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Siyuan Huang
Zan Wang
Puhao Li
Baoxiong Jia
Tengyu Liu
Yixin Zhu
Wei Liang
Song-Chun Zhu
DiffM
128
218
0
15 Jan 2023
Rationalizing Predictions by Adversarial Information Calibration
Rationalizing Predictions by Adversarial Information Calibration
Lei Sha
Oana-Maria Camburu
Thomas Lukasiewicz
67
7
0
15 Jan 2023
World Models and Predictive Coding for Cognitive and Developmental
  Robotics: Frontiers and Challenges
World Models and Predictive Coding for Cognitive and Developmental Robotics: Frontiers and Challenges
T. Taniguchi
Shingo Murata
Masahiro Suzuki
D. Ognibene
Pablo Lanillos
...
L. Jamone
Tomoaki Nakamura
Alejandra Ciria
B. Lara
G. Pezzulo
103
57
0
14 Jan 2023
A Comprehensive Survey of Dataset Distillation
A Comprehensive Survey of Dataset Distillation
Shiye Lei
Dacheng Tao
DD
106
93
0
13 Jan 2023
Previous
123...166167168...248249250
Next