Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,483 papers shown
Title
A Comparative Study of Pretrained Language Models for Long Clinical Text
Yikuan Li
R. M. Wehbe
F. Ahmad
Hanyin Wang
Yuan Luo
LM&MA
ELM
VLM
MedIm
93
86
0
27 Jan 2023
Can We Use Probing to Better Understand Fine-tuning and Knowledge Distillation of the BERT NLU?
Jakub Ho'scilowicz
Marcin Sowanski
Piotr Czubowski
Artur Janicki
61
2
0
27 Jan 2023
Probing Out-of-Distribution Robustness of Language Models with Parameter-Efficient Transfer Learning
Hyunsoo Cho
Choonghyun Park
Junyeop Kim
Sungmin Cho
Kang Min Yoo
Sang-goo Lee
OODD
87
3
0
27 Jan 2023
Deep Quantum Error Correction
Yoni Choukroun
Lior Wolf
70
11
0
27 Jan 2023
Robust Transformer with Locality Inductive Bias and Feature Normalization
Omid Nejati Manzari
Hossein Kashiani
Hojat Asgarian Dehkordi
S. B. Shokouhi
ViT
77
15
0
27 Jan 2023
Projected Subnetworks Scale Adaptation
Siddhartha Datta
N. Shadbolt
VLM
CLL
90
0
0
27 Jan 2023
MusicLM: Generating Music From Text
A. Agostinelli
Timo I. Denk
Zalan Borsos
Jesse Engel
Mauro Verzetti
...
Adam Roberts
Marco Tagliasacchi
Matthew Sharifi
Neil Zeghidour
Christian Frank
MGen
152
451
0
26 Jan 2023
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
E. Mitchell
Yoonho Lee
Alexander Khazatsky
Christopher D. Manning
Chelsea Finn
132
632
0
26 Jan 2023
ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients
Guihong Li
Yuedong Yang
Kartikeya Bhardwaj
R. Marculescu
121
63
0
26 Jan 2023
Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning
Mingyu Derek Ma
Jiun-Yu Kao
Shuyang Gao
Arpit Gupta
Di Jin
Tagyoung Chung
Nanyun Peng
73
7
0
26 Jan 2023
Distilling Cognitive Backdoor Patterns within an Image
Hanxun Huang
Xingjun Ma
S. Erfani
James Bailey
AAML
119
26
0
26 Jan 2023
Distilling Text into Circuits
Vincent Wang-Ma'scianica
Jonathon Liu
B. Coecke
90
11
0
25 Jan 2023
Explainable AI does not provide the explanations end-users are asking for
Savio Rozario
G. Cevora
XAI
63
1
0
25 Jan 2023
Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction
Jonathan Pilault
Xavier Garcia
Arthur Bravzinskas
Orhan Firat
AI4CE
LRM
90
17
0
24 Jan 2023
Large language models can segment narrative events similarly to humans
Sebastian Michelmann
Manoj Kumar
K. A. Norman
Mariya Toneva
67
16
0
24 Jan 2023
Multitask Instruction-based Prompting for Fallacy Recognition
Tariq Alhindi
Tuhin Chakrabarty
Elena Musi
Smaranda Muresan
LRM
72
30
0
24 Jan 2023
Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression
Jaeyong Song
Jinkyu Yim
Jaewon Jung
Hongsun Jang
H. Kim
Youngsok Kim
Jinho Lee
GNN
74
28
0
24 Jan 2023
AI vs. Human -- Differentiation Analysis of Scientific Content Generation
Yongqiang Ma
Jiawei Liu
Fan Yi
Qikai Cheng
Yong Huang
Wei Lu
Xiaozhong Liu
DeLMO
112
60
0
24 Jan 2023
Transformer-Patcher: One Mistake worth One Neuron
Zeyu Huang
Songlin Yang
Xiaofeng Zhang
Jie Zhou
Wenge Rong
Zhang Xiong
KELM
102
179
0
24 Jan 2023
Mathematics, word problems, common sense, and artificial intelligence
E. Davis
AIMat
76
25
0
23 Jan 2023
The Backpropagation algorithm for a math student
S. Damadi
Golnaz Moharrer
Mostafa Cham
60
4
0
22 Jan 2023
ExClaim: Explainable Neural Claim Verification Using Rationalization
Sai Gurrapu
Lifu Huang
Feras A. Batarseh
AAML
94
9
0
21 Jan 2023
AQuaMaM: An Autoregressive, Quaternion Manifold Model for Rapidly Estimating Complex SO(3) Distributions
Michael A. Alcorn
51
0
0
21 Jan 2023
Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt Learning for Automatic Scoring in Science Education
Xuansheng Wu
Xinyu He
Tianming Li
Ninghao Liu
Xiaoming Zhai
102
26
0
20 Jan 2023
Baechi: Fast Device Placement of Machine Learning Graphs
Beomyeol Jeon
L. Cai
Chirag Shetty
P. Srivastava
Jintao Jiang
Xiaolan Ke
Yitao Meng
Cong Xie
Indranil Gupta
GNN
47
19
0
20 Jan 2023
Multiview Compressive Coding for 3D Reconstruction
Chaozheng Wu
Justin Johnson
Jitendra Malik
Christoph Feichtenhofer
Georgia Gkioxari
128
75
0
19 Jan 2023
Language Embeddings Sometimes Contain Typological Generalizations
Robert Östling
Murathan Kurfali
NAI
115
11
0
19 Jan 2023
Towards Rigorous Understanding of Neural Networks via Semantics-preserving Transformations
Maximilian Schlüter
Gerrit Nolte
Alnis Murtovi
Bernhard Steffen
75
6
0
19 Jan 2023
Universal Neural-Cracking-Machines: Self-Configurable Password Models from Auxiliary Data
Dario Pasquini
G. Ateniese
Carmela Troncoso
FedML
51
4
0
18 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
139
119
0
18 Jan 2023
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection
Biyang Guo
Xin Zhang
Ziyuan Wang
Minqi Jiang
Jinran Nie
Yuxuan Ding
Jianwei Yue
Yupeng Wu
DeLMO
ELM
132
622
0
18 Jan 2023
Vision Learners Meet Web Image-Text Pairs
Bingchen Zhao
Quan Cui
Hao Wu
Osamu Yoshie
Cheng Yang
Oisin Mac Aodha
VLM
86
5
0
17 Jan 2023
Are Language Models Worse than Humans at Following Prompts? It's Complicated
Albert Webson
A. Loo
Qinan Yu
Ellie Pavlick
LRM
86
17
0
17 Jan 2023
Prompting Large Language Model for Machine Translation: A Case Study
Biao Zhang
Barry Haddow
Alexandra Birch
LRM
141
300
0
17 Jan 2023
Dataset Distillation: A Comprehensive Review
Ruonan Yu
Songhua Liu
Xinchao Wang
DD
163
132
0
17 Jan 2023
Which Model Shall I Choose? Cost/Quality Trade-offs for Text Classification Tasks
Shi Zong
Joshua Seltzer
Jia Pan
Pan
Kathy Cheng
Jimmy J. Lin
80
4
0
17 Jan 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
CLIP
95
11
0
17 Jan 2023
Deep Conditional Measure Quantization
G. Turinici
43
1
0
17 Jan 2023
Dissociating language and thought in large language models
Kyle Mahowald
Anna A. Ivanova
I. Blank
Nancy Kanwisher
J. Tenenbaum
Evelina Fedorenko
ELM
ReLM
121
215
0
16 Jan 2023
A Transformer-based Diffusion Probabilistic Model for Heart Rate and Blood Pressure Forecasting in Intensive Care Unit
Ping Chang
Huayu Li
S. Quan
Shuyang Lu
Shu-Fen Wung
Janet Roveda
Ao Li
DiffM
131
20
0
16 Jan 2023
PromptShots at the FinNLP-2022 ERAI Tasks: Pairwise Comparison and Unsupervised Ranking
Peratham Wiriyathammabhum
46
4
0
16 Jan 2023
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
Kinyugo Maina
85
7
0
16 Jan 2023
Deep Learning Models to Study Sentence Comprehension in the Human Brain
S. Arana
Jacques Pesnot Lerousseau
P. Hagoort
55
14
0
16 Jan 2023
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models
Zhiqiu Lin
Samuel Yu
Zhiyi Kuang
Deepak Pathak
Deva Ramana
VLM
152
117
0
16 Jan 2023
Bike Frames: Understanding the Implicit Portrayal of Cyclists in the News
Xingmeng Zhao
Dan Schumacher
Sashank Nalluri
Xavier Walton
Suhana Shrestha
Anthony Rios
55
2
0
15 Jan 2023
Improving Reliability of Fine-tuning with Block-wise Optimisation
Basel Barakat
Qiang Huang
60
1
0
15 Jan 2023
Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Siyuan Huang
Zan Wang
Puhao Li
Baoxiong Jia
Tengyu Liu
Yixin Zhu
Wei Liang
Song-Chun Zhu
DiffM
128
218
0
15 Jan 2023
Rationalizing Predictions by Adversarial Information Calibration
Lei Sha
Oana-Maria Camburu
Thomas Lukasiewicz
67
7
0
15 Jan 2023
World Models and Predictive Coding for Cognitive and Developmental Robotics: Frontiers and Challenges
T. Taniguchi
Shingo Murata
Masahiro Suzuki
D. Ognibene
Pablo Lanillos
...
L. Jamone
Tomoaki Nakamura
Alejandra Ciria
B. Lara
G. Pezzulo
103
57
0
14 Jan 2023
A Comprehensive Survey of Dataset Distillation
Shiye Lei
Dacheng Tao
DD
106
93
0
13 Jan 2023
Previous
1
2
3
...
166
167
168
...
248
249
250
Next