Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,243 papers shown
Title
Neural Text Classification by Jointly Learning to Cluster and Align
Yekun Chai
Haidong Zhang
Shuo Jin
44
2
0
24 Nov 2020
Argument from Old Man's View: Assessing Social Bias in Argumentation
Maximilian Spliethover
Henning Wachsmuth
54
20
0
24 Nov 2020
GLGE: A New General Language Generation Evaluation Benchmark
Dayiheng Liu
Yu Yan
Yeyun Gong
Weizhen Qi
Hang Zhang
...
Jiancheng Lv
Ruofei Zhang
Winnie Wu
Ming Zhou
Nan Duan
ELM
109
66
0
24 Nov 2020
Language guided machine action
Feng Qi
LM&Ro
8
0
0
23 Nov 2020
Investigating Emotion-Color Association in Deep Neural Networks
Shivi Gupta
Shashi Kant Gupta
11
2
0
22 Nov 2020
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images
R. Child
BDL
VLM
190
353
0
20 Nov 2020
ONION: A Simple and Effective Defense Against Textual Backdoor Attacks
Fanchao Qi
Yangyi Chen
Mukai Li
Yuan Yao
Zhiyuan Liu
Maosong Sun
AAML
109
283
0
20 Nov 2020
ClickTrain: Efficient and Accurate End-to-End Deep Learning Training via Fine-Grained Architecture-Preserving Pruning
Chengming Zhang
Geng Yuan
Wei Niu
Jiannan Tian
Sian Jin
...
Zhe Jiang
Yanzhi Wang
Bin Ren
Shuaiwen Leon Song
Dingwen Tao
3DV
61
1
0
20 Nov 2020
Deep Neural Networks using a Single Neuron: Folded-in-Time Architecture using Feedback-Modulated Delay Loops
Florian Stelzer
André Röhm
Raul Vicente
Ingo Fischer
University of Tartu
AI4CE
73
48
0
19 Nov 2020
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning
Zhenda Xie
Yutong Lin
Zheng Zhang
Yue Cao
Stephen Lin
Han Hu
SSL
120
415
0
19 Nov 2020
A Definition and a Test for Human-Level Artificial Intelligence
Deokgun Park
Md Ashaduzzaman Rubel Mondol
Aishwarya Pothula
Mazharul Islam
VLM
14
4
0
18 Nov 2020
Whale: Efficient Giant Model Training over Heterogeneous GPUs
Xianyan Jia
Le Jiang
Ang Wang
Wencong Xiao
Ziji Shi
...
Lan-yue Chen
Yong Li
Zhen Zheng
Xiaoyong Liu
Wei Lin
78
56
0
18 Nov 2020
Do Fine-tuned Commonsense Language Models Really Generalize?
Mayank Kejriwal
Ke Shen
ELM
LRM
57
10
0
18 Nov 2020
A Novel Memory-Efficient Deep Learning Training Framework via Error-Bounded Lossy Compression
Sian Jin
Guanpeng Li
Shuaiwen Leon Song
Dingwen Tao
AI4CE
65
12
0
18 Nov 2020
A Review of Generalized Zero-Shot Learning Methods
Farhad Pourpanah
Moloud Abdar
Yuxuan Luo
Xinlei Zhou
Ran Wang
C. P. Lim
Xizhao Wang
Q. M. Jonathan Wu
VLM
147
358
0
17 Nov 2020
MGIC: Multigrid-in-Channels Neural Network Architectures
Moshe Eliasof
Jonathan Ephrath
Lars Ruthotto
Eran Treister
94
8
0
17 Nov 2020
Learning from Task Descriptions
Orion Weller
Nicholas Lourie
Matt Gardner
Matthew E. Peters
113
91
0
16 Nov 2020
A partition-based similarity for classification distributions
Hayden S. Helm
Ronak D. Mehta
Brandon Duderstadt
Weiwei Yang
Christoper M. White
Ali Geisa
Joshua T. Vogelstein
Carey E. Priebe
34
6
0
12 Nov 2020
Fairness and Robustness in Invariant Learning: A Case Study in Toxicity Classification
Robert Adragna
Elliot Creager
David Madras
R. Zemel
OOD
FaML
80
43
0
12 Nov 2020
Hurricane Forecasting: A Novel Multimodal Machine Learning Framework
L. Boussioux
C. Zeng
Théo Guénais
Dimitris Bertsimas
53
40
0
11 Nov 2020
When Do You Need Billions of Words of Pretraining Data?
Yian Zhang
Alex Warstadt
Haau-Sing Li
Samuel R. Bowman
62
141
0
10 Nov 2020
Multi-document Summarization via Deep Learning Techniques: A Survey
Congbo Ma
W. Zhang
Mingyu Guo
Hu Wang
Quan Z. Sheng
125
129
0
10 Nov 2020
An Analysis of Dataset Overlap on Winograd-Style Tasks
Ali Emami
Adam Trischler
Kaheer Suleman
Jackie C.K. Cheung
76
22
0
09 Nov 2020
Improving Neural Network Training in Low Dimensional Random Bases
Frithjof Gressmann
Zach Eaton-Rosen
Carlo Luschi
78
28
0
09 Nov 2020
Know What You Don't Need: Single-Shot Meta-Pruning for Attention Heads
Zhengyan Zhang
Fanchao Qi
Zhiyuan Liu
Qun Liu
Maosong Sun
VLM
86
31
0
07 Nov 2020
Exploring the limits of Concurrency in ML Training on Google TPUs
Sameer Kumar
James Bradbury
C. Young
Yu Emma Wang
Anselm Levskaya
...
Tao Wang
Tayo Oguntebi
Yazhou Zu
Yuanzhong Xu
Andy Swing
BDL
AIMat
MoE
LRM
64
27
0
07 Nov 2020
Machine Generation and Detection of Arabic Manipulated and Fake News
El Moatez Billah Nagoudi
AbdelRahim Elmadany
Muhammad Abdul-Mageed
Tariq Alhindi
H. Cavusoglu
DeLMO
89
52
0
05 Nov 2020
Detecting Hallucinated Content in Conditional Neural Sequence Generation
Chunting Zhou
Graham Neubig
Jiatao Gu
Mona T. Diab
P. Guzmán
Luke Zettlemoyer
Marjan Ghazvininejad
HILM
133
200
0
05 Nov 2020
Rearrangement: A Challenge for Embodied AI
Dhruv Batra
Angel X. Chang
Sonia Chernova
Andrew J. Davison
Jia Deng
...
Jitendra Malik
Igor Mordatch
Roozbeh Mottaghi
Manolis Savva
Hao Su
LM&Ro
114
220
0
03 Nov 2020
Automatic Detection of Machine Generated Text: A Critical Survey
Ganesh Jawahar
Muhammad Abdul-Mageed
L. Lakshmanan
DeLMO
81
239
0
02 Nov 2020
Emergent Communication Pretraining for Few-Shot Machine Translation
Yaoyiran Li
Edoardo Ponti
Ivan Vulić
Anna Korhonen
106
19
0
02 Nov 2020
Training EfficientNets at Supercomputer Scale: 83% ImageNet Top-1 Accuracy in One Hour
Arissa Wongpanich
Hieu H. Pham
J. Demmel
Mingxing Tan
Quoc V. Le
Yang You
Sameer Kumar
78
8
0
30 Oct 2020
A New Neural Search and Insights Platform for Navigating and Organizing AI Research
Marzieh Fadaee
Olga Gureenkova
Fernando Rejon Barrera
Carsten Schnober
W. Weerkamp
Jakub Zavrel
43
7
0
30 Oct 2020
Topic-Preserving Synthetic News Generation: An Adversarial Deep Reinforcement Learning Approach
Ahmadreza Mosallanezhad
Kai Shu
Huan Liu
48
10
0
30 Oct 2020
AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts
Taylor Shin
Yasaman Razeghi
Robert L Logan IV
Eric Wallace
Sameer Singh
KELM
108
407
0
29 Oct 2020
Melody-Conditioned Lyrics Generation with SeqGANs
Yihao Chen
Alexander Lerch
GAN
MGen
82
29
0
28 Oct 2020
Scaling Laws for Autoregressive Generative Modeling
T. Henighan
Jared Kaplan
Mor Katz
Mark Chen
Christopher Hesse
...
Nick Ryder
Daniel M. Ziegler
John Schulman
Dario Amodei
Sam McCandlish
121
433
0
28 Oct 2020
Predicting Themes within Complex Unstructured Texts: A Case Study on Safeguarding Reports
A. Edwards
David Rogers
Jose Camacho-Collados
Hélène de Ribaupierre
Alun D. Preece
72
1
0
27 Oct 2020
A Statistical Framework for Low-bitwidth Training of Deep Neural Networks
Jianfei Chen
Yujie Gai
Z. Yao
Michael W. Mahoney
Joseph E. Gonzalez
MQ
68
59
0
27 Oct 2020
Out-of-core Training for Extremely Large-Scale Neural Networks With Adaptive Window-Based Scheduling
Akio Hayakawa
T. Narihira
26
4
0
27 Oct 2020
Dutch Humor Detection by Generating Negative Examples
Thomas Winters
Pieter Delobelle
115
11
0
26 Oct 2020
Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification
Timo Schick
Helmut Schmid
Hinrich Schütze
VLM
92
208
0
26 Oct 2020
Accelerating Training of Transformer-Based Language Models with Progressive Layer Dropping
Minjia Zhang
Yuxiong He
AI4CE
48
104
0
26 Oct 2020
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
Benjamin Muller
Antonis Anastasopoulos
Benoît Sagot
Djamé Seddah
LRM
209
170
0
24 Oct 2020
Text Editing by Command
Felix Faltings
Michel Galley
Gerold Hintz
Chris Brockett
Chris Quirk
Jianfeng Gao
Bill Dolan
KELM
207
38
0
24 Oct 2020
Rethinking embedding coupling in pre-trained language models
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
172
143
0
24 Oct 2020
Pre-training Text-to-Text Transformers for Concept-centric Common Sense
Wangchunshu Zhou
Dong-Ho Lee
Ravi Kiran Selvam
Seyeon Lee
Bill Yuchen Lin
Xiang Ren
LRM
VLM
55
72
0
24 Oct 2020
Improving Multilingual Models with Language-Clustered Vocabularies
Hyung Won Chung
Dan Garrette
Kiat Chuan Tan
Jason Riesa
VLM
129
65
0
24 Oct 2020
Text Style Transfer: A Review and Experimental Evaluation
Zhiqiang Hu
Roy Ka-wei Lee
Charu C. Aggarwal
Aston Zhang
AI4TS
126
27
0
24 Oct 2020
An Evaluation Protocol for Generative Conversational Systems
Seolhwa Lee
Heuiseok Lim
Jo˜ao Sedoc
ELM
80
10
0
24 Oct 2020
Previous
1
2
3
...
239
240
241
...
243
244
245
Next