Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 11,082 papers shown
Title
Responsible Disclosure of Generative Models Using Scalable Fingerprinting
Ning Yu
Vladislav Skripniuk
Dingfan Chen
Larry S. Davis
Mario Fritz
WIGM
46
89
0
16 Dec 2020
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z Leibo
Kate Larson
T. Graepel
37
199
0
15 Dec 2020
Attention over learned object embeddings enables complex visual reasoning
David Ding
Felix Hill
Adam Santoro
Malcolm Reynolds
M. Botvinick
OCL
22
69
0
15 Dec 2020
Nested Named Entity Recognition with Partially-Observed TreeCRFs
Yao Fu
Chuanqi Tan
Mosha Chen
Songfang Huang
Fei Huang
80
48
0
15 Dec 2020
*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional Task
Dmitry Tsarkov
Tibor Tihon
Nathan Scales
Nikola Momchev
Danila Sinopalnikov
Nathanael Scharli
18
17
0
15 Dec 2020
Writing Polishment with Simile: Task, Dataset and A Neural Approach
Jiayi Zhang
Zhi Cui
Xiaoqiang Xia
Yalong Guo
Yanran Li
Chen Wei
Jianwei Cui
20
17
0
15 Dec 2020
Parameter-Efficient Transfer Learning with Diff Pruning
Demi Guo
Alexander M. Rush
Yoon Kim
13
385
0
14 Dec 2020
Hardware Beyond Backpropagation: a Photonic Co-Processor for Direct Feedback Alignment
Julien Launay
Iacopo Poli
Kilian Muller
Gustave Pariente
I. Carron
L. Daudet
Florent Krzakala
S. Gigan
MoE
15
18
0
11 Dec 2020
Towards Neural Programming Interfaces
Zachary Brown
Nathaniel R. Robinson
David Wingate
Nancy Fulda
AI4CE
20
5
0
10 Dec 2020
Know Your Limits: Uncertainty Estimation with ReLU Classifiers Fails at Reliable OOD Detection
Dennis Ulmer
Giovanni Cina
OODD
35
31
0
09 Dec 2020
Topological Planning with Transformers for Vision-and-Language Navigation
Kevin Chen
Junshen K. Chen
Jo Chuang
Marynel Vázquez
Silvio Savarese
LM&Ro
27
99
0
09 Dec 2020
Positional Encoding as Spatial Inductive Bias in GANs
Rui Xu
Xintao Wang
Kai-xiang Chen
Bolei Zhou
Chen Change Loy
GAN
27
89
0
09 Dec 2020
NavRep: Unsupervised Representations for Reinforcement Learning of Robot Navigation in Dynamic Human Environments
Daniel Dugas
Juan I. Nieto
Roland Siegwart
Jen Jen Chung
SSL
24
51
0
08 Dec 2020
CTRLsum: Towards Generic Controllable Text Summarization
Junxian He
Wojciech Kry'sciñski
Bryan McCann
Nazneen Rajani
Caiming Xiong
216
138
0
08 Dec 2020
Unleashing the Tiger: Inference Attacks on Split Learning
Dario Pasquini
G. Ateniese
M. Bernaschi
FedML
34
147
0
04 Dec 2020
From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting
K. Mangalam
Yang An
Harshayu Girase
Jitendra Malik
19
264
0
02 Dec 2020
Exploiting BERT to improve aspect-based sentiment analysis performance on Persian language
H. Jafarian
Amirhosein Taghavi
Alireza Javaheri
Reza Rawassizadeh
23
19
0
02 Dec 2020
Modifying Memories in Transformer Models
Chen Zhu
A. S. Rawat
Manzil Zaheer
Srinadh Bhojanapalli
Daliang Li
Felix X. Yu
Sanjiv Kumar
KELM
32
192
0
01 Dec 2020
Data-Free Model Extraction
Jean-Baptiste Truong
Pratyush Maini
R. Walls
Nicolas Papernot
MIACV
15
181
0
30 Nov 2020
Argument from Old Man's View: Assessing Social Bias in Argumentation
Maximilian Spliethover
Henning Wachsmuth
11
20
0
24 Nov 2020
GLGE: A New General Language Generation Evaluation Benchmark
Dayiheng Liu
Yu Yan
Yeyun Gong
Weizhen Qi
Hang Zhang
...
Jiancheng Lv
Ruofei Zhang
Winnie Wu
Ming Zhou
Nan Duan
ELM
40
66
0
24 Nov 2020
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images
R. Child
BDL
VLM
56
337
0
20 Nov 2020
ONION: A Simple and Effective Defense Against Textual Backdoor Attacks
Fanchao Qi
Yangyi Chen
Mukai Li
Yuan Yao
Zhiyuan Liu
Maosong Sun
AAML
42
264
0
20 Nov 2020
Deep Neural Networks using a Single Neuron: Folded-in-Time Architecture using Feedback-Modulated Delay Loops
Florian Stelzer
André Röhm
Raul Vicente
Ingo Fischer
University of Tartu
AI4CE
19
46
0
19 Nov 2020
A Novel Memory-Efficient Deep Learning Training Framework via Error-Bounded Lossy Compression
Sian Jin
Guanpeng Li
Shuaiwen Leon Song
Dingwen Tao
AI4CE
29
12
0
18 Nov 2020
Learning from Task Descriptions
Orion Weller
Nicholas Lourie
Matt Gardner
Matthew E. Peters
47
89
0
16 Nov 2020
Hurricane Forecasting: A Novel Multimodal Machine Learning Framework
L. Boussioux
C. Zeng
Théo Guénais
Dimitris Bertsimas
16
38
0
11 Nov 2020
Dirichlet Pruning for Neural Network Compression
Kamil Adamczewski
Mijung Park
27
3
0
10 Nov 2020
When Do You Need Billions of Words of Pretraining Data?
Yian Zhang
Alex Warstadt
Haau-Sing Li
Samuel R. Bowman
29
136
0
10 Nov 2020
Multi-document Summarization via Deep Learning Techniques: A Survey
Congbo Ma
W. Zhang
Mingyu Guo
Hu Wang
Quan Z. Sheng
13
126
0
10 Nov 2020
Improving Neural Network Training in Low Dimensional Random Bases
Frithjof Gressmann
Zach Eaton-Rosen
Carlo Luschi
30
28
0
09 Nov 2020
Know What You Don't Need: Single-Shot Meta-Pruning for Attention Heads
Zhengyan Zhang
Fanchao Qi
Zhiyuan Liu
Qun Liu
Maosong Sun
VLM
41
30
0
07 Nov 2020
Exploring the limits of Concurrency in ML Training on Google TPUs
Sameer Kumar
James Bradbury
C. Young
Yu Emma Wang
Anselm Levskaya
...
Tao Wang
Tayo Oguntebi
Yazhou Zu
Yuanzhong Xu
Andy Swing
BDL
AIMat
MoE
LRM
22
27
0
07 Nov 2020
Machine Generation and Detection of Arabic Manipulated and Fake News
El Moatez Billah Nagoudi
AbdelRahim Elmadany
Muhammad Abdul-Mageed
Tariq Alhindi
H. Cavusoglu
DeLMO
24
50
0
05 Nov 2020
Detecting Hallucinated Content in Conditional Neural Sequence Generation
Chunting Zhou
Graham Neubig
Jiatao Gu
Mona T. Diab
P. Guzmán
Luke Zettlemoyer
Marjan Ghazvininejad
HILM
39
195
0
05 Nov 2020
Rearrangement: A Challenge for Embodied AI
Dhruv Batra
Angel X. Chang
Sonia Chernova
Andrew J. Davison
Jia Deng
...
Jitendra Malik
Igor Mordatch
Roozbeh Mottaghi
Manolis Savva
Hao Su
LM&Ro
38
217
0
03 Nov 2020
Emergent Communication Pretraining for Few-Shot Machine Translation
Yaoyiran Li
E. Ponti
Ivan Vulić
Anna Korhonen
25
19
0
02 Nov 2020
Melody-Conditioned Lyrics Generation with SeqGANs
Yihao Chen
Alexander Lerch
GAN
MGen
32
29
0
28 Oct 2020
Scaling Laws for Autoregressive Generative Modeling
T. Henighan
Jared Kaplan
Mor Katz
Mark Chen
Christopher Hesse
...
Nick Ryder
Daniel M. Ziegler
John Schulman
Dario Amodei
Sam McCandlish
53
405
0
28 Oct 2020
A Statistical Framework for Low-bitwidth Training of Deep Neural Networks
Jianfei Chen
Yujie Gai
Z. Yao
Michael W. Mahoney
Joseph E. Gonzalez
MQ
17
58
0
27 Oct 2020
Dutch Humor Detection by Generating Negative Examples
Thomas Winters
Pieter Delobelle
11
10
0
26 Oct 2020
Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification
Timo Schick
Helmut Schmid
Hinrich Schütze
VLM
19
206
0
26 Oct 2020
Pre-trained Summarization Distillation
Sam Shleifer
Alexander M. Rush
26
98
0
24 Oct 2020
Text Editing by Command
Felix Faltings
Michel Galley
Gerold Hintz
Chris Brockett
Chris Quirk
Jianfeng Gao
Bill Dolan
KELM
147
37
0
24 Oct 2020
Rethinking embedding coupling in pre-trained language models
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
95
142
0
24 Oct 2020
Text Style Transfer: A Review and Experimental Evaluation
Zhiqiang Hu
Roy Ka-Wei Lee
Charu C. Aggarwal
Aston Zhang
AI4TS
42
26
0
24 Oct 2020
An Evaluation Protocol for Generative Conversational Systems
Seolhwa Lee
Heuiseok Lim
Jo˜ao Sedoc
ELM
35
10
0
24 Oct 2020
Long Document Ranking with Query-Directed Sparse Transformer
Jyun-Yu Jiang
Chenyan Xiong
Chia-Jung Lee
Wei Wang
30
25
0
23 Oct 2020
On the Transformer Growth for Progressive BERT Training
Xiaotao Gu
Liyuan Liu
Hongkun Yu
Jing Li
Cheng Chen
Jiawei Han
VLM
69
51
0
23 Oct 2020
Language Models are Open Knowledge Graphs
Chenguang Wang
Xiao Liu
D. Song
SSL
KELM
26
135
0
22 Oct 2020
Previous
1
2
3
...
218
219
220
221
222
Next