Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,183 papers shown
Title
learn2learn: A Library for Meta-Learning Research
Sébastien M. R. Arnold
Praateek Mahajan
Debajyoti Datta
Ian Bunner
Konstantinos Saitas Zarkias
115
96
0
27 Aug 2020
Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning
Aurick Qiao
Sang Keun Choe
Suhas Jayaram Subramanya
Willie Neiswanger
Qirong Ho
Hao Zhang
G. Ganger
Eric Xing
VLM
77
182
0
27 Aug 2020
A Survey of Evaluation Metrics Used for NLG Systems
Ananya B. Sai
Akash Kumar Mohankumar
Mitesh M. Khapra
ELM
99
237
0
27 Aug 2020
What is being transferred in transfer learning?
Behnam Neyshabur
Hanie Sedghi
Chiyuan Zhang
135
529
0
26 Aug 2020
Looking Deeper into Tabular LIME
Damien Garreau
U. V. Luxburg
FAtt
LMTD
171
30
0
25 Aug 2020
Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing
Wei Shen
Xiaonan He
Wei Shen
Q. Ni
Wanchun Dou
Yan Wang
56
18
0
25 Aug 2020
ETC-NLG: End-to-end Topic-Conditioned Natural Language Generation
Ginevra Carbone
Gabriele Sarti
92
9
0
25 Aug 2020
NL4DV: A Toolkit for Generating Analytic Specifications for Data Visualization from Natural Language Queries
Arpit Narechania
Arjun Srinivasan
J. Stasko
54
180
0
24 Aug 2020
Example-Based Named Entity Recognition
Morteza Ziyadi
Yuting Sun
Abhishek Goswami
Jade Huang
Weizhu Chen
64
33
0
24 Aug 2020
Periodic Stochastic Gradient Descent with Momentum for Decentralized Training
Hongchang Gao
Heng-Chiao Huang
61
25
0
24 Aug 2020
End to End Dialogue Transformer
Ondrej Mekota
Memduh Gökirmak
Petr Laitoch
23
1
0
24 Aug 2020
Tearing Down the Memory Wall
Zaid Qureshi
Vikram Sharma Mailthody
S. Min
I-Hsin Chung
Jinjun Xiong
Wen-mei W. Hwu
GNN
59
9
0
24 Aug 2020
VisualSem: A High-quality Knowledge Graph for Vision and Language
Houda Alberts
Teresa Huang
Y. Deshpande
Yibo Liu
Kyunghyun Cho
Clara Vania
Iacer Calixto
VLM
58
46
0
20 Aug 2020
Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries
Benjamin Heinzerling
Kentaro Inui
KELM
68
133
0
20 Aug 2020
Very Deep Transformers for Neural Machine Translation
Xiaodong Liu
Kevin Duh
Liyuan Liu
Jianfeng Gao
87
104
0
18 Aug 2020
PopMAG: Pop Music Accompaniment Generation
Yi Ren
Jinzheng He
Xu Tan
Tao Qin
Zhou Zhao
Tie-Yan Liu
91
118
0
18 Aug 2020
Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Dara Bahri
Yi Tay
Che Zheng
Donald Metzler
Clifford Brunk
Andrew Tomkins
35
8
0
17 Aug 2020
Explainability in Deep Reinforcement Learning
Alexandre Heuillet
Fabien Couthouis
Natalia Díaz Rodríguez
XAI
231
283
0
15 Aug 2020
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Andrea Madotto
Zihan Liu
Zhaojiang Lin
Pascale Fung
108
59
0
14 Aug 2020
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
Ming Tao
Hao Tang
Leilei Gan
Xiaoyuan Jing
Bingkun Bao
Changsheng Xu
126
214
0
13 Aug 2020
Navigating Human Language Models with Synthetic Agents
Philip G. Feldman
Antonio Bucchiarone
94
4
0
10 Aug 2020
aschern at SemEval-2020 Task 11: It Takes Three to Tango: RoBERTa, CRF, and Transfer Learning
Anton Chernyavskiy
Dmitry Ilvovsky
Preslav Nakov
44
25
0
06 Aug 2020
Communication-Efficient and Distributed Learning Over Wireless Networks: Principles and Applications
Jihong Park
S. Samarakoon
Anis Elgabli
Joongheon Kim
M. Bennis
Seong-Lyun Kim
Mérouane Debbah
102
164
0
06 Aug 2020
Aligning AI With Shared Human Values
Dan Hendrycks
Collin Burns
Steven Basart
Andrew Critch
Jingkai Li
Basel Alomair
Jacob Steinhardt
149
574
0
05 Aug 2020
Word meaning in minds and machines
Brenden M. Lake
G. Murphy
NAI
112
118
0
04 Aug 2020
PowerGossip: Practical Low-Rank Communication Compression in Decentralized Deep Learning
Thijs Vogels
Sai Praneeth Karimireddy
Martin Jaggi
FedML
74
54
0
04 Aug 2020
DeLighT: Deep and Light-weight Transformer
Sachin Mehta
Marjan Ghazvininejad
Srini Iyer
Luke Zettlemoyer
Hannaneh Hajishirzi
VLM
81
32
0
03 Aug 2020
Multi-node Bert-pretraining: Cost-efficient Approach
Jiahuang Lin
Xuelong Li
Gennady Pekhimenko
50
13
0
01 Aug 2020
Self-supervised learning through the eyes of a child
A. Orhan
Vaibhav Gupta
Brenden M. Lake
SSL
110
100
0
31 Jul 2020
Neural Language Generation: Formulation, Methods, and Evaluation
Cristina Garbacea
Qiaozhu Mei
153
30
0
31 Jul 2020
Rewriting a Deep Generative Model
David Bau
Steven Liu
Tongzhou Wang
Jun-Yan Zhu
Antonio Torralba
GAN
DRL
108
140
0
30 Jul 2020
At-Scale Sparse Deep Neural Network Inference with Efficient GPU Implementation
Mert Hidayetoğlu
Carl Pearson
Vikram Sharma Mailthody
Eiman Ebrahimi
Jinjun Xiong
R. Nagi
Wen-mei W. Hwu
GNN
22
1
0
28 Jul 2020
Representation Learning with Video Deep InfoMax
R. Devon Hjelm
Philip Bachman
SSL
MDE
104
28
0
27 Jul 2020
Exploring Swedish & English fastText Embeddings for NER with the Transformer
Tosin Adewumi
F. Liwicki
Marcus Liwicki
41
4
0
23 Jul 2020
On Controllability of AI
Roman V. Yampolskiy
48
14
0
19 Jul 2020
Contextualizing Enhances Gradient Based Meta Learning
Evan Vogelbaum
Rumen Dangovski
L. Jing
Marin Soljacic
117
3
0
17 Jul 2020
Artificial Fingerprinting for Generative Models: Rooting Deepfake Attribution in Training Data
Ning Yu
Vladislav Skripniuk
Sahar Abdelnabi
Mario Fritz
WIGM
75
219
0
16 Jul 2020
Automated Detection and Forecasting of COVID-19 using Deep Learning Techniques: A Review
A. Shoeibi
Marjane Khodatars
M. Jafari
Navid Ghassemi
Delaram Sadeghi
...
Z. Sani
F. Khozeimeh
S. Nahavandi
U. Acharya
Juan M Gorriz
138
182
0
16 Jul 2020
Deep Learning in Protein Structural Modeling and Design
Wenhao Gao
S. Mahajan
Jeremias Sulam
Jeffrey J. Gray
84
160
0
16 Jul 2020
Add a SideNet to your MainNet
Adrien Morisot
23
0
0
14 Jul 2020
ProtTrans: Towards Cracking the Language of Life's Code Through Self-Supervised Deep Learning and High Performance Computing
Ahmed Elnaggar
M. Heinzinger
Christian Dallago
Ghalia Rehawi
Yu Wang
...
Tamas B. Fehér
Christoph Angerer
Martin Steinegger
D. Bhowmik
B. Rost
DRL
80
960
0
13 Jul 2020
Data-Efficient Reinforcement Learning with Self-Predictive Representations
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
112
321
0
12 Jul 2020
Bespoke vs. Prêt-à-Porter Lottery Tickets: Exploiting Mask Similarity for Trainable Sub-Network Finding
Michela Paganini
Jessica Zosa Forde
UQCV
48
6
0
06 Jul 2020
Carbontracker: Tracking and Predicting the Carbon Footprint of Training Deep Learning Models
Lasse F. Wolff Anthony
Benjamin Kanding
Raghavendra Selvan
HAI
72
316
0
06 Jul 2020
Abstractive and mixed summarization for long-single documents
Roger Barrull
Jugal Kalita
37
0
0
03 Jul 2020
Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering
Gautier Izacard
Edouard Grave
RALM
159
1,186
0
02 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
110
85
0
02 Jul 2020
Facts as Experts: Adaptable and Interpretable Neural Memory over Symbolic Knowledge
Pat Verga
Haitian Sun
Livio Baldini Soares
William W. Cohen
KELM
95
50
0
02 Jul 2020
Go Wide, Then Narrow: Efficient Training of Deep Thin Networks
Denny Zhou
Mao Ye
Chen Chen
Tianjian Meng
Mingxing Tan
Xiaodan Song
Quoc V. Le
Qiang Liu
Dale Schuurmans
61
20
0
01 Jul 2020
On Linear Identifiability of Learned Representations
Geoffrey Roeder
Luke Metz
Diederik P. Kingma
CML
70
85
0
01 Jul 2020
Previous
1
2
3
...
241
242
243
244
Next