Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 11,078 papers shown
Title
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics
Kunhao Zheng
Jesse Michael Han
Stanislas Polu
AIMat
35
152
0
31 Aug 2021
Quantized Convolutional Neural Networks Through the Lens of Partial Differential Equations
Ido Ben-Yair
Gil Ben Shalom
Moshe Eliasof
Eran Treister
MQ
33
5
0
31 Aug 2021
It's not Rocket Science : Interpreting Figurative Language in Narratives
Tuhin Chakrabarty
Yejin Choi
Vered Shwartz
22
55
0
31 Aug 2021
Sentence Bottleneck Autoencoders from Transformer Language Models
Ivan Montero
Nikolaos Pappas
Noah A. Smith
AI4CE
22
28
0
31 Aug 2021
APS: Active Pretraining with Successor Features
Hao Liu
Pieter Abbeel
47
119
0
31 Aug 2021
LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting
Xiang Chen
Lei Li
Shumin Deng
Chuanqi Tan
Changliang Xu
Fei Huang
Luo Si
Huajun Chen
Ningyu Zhang
VLM
36
65
0
31 Aug 2021
The five Is: Key principles for interpretable and safe conversational AI
M. Wahde
M. Virgolin
36
5
0
31 Aug 2021
Discretized Integrated Gradients for Explaining Language Models
Soumya Sanyal
Xiang Ren
FAtt
17
53
0
31 Aug 2021
Towards Out-Of-Distribution Generalization: A Survey
Jiashuo Liu
Zheyan Shen
Yue He
Xingxuan Zhang
Renzhe Xu
Han Yu
Peng Cui
CML
OOD
55
517
0
31 Aug 2021
How Does Adversarial Fine-Tuning Benefit BERT?
J. Ebrahimi
Hao Yang
Wei Zhang
AAML
26
4
0
31 Aug 2021
Semi-Supervised Exaggeration Detection of Health Science Press Releases
Dustin Wright
Isabelle Augenstein
33
12
0
30 Aug 2021
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Bo-wen Li
Xinyang Jiang
Donglin Bai
Yuge Zhang
Ningxin Zheng
Xuanyi Dong
Lu Liu
Yuqing Yang
Dongsheng Li
14
10
0
30 Aug 2021
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Amin Banitalebi-Dehkordi
Naveen Vedula
J. Pei
Fei Xia
Lanjun Wang
Yong Zhang
22
89
0
30 Aug 2021
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation
Jian Guan
Zhuoer Feng
Yamei Chen
Ru He
Xiaoxi Mao
Changjie Fan
Minlie Huang
39
32
0
30 Aug 2021
WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural Language Understanding
Guoqing Zheng
Giannis Karamanolakis
Kai Shu
Ahmed Hassan Awadallah
SSL
21
1
0
28 Aug 2021
Layer-wise Model Pruning based on Mutual Information
Chun Fan
Jiwei Li
Xiang Ao
Fei Wu
Yuxian Meng
Xiaofei Sun
46
19
0
28 Aug 2021
Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems
Fei Mi
Wanhao Zhou
Feng Cai
Lingjing Kong
Minlie Huang
Boi Faltings
27
32
0
28 Aug 2021
LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision
Zhijian Liu
Simon Stent
Jie Li
John Gideon
Song Han
VLM
25
10
0
26 Aug 2021
A Survey on Automated Fact-Checking
Zhijiang Guo
M. Schlichtkrull
Andreas Vlachos
27
459
0
26 Aug 2021
Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts
Ashutosh Baheti
Maarten Sap
Alan Ritter
Mark O. Riedl
21
84
0
26 Aug 2021
Bilateral Denoising Diffusion Models
Max W. Y. Lam
Jun Wang
Rongjie Huang
Dan Su
Dong Yu
DiffM
27
42
0
26 Aug 2021
Few-shot Visual Relationship Co-localization
Revant Teotia
Vaibhav Mishra
Mayank Maheshwari
Anand Mishra
20
1
0
26 Aug 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
VLM
MLLM
51
780
0
24 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
26
12
0
24 Aug 2021
Prompt-Learning for Fine-Grained Entity Typing
Ning Ding
Yulin Chen
Xu Han
Guangwei Xu
Pengjun Xie
Haitao Zheng
Zhiyuan Liu
Juan-Zi Li
Hong-Gee Kim
26
156
0
24 Aug 2021
Explaining Bayesian Neural Networks
Kirill Bykov
Marina M.-C. Höhne
Adelaida Creosteanu
Klaus-Robert Muller
Frederick Klauschen
Shinichi Nakajima
Marius Kloft
BDL
AAML
34
25
0
23 Aug 2021
Decomposition Multi-Objective Evolutionary Optimization: From State-of-the-Art to Future Opportunities
Ke Li
19
9
0
21 Aug 2021
Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions
Hammond Pearce
Baleegh Ahmad
Benjamin Tan
Brendan Dolan-Gavitt
Ramesh Karri
SILM
34
392
0
20 Aug 2021
SplitGuard: Detecting and Mitigating Training-Hijacking Attacks in Split Learning
Ege Erdogan
Alptekin Kupcu
A. E. Cicek
AAML
22
32
0
20 Aug 2021
Knowledge Perceived Multi-modal Pretraining in E-commerce
Yushan Zhu
Huaixiao Tou
Wen Zhang
Ganqiang Ye
Hui Chen
Ningyu Zhang
Huajun Chen
28
32
0
20 Aug 2021
UnSplit: Data-Oblivious Model Inversion, Model Stealing, and Label Inference Attacks Against Split Learning
Ege Erdogan
Alptekin Kupcu
A. E. Cicek
FedML
MIACV
35
77
0
20 Aug 2021
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
38
156
0
19 Aug 2021
Neural Operator: Learning Maps Between Function Spaces
Nikola B. Kovachki
Zong-Yi Li
Burigede Liu
Kamyar Azizzadenesheli
K. Bhattacharya
Andrew M. Stuart
Anima Anandkumar
AI4CE
52
440
0
19 Aug 2021
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation
Pan Xie
Zexian Li
Xiaohui Hu
34
11
0
19 Aug 2021
Moser Flow: Divergence-based Generative Modeling on Manifolds
N. Rozen
Aditya Grover
Maximilian Nickel
Y. Lipman
DRL
AI4CE
27
57
0
18 Aug 2021
Data Pricing in Machine Learning Pipelines
Zicun Cong
Xuan Luo
J. Pei
Feida Zhu
Yong Zhang
28
46
0
18 Aug 2021
Toward a `Standard Model' of Machine Learning
Zhiting Hu
Eric P. Xing
37
12
0
17 Aug 2021
Continual Backprop: Stochastic Gradient Descent with Persistent Randomness
Shibhansh Dohare
R. Sutton
A. R. Mahmood
CLL
47
80
0
13 Aug 2021
Billion-Scale Pretraining with Vision Transformers for Multi-Task Visual Representations
Josh Beal
Hao Wu
Dong Huk Park
Andrew Zhai
Dmitry Kislyuk
ViT
21
29
0
12 Aug 2021
PatrickStar: Parallel Training of Pre-trained Models via Chunk-based Memory Management
Jiarui Fang
Zilin Zhu
Shenggui Li
Hui Su
Yang Yu
Jie Zhou
Yang You
VLM
37
24
0
12 Aug 2021
Reimagining an autonomous vehicle
Jeffrey Hawke
E. Haibo
Vijay Badrinarayanan
Alex Kendall
43
11
0
12 Aug 2021
Video Transformer for Deepfake Detection with Incremental Learning
Sohail Ahmed Khan
Hang Dai
ViT
24
62
0
11 Aug 2021
Differentiable Subset Pruning of Transformer Heads
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
45
53
0
10 Aug 2021
Privacy-Preserving Machine Learning: Methods, Challenges and Directions
Runhua Xu
Nathalie Baracaldo
J. Joshi
32
100
0
10 Aug 2021
Noisy Channel Language Model Prompting for Few-Shot Text Classification
Sewon Min
Michael Lewis
Hannaneh Hajishirzi
Luke Zettlemoyer
VLM
40
218
0
09 Aug 2021
Token Shift Transformer for Video Classification
Hao Zhang
Y. Hao
Chong-Wah Ngo
ViT
29
116
0
05 Aug 2021
Deep multi-task mining Calabi-Yau four-folds
Harold Erbin
Riccardo Finotello
Robin Schneider
M. Tamaazousti
35
17
0
04 Aug 2021
How to Query Language Models?
Leonard Adolphs
S. Dhuliawala
Thomas Hofmann
KELM
24
15
0
04 Aug 2021
Controlled Text Generation as Continuous Optimization with Multiple Constraints
Sachin Kumar
Eric Malmi
Aliaksei Severyn
Yulia Tsvetkov
BDL
AI4CE
43
76
0
04 Aug 2021
Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management
Cécile Logé
Emily L. Ross
D. Dadey
Saahil Jain
A. Saporta
A. Ng
Pranav Rajpurkar
24
22
0
03 Aug 2021
Previous
1
2
3
...
209
210
211
...
220
221
222
Next