Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,277 papers shown
Title
DORA: Exploring Outlier Representations in Deep Neural Networks
Kirill Bykov
Mayukh Deb
Dennis Grinwald
Klaus-Robert Muller
Marina M.-C. Höhne
123
13
0
09 Jun 2022
Meet You Halfway: Explaining Deep Learning Mysteries
Oriel BenShmuel
AAML
FedML
FAtt
OOD
54
0
0
09 Jun 2022
Topic-Controllable Summarization: Topic-Aware Evaluation and Transformer Methods
Tatiana Passali
Grigorios Tsoumakas
58
0
0
09 Jun 2022
Unveiling Transformers with LEGO: a synthetic reasoning task
Yi Zhang
A. Backurs
Sébastien Bubeck
Ronen Eldan
Suriya Gunasekar
Tal Wagner
LRM
136
91
0
09 Jun 2022
It's a super deal -- train recurrent network on noisy data and get smooth prediction free
Boris Rubinstein
AI4TS
39
0
0
09 Jun 2022
Few-shot Question Generation for Personalized Feedback in Intelligent Tutoring Systems
Devang Kulshreshtha
Muhammad Shayan
Robert Belfer
Siva Reddy
Iulian Serban
E. Kochmar
46
11
0
08 Jun 2022
Neural Collapse: A Review on Modelling Principles and Generalization
Vignesh Kothapalli
158
82
0
08 Jun 2022
Patch-based Object-centric Transformers for Efficient Video Generation
Wilson Yan
Ryogo Okumura
Stephen James
Pieter Abbeel
DiffM
ViT
76
6
0
08 Jun 2022
Multi-channel neural networks for predicting influenza A virus hosts and antigenic types
Yanhua Xu
D. Wojtczak
BDL
13
0
0
08 Jun 2022
Metric Based Few-Shot Graph Classification
Donato Crisostomi
Simone Antonelli
Valentino Maiorca
Luca Moschella
R. Marin
Emanuele Rodolà
101
5
0
08 Jun 2022
Delving into the Pre-training Paradigm of Monocular 3D Object Detection
Zhuoling Li
Chuanrui Zhang
En Yu
Haoqian Wang
37
1
0
08 Jun 2022
Can CNNs Be More Robust Than Transformers?
Zeyu Wang
Yutong Bai
Yuyin Zhou
Cihang Xie
UQCV
OOD
115
46
0
07 Jun 2022
Generating Long Videos of Dynamic Scenes
Tim Brooks
Janne Hellsten
M. Aittala
Ting-Chun Wang
Timo Aila
J. Lehtinen
Xuan Li
Alexei A. Efros
Tero Karras
SyDa
84
114
0
07 Jun 2022
DeepTPI: Test Point Insertion with Deep Reinforcement Learning
Zhengyuan Shi
Min Li
Sadaf Khan
Liuzheng Wang
Naixing Wang
Yu Huang
Qiang Xu
97
16
0
07 Jun 2022
Recent Advances for Quantum Neural Networks in Generative Learning
Jinkai Tian
Xiaoyun Sun
Yuxuan Du
Shanshan Zhao
Qing Liu
...
Xingyao Wu
Min-hsiu Hsieh
Tongliang Liu
Wen-Bin Yang
Dacheng Tao
AI4CE
90
85
0
07 Jun 2022
Neuro-Symbolic Procedural Planning with Commonsense Prompting
Yujie Lu
Weixi Feng
Wanrong Zhu
Wenda Xu
Xinze Wang
Miguel P. Eckstein
William Yang Wang
LM&Ro
61
37
0
06 Jun 2022
Training Subset Selection for Weak Supervision
Hunter Lang
Aravindan Vijayaraghavan
David Sontag
NoLa
92
23
0
06 Jun 2022
No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval
G. Rosa
L. Bonifacio
Vitor Jeronymo
Hugo Queiroz Abonizio
Marzieh Fadaee
R. Lotufo
Rodrigo Nogueira
99
26
0
06 Jun 2022
Learning to Ask Like a Physician
Eric P. Lehman
Vladislav Lialin
K. Y. Legaspi
Anne Janelle R. Sy
Patricia Therese S. Pile
...
Anna Rumshisky
Jenifer Liang
Preethi Raghavan
Leo Anthony Celi
Peter Szolovits
OOD
80
20
0
06 Jun 2022
Separable Self-attention for Mobile Vision Transformers
Sachin Mehta
Mohammad Rastegari
ViT
MQ
105
265
0
06 Jun 2022
What do tokens know about their characters and how do they know it?
Ayush Kaushal
Kyle Mahowald
90
31
0
06 Jun 2022
A Simple yet Effective Method for Graph Classification
Junran Wu
Shangzhe Li
Jianhao Li
Yicheng Pan
Keyulu Xu
132
26
0
06 Jun 2022
Pretrained Models for Multilingual Federated Learning
Orion Weller
Marc Marone
Vladimir Braverman
Dawn J Lawrie
Benjamin Van Durme
VLM
FedML
AI4CE
94
42
0
06 Jun 2022
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
221
115
0
05 Jun 2022
Functional Ensemble Distillation
Coby Penso
Idan Achituve
Ethan Fetaya
FedML
89
2
0
05 Jun 2022
ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences
Christos Tzelepis
James Oldfield
Georgios Tzimiropoulos
Ioannis Patras
53
16
0
05 Jun 2022
(Im)possibility of Collective Intelligence
Krikamol Muandet
259
6
0
05 Jun 2022
Fault-Aware Neural Code Rankers
J. Inala
Chenglong Wang
Mei Yang
Andrés Codas
Mark Encarnación
Shuvendu K. Lahiri
Madan Musuvathi
Jianfeng Gao
ALM
100
45
0
04 Jun 2022
Formal Specifications from Natural Language
Christopher Hahn
Frederik Schmitt
Julia J. Tillman
Niklas Metzger
Julian Siber
Bernd Finkbeiner
100
29
0
04 Jun 2022
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
Z. Yao
Reza Yazdani Aminabadi
Minjia Zhang
Xiaoxia Wu
Conglong Li
Yuxiong He
VLM
MQ
165
484
0
04 Jun 2022
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
Yujia Xie
Luowei Zhou
Xiyang Dai
Lu Yuan
Nguyen Bach
Ce Liu
Michael Zeng
VLM
MLLM
69
28
0
03 Jun 2022
Differentially Private Model Compression
Fatemehsadat Mireshghallah
A. Backurs
Huseyin A. Inan
Lukas Wutschitz
Janardhan Kulkarni
SyDa
50
14
0
03 Jun 2022
A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge
Dustin Schwenk
Apoorv Khandelwal
Christopher Clark
Kenneth Marino
Roozbeh Mottaghi
74
556
0
03 Jun 2022
A Co-design view of Compute in-Memory with Non-Volatile Elements for Neural Networks
W. Haensch
A. Raghunathan
Kaushik Roy
B. Chakrabarti
C. Phatak
Cheng Wang
Supratik Guha
56
3
0
03 Jun 2022
Findings of the The RuATD Shared Task 2022 on Artificial Text Detection in Russian
T. Shamardina
Vladislav Mikhailov
Daniil Chernianskii
Alena Fenogenova
Marat Saidov
A. Valeeva
Tatiana Shavrina
I. Smurov
E. Tutubalina
Ekaterina Artemova
DeLMO
62
30
0
03 Jun 2022
Acquiring and Modelling Abstract Commonsense Knowledge via Conceptualization
Mutian He
Tianqing Fang
Weiqi Wang
Yangqiu Song
86
30
0
03 Jun 2022
Understanding Deep Learning via Decision Boundary
Shiye Lei
Fengxiang He
Yancheng Yuan
Dacheng Tao
65
14
0
03 Jun 2022
Automatic Generation of Programming Exercises and Code Explanations using Large Language Models
Sami Sarsa
Paul Denny
Arto Hellas
Juho Leinonen
ELM
186
362
0
03 Jun 2022
Code Generation Tools (Almost) for Free? A Study of Few-Shot, Pre-Trained Language Models on Code
Patrick Bareiss
Beatriz Souza
Marcelo d’Amorim
Michael Pradel
ELM
86
81
0
02 Jun 2022
Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees
Jue Wang
Binhang Yuan
Luka Rimanic
Yongjun He
Tri Dao
Beidi Chen
Christopher Ré
Ce Zhang
AI4CE
105
13
0
02 Jun 2022
Decentralized Training of Foundation Models in Heterogeneous Environments
Binhang Yuan
Yongjun He
Jared Davis
Tianyi Zhang
Tri Dao
Beidi Chen
Percy Liang
Christopher Ré
Ce Zhang
123
97
0
02 Jun 2022
Siamese Image Modeling for Self-Supervised Vision Representation Learning
Chenxin Tao
Xizhou Zhu
Weijie Su
Gao Huang
Bin Li
Jie Zhou
Yu Qiao
Xiaogang Wang
Jifeng Dai
SSL
105
96
0
02 Jun 2022
REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Yuanze Lin
Yujia Xie
Dongdong Chen
Yichong Xu
Chenguang Zhu
Lu Yuan
86
75
0
02 Jun 2022
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
Ming Tao
Bingkun Bao
Hao Tang
Leilei Gan
Longhui Wei
Qi Tian
DiffM
80
15
0
02 Jun 2022
Language and Culture Internalisation for Human-Like Autotelic AI
Cédric Colas
Tristan Karch
Clément Moulin-Frier
Pierre-Yves Oudeyer
LM&Ro
98
27
0
02 Jun 2022
Prefix Conditioning Unifies Language and Label Supervision
Kuniaki Saito
Kihyuk Sohn
Xinming Zhang
Chun-Liang Li
Chen-Yu Lee
Kate Saenko
Tomas Pfister
VLM
CLIP
97
16
0
02 Jun 2022
Weakly Supervised Representation Learning with Sparse Perturbations
Kartik Ahuja
Jason S. Hartford
Yoshua Bengio
SSL
109
61
0
02 Jun 2022
Learning code summarization from a small and local dataset
Toufique Ahmed
Prem Devanbu
79
10
0
02 Jun 2022
Cascaded Video Generation for Videos In-the-Wild
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
67
0
0
01 Jun 2022
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training
Yan Zeng
Wangchunshu Zhou
Ao Luo
Ziming Cheng
Xinsong Zhang
VLM
95
32
0
01 Jun 2022
Previous
1
2
3
...
194
195
196
...
244
245
246
Next