Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,362 papers shown
Title
Vision Transformers provably learn spatial structure
Samy Jelassi
Michael E. Sander
Yuan-Fang Li
ViT
MLT
100
83
0
13 Oct 2022
Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods
Evan Crothers
Nathalie Japkowicz
H. Viktor
DeLMO
153
113
0
13 Oct 2022
AMP: Automatically Finding Model Parallel Strategies with Heterogeneity Awareness
Dacheng Li
Hongyi Wang
Eric P. Xing
Haotong Zhang
MoE
54
24
0
13 Oct 2022
Joint Reasoning on Hybrid-knowledge sources for Task-Oriented Dialog
Mayank Mishra
Danish Contractor
Dinesh Raghu
RALM
57
0
0
13 Oct 2022
Mass-Editing Memory in a Transformer
Kevin Meng
Arnab Sen Sharma
A. Andonian
Yonatan Belinkov
David Bau
KELM
VLM
159
601
0
13 Oct 2022
Language Model Decoding as Likelihood-Utility Alignment
Martin Josifoski
Maxime Peyrard
Frano Rajic
Jiheng Wei
Debjit Paul
...
Barun Patra
Vishrav Chaudhary
Emre Kıcıman
Boi Faltings
Robert West
82
5
0
13 Oct 2022
Towards a Unified Multi-Dimensional Evaluator for Text Generation
Ming Zhong
Yang Liu
Da Yin
Yuning Mao
Yizhu Jiao
Peng Liu
Chenguang Zhu
Heng Ji
Jiawei Han
ELM
107
276
0
13 Oct 2022
Visual Classification via Description from Large Language Models
Sachit Menon
Carl Vondrick
VLM
103
303
0
13 Oct 2022
Fast Estimation of Bayesian State Space Models Using Amortized Simulation-Based Inference
R. Khabibullin
S. Seleznev
59
1
0
13 Oct 2022
Language Models of Code are Few-Shot Commonsense Learners
Aman Madaan
Shuyan Zhou
Uri Alon
Yiming Yang
Graham Neubig
ReLM
LRM
139
222
0
13 Oct 2022
Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence
Chris Callison-Burch
Gaurav Singh Tomar
Lara J. Martin
Daphne Ippolito
Suma Bailis
David Reitter
77
50
0
13 Oct 2022
CLASP: Few-Shot Cross-Lingual Data Augmentation for Semantic Parsing
Andrew Rosenbaum
Saleh Soltan
Wael Hamza
Amir Saffari
Macro Damonte
Isabel Groves
97
32
0
13 Oct 2022
Spontaneous Emerging Preference in Two-tower Language Model
Zhengqi He
Taro Toyoizumi
LRM
48
1
0
13 Oct 2022
Prompt-based Connective Prediction Method for Fine-grained Implicit Discourse Relation Recognition
Hao Zhou
Man Lan
Yuanbin Wu
YueFeng Chen
Meirong Ma
58
26
0
13 Oct 2022
Retrospectives on the Embodied AI Workshop
Matt Deitke
Dhruv Batra
Yonatan Bisk
Tommaso Campari
Angel X. Chang
...
Jesse Thomason
Alexander Toshev
Joanne Truong
Luca Weihs
Jiajun Wu
LM&Ro
122
51
0
13 Oct 2022
Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features
Changde Du
Kaicheng Fu
Jinpeng Li
Huiguang He
VLM
83
76
0
13 Oct 2022
Explanations from Large Language Models Make Small Reasoners Better
Shiyang Li
Jianshu Chen
Yelong Shen
Zhiyu Zoey Chen
Xinlu Zhang
...
Jingu Qian
Baolin Peng
Yi Mao
Wenhu Chen
Xifeng Yan
ReLM
LRM
107
138
0
13 Oct 2022
Few-shot Relational Reasoning via Connection Subgraph Pretraining
Qian Huang
Hongyu Ren
J. Leskovec
LRM
76
25
0
13 Oct 2022
Large Language Models are few(1)-shot Table Reasoners
Wenhu Chen
LMTD
ReLM
LRM
89
153
0
13 Oct 2022
A Mixture of Surprises for Unsupervised Reinforcement Learning
Andrew Zhao
Matthieu Lin
Yangguang Li
Yang Liu
Gao Huang
68
13
0
13 Oct 2022
Parameter-Efficient Masking Networks
Yue Bai
Huan Wang
Xu Ma
Yitian Zhang
Zhiqiang Tao
Yun Fu
67
10
0
13 Oct 2022
SubeventWriter: Iterative Sub-event Sequence Generation with Coherence Controller
Zhaowei Wang
Hongming Zhang
Tianqing Fang
Yangqiu Song
Ginny Wong
Simon See
116
14
0
13 Oct 2022
Structural Pruning via Latency-Saliency Knapsack
Maying Shen
Hongxu Yin
Pavlo Molchanov
Lei Mao
Jianna Liu
J. Álvarez
100
50
0
13 Oct 2022
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Brian Bartoldson
B. Kailkhura
Davis W. Blalock
107
51
0
13 Oct 2022
RankT5: Fine-Tuning T5 for Text Ranking with Ranking Losses
Honglei Zhuang
Zhen Qin
R. Jagerman
Kai Hui
Ji Ma
Jing Lu
Jianmo Ni
Xuanhui Wang
Michael Bendersky
AIMat
101
141
0
12 Oct 2022
Developing a general-purpose clinical language inference model from a large corpus of clinical notes
Madhumita Sushil
Dana Ludwig
A. Butte
V. Rudrapatna
LM&MA
77
12
0
12 Oct 2022
Microscopy is All You Need
Sergei V. Kalinin
Rama K Vasudevan
Yongtao Liu
Ayana Ghosh
Kevin M. Roccapriore
M. Ziatdinov
76
0
0
12 Oct 2022
Are Sample-Efficient NLP Models More Robust?
Nelson F. Liu
Ananya Kumar
Percy Liang
Robin Jia
VLM
OOD
67
6
0
12 Oct 2022
Foundation Transformers
Hongyu Wang
Shuming Ma
Shaohan Huang
Li Dong
Wenhui Wang
...
Barun Patra
Zhun Liu
Vishrav Chaudhary
Xia Song
Furu Wei
AI4CE
91
27
0
12 Oct 2022
Probing Commonsense Knowledge in Pre-trained Language Models with Sense-level Precision and Expanded Vocabulary
Daniel Loureiro
A. Jorge
ReLM
KELM
AI4MH
LRM
50
1
0
12 Oct 2022
CTL++: Evaluating Generalization on Never-Seen Compositional Patterns of Known Functions, and Compatibility of Neural Representations
Róbert Csordás
Kazuki Irie
Jürgen Schmidhuber
NAI
63
13
0
12 Oct 2022
Non-Axiomatic Term Logic: A Computational Theory of Cognitive Symbolic Reasoning
Kotaro Funakoshi
NAI
62
1
0
12 Oct 2022
Visual Prompting for Adversarial Robustness
Aochuan Chen
P. Lorenz
Yuguang Yao
Pin-Yu Chen
Sijia Liu
VLM
VPVLM
118
35
0
12 Oct 2022
Language Models are Realistic Tabular Data Generators
V. Borisov
Kathrin Seßler
Tobias Leemann
Martin Pawelczyk
Gjergji Kasneci
LMTD
117
255
0
12 Oct 2022
Zero-Shot On-the-Fly Event Schema Induction
Rotem Dror
Haoyu Wang
Dan Roth
93
18
0
12 Oct 2022
Zero-Shot Prompting for Implicit Intent Prediction and Recommendation with Commonsense Reasoning
Hui-Chi Kuo
Yun-Nung Chen
79
10
0
12 Oct 2022
Underspecification in Scene Description-to-Depiction Tasks
Ben Hutchinson
Jason Baldridge
Vinodkumar Prabhakaran
DiffM
128
34
0
11 Oct 2022
Designing Robust Transformers using Robust Kernel Density Estimation
Xing Han
Zhaolin Ren
T. Nguyen
Khai Nguyen
Joydeep Ghosh
Nhat Ho
110
6
0
11 Oct 2022
Decoupled Context Processing for Context Augmented Language Modeling
Zonglin Li
Ruiqi Guo
Surinder Kumar
RALM
KELM
82
24
0
11 Oct 2022
Visual Language Maps for Robot Navigation
Chen Huang
Oier Mees
Andy Zeng
Wolfram Burgard
LM&Ro
283
371
0
11 Oct 2022
Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation
Long Phan
Tai Dang
H. Tran
Trieu H. Trinh
Vy Phan
Lam D. Chau
Minh-Thang Luong
56
8
0
11 Oct 2022
Continual Training of Language Models for Few-Shot Learning
Zixuan Ke
Haowei Lin
Yijia Shao
Hu Xu
Lei Shu
Bin Liu
KELM
BDL
CLL
136
35
0
11 Oct 2022
Robust and Controllable Object-Centric Learning through Energy-based Models
Ruixiang Zhang
Tong Che
Boris Ivanovic
Renhao Wang
Marco Pavone
Yoshua Bengio
Liam Paull
OCL
97
8
0
11 Oct 2022
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models
Qihuang Zhong
Liang Ding
Li Shen
Peng Mi
Juhua Liu
Bo Du
Dacheng Tao
AAML
90
51
0
11 Oct 2022
MiDe22: An Annotated Multi-Event Tweet Dataset for Misinformation Detection
Cagri Toraman
Oguzhan Ozcelik
Furkan Şahinuç
Fazli Can
47
13
0
11 Oct 2022
Mind's Eye: Grounded Language Model Reasoning through Simulation
Ruibo Liu
Jason W. Wei
S. Gu
Te-Yen Wu
Soroush Vosoughi
Claire Cui
Denny Zhou
Andrew M. Dai
ReLM
LRM
217
83
0
11 Oct 2022
Transformers generalize differently from information stored in context vs in weights
Stephanie C. Y. Chan
Ishita Dasgupta
Junkyung Kim
D. Kumaran
Andrew Kyle Lampinen
Felix Hill
214
50
0
11 Oct 2022
Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials
Aviral Kumar
Anika Singh
F. Ebert
Mitsuhiko Nakamoto
Yanlai Yang
Chelsea Finn
Sergey Levine
OffRL
OnRL
218
71
0
11 Oct 2022
Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems
Fan Zhou
Haoyu Dong
Qian Liu
Zhoujun Cheng
Shi Han
Dongmei Zhang
ReLM
LRM
85
6
0
11 Oct 2022
Energy-Efficient Deployment of Machine Learning Workloads on Neuromorphic Hardware
Peyton S. Chandarana
Mohammadreza Mohammadi
J. Seekings
Ramtin Zand
73
6
0
10 Oct 2022
Previous
1
2
3
...
182
183
184
...
246
247
248
Next