Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,362 papers shown
Title
This joke is [MASK]: Recognizing Humor and Offense with Prompting
Junze Li
Mengjie Zhao
Yubo Xie
Antonis Maronikolakis
Pearl Pu
Hinrich Schütze
AAML
61
1
0
25 Oct 2022
Differentially Private Language Models for Secure Data Sharing
Justus Mattern
Zhijing Jin
Benjamin Weggenmann
Bernhard Schoelkopf
Mrinmaya Sachan
SyDa
104
52
0
25 Oct 2022
SepLL: Separating Latent Class Labels from Weak Supervision Noise
Andreas Stephan
Vasiliki Kougia
Benjamin Roth
46
8
0
25 Oct 2022
Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation
Melanie Sclar
Peter West
Sachin Kumar
Yulia Tsvetkov
Yejin Choi
59
20
0
25 Oct 2022
IDK-MRC: Unanswerable Questions for Indonesian Machine Reading Comprehension
Rifki Afina Putri
Alice Oh
99
10
0
25 Oct 2022
DEMETR: Diagnosing Evaluation Metrics for Translation
Marzena Karpinska
N. Raj
Katherine Thai
Yixiao Song
Ankita Gupta
Mohit Iyyer
87
39
0
25 Oct 2022
Better Few-Shot Relation Extraction with Label Prompt Dropout
Peiyuan Zhang
Wei Lu
VLM
70
28
0
25 Oct 2022
Parameter-Efficient Legal Domain Adaptation
Jonathan Li
R. Bhambhoria
Xiao-Dan Zhu
ELM
AILaw
ALM
80
14
0
25 Oct 2022
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting Evidence
Hung-Ting Chen
Michael J.Q. Zhang
Eunsol Choi
RALM
HILM
141
100
0
25 Oct 2022
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing
Peng Shi
Rui Zhang
Richard He Bai
Jimmy J. Lin
RALM
98
45
0
25 Oct 2022
Evaluating Parameter Efficient Learning for Generation
Peng Xu
M. Patwary
Shrimai Prabhumoye
Virginia Adams
R. Prenger
Ming-Yu Liu
Nayeon Lee
Mohammad Shoeybi
Bryan Catanzaro
MoE
67
3
0
25 Oct 2022
Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry Writing
Tuhin Chakrabarty
Vishakh Padmakumar
Hengxing He
82
82
0
25 Oct 2022
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook
Baihan Lin
OffRL
AI4TS
127
27
0
24 Oct 2022
Adapters for Enhanced Modeling of Multilingual Knowledge and Text
Buse Giledereli
Wenxiang Jiao
Mei-Jun Liu
Carl Allen
Zhaopeng Tu
Mrinmaya Sachan
86
11
0
24 Oct 2022
MetaFormer Baselines for Vision
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
99
169
0
24 Oct 2022
Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task
Kenneth Li
Aspen K. Hopkins
David Bau
Fernanda Viégas
Hanspeter Pfister
Martin Wattenberg
MILM
180
297
0
24 Oct 2022
Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs
Maarten Sap
Ronan Le Bras
Daniel Fried
Yejin Choi
101
232
0
24 Oct 2022
Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Delta Tuning
Jing Yi
Weize Chen
Yujia Qin
Yankai Lin
Ning Ding
Xu Han
Zhiyuan Liu
Maosong Sun
Jie Zhou
113
2
0
24 Oct 2022
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
81
17
0
24 Oct 2022
Mutual Information Alleviates Hallucinations in Abstractive Summarization
Liam van der Poel
Ryan Cotterell
Clara Meister
HILM
109
61
0
24 Oct 2022
Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models
Stelios Maroudas
Sotiris Legkas
Prodromos Malakasiotis
Ilias Chalkidis
VLM
AILaw
ALM
ELM
80
4
0
24 Oct 2022
A Unified Framework for Pun Generation with Humor Principles
Yufei Tian
Divyanshu Sheth
Nanyun Peng
87
14
0
24 Oct 2022
Subspace Representations for Soft Set Operations and Sentence Similarities
Yoichi Ishibashi
Sho Yokoi
Katsuhito Sudoh
Satoshi Nakamura
NAI
64
1
0
24 Oct 2022
Investigating the detection of Tortured Phrases in Scientific Literature
Puthineath Lay
M. Lentschat
Cyril Labbe
59
5
0
24 Oct 2022
Exploring Euphemism Detection in Few-Shot and Zero-Shot Settings
Sedrick Scott Keh
50
7
0
24 Oct 2022
TIARA: Multi-grained Retrieval for Robust Question Answering over Large Knowledge Bases
Yiheng Shu
Zhiwei Yu
Yuhan Li
Börje F. Karlsson
Tingting Ma
Yuzhong Qu
Chin-Yew Lin
89
74
0
24 Oct 2022
Retrieval Augmentation for Commonsense Reasoning: A Unified Approach
Wenhao Yu
Chenguang Zhu
Zhihan Zhang
Shuohang Wang
Zhuosheng Zhang
Yuwei Fang
Meng Jiang
LRM
ReLM
64
19
0
23 Oct 2022
Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification
Junfei Xiao
Yutong Bai
Alan Yuille
Zongwei Zhou
MedIm
ViT
82
62
0
23 Oct 2022
Towards Pragmatic Production Strategies for Natural Language Generation Tasks
Mario Giulianelli
32
6
0
23 Oct 2022
Discriminative Language Model as Semantic Consistency Scorer for Prompt-based Few-Shot Text Classification
Zhipeng Xie
Yahe Li
49
0
0
23 Oct 2022
Adversarial Pretraining of Self-Supervised Deep Networks: Past, Present and Future
Guo-Jun Qi
M. Shah
SSL
78
8
0
23 Oct 2022
Generative Knowledge Graph Construction: A Review
Hongbin Ye
Ningyu Zhang
Hui Chen
Huajun Chen
125
75
0
23 Oct 2022
Neural Eigenfunctions Are Structured Representation Learners
Zhijie Deng
Jiaxin Shi
Hao Zhang
Peng Cui
Cewu Lu
Jun Zhu
109
14
0
23 Oct 2022
Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning
Xiangyu Peng
Chen Xing
Prafulla Kumar Choubey
Chien-Sheng Wu
Caiming Xiong
VLM
137
12
0
23 Oct 2022
Language Model Pre-Training with Sparse Latent Typing
Liliang Ren
Zixuan Zhang
H. Wang
Clare R. Voss
Chengxiang Zhai
Heng Ji
95
3
0
23 Oct 2022
The Curious Case of Absolute Position Embeddings
Koustuv Sinha
Amirhossein Kazemnejad
Siva Reddy
J. Pineau
Dieuwke Hupkes
Adina Williams
135
15
0
23 Oct 2022
Greedy Modality Selection via Approximate Submodular Maximization
Runxiang Cheng
Gargi Balasubramaniam
Yifei He
Yao-Hung Hubert Tsai
Han Zhao
54
1
0
22 Oct 2022
LMPriors: Pre-Trained Language Models as Task-Specific Priors
Kristy Choi
Chris Cundy
Sanjari Srivastava
Stefano Ermon
BDL
112
43
0
22 Oct 2022
Exploring The Landscape of Distributional Robustness for Question Answering Models
Anas Awadalla
Mitchell Wortsman
Gabriel Ilharco
Sewon Min
Ian H. Magnusson
Hannaneh Hajishirzi
Ludwig Schmidt
ELM
OOD
KELM
116
21
0
22 Oct 2022
ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples
Yilun Zhao
Linyong Nan
Zhenting Qi
Rui Zhang
Dragomir R. Radev
ReLM
LMTD
LRM
109
39
0
22 Oct 2022
Prompt-Tuning Can Be Much Better Than Fine-Tuning on Cross-lingual Understanding With Multilingual Language Models
Lifu Tu
Caiming Xiong
Yingbo Zhou
VLM
AAML
LRM
143
28
0
22 Oct 2022
Meta-learning Pathologies from Radiology Reports using Variance Aware Prototypical Networks
Arijit Sehanobish
Kawshik Kannan
Nabila Abraham
Anasuya Das
Benjamin Odry
VLM
75
0
0
22 Oct 2022
Leveraging Large Language Models for Multiple Choice Question Answering
Joshua Robinson
Christopher Rytting
David Wingate
ELM
244
200
0
22 Oct 2022
P
3
^3
3
LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training
Junwei Bao
Yifan Wang
Jiangyong Ying
Yeyun Gong
Jing Zhao
Youzheng Wu
Xiaodong He
70
1
0
22 Oct 2022
ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback
Jiacheng Ye
Jiahui Gao
Jiangtao Feng
Zhiyong Wu
Tao Yu
Lingpeng Kong
SyDa
VLM
166
78
0
22 Oct 2022
BEANS: The Benchmark of Animal Sounds
Masato Hagiwara
Benjamin Hoffman
Jen-Yu Liu
M. Cusimano
Felix Effenberger
Katie Zacarian
97
27
0
21 Oct 2022
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs
Albert Q. Jiang
Sean Welleck
Jin Peng Zhou
Wenda Li
Jiacheng Liu
M. Jamnik
Timothée Lacroix
Yuhuai Wu
Guillaume Lample
AIMat
157
181
0
21 Oct 2022
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
Yue Yang
Wenlin Yao
Hongming Zhang
Xiaoyang Wang
Dong Yu
Jianshu Chen
VLM
99
22
0
21 Oct 2022
TCAB: A Large-Scale Text Classification Attack Benchmark
Kalyani Asthana
Zhouhang Xie
Wencong You
Adam Noack
Jonathan Brophy
Sameer Singh
Daniel Lowd
119
3
0
21 Oct 2022
Entailer: Answering Questions with Faithful and Truthful Chains of Reasoning
Oyvind Tafjord
Bhavana Dalvi
Peter Clark
ReLM
KELM
LRM
158
54
0
21 Oct 2022
Previous
1
2
3
...
179
180
181
...
246
247
248
Next