Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 11,082 papers shown
Title
Seeking an Optimal Approach for Computer-Aided Pulmonary Embolism Detection
N. Islam
S. Gehlot
Zongwei Zhou
Michael B. Gotway
Jianming Liang
OOD
88
11
0
15 Sep 2021
Automatically Exposing Problems with Neural Dialog Models
Dian Yu
Kenji Sagae
31
9
0
14 Sep 2021
A Temporal Variational Model for Story Generation
David Wilmot
Frank Keller
DRL
35
8
0
14 Sep 2021
Just What do You Think You're Doing, Dave?' A Checklist for Responsible Data Use in NLP
Anna Rogers
Timothy Baldwin
Kobi Leins
104
64
0
14 Sep 2021
Sum-Product-Attention Networks: Leveraging Self-Attention in Probabilistic Circuits
Zhongjie Yu
Devendra Singh Dhami
Kristian Kersting
TPM
3DPC
LRM
26
0
0
14 Sep 2021
Exploring Prompt-based Few-shot Learning for Grounded Dialog Generation
Chujie Zheng
Minlie Huang
59
44
0
14 Sep 2021
Task-adaptive Pre-training and Self-training are Complementary for Natural Language Understanding
Shiyang Li
Semih Yavuz
Wenhu Chen
Xifeng Yan
22
12
0
14 Sep 2021
Connecting degree and polarity: An artificial language learning study
Lisa Bylinina
Alexey Tikhonov
Ekaterina Garmash
AI4CE
14
0
0
13 Sep 2021
Packed Levitated Marker for Entity and Relation Extraction
Deming Ye
Yankai Lin
Peng Li
Maosong Sun
141
106
0
13 Sep 2021
Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training
Momchil Hardalov
Arnav Arora
Preslav Nakov
Isabelle Augenstein
30
58
0
13 Sep 2021
How to Select One Among All? An Extensive Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding
Tianda Li
Ahmad Rashid
A. Jafari
Pranav Sharma
A. Ghodsi
Mehdi Rezagholizadeh
AAML
33
5
0
13 Sep 2021
Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning
Runxin Xu
Fuli Luo
Zhiyuan Zhang
Chuanqi Tan
Baobao Chang
Songfang Huang
Fei Huang
LRM
151
178
0
13 Sep 2021
TEASEL: A Transformer-Based Speech-Prefixed Language Model
Mehdi Arjmand
M. Dousti
H. Moradi
33
18
0
12 Sep 2021
Compute and Energy Consumption Trends in Deep Learning Inference
Radosvet Desislavov
Fernando Martínez-Plumed
José Hernández-Orallo
35
113
0
12 Sep 2021
Guiding Topic Flows in the Generative Chatbot by Enhancing the ConceptNet with the Conversation Corpora
Pengda Si
Yao Qiu
Jinchao Zhang
Yujiu Yang
26
0
0
12 Sep 2021
Multilingual Translation via Grafting Pre-trained Language Models
Zewei Sun
Mingxuan Wang
Lei Li
AI4CE
191
22
0
11 Sep 2021
Towards Zero-shot Commonsense Reasoning with Self-supervised Refinement of Language Models
T. Klein
Moin Nabi
ReLM
LRM
35
8
0
10 Sep 2021
Entity-Based Knowledge Conflicts in Question Answering
Shayne Longpre
Kartik Perisetla
Anthony Chen
Nikhil Ramesh
Chris DuBois
Sameer Singh
HILM
245
239
0
10 Sep 2021
Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding
Shane Storks
Qiaozi Gao
Yichi Zhang
J. Chai
ReLM
LRM
49
22
0
10 Sep 2021
EfficientCLIP: Efficient Cross-Modal Pre-training by Ensemble Confident Learning and Language Modeling
Jue Wang
Haofan Wang
Jincan Deng
Weijia Wu
Debing Zhang
VLM
CLIP
67
18
0
10 Sep 2021
Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning
Seonghyeon Ye
Jiseon Kim
Alice H. Oh
CLL
VLM
24
21
0
10 Sep 2021
What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers
Boseop Kim
Hyoungseok Kim
Sang-Woo Lee
Gichang Lee
Donghyun Kwak
...
Jaewook Kang
Inho Kang
Jung-Woo Ha
W. Park
Nako Sung
VLM
249
121
0
10 Sep 2021
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation
Haoran Xu
Benjamin Van Durme
Kenton W. Murray
50
57
0
09 Sep 2021
Medically Aware GPT-3 as a Data Generator for Medical Dialogue Summarization
Bharath Chintagunta
Namit Katariya
X. Amatriain
Anitha Kannan
LM&MA
MedIm
131
149
0
09 Sep 2021
TxT: Crossmodal End-to-End Learning with Transformers
Jan-Martin O. Steitz
Jonas Pfeiffer
Iryna Gurevych
Stefan Roth
LRM
21
2
0
09 Sep 2021
ErfAct and Pserf: Non-monotonic Smooth Trainable Activation Functions
Koushik Biswas
Sandeep Kumar
Shilpak Banerjee
A. Pandey
51
13
0
09 Sep 2021
PPT: Pre-trained Prompt Tuning for Few-shot Learning
Yuxian Gu
Xu Han
Zhiyuan Liu
Minlie Huang
VLM
54
403
0
09 Sep 2021
MetaXT: Meta Cross-Task Transfer between Disparate Label Spaces
Srinagesh Sharma
Guoqing Zheng
Ahmed Hassan Awadallah
27
1
0
09 Sep 2021
Efficient Nearest Neighbor Language Models
Junxian He
Graham Neubig
Taylor Berg-Kirkpatrick
RALM
195
103
0
09 Sep 2021
Learning the Physics of Particle Transport via Transformers
O. Pastor-Serrano
Zoltán Perkó
MedIm
21
13
0
08 Sep 2021
What's Hidden in a One-layer Randomly Weighted Transformer?
Sheng Shen
Z. Yao
Douwe Kiela
Kurt Keutzer
Michael W. Mahoney
32
4
0
08 Sep 2021
Sustainable Modular Debiasing of Language Models
Anne Lauscher
Tobias Lüken
Goran Glavas
55
120
0
08 Sep 2021
Discrete and Soft Prompting for Multilingual Models
Mengjie Zhao
Hinrich Schütze
LRM
18
71
0
08 Sep 2021
NSP-BERT: A Prompt-based Few-Shot Learner Through an Original Pre-training Task--Next Sentence Prediction
Yi Sun
Yu Zheng
Chao Hao
Hangping Qiu
VLM
41
37
0
08 Sep 2021
Towards Natural Language Interfaces for Data Visualization: A Survey
Leixian Shen
Enya Shen
Yuyu Luo
Xiaocong Yang
Xuming Hu
Xiongshuai Zhang
Zhiwei Tai
Jianmin Wang
29
137
0
08 Sep 2021
Text-Free Prosody-Aware Generative Spoken Language Modeling
Eugene Kharitonov
Ann Lee
Adam Polyak
Yossi Adi
Jade Copet
...
Tu Nguyen
M. Rivière
Abdel-rahman Mohamed
Emmanuel Dupoux
Wei-Ning Hsu
35
116
0
07 Sep 2021
NumGPT: Improving Numeracy Ability of Generative Pre-trained Models
Zhihua Jin
Xin Jiang
Xingbo Wang
Qun Liu
Yong Wang
Xiaozhe Ren
Huamin Qu
16
19
0
07 Sep 2021
LEAF: Navigating Concept Drift in Cellular Networks
Shinan Liu
F. Bronzino
Paul Schmitt
A. Bhagoji
Nick Feamster
Héctor García Crespo
Timothy T Coyle
Brian Ward
17
12
0
07 Sep 2021
Datasets: A Community Library for Natural Language Processing
Quentin Lhoest
Albert Villanova del Moral
Yacine Jernite
A. Thakur
Patrick von Platen
...
Thibault Goehringer
Victor Mustar
François Lagunas
Alexander M. Rush
Thomas Wolf
30
580
0
07 Sep 2021
General-Purpose Question-Answering with Macaw
Oyvind Tafjord
Peter Clark
SyDa
ELM
MLLM
19
59
0
06 Sep 2021
GPT-3 Models are Poor Few-Shot Learners in the Biomedical Domain
M. Moradi
Kathrin Blagec
F. Haberl
Matthias Samwald
LM&MA
AI4MH
36
65
0
06 Sep 2021
Modular Framework for Visuomotor Language Grounding
Kolby Nottingham
Litian Liang
Daeyun Shin
Charless C. Fowlkes
Roy Fox
Sameer Singh
24
12
0
05 Sep 2021
FewshotQA: A simple framework for few-shot learning of question answering tasks using pre-trained text-to-text models
Rakesh Chada
P. Natarajan
36
45
0
04 Sep 2021
Robust fine-tuning of zero-shot models
Mitchell Wortsman
Gabriel Ilharco
Jong Wook Kim
Mike Li
Simon Kornblith
...
Raphael Gontijo-Lopes
Hannaneh Hajishirzi
Ali Farhadi
Hongseok Namkoong
Ludwig Schmidt
VLM
64
691
0
04 Sep 2021
Error Detection in Large-Scale Natural Language Understanding Systems Using Transformer Models
Rakesh Chada
P. Natarajan
Darshan Fofadiya
Prathap Ramachandra
27
6
0
04 Sep 2021
Finetuned Language Models Are Zero-Shot Learners
Jason W. Wei
Maarten Bosma
Vincent Zhao
Kelvin Guu
Adams Wei Yu
Brian Lester
Nan Du
Andrew M. Dai
Quoc V. Le
ALM
UQCV
35
3,576
0
03 Sep 2021
A brief history of AI: how to prevent another winter (a critical review)
Amirhosein Toosi
A. Bottino
Babak Saboury
E. Siegel
Arman Rahmim
24
72
0
03 Sep 2021
Do Prompt-Based Models Really Understand the Meaning of their Prompts?
Albert Webson
Ellie Pavlick
LRM
53
355
0
02 Sep 2021
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
348
2,271
0
02 Sep 2021
How Suitable Are Subword Segmentation Strategies for Translating Non-Concatenative Morphology?
Chantal Amrhein
Rico Sennrich
27
13
0
02 Sep 2021
Previous
1
2
3
...
208
209
210
...
220
221
222
Next