Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.07118
Cited By
v1
v2 (latest)
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
15 September 2020
Timo Schick
Hinrich Schütze
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"
50 / 613 papers shown
Title
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
189
1,094
0
01 Nov 2021
Deep Transfer Learning & Beyond: Transformer Language Models in Information Systems Research
Ross Gruetzemacher
D. Paradice
78
35
0
18 Oct 2021
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models
Woojeong Jin
Yu Cheng
Yelong Shen
Weizhu Chen
Xiang Ren
VLM
VPVLM
MLLM
117
138
0
16 Oct 2021
Control Prefixes for Parameter-Efficient Text Generation
Jordan Clive
Kris Cao
Marek Rei
120
32
0
15 Oct 2021
Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Andrea Madotto
Zhaojiang Lin
Genta Indra Winata
Pascale Fung
93
85
0
15 Oct 2021
SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer
Tu Vu
Brian Lester
Noah Constant
Rami Al-Rfou
Daniel Cer
VLM
LRM
214
290
0
15 Oct 2021
Exploring Universal Intrinsic Task Subspace via Prompt Tuning
Yujia Qin
Xiaozhi Wang
Yusheng Su
Yankai Lin
Ning Ding
...
Juanzi Li
Lei Hou
Peng Li
Maosong Sun
Jie Zhou
VLM
VPVLM
188
29
0
15 Oct 2021
Meta-learning via Language Model In-context Tuning
Yanda Chen
Ruiqi Zhong
Sheng Zha
George Karypis
He He
308
162
0
15 Oct 2021
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
Xiao Liu
Kaixuan Ji
Yicheng Fu
Weng Lam Tam
Zhengxiao Du
Zhilin Yang
Jie Tang
VLM
297
863
0
14 Oct 2021
Can Machines Learn Morality? The Delphi Experiment
Liwei Jiang
Jena D. Hwang
Chandra Bhagavatula
Ronan Le Bras
Jenny T Liang
...
Yulia Tsvetkov
Oren Etzioni
Maarten Sap
Regina A. Rini
Yejin Choi
FaML
208
122
0
14 Oct 2021
Teaching Models new APIs: Domain-Agnostic Simulators for Task Oriented Dialogue
Moya Chen
Paul A. Crook
Stephen Roller
ALM
73
7
0
13 Oct 2021
LiST: Lite Prompted Self-training Makes Parameter-Efficient Few-shot Learners
Yaqing Wang
Subhabrata Mukherjee
Xiaodong Liu
Jing Gao
Ahmed Hassan Awadallah
Jianfeng Gao
VLM
BDL
106
11
0
12 Oct 2021
Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue Systems
Po-Nien Kung
Chung-Cheng Chang
Tse-Hsuan Yang
H. Hsu
Yu-Jia Liou
Yun-Nung Chen
65
6
0
11 Oct 2021
The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design
Yoav Levine
Noam Wies
Daniel Jannai
D. Navon
Yedid Hoshen
Amnon Shashua
AI4CE
112
37
0
09 Oct 2021
A Few More Examples May Be Worth Billions of Parameters
Yuval Kirstain
Patrick Lewis
Sebastian Riedel
Omer Levy
121
21
0
08 Oct 2021
Revisiting Self-Training for Few-Shot Learning of Language Model
Yiming Chen
Yan Zhang
Chen Zhang
Grandee Lee
Ran Cheng
Haizhou Li
66
42
0
04 Oct 2021
RAFT: A Real-World Few-Shot Text Classification Benchmark
Neel Alex
Eli Lifland
Lewis Tunstall
A. Thakur
Pegah Maham
...
Carolyn Ashurst
Paul Sedille
A. Carlier
M. Noetel
Andreas Stuhlmuller
RALM
213
56
0
28 Sep 2021
Template-free Prompt Tuning for Few-shot NER
Ruotian Ma
Xin Zhou
Tao Gui
Y. Tan
Linyang Li
Qi Zhang
Xuanjing Huang
VLM
224
183
0
28 Sep 2021
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding
Yanan Zheng
Jing Zhou
Yujie Qian
Ming Ding
Chonghua Liao
Jian Li
Ruslan Salakhutdinov
Jie Tang
Sebastian Ruder
Zhilin Yang
ELM
276
29
0
27 Sep 2021
Paradigm Shift in Natural Language Processing
Tianxiang Sun
Xiangyang Liu
Xipeng Qiu
Xuanjing Huang
218
82
0
26 Sep 2021
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models
Yuan Yao
Ao Zhang
Zhengyan Zhang
Zhiyuan Liu
Tat-Seng Chua
Maosong Sun
MLLM
VPVLM
VLM
294
224
0
24 Sep 2021
Towards Zero-Label Language Learning
Zirui Wang
Adams Wei Yu
Orhan Firat
Yuan Cao
SyDa
244
105
0
19 Sep 2021
Primer: Searching for Efficient Transformers for Language Modeling
David R. So
Wojciech Mañke
Hanxiao Liu
Zihang Dai
Noam M. Shazeer
Quoc V. Le
VLM
277
156
0
17 Sep 2021
Efficient Attribute Injection for Pretrained Language Models
Reinald Kim Amplayo
Kang Min Yoo
Sang-Woo Lee
43
0
0
16 Sep 2021
Language Models are Few-shot Multilingual Learners
Genta Indra Winata
Andrea Madotto
Zhaojiang Lin
Rosanne Liu
J. Yosinski
Pascale Fung
ELM
LRM
110
138
0
16 Sep 2021
On the Universality of Deep Contextual Language Models
Shaily Bhatt
Poonam Goyal
Sandipan Dandapat
Monojit Choudhury
Sunayana Sitaram
ELM
57
5
0
15 Sep 2021
STraTA: Self-Training with Task Augmentation for Better Few-shot Learning
Tu Vu
Minh-Thang Luong
Quoc V. Le
Grady Simon
Mohit Iyyer
172
61
0
13 Sep 2021
Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training
Momchil Hardalov
Arnav Arora
Preslav Nakov
Isabelle Augenstein
88
63
0
13 Sep 2021
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
Yunfan Shao
Zhichao Geng
Yitao Liu
Junqi Dai
Hang Yan
Fei Yang
Li Zhe
Hujun Bao
Xipeng Qiu
MedIm
139
151
0
13 Sep 2021
MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets
Shraman Pramanick
Shivam Sharma
Dimitar Dimitrov
Md. Shad Akhtar
Preslav Nakov
Tanmoy Chakraborty
77
130
0
11 Sep 2021
What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers
Boseop Kim
Hyoungseok Kim
Sang-Woo Lee
Gichang Lee
Donghyun Kwak
...
Jaewook Kang
Inho Kang
Jung-Woo Ha
W. Park
Nako Sung
VLM
290
124
0
10 Sep 2021
CINS: Comprehensive Instruction for Few-shot Learning in Task-oriented Dialog Systems
Fei Mi
Yitong Li
Yasheng Wang
Xin Jiang
Qun Liu
94
43
0
10 Sep 2021
PPT: Pre-trained Prompt Tuning for Few-shot Learning
Yuxian Gu
Xu Han
Zhiyuan Liu
Minlie Huang
VLM
146
419
0
09 Sep 2021
Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning
Prasetya Ajie Utama
N. Moosavi
Victor Sanh
Iryna Gurevych
AAML
128
36
0
09 Sep 2021
A Recipe For Arbitrary Text Style Transfer with Large Language Models
Emily Reif
Daphne Ippolito
Ann Yuan
Andy Coenen
Chris Callison-Burch
Jason W. Wei
307
120
0
08 Sep 2021
Continuous Entailment Patterns for Lexical Inference in Context
Martin Schmitt
Hinrich Schütze
78
3
0
08 Sep 2021
Label Verbalization and Entailment for Effective Zero- and Few-Shot Relation Extraction
Oscar Sainz
Oier López de Lacalle
Gorka Labaka
Ander Barrena
Eneko Agirre
49
126
0
08 Sep 2021
Discrete and Soft Prompting for Multilingual Models
Mengjie Zhao
Hinrich Schütze
LRM
87
72
0
08 Sep 2021
NSP-BERT: A Prompt-based Few-Shot Learner Through an Original Pre-training Task--Next Sentence Prediction
Yi Sun
Yu Zheng
Chao Hao
Hangping Qiu
VLM
107
37
0
08 Sep 2021
FewshotQA: A simple framework for few-shot learning of question answering tasks using pre-trained text-to-text models
Rakesh Chada
P. Natarajan
84
46
0
04 Sep 2021
Do Prompt-Based Models Really Understand the Meaning of their Prompts?
Albert Webson
Ellie Pavlick
LRM
132
374
0
02 Sep 2021
ConQX: Semantic Expansion of Spoken Queries for Intent Detection based on Conditioned Text Generation
E. Yilmaz
Cagri Toraman
40
1
0
02 Sep 2021
It's not Rocket Science : Interpreting Figurative Language in Narratives
Tuhin Chakrabarty
Yejin Choi
Vered Shwartz
97
58
0
31 Aug 2021
Semi-Supervised Exaggeration Detection of Health Science Press Releases
Dustin Wright
Isabelle Augenstein
83
13
0
30 Aug 2021
Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners
Ningyu Zhang
Luoqiu Li
Xiang Chen
Shumin Deng
Zhen Bi
Chuanqi Tan
Fei Huang
Huajun Chen
VLM
146
180
0
30 Aug 2021
WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural Language Understanding
Guoqing Zheng
Giannis Karamanolakis
Kai Shu
Ahmed Hassan Awadallah
SSL
53
1
0
28 Aug 2021
The SelectGen Challenge: Finding the Best Training Samples for Few-Shot Neural Text Generation
Ernie Chang
Xiaoyu Shen
Alex Marin
Vera Demberg
49
9
0
14 Aug 2021
FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning
Jing Zhou
Yanan Zheng
Jie Tang
Jian Li
Zhilin Yang
VLM
89
80
0
13 Aug 2021
How Optimal is Greedy Decoding for Extractive Question Answering?
Or Castel
Ori Ram
Avia Efrat
Omer Levy
84
4
0
12 Aug 2021
AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
VLM
LM&MA
103
270
0
12 Aug 2021
Previous
1
2
3
...
10
11
12
13
Next