Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.07118
Cited By
v1
v2 (latest)
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
15 September 2020
Timo Schick
Hinrich Schütze
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"
50 / 613 papers shown
Title
Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again
Bernal Jiménez Gutiérrez
Nikolas McNeal
Clay Washington
You Chen
Lang Li
Huan Sun
Yu-Chuan Su
105
157
0
16 Mar 2022
Things not Written in Text: Exploring Spatial Commonsense from Visual Signals
Xiao Liu
Da Yin
Yansong Feng
Dongyan Zhao
LRM
80
46
0
15 Mar 2022
Modular and Parameter-Efficient Multimodal Fusion with Prompting
Sheng Liang
Mengjie Zhao
Hinrich Schütze
90
45
0
15 Mar 2022
GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models
Archiki Prasad
Peter Hase
Xiang Zhou
Joey Tianyi Zhou
115
124
0
14 Mar 2022
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment
Haoyu Song
Li Dong
Weinan Zhang
Ting Liu
Furu Wei
VLM
CLIP
89
139
0
14 Mar 2022
Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning
Yuxin Jiang
Linhan Zhang
Wei Wang
SSL
59
47
0
14 Mar 2022
Continual Prompt Tuning for Dialog State Tracking
Qi Zhu
Bing Li
Fei Mi
Xiaoyan Zhu
Minlie Huang
CLL
KELM
90
60
0
13 Mar 2022
HyperMixer: An MLP-based Low Cost Alternative to Transformers
Florian Mai
Arnaud Pannatier
Fabio Fehr
Haolin Chen
François Marelli
François Fleuret
James Henderson
77
11
0
07 Mar 2022
Pre-trained Token-replaced Detection Model as Few-shot Learner
Zicheng Li
Shoushan Li
Guodong Zhou
77
9
0
07 Mar 2022
Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Shengnan An
Yifei Li
Zeqi Lin
Qian Liu
Bei Chen
Qiang Fu
Weizhu Chen
Nanning Zheng
Jian-Guang Lou
VLM
AAML
88
43
0
07 Mar 2022
Dialogue Summaries as Dialogue States (DS2), Template-Guided Summarization for Few-shot Dialogue State Tracking
Jamin Shin
Hangyeol Yu
Hyeongdon Moon
Andrea Madotto
Juneyoung Park
79
29
0
03 Mar 2022
QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition
Andy T. Liu
Wei Xiao
Henghui Zhu
Dejiao Zhang
Shang-Wen Li
Andrew O. Arnold
76
27
0
03 Mar 2022
Do Prompts Solve NLP Tasks Using Natural Language?
Sen Yang
Yunchen Zhang
Leyang Cui
Yue Zhang
LRM
80
4
0
02 Mar 2022
Attend, Memorize and Generate: Towards Faithful Table-to-Text Generation in Few Shots
Wenting Zhao
Ye Liu
Yao Wan
Philip S. Yu
60
11
0
01 Mar 2022
EPPAC: Entity Pre-typing Relation Classification with Prompt AnswerCentralizing
Jiejun Tan
Wenbin Hu
Weiwei Liu
86
1
0
01 Mar 2022
SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization
Austin W. Hanjie
Ameet Deshpande
Karthik Narasimhan
VLM
69
2
0
26 Feb 2022
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
Sewon Min
Xinxi Lyu
Ari Holtzman
Mikel Artetxe
M. Lewis
Hannaneh Hajishirzi
Luke Zettlemoyer
LLMAG
LRM
193
1,502
0
25 Feb 2022
Prompt-Learning for Short Text Classification
Yi Zhu
Xinke Zhou
Jipeng Qiang
Yun Li
Yunhao Yuan
Xindong Wu
VLM
59
36
0
23 Feb 2022
Revisiting Parameter-Efficient Tuning: Are We Really There Yet?
Guanzheng Chen
Fangyu Liu
Zaiqiao Meng
Shangsong Liang
64
95
0
16 Feb 2022
P4E: Few-Shot Event Detection as Prompt-Guided Identification and Localization
Sha Li
Liyuan Liu
Yiqing Xie
Heng Ji
Jiawei Han
98
4
0
15 Feb 2022
Enhancing Cross-lingual Prompting with Dual Prompt Augmentation
Meng Zhou
Xin Li
Yuechun Jiang
Lidong Bing
LRM
69
6
0
15 Feb 2022
Semantic-Oriented Unlabeled Priming for Large-Scale Language Models
Yanchen Liu
Timo Schick
Hinrich Schütze
VLM
66
15
0
12 Feb 2022
InPars: Data Augmentation for Information Retrieval using Large Language Models
L. Bonifacio
Hugo Queiroz Abonizio
Marzieh Fadaee
Rodrigo Nogueira
VLM
RALM
91
62
0
10 Feb 2022
AdaPrompt: Adaptive Model Training for Prompt-based NLP
Yulong Chen
Yang Liu
Li Dong
Shuohang Wang
Chenguang Zhu
Michael Zeng
Yue Zhang
VLM
102
48
0
10 Feb 2022
Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
Yu Meng
Jiaxin Huang
Yu Zhang
Jiawei Han
SyDa
77
235
0
09 Feb 2022
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models
Wei Ping
Ming-Yu Liu
Chaowei Xiao
Peng Xu
M. Patwary
Mohammad Shoeybi
Yue Liu
Anima Anandkumar
Bryan Catanzaro
100
71
0
08 Feb 2022
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Stephen H. Bach
Victor Sanh
Zheng-Xin Yong
Albert Webson
Colin Raffel
...
Khalid Almubarak
Xiangru Tang
Dragomir R. Radev
Mike Tian-Jian Jiang
Alexander M. Rush
VLM
350
355
0
02 Feb 2022
Retrieve-and-Fill for Scenario-based Task-Oriented Semantic Parsing
Akshat Shrivastava
Shrey Desai
Anchit Gupta
A. Elkahky
Aleksandr Livshits
Alexander Zotov
Ahmed Aly
171
6
0
02 Feb 2022
Protum: A New Method For Prompt Tuning Based on "[MASK]"
Pan He
Yuxi Chen
Yan Wang
Yanru Zhang
AAML
46
3
0
28 Jan 2022
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages
Emanuele Bugliarello
Fangyu Liu
Jonas Pfeiffer
Siva Reddy
Desmond Elliott
Edoardo Ponti
Ivan Vulić
MLLM
VLM
ELM
117
64
0
27 Jan 2022
An Assessment of the Impact of OCR Noise on Language Models
Konstantin Todorov
Giovanni Colavizza
49
7
0
26 Jan 2022
Black-box Prompt Learning for Pre-trained Language Models
Shizhe Diao
Zhichao Huang
Ruijia Xu
Xuechun Li
Yong Lin
Xiao Zhou
Tong Zhang
VLM
AAML
91
71
0
21 Jan 2022
Instance-aware Prompt Learning for Language Understanding and Generation
Feihu Jin
Jinliang Lu
Jiajun Zhang
Chengqing Zong
57
33
0
18 Jan 2022
Black-Box Tuning for Language-Model-as-a-Service
Tianxiang Sun
Yunfan Shao
Hong Qian
Xuanjing Huang
Xipeng Qiu
VLM
181
275
0
10 Jan 2022
ArT: All-round Thinker for Unsupervised Commonsense Question-Answering
Jiawei Wang
Hai Zhao
LLMAG
LRM
79
3
0
26 Dec 2021
Efficient Large Scale Language Modeling with Mixtures of Experts
Mikel Artetxe
Shruti Bhosale
Naman Goyal
Todor Mihaylov
Myle Ott
...
Jeff Wang
Luke Zettlemoyer
Mona T. Diab
Zornitsa Kozareva
Ves Stoyanov
MoE
224
200
0
20 Dec 2021
Reframing Human-AI Collaboration for Generating Free-Text Explanations
Sarah Wiegreffe
Jack Hessel
Swabha Swayamdipta
Mark O. Riedl
Yejin Choi
77
149
0
16 Dec 2021
Analyzing the Limits of Self-Supervision in Handling Bias in Language
Lisa Bauer
Karthik Gopalakrishnan
Spandana Gella
Yang Liu
Joey Tianyi Zhou
Dilek Z. Hakkani-Tür
ELM
32
1
0
16 Dec 2021
NewsClaims: A New Benchmark for Claim Detection from News with Attribute Knowledge
R. Reddy
Sai Chetan Chinthakindi
Zhenhailong Wang
Yi R. Fung
Kathryn Conger
...
Martha Palmer
Preslav Nakov
Eduard H. Hovy
Kevin Small
Heng Ji
VLM
67
29
0
16 Dec 2021
Fine-Tuning Large Neural Language Models for Biomedical Natural Language Processing
Robert Tinn
Hao Cheng
Yu Gu
Naoto Usuyama
Xiaodong Liu
Tristan Naumann
Jianfeng Gao
Hoifung Poon
LM&MA
60
116
0
15 Dec 2021
LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework
Mengjie Zhao
Fei Mi
Yasheng Wang
Minglei Li
Xin Jiang
Qun Liu
Hinrich Schütze
RALM
113
11
0
14 Dec 2021
Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer
Yunyun Huang
Xiaoyu Shen
Chuanyi Li
Jidong Ge
B. Luo
AILaw
77
20
0
13 Dec 2021
Prompt-based Zero-shot Relation Extraction with Semantic Knowledge Augmentation
Jiaying Gong
Hoda Eldardiry
58
7
0
08 Dec 2021
True Few-Shot Learning with Prompts -- A Real-World Perspective
Timo Schick
Hinrich Schütze
VLM
110
64
0
26 Nov 2021
Few-shot Named Entity Recognition with Cloze Questions
V. Gatta
V. Moscato
Marco Postiglione
Giancarlo Sperlí
56
4
0
24 Nov 2021
Few-Shot Self-Rationalization with Natural Language Prompts
Ana Marasović
Iz Beltagy
Doug Downey
Matthew E. Peters
LRM
89
110
0
16 Nov 2021
On Transferability of Prompt Tuning for Natural Language Processing
Yusheng Su
Xiaozhi Wang
Yujia Qin
Chi-Min Chan
Yankai Lin
...
Peng Li
Juanzi Li
Lei Hou
Maosong Sun
Jie Zhou
AAML
VLM
81
106
0
12 Nov 2021
A Survey on Green Deep Learning
Jingjing Xu
Wangchunshu Zhou
Zhiyi Fu
Hao Zhou
Lei Li
VLM
201
84
0
08 Nov 2021
CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
Subhabrata Mukherjee
Xiaodong Liu
Guoqing Zheng
Saghar Hosseini
Hao Cheng
Greg Yang
Christopher Meek
Ahmed Hassan Awadallah
Jianfeng Gao
ELM
70
11
0
04 Nov 2021
Diverse Distributions of Self-Supervised Tasks for Meta-Learning in NLP
Trapit Bansal
K. Gunasekaran
Tong Wang
Tsendsuren Munkhdalai
Andrew McCallum
SSL
OOD
96
20
0
02 Nov 2021
Previous
1
2
3
...
10
11
12
13
9
Next