ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
v1v2 (latest)

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXiv (abs)PDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 613 papers shown
Title
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt
  Verbalizer for Text Classification
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification
Shengding Hu
Ning Ding
Huadong Wang
Zhiyuan Liu
Jingang Wang
Juan-Zi Li
Wei Wu
Maosong Sun
VLM
97
372
0
04 Aug 2021
How to Query Language Models?
How to Query Language Models?
Leonard Adolphs
Shehzaad Dhuliawala
Thomas Hofmann
KELM
86
15
0
04 Aug 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLMSyDa
271
4,036
0
28 Jul 2021
Bridging the Gap between Language Model and Reading Comprehension:
  Unsupervised MRC via Self-Supervision
Bridging the Gap between Language Model and Reading Comprehension: Unsupervised MRC via Self-Supervision
Ning Bian
Xianpei Han
Bo Chen
Hongyu Lin
Xianpei Han
Le Sun
SSLLRM
96
5
0
19 Jul 2021
FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark
FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark
Liang Xu
Xiaojing Lu
Chenyang Yuan
Xuanwei Zhang
Huilin Xu
...
Guoao Wei
X. Pan
Xin Tian
Libo Qin
Hai Hu
ELM
90
57
0
15 Jul 2021
FLEX: Unifying Evaluation for Few-Shot NLP
FLEX: Unifying Evaluation for Few-Shot NLP
Jonathan Bragg
Arman Cohan
Kyle Lo
Iz Beltagy
270
108
0
15 Jul 2021
HTLM: Hyper-Text Pre-Training and Prompting of Language Models
HTLM: Hyper-Text Pre-Training and Prompting of Language Models
Armen Aghajanyan
Dmytro Okhonko
M. Lewis
Mandar Joshi
Hu Xu
Gargi Ghosh
Luke Zettlemoyer
VLMVPVLMAI4TSAI4CE
73
76
0
14 Jul 2021
Not Quite Ásk a Librarian': AI on the Nature, Value, and Future of LIS
Not Quite Ásk a Librarian': AI on the Nature, Value, and Future of LIS
J. Dinneen
Helen Bubinger
23
3
0
07 Jul 2021
On Training Instance Selection for Few-Shot Neural Text Generation
On Training Instance Selection for Few-Shot Neural Text Generation
Ernie Chang
Xiaoyu Shen
Hui-Syuan Yeh
Vera Demberg
86
42
0
07 Jul 2021
Interviewer-Candidate Role Play: Towards Developing Real-World NLP
  Systems
Interviewer-Candidate Role Play: Towards Developing Real-World NLP Systems
Neeraj Varshney
Swaroop Mishra
Chitta Baral
97
0
0
01 Jul 2021
Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with
  Language Models
Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models
Robert L Logan IV
Ivana Balavzević
Eric Wallace
Fabio Petroni
Sameer Singh
Sebastian Riedel
VPVLM
106
212
0
24 Jun 2021
BARTScore: Evaluating Generated Text as Text Generation
BARTScore: Evaluating Generated Text as Text Generation
Weizhe Yuan
Graham Neubig
Pengfei Liu
173
851
0
22 Jun 2021
CPM-2: Large-scale Cost-effective Pre-trained Language Models
CPM-2: Large-scale Cost-effective Pre-trained Language Models
Zhengyan Zhang
Yuxian Gu
Xu Han
Shengqi Chen
Chaojun Xiao
...
Minlie Huang
Wentao Han
Yang Liu
Xiaoyan Zhu
Maosong Sun
MoE
90
88
0
20 Jun 2021
Label prompt for multi-label text classification
Label prompt for multi-label text classification
Rui Song
Xingbing Chen
Zelong Liu
Haining An
Zhiqi Zhang
Xiaoguang Wang
Hao Xu
VLM
60
4
0
18 Jun 2021
Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge
  Bases
Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
Lingyong Yan
M. Liao
Tong Xue
Jin Xu
71
136
0
17 Jun 2021
Why Do Pretrained Language Models Help in Downstream Tasks? An Analysis
  of Head and Prompt Tuning
Why Do Pretrained Language Models Help in Downstream Tasks? An Analysis of Head and Prompt Tuning
Colin Wei
Sang Michael Xie
Tengyu Ma
148
100
0
17 Jun 2021
Question Answering Infused Pre-training of General-Purpose
  Contextualized Representations
Question Answering Infused Pre-training of General-Purpose Contextualized Representations
Robin Jia
M. Lewis
Luke Zettlemoyer
80
28
0
15 Jun 2021
An Empirical Survey of Data Augmentation for Limited Data Learning in
  NLP
An Empirical Survey of Data Augmentation for Limited Data Learning in NLP
Jiaao Chen
Derek Tam
Colin Raffel
Joey Tianyi Zhou
Diyi Yang
116
178
0
14 Jun 2021
Pre-Trained Models: Past, Present and Future
Pre-Trained Models: Past, Present and Future
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
...
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFinMQAI4MH
174
862
0
14 Jun 2021
Zero-Shot Controlled Generation with Encoder-Decoder Transformers
Zero-Shot Controlled Generation with Encoder-Decoder Transformers
Devamanyu Hazarika
Mahdi Namazifar
Dilek Z. Hakkani-Tür
AI4CE
81
6
0
11 Jun 2021
A Semi-supervised Multi-task Learning Approach to Classify Customer
  Contact Intents
A Semi-supervised Multi-task Learning Approach to Classify Customer Contact Intents
Li Dong
Matthew C. Spencer
Amir Biagi
43
3
0
10 Jun 2021
True Few-Shot Learning with Language Models
True Few-Shot Learning with Language Models
Ethan Perez
Douwe Kiela
Kyunghyun Cho
140
440
0
24 May 2021
A brain basis of dynamical intelligence for AI and computational
  neuroscience
A brain basis of dynamical intelligence for AI and computational neuroscience
J. Monaco
Kanaka Rajan
Grace M. Hwang
AI4CE
51
6
0
15 May 2021
Entailment as Few-Shot Learner
Entailment as Few-Shot Learner
Sinong Wang
Han Fang
Madian Khabsa
Hanzi Mao
Hao Ma
100
184
0
29 Apr 2021
Text-to-Text Multi-view Learning for Passage Re-ranking
Text-to-Text Multi-view Learning for Passage Re-ranking
Jia-Huei Ju
Jheng-Hong Yang
Chuan-Ju Wang
AIMat
48
20
0
29 Apr 2021
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in
  NLP
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLP
Qinyuan Ye
Bill Yuchen Lin
Xiang Ren
299
185
0
18 Apr 2021
GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation
GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation
Kang Min Yoo
Dongju Park
Jaewook Kang
Sang-Woo Lee
Woomyeong Park
115
243
0
18 Apr 2021
Fantastically Ordered Prompts and Where to Find Them: Overcoming
  Few-Shot Prompt Order Sensitivity
Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity
Yao Lu
Max Bartolo
Alastair Moore
Sebastian Riedel
Pontus Stenetorp
AILawLRM
431
1,200
0
18 Apr 2021
Constrained Language Models Yield Few-Shot Semantic Parsers
Constrained Language Models Yield Few-Shot Semantic Parsers
Richard Shin
C. H. Lin
Sam Thomson
Charles C. Chen
Subhro Roy
Emmanouil Antonios Platanios
Adam Pauls
Dan Klein
J. Eisner
Benjamin Van Durme
391
206
0
18 Apr 2021
Moving on from OntoNotes: Coreference Resolution Model Transfer
Moving on from OntoNotes: Coreference Resolution Model Transfer
Patrick Xia
Benjamin Van Durme
86
30
0
17 Apr 2021
Surface Form Competition: Why the Highest Probability Answer Isn't
  Always Right
Surface Form Competition: Why the Highest Probability Answer Isn't Always Right
Ari Holtzman
Peter West
Vered Schwartz
Yejin Choi
Luke Zettlemoyer
LRM
195
239
0
16 Apr 2021
KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization
  for Relation Extraction
KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction
Xiang Chen
Ningyu Zhang
Xin Xie
Shumin Deng
Yunzhi Yao
Chuanqi Tan
Fei Huang
Luo Si
Huajun Chen
185
418
0
15 Apr 2021
Generating Datasets with Pretrained Language Models
Generating Datasets with Pretrained Language Models
Timo Schick
Hinrich Schütze
165
235
0
15 Apr 2021
Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution
Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution
Ryuto Konno
Shun Kiyono
Yuichiroh Matsubayashi
Hiroki Ouchi
Kentaro Inui
17
10
0
15 Apr 2021
Learning How to Ask: Querying LMs with Mixtures of Soft Prompts
Learning How to Ask: Querying LMs with Mixtures of Soft Prompts
Guanghui Qin
J. Eisner
68
551
0
14 Apr 2021
Factual Probing Is [MASK]: Learning vs. Learning to Recall
Factual Probing Is [MASK]: Learning vs. Learning to Recall
Zexuan Zhong
Dan Friedman
Danqi Chen
81
413
0
12 Apr 2021
Adapting Language Models for Zero-shot Learning by Meta-tuning on
  Dataset and Prompt Collections
Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections
Ruiqi Zhong
Kristy Lee
Zheng Zhang
Dan Klein
132
173
0
10 Apr 2021
What Will it Take to Fix Benchmarking in Natural Language Understanding?
What Will it Take to Fix Benchmarking in Natural Language Understanding?
Samuel R. Bowman
George E. Dahl
ELMALM
76
164
0
05 Apr 2021
Thinking Aloud: Dynamic Context Generation Improves Zero-Shot Reasoning
  Performance of GPT-2
Thinking Aloud: Dynamic Context Generation Improves Zero-Shot Reasoning Performance of GPT-2
Gregor Betz
Kyle Richardson
Christian Voigt
ReLMLRM
87
31
0
24 Mar 2021
Improving and Simplifying Pattern Exploiting Training
Improving and Simplifying Pattern Exploiting Training
Derek Tam
Rakesh R Menon
Joey Tianyi Zhou
Shashank Srivastava
Colin Raffel
78
151
0
22 Mar 2021
Using Molecular Embeddings in QSAR Modeling: Does it Make a Difference?
Using Molecular Embeddings in QSAR Modeling: Does it Make a Difference?
María Virginia Sabando
I. Ponzoni
E. Milios
Axel J. Soto
40
27
0
20 Mar 2021
GPT Understands, Too
GPT Understands, Too
Xiao Liu
Yanan Zheng
Zhengxiao Du
Ming Ding
Yujie Qian
Zhilin Yang
Jie Tang
VLM
174
1,184
0
18 Mar 2021
GLM: General Language Model Pretraining with Autoregressive Blank
  Infilling
GLM: General Language Model Pretraining with Autoregressive Blank Infilling
Zhengxiao Du
Yujie Qian
Xiao Liu
Ming Ding
J. Qiu
Zhilin Yang
Jie Tang
BDLAI4CE
156
1,565
0
18 Mar 2021
How Many Data Points is a Prompt Worth?
How Many Data Points is a Prompt Worth?
Teven Le Scao
Alexander M. Rush
VLM
196
303
0
15 Mar 2021
Inductive Relation Prediction by BERT
Inductive Relation Prediction by BERT
H. Zha
Zhiyu Zoey Chen
Xifeng Yan
146
58
0
12 Mar 2021
Attention is Not All You Need: Pure Attention Loses Rank Doubly
  Exponentially with Depth
Attention is Not All You Need: Pure Attention Loses Rank Doubly Exponentially with Depth
Yihe Dong
Jean-Baptiste Cordonnier
Andreas Loukas
161
388
0
05 Mar 2021
Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based
  Bias in NLP
Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP
Timo Schick
Sahana Udupa
Hinrich Schütze
317
388
0
28 Feb 2021
A Survey on Stance Detection for Mis- and Disinformation Identification
A Survey on Stance Detection for Mis- and Disinformation Identification
Momchil Hardalov
Arnav Arora
Preslav Nakov
Isabelle Augenstein
201
136
0
27 Feb 2021
COCO-LM: Correcting and Contrasting Text Sequences for Language Model
  Pretraining
COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
Yu Meng
Chenyan Xiong
Payal Bajaj
Saurabh Tiwary
Paul N. Bennett
Jiawei Han
Xia Song
182
205
0
16 Feb 2021
Structured Prediction as Translation between Augmented Natural Languages
Structured Prediction as Translation between Augmented Natural Languages
Giovanni Paolini
Ben Athiwaratkun
Jason Krone
Jie Ma
Alessandro Achille
Rishita Anubhai
Cicero Nogueira dos Santos
Bing Xiang
Stefano Soatto
90
295
0
14 Jan 2021
Previous
123...111213
Next