ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
v1v2 (latest)

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXiv (abs)PDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 613 papers shown
Title
Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again
Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again
Bernal Jiménez Gutiérrez
Nikolas McNeal
Clay Washington
You Chen
Lang Li
Huan Sun
Yu-Chuan Su
105
157
0
16 Mar 2022
Things not Written in Text: Exploring Spatial Commonsense from Visual
  Signals
Things not Written in Text: Exploring Spatial Commonsense from Visual Signals
Xiao Liu
Da Yin
Yansong Feng
Dongyan Zhao
LRM
80
46
0
15 Mar 2022
Modular and Parameter-Efficient Multimodal Fusion with Prompting
Modular and Parameter-Efficient Multimodal Fusion with Prompting
Sheng Liang
Mengjie Zhao
Hinrich Schütze
90
45
0
15 Mar 2022
GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large
  Language Models
GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models
Archiki Prasad
Peter Hase
Xiang Zhou
Joey Tianyi Zhou
115
124
0
14 Mar 2022
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual
  Entailment
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment
Haoyu Song
Li Dong
Weinan Zhang
Ting Liu
Furu Wei
VLMCLIP
89
139
0
14 Mar 2022
Improved Universal Sentence Embeddings with Prompt-based Contrastive
  Learning and Energy-based Learning
Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning
Yuxin Jiang
Linhan Zhang
Wei Wang
SSL
59
47
0
14 Mar 2022
Continual Prompt Tuning for Dialog State Tracking
Continual Prompt Tuning for Dialog State Tracking
Qi Zhu
Bing Li
Fei Mi
Xiaoyan Zhu
Minlie Huang
CLLKELM
90
60
0
13 Mar 2022
HyperMixer: An MLP-based Low Cost Alternative to Transformers
HyperMixer: An MLP-based Low Cost Alternative to Transformers
Florian Mai
Arnaud Pannatier
Fabio Fehr
Haolin Chen
François Marelli
François Fleuret
James Henderson
77
11
0
07 Mar 2022
Pre-trained Token-replaced Detection Model as Few-shot Learner
Pre-trained Token-replaced Detection Model as Few-shot Learner
Zicheng Li
Shoushan Li
Guodong Zhou
77
9
0
07 Mar 2022
Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Shengnan An
Yifei Li
Zeqi Lin
Qian Liu
Bei Chen
Qiang Fu
Weizhu Chen
Nanning Zheng
Jian-Guang Lou
VLMAAML
88
43
0
07 Mar 2022
Dialogue Summaries as Dialogue States (DS2), Template-Guided
  Summarization for Few-shot Dialogue State Tracking
Dialogue Summaries as Dialogue States (DS2), Template-Guided Summarization for Few-shot Dialogue State Tracking
Jamin Shin
Hangyeol Yu
Hyeongdon Moon
Andrea Madotto
Juneyoung Park
79
29
0
03 Mar 2022
QaNER: Prompting Question Answering Models for Few-shot Named Entity
  Recognition
QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition
Andy T. Liu
Wei Xiao
Henghui Zhu
Dejiao Zhang
Shang-Wen Li
Andrew O. Arnold
76
27
0
03 Mar 2022
Do Prompts Solve NLP Tasks Using Natural Language?
Do Prompts Solve NLP Tasks Using Natural Language?
Sen Yang
Yunchen Zhang
Leyang Cui
Yue Zhang
LRM
80
4
0
02 Mar 2022
Attend, Memorize and Generate: Towards Faithful Table-to-Text Generation
  in Few Shots
Attend, Memorize and Generate: Towards Faithful Table-to-Text Generation in Few Shots
Wenting Zhao
Ye Liu
Yao Wan
Philip S. Yu
60
11
0
01 Mar 2022
EPPAC: Entity Pre-typing Relation Classification with Prompt AnswerCentralizing
Jiejun Tan
Wenbin Hu
Weiwei Liu
86
1
0
01 Mar 2022
SemSup: Semantic Supervision for Simple and Scalable Zero-shot
  Generalization
SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization
Austin W. Hanjie
Ameet Deshpande
Karthik Narasimhan
VLM
69
2
0
26 Feb 2022
Rethinking the Role of Demonstrations: What Makes In-Context Learning
  Work?
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
Sewon Min
Xinxi Lyu
Ari Holtzman
Mikel Artetxe
M. Lewis
Hannaneh Hajishirzi
Luke Zettlemoyer
LLMAGLRM
193
1,502
0
25 Feb 2022
Prompt-Learning for Short Text Classification
Prompt-Learning for Short Text Classification
Yi Zhu
Xinke Zhou
Jipeng Qiang
Yun Li
Yunhao Yuan
Xindong Wu
VLM
59
36
0
23 Feb 2022
Revisiting Parameter-Efficient Tuning: Are We Really There Yet?
Revisiting Parameter-Efficient Tuning: Are We Really There Yet?
Guanzheng Chen
Fangyu Liu
Zaiqiao Meng
Shangsong Liang
64
95
0
16 Feb 2022
P4E: Few-Shot Event Detection as Prompt-Guided Identification and
  Localization
P4E: Few-Shot Event Detection as Prompt-Guided Identification and Localization
Sha Li
Liyuan Liu
Yiqing Xie
Heng Ji
Jiawei Han
98
4
0
15 Feb 2022
Enhancing Cross-lingual Prompting with Dual Prompt Augmentation
Enhancing Cross-lingual Prompting with Dual Prompt Augmentation
Meng Zhou
Xin Li
Yuechun Jiang
Lidong Bing
LRM
69
6
0
15 Feb 2022
Semantic-Oriented Unlabeled Priming for Large-Scale Language Models
Semantic-Oriented Unlabeled Priming for Large-Scale Language Models
Yanchen Liu
Timo Schick
Hinrich Schütze
VLM
66
15
0
12 Feb 2022
InPars: Data Augmentation for Information Retrieval using Large Language
  Models
InPars: Data Augmentation for Information Retrieval using Large Language Models
L. Bonifacio
Hugo Queiroz Abonizio
Marzieh Fadaee
Rodrigo Nogueira
VLMRALM
91
62
0
10 Feb 2022
AdaPrompt: Adaptive Model Training for Prompt-based NLP
AdaPrompt: Adaptive Model Training for Prompt-based NLP
Yulong Chen
Yang Liu
Li Dong
Shuohang Wang
Chenguang Zhu
Michael Zeng
Yue Zhang
VLM
102
48
0
10 Feb 2022
Generating Training Data with Language Models: Towards Zero-Shot
  Language Understanding
Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
Yu Meng
Jiaxin Huang
Yu Zhang
Jiawei Han
SyDa
77
235
0
09 Feb 2022
Exploring the Limits of Domain-Adaptive Training for Detoxifying
  Large-Scale Language Models
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models
Wei Ping
Ming-Yu Liu
Chaowei Xiao
Peng Xu
M. Patwary
Mohammad Shoeybi
Yue Liu
Anima Anandkumar
Bryan Catanzaro
100
71
0
08 Feb 2022
PromptSource: An Integrated Development Environment and Repository for
  Natural Language Prompts
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Stephen H. Bach
Victor Sanh
Zheng-Xin Yong
Albert Webson
Colin Raffel
...
Khalid Almubarak
Xiangru Tang
Dragomir R. Radev
Mike Tian-Jian Jiang
Alexander M. Rush
VLM
350
355
0
02 Feb 2022
Retrieve-and-Fill for Scenario-based Task-Oriented Semantic Parsing
Retrieve-and-Fill for Scenario-based Task-Oriented Semantic Parsing
Akshat Shrivastava
Shrey Desai
Anchit Gupta
A. Elkahky
Aleksandr Livshits
Alexander Zotov
Ahmed Aly
171
6
0
02 Feb 2022
Protum: A New Method For Prompt Tuning Based on "[MASK]"
Protum: A New Method For Prompt Tuning Based on "[MASK]"
Pan He
Yuxi Chen
Yan Wang
Yanru Zhang
AAML
46
3
0
28 Jan 2022
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and
  Languages
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages
Emanuele Bugliarello
Fangyu Liu
Jonas Pfeiffer
Siva Reddy
Desmond Elliott
Edoardo Ponti
Ivan Vulić
MLLMVLMELM
117
64
0
27 Jan 2022
An Assessment of the Impact of OCR Noise on Language Models
An Assessment of the Impact of OCR Noise on Language Models
Konstantin Todorov
Giovanni Colavizza
49
7
0
26 Jan 2022
Black-box Prompt Learning for Pre-trained Language Models
Black-box Prompt Learning for Pre-trained Language Models
Shizhe Diao
Zhichao Huang
Ruijia Xu
Xuechun Li
Yong Lin
Xiao Zhou
Tong Zhang
VLMAAML
91
71
0
21 Jan 2022
Instance-aware Prompt Learning for Language Understanding and Generation
Instance-aware Prompt Learning for Language Understanding and Generation
Feihu Jin
Jinliang Lu
Jiajun Zhang
Chengqing Zong
57
33
0
18 Jan 2022
Black-Box Tuning for Language-Model-as-a-Service
Black-Box Tuning for Language-Model-as-a-Service
Tianxiang Sun
Yunfan Shao
Hong Qian
Xuanjing Huang
Xipeng Qiu
VLM
181
275
0
10 Jan 2022
ArT: All-round Thinker for Unsupervised Commonsense Question-Answering
ArT: All-round Thinker for Unsupervised Commonsense Question-Answering
Jiawei Wang
Hai Zhao
LLMAGLRM
79
3
0
26 Dec 2021
Efficient Large Scale Language Modeling with Mixtures of Experts
Efficient Large Scale Language Modeling with Mixtures of Experts
Mikel Artetxe
Shruti Bhosale
Naman Goyal
Todor Mihaylov
Myle Ott
...
Jeff Wang
Luke Zettlemoyer
Mona T. Diab
Zornitsa Kozareva
Ves Stoyanov
MoE
224
200
0
20 Dec 2021
Reframing Human-AI Collaboration for Generating Free-Text Explanations
Reframing Human-AI Collaboration for Generating Free-Text Explanations
Sarah Wiegreffe
Jack Hessel
Swabha Swayamdipta
Mark O. Riedl
Yejin Choi
77
149
0
16 Dec 2021
Analyzing the Limits of Self-Supervision in Handling Bias in Language
Analyzing the Limits of Self-Supervision in Handling Bias in Language
Lisa Bauer
Karthik Gopalakrishnan
Spandana Gella
Yang Liu
Joey Tianyi Zhou
Dilek Z. Hakkani-Tür
ELM
32
1
0
16 Dec 2021
NewsClaims: A New Benchmark for Claim Detection from News with Attribute
  Knowledge
NewsClaims: A New Benchmark for Claim Detection from News with Attribute Knowledge
R. Reddy
Sai Chetan Chinthakindi
Zhenhailong Wang
Yi R. Fung
Kathryn Conger
...
Martha Palmer
Preslav Nakov
Eduard H. Hovy
Kevin Small
Heng Ji
VLM
67
29
0
16 Dec 2021
Fine-Tuning Large Neural Language Models for Biomedical Natural Language
  Processing
Fine-Tuning Large Neural Language Models for Biomedical Natural Language Processing
Robert Tinn
Hao Cheng
Yu Gu
Naoto Usuyama
Xiaodong Liu
Tristan Naumann
Jianfeng Gao
Hoifung Poon
LM&MA
60
116
0
15 Dec 2021
LMTurk: Few-Shot Learners as Crowdsourcing Workers in a
  Language-Model-as-a-Service Framework
LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework
Mengjie Zhao
Fei Mi
Yasheng Wang
Minglei Li
Xin Jiang
Qun Liu
Hinrich Schütze
RALM
113
11
0
14 Dec 2021
Dependency Learning for Legal Judgment Prediction with a Unified
  Text-to-Text Transformer
Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer
Yunyun Huang
Xiaoyu Shen
Chuanyi Li
Jidong Ge
B. Luo
AILaw
77
20
0
13 Dec 2021
Prompt-based Zero-shot Relation Extraction with Semantic Knowledge
  Augmentation
Prompt-based Zero-shot Relation Extraction with Semantic Knowledge Augmentation
Jiaying Gong
Hoda Eldardiry
58
7
0
08 Dec 2021
True Few-Shot Learning with Prompts -- A Real-World Perspective
True Few-Shot Learning with Prompts -- A Real-World Perspective
Timo Schick
Hinrich Schütze
VLM
110
64
0
26 Nov 2021
Few-shot Named Entity Recognition with Cloze Questions
Few-shot Named Entity Recognition with Cloze Questions
V. Gatta
V. Moscato
Marco Postiglione
Giancarlo Sperlí
56
4
0
24 Nov 2021
Few-Shot Self-Rationalization with Natural Language Prompts
Few-Shot Self-Rationalization with Natural Language Prompts
Ana Marasović
Iz Beltagy
Doug Downey
Matthew E. Peters
LRM
89
110
0
16 Nov 2021
On Transferability of Prompt Tuning for Natural Language Processing
On Transferability of Prompt Tuning for Natural Language Processing
Yusheng Su
Xiaozhi Wang
Yujia Qin
Chi-Min Chan
Yankai Lin
...
Peng Li
Juanzi Li
Lei Hou
Maosong Sun
Jie Zhou
AAMLVLM
81
106
0
12 Nov 2021
A Survey on Green Deep Learning
A Survey on Green Deep Learning
Jingjing Xu
Wangchunshu Zhou
Zhiyi Fu
Hao Zhou
Lei Li
VLM
201
84
0
08 Nov 2021
CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
Subhabrata Mukherjee
Xiaodong Liu
Guoqing Zheng
Saghar Hosseini
Hao Cheng
Greg Yang
Christopher Meek
Ahmed Hassan Awadallah
Jianfeng Gao
ELM
70
11
0
04 Nov 2021
Diverse Distributions of Self-Supervised Tasks for Meta-Learning in NLP
Diverse Distributions of Self-Supervised Tasks for Meta-Learning in NLP
Trapit Bansal
K. Gunasekaran
Tong Wang
Tsendsuren Munkhdalai
Andrew McCallum
SSLOOD
96
20
0
02 Nov 2021
Previous
123...101112139
Next