ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
v1v2 (latest)

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXiv (abs)PDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 613 papers shown
Title
Universal Prompt Tuning for Graph Neural Networks
Universal Prompt Tuning for Graph Neural Networks
Taoran Fang
Yunchao Zhang
Yang Yang
Chunping Wang
Lei Chen
122
58
0
30 Sep 2022
What Makes Pre-trained Language Models Better Zero-shot Learners?
What Makes Pre-trained Language Models Better Zero-shot Learners?
Jinghui Lu
Dongsheng Zhu
Weidong Han
Rui Zhao
Brian Mac Namee
Fei Tan
87
24
0
30 Sep 2022
Bidirectional Language Models Are Also Few-shot Learners
Bidirectional Language Models Are Also Few-shot Learners
Ajay Patel
Bryan Li
Mohammad Sadegh Rasooli
Noah Constant
Colin Raffel
Chris Callison-Burch
LRM
140
47
0
29 Sep 2022
Promptagator: Few-shot Dense Retrieval From 8 Examples
Promptagator: Few-shot Dense Retrieval From 8 Examples
Zhuyun Dai
Vincent Zhao
Ji Ma
Yi Luan
Jianmo Ni
Jing Lu
A. Bakalov
Kelvin Guu
Keith B. Hall
Ming-Wei Chang
RALM
95
242
0
23 Sep 2022
MetaPrompting: Learning to Learn Better Prompts
MetaPrompting: Learning to Learn Better Prompts
Yutai Hou
Hongyuan Dong
Xinghao Wang
Bohan Li
Wanxiang Che
VLM
72
29
0
23 Sep 2022
Prompting for a conversation: How to control a dialog model?
Prompting for a conversation: How to control a dialog model?
Josef Valvoda
Yimai Fang
David Vandyke
219
5
0
22 Sep 2022
Efficient Few-Shot Learning Without Prompts
Efficient Few-Shot Learning Without Prompts
Lewis Tunstall
Nils Reimers
Unso Eun Seo Jo
Luke Bates
Daniel Korat
Moshe Wasserblat
Oren Pereg
VLM
86
196
0
22 Sep 2022
Selecting Better Samples from Pre-trained LLMs: A Case Study on Question
  Generation
Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation
Xingdi Yuan
Tong Wang
Yen-Hsiang Wang
Emery Fine
Rania Abdelghani
Pauline Lucas
Hélene Sauzéon
Pierre-Yves Oudeyer
91
30
0
22 Sep 2022
WeLM: A Well-Read Pre-trained Language Model for Chinese
WeLM: A Well-Read Pre-trained Language Model for Chinese
Hui Su
Xiao Zhou
Houjin Yu
Xiaoyu Shen
Yuwen Chen
Zilin Zhu
Yang Yu
Jie Zhou
87
23
0
21 Sep 2022
Fairness Reprogramming
Fairness Reprogramming
Guanhua Zhang
Yihua Zhang
Yang Zhang
Wenqi Fan
Qing Li
Sijia Liu
Shiyu Chang
AAML
213
40
0
21 Sep 2022
Is More Data Better? Re-thinking the Importance of Efficiency in Abusive
  Language Detection with Transformers-Based Active Learning
Is More Data Better? Re-thinking the Importance of Efficiency in Abusive Language Detection with Transformers-Based Active Learning
Hannah Rose Kirk
Bertie Vidgen
Scott A. Hale
46
10
0
21 Sep 2022
A Few-shot Approach to Resume Information Extraction via Prompts
A Few-shot Approach to Resume Information Extraction via Prompts
Chengguang Gan
Tatsunori Mori
41
10
0
20 Sep 2022
Automatic Label Sequence Generation for Prompting Sequence-to-sequence
  Models
Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models
Zichun Yu
Tianyu Gao
Zhengyan Zhang
Yankai Lin
Zhiyuan Liu
Maosong Sun
Jie Zhou
VLMLRM
51
1
0
20 Sep 2022
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous
  Driving
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving
Xiwen Liang
Yangxin Wu
Jianhua Han
Hang Xu
Chunjing Xu
Xiaodan Liang
103
37
0
19 Sep 2022
Does CLIP Know My Face?
Does CLIP Know My Face?
Dominik Hintersdorf
Lukas Struppek
Manuel Brack
Felix Friedrich
P. Schramowski
Kristian Kersting
VLM
60
11
0
15 Sep 2022
Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A
  Prompt-Based Uncertainty Propagation Approach
Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach
Yue Yu
Rongzhi Zhang
Ran Xu
Jieyu Zhang
Jiaming Shen
Chao Zhang
103
21
0
15 Sep 2022
Language Chameleon: Transformation analysis between languages using
  Cross-lingual Post-training based on Pre-trained language models
Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models
Suhyune Son
Chanjun Park
Jungseob Lee
Midan Shim
Chanhee Lee
Yoonna Jang
Jaehyung Seo
Heu-Jeoung Lim
66
0
0
14 Sep 2022
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based
  Cross-Modal Generation
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
Yi Ma
Huan Yang
Bei Liu
Jianlong Fu
Jiaying Liu
DiffMMLLM
46
10
0
07 Sep 2022
Efficient Methods for Natural Language Processing: A Survey
Efficient Methods for Natural Language Processing: A Survey
Marcos Vinícius Treviso
Ji-Ung Lee
Tianchu Ji
Betty van Aken
Qingqing Cao
...
Emma Strubell
Niranjan Balasubramanian
Leon Derczynski
Iryna Gurevych
Roy Schwartz
151
114
0
31 Aug 2022
DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
Zhen-Quan Tang
Benyou Wang
Ting Yao
VLM
56
14
0
24 Aug 2022
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model
  Adaptation
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
VLMCLL
94
44
0
22 Aug 2022
Active PETs: Active Data Annotation Prioritisation for Few-Shot Claim
  Verification with Pattern Exploiting Training
Active PETs: Active Data Annotation Prioritisation for Few-Shot Claim Verification with Pattern Exploiting Training
Xia Zeng
A. Zubiaga
85
8
0
18 Aug 2022
Interactive and Visual Prompt Engineering for Ad-hoc Task Adaptation
  with Large Language Models
Interactive and Visual Prompt Engineering for Ad-hoc Task Adaptation with Large Language Models
Hendrik Strobelt
Albert Webson
Victor Sanh
Benjamin Hoover
Johanna Beyer
Hanspeter Pfister
Alexander M. Rush
VLM
78
141
0
16 Aug 2022
Reducing Retraining by Recycling Parameter-Efficient Prompts
Reducing Retraining by Recycling Parameter-Efficient Prompts
Brian Lester
Joshua Yurtsever
Siamak Shakeri
Noah Constant
51
12
0
10 Aug 2022
Improving Task Generalization via Unified Schema Prompt
Improving Task Generalization via Unified Schema Prompt
Wanjun Zhong
Yifan Gao
Ning Ding
Zhiyuan Liu
Ming Zhou
Jiahai Wang
Jian Yin
Nan Duan
75
8
0
05 Aug 2022
BEIKE NLP at SemEval-2022 Task 4: Prompt-Based Paragraph Classification
  for Patronizing and Condescending Language Detection
BEIKE NLP at SemEval-2022 Task 4: Prompt-Based Paragraph Classification for Patronizing and Condescending Language Detection
Yong Deng
Chenxiao Dou
Liangyu Chen
D. Miao
Xianghui Sun
Baochang Ma
Xiangang Li
33
9
0
02 Aug 2022
No More Fine-Tuning? An Experimental Evaluation of Prompt Tuning in Code
  Intelligence
No More Fine-Tuning? An Experimental Evaluation of Prompt Tuning in Code Intelligence
Chaozheng Wang
Yuanhang Yang
Cuiyun Gao
Yun Peng
Hongyu Zhang
Michael R. Lyu
AAML
115
142
0
24 Jul 2022
ELECTRA is a Zero-Shot Learner, Too
ELECTRA is a Zero-Shot Learner, Too
Shiwen Ni
Hung-Yu kao
67
9
0
17 Jul 2022
Aspect-specific Context Modeling for Aspect-based Sentiment Analysis
Aspect-specific Context Modeling for Aspect-based Sentiment Analysis
Fang Ma
Chen Zhang
Bo Zhang
Dawei Song
31
8
0
17 Jul 2022
Transformers are Adaptable Task Planners
Transformers are Adaptable Task Planners
Vidhi Jain
Yixin Lin
Eric Undersander
Yonatan Bisk
Akshara Rai
113
24
0
06 Jul 2022
Probing via Prompting
Probing via Prompting
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
109
13
0
04 Jul 2022
Few-Shot Stance Detection via Target-Aware Prompt Distillation
Few-Shot Stance Detection via Target-Aware Prompt Distillation
Yan Jiang
Jinhua Gao
Huawei Shen
Xueqi Cheng
72
27
0
27 Jun 2022
KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in
  Low-Resource NLP
KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Low-Resource NLP
Yufei Wang
Jiayi Zheng
Can Xu
Xiubo Geng
Tao Shen
Chongyang Tao
Daxin Jiang
VLMMoE
57
2
0
21 Jun 2022
Zero-Shot Video Question Answering via Frozen Bidirectional Language
  Models
Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Antoine Yang
Antoine Miech
Josef Sivic
Ivan Laptev
Cordelia Schmid
149
239
0
16 Jun 2022
Emergent Abilities of Large Language Models
Emergent Abilities of Large Language Models
Jason W. Wei
Yi Tay
Rishi Bommasani
Colin Raffel
Barret Zoph
...
Tatsunori Hashimoto
Oriol Vinyals
Percy Liang
J. Dean
W. Fedus
ELMReLMLRM
312
2,522
0
15 Jun 2022
DynaMaR: Dynamic Prompt with Mask Token Representation
DynaMaR: Dynamic Prompt with Mask Token Representation
Xiaodi Sun
Sunny Rajagopalan
Priyank Nigam
Weiyi Lu
Yi Xu
Belinda Zeng
Trishul Chilimbi
42
1
0
07 Jun 2022
Learning to Ask Like a Physician
Learning to Ask Like a Physician
Eric P. Lehman
Vladislav Lialin
K. Y. Legaspi
Anne Janelle R. Sy
Patricia Therese S. Pile
...
Anna Rumshisky
Jenifer Liang
Preethi Raghavan
Leo Anthony Celi
Peter Szolovits
OOD
80
20
0
06 Jun 2022
Instance-wise Prompt Tuning for Pretrained Language Models
Instance-wise Prompt Tuning for Pretrained Language Models
Yuezihan Jiang
Hao Yang
Junyang Lin
Hanyu Zhao
An Yang
Chang Zhou
Hongxia Yang
Zhi-Xin Yang
Tengjiao Wang
VLM
59
7
0
04 Jun 2022
Prompt Injection: Parameterization of Fixed Inputs
Prompt Injection: Parameterization of Fixed Inputs
Eunbi Choi
Yongrae Jo
Joel Jang
Minjoon Seo
119
30
0
31 May 2022
Prompting ELECTRA: Few-Shot Learning with Discriminative Pre-Trained
  Models
Prompting ELECTRA: Few-Shot Learning with Discriminative Pre-Trained Models
Mengzhou Xia
Mikel Artetxe
Jingfei Du
Danqi Chen
Ves Stoyanov
49
6
0
30 May 2022
E2S2: Encoding-Enhanced Sequence-to-Sequence Pretraining for Language
  Understanding and Generation
E2S2: Encoding-Enhanced Sequence-to-Sequence Pretraining for Language Understanding and Generation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
93
27
0
30 May 2022
Few-shot Subgoal Planning with Language Models
Few-shot Subgoal Planning with Language Models
Lajanugen Logeswaran
Yao Fu
Moontae Lee
Honglak Lee
LRM
76
26
0
28 May 2022
Ground-Truth Labels Matter: A Deeper Look into Input-Label
  Demonstrations
Ground-Truth Labels Matter: A Deeper Look into Input-Label Demonstrations
Kang Min Yoo
Junyeob Kim
Sungmin Cho
Hyunsoo Cho
Hwiyeol Jo
Sang-Woo Lee
Sang-goo Lee
Taeuk Kim
102
129
0
25 May 2022
Asking the Right Questions in Low Resource Template Extraction
Asking the Right Questions in Low Resource Template Extraction
Nils Holzenberger
Yunmo Chen
Benjamin Van Durme
87
4
0
25 May 2022
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
Mingkai Deng
Jianyu Wang
Cheng-Ping Hsieh
Yihan Wang
Han Guo
Tianmin Shu
Meng Song
Eric Xing
Zhiting Hu
97
345
0
25 May 2022
GENEVA: Benchmarking Generalizability for Event Argument Extraction with
  Hundreds of Event Types and Argument Roles
GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles
Tanmay Parekh
I-Hung Hsu
Kuan-Hao Huang
Kai-Wei Chang
Nanyun Peng
106
27
0
25 May 2022
Structured Prompt Tuning
Structured Prompt Tuning
Chi-Liang Liu
Hung-yi Lee
Wen-tau Yih
49
3
0
24 May 2022
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLMLRM
572
4,077
0
24 May 2022
Many-Class Text Classification with Matching
Many-Class Text Classification with Matching
Yi-Fan Song
Yuxian Gu
Minlie Huang
VLM
29
1
0
23 May 2022
Sample Efficient Approaches for Idiomaticity Detection
Sample Efficient Approaches for Idiomaticity Detection
Dylan Phelps
Xu Fan
Edward Gow-Smith
Harish Tayyar Madabushi
Carolina Scarton
Aline Villavicencio
78
1
0
23 May 2022
Previous
123...789...111213
Next