ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
v1v2 (latest)

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXiv (abs)PDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 613 papers shown
Title
Learning Federated Visual Prompt in Null Space for MRI Reconstruction
Learning Federated Visual Prompt in Null Space for MRI Reconstruction
Chun-Mei Feng
Bangjun Li
Xinxing Xu
Yong Liu
Huazhu Fu
W. Zuo
FedML
99
47
0
28 Mar 2023
Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning
Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning
Vladislav Lialin
Vijeta Deshpande
Anna Rumshisky
104
179
0
28 Mar 2023
Analyzing the Performance of GPT-3.5 and GPT-4 in Grammatical Error
  Correction
Analyzing the Performance of GPT-3.5 and GPT-4 in Grammatical Error Correction
Steven Coyne
Keisuke Sakaguchi
Diana Galván-Sosa
M. Zock
Kentaro Inui
62
12
0
25 Mar 2023
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation
  in an Open World
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World
Qifan Yu
Juncheng Li
Yuehua Wu
Siliang Tang
Wei Ji
Yueting Zhuang
100
38
0
23 Mar 2023
Fairness: from the ethical principle to the practice of Machine Learning
  development as an ongoing agreement with stakeholders
Fairness: from the ethical principle to the practice of Machine Learning development as an ongoing agreement with stakeholders
Georgina Curto
F. Comim
FaML
23
1
0
22 Mar 2023
Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization
  for Few-shot Generalization
Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization
Kaihang Pan
Juncheng Billy Li
Hongye Song
Jun Lin
Xiaozhong Liu
Siliang Tang
OffRL
99
13
0
22 Mar 2023
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
Zheng-Long Liu
Yue Huang
Xiao-Xing Yu
Lu Zhang
Zihao Wu
...
Dinggang Shen
Quanzheng Li
Tianming Liu
Dajiang Zhu
Xiang Li
LM&MAMedIm
127
178
0
20 Mar 2023
Large Language Model Instruction Following: A Survey of Progresses and
  Challenges
Large Language Model Instruction Following: A Survey of Progresses and Challenges
Renze Lou
Kai Zhang
Wenpeng Yin
ALMLRM
153
25
0
18 Mar 2023
SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language
  Models
SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Vithursan Thangarasa
Abhay Gupta
William Marshall
Tianda Li
Kevin Leong
D. DeCoste
Sean Lie
Shreyas Saxena
MoEAI4CE
73
22
0
18 Mar 2023
Data-centric Artificial Intelligence: A Survey
Data-centric Artificial Intelligence: A Survey
Daochen Zha
Zaid Pervaiz Bhat
Kwei-Herng Lai
Fan Yang
Zhimeng Jiang
Shaochen Zhong
Helen Zhou
119
214
0
17 Mar 2023
Automated Query Generation for Evidence Collection from Web Search
  Engines
Automated Query Generation for Evidence Collection from Web Search Engines
Nestor Prieto-Chavana
Julie Weeds
David J. Weir
HILM
53
1
0
15 Mar 2023
Model-tuning Via Prompts Makes NLP Models Adversarially Robust
Model-tuning Via Prompts Makes NLP Models Adversarially Robust
Mrigank Raman
Pratyush Maini
J. Zico Kolter
Zachary Chase Lipton
Danish Pruthi
AAML
71
17
0
13 Mar 2023
Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Yimeng Zhang
Xin Chen
Jinghan Jia
Sijia Liu
Ke Ding
92
27
0
09 Mar 2023
A Challenging Benchmark for Low-Resource Learning
A Challenging Benchmark for Low-Resource Learning
Yudong Wang
Chang Ma
Qingxiu Dong
Lingpeng Kong
Jingjing Xu
70
4
0
07 Mar 2023
OpenICL: An Open-Source Framework for In-context Learning
OpenICL: An Open-Source Framework for In-context Learning
Zhenyu Wu
Yaoxiang Wang
Jiacheng Ye
Jiangtao Feng
Jingjing Xu
Yu Qiao
Zhiyong Wu
67
53
0
06 Mar 2023
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
Zhen Wang
Yikang Shen
Leonid Karlinsky
Rogerio Feris
Huan Sun
Yoon Kim
VLMVPVLM
96
118
0
06 Mar 2023
MathPrompter: Mathematical Reasoning using Large Language Models
MathPrompter: Mathematical Reasoning using Large Language Models
Shima Imani
Liang Du
H. Shrivastava
KELMReLMLRM
104
214
0
04 Mar 2023
Investigating the Translation Performance of a Large Multilingual
  Language Model: the Case of BLOOM
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM
Rachel Bawden
François Yvon
VLMLRM
90
65
0
03 Mar 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMeOOD
159
80
0
22 Feb 2023
Mask-guided BERT for Few Shot Text Classification
Mask-guided BERT for Few Shot Text Classification
Wenxiong Liao
Zheng Liu
Haixing Dai
Zihao Wu
Yiyang Zhang
...
Dajiang Zhu
Tianming Liu
Sheng Li
Xiang Li
Hongmin Cai
VLM
93
41
0
21 Feb 2023
Can discrete information extraction prompts generalize across language
  models?
Can discrete information extraction prompts generalize across language models?
Nathanaël Carraz Rakotonirina
Roberto Dessì
Fabio Petroni
Sebastian Riedel
Marco Baroni
58
8
0
20 Feb 2023
Scalable Prompt Generation for Semi-supervised Learning with Language
  Models
Scalable Prompt Generation for Semi-supervised Learning with Language Models
Yuhang Zhou
Suraj Maharjan
Bei Liu
VLM
92
14
0
18 Feb 2023
Like a Good Nearest Neighbor: Practical Content Moderation and Text
  Classification
Like a Good Nearest Neighbor: Practical Content Moderation and Text Classification
Luke Bates
Iryna Gurevych
BDLAI4MH
84
4
0
17 Feb 2023
Gradient-Based Automated Iterative Recovery for Parameter-Efficient
  Tuning
Gradient-Based Automated Iterative Recovery for Parameter-Efficient Tuning
Maximilian Mozes
Tolga Bolukbasi
Ann Yuan
Frederick Liu
Nithum Thain
Lucas Dixon
57
5
0
13 Feb 2023
Distinguishability Calibration to In-Context Learning
Distinguishability Calibration to In-Context Learning
Hongjing Li
Hanqi Yan
Yanran Li
Li Qian
Yulan He
Lin Gui
77
2
0
13 Feb 2023
Lightweight Transformers for Clinical Natural Language Processing
Lightweight Transformers for Clinical Natural Language Processing
Omid Rohanian
Mohammadmahdi Nouriborji
Hannah Jauncey
Samaneh Kouchaki
Isaric Clinical Characterisation Group
Lei A. Clifton
L. Merson
David Clifton
MedImLM&MA
77
12
0
09 Feb 2023
Prompting for Multimodal Hateful Meme Classification
Prompting for Multimodal Hateful Meme Classification
Rui Cao
Roy Ka-wei Lee
Wen-Haw Chong
Jing Jiang
VLM
78
82
0
08 Feb 2023
What do Language Models know about word senses? Zero-Shot WSD with
  Language Models and Domain Inventories
What do Language Models know about word senses? Zero-Shot WSD with Language Models and Domain Inventories
Oscar Sainz
Oier López de Lacalle
Eneko Agirre
German Rigau
74
7
0
07 Feb 2023
Quantifying Context Mixing in Transformers
Quantifying Context Mixing in Transformers
Hosein Mohebbi
Willem H. Zuidema
Grzegorz Chrupała
Afra Alishahi
226
28
0
30 Jan 2023
Prompt-Based Editing for Text Style Transfer
Prompt-Based Editing for Text Style Transfer
Guoqing Luo
Yu Tong Han
Lili Mou
Mauajama Firdaus
91
26
0
27 Jan 2023
Multitask Instruction-based Prompting for Fallacy Recognition
Multitask Instruction-based Prompting for Fallacy Recognition
Tariq Alhindi
Tuhin Chakrabarty
Elena Musi
Smaranda Muresan
LRM
67
30
0
24 Jan 2023
Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt
  Learning for Automatic Scoring in Science Education
Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt Learning for Automatic Scoring in Science Education
Xuansheng Wu
Xinyu He
Tianming Li
Ninghao Liu
Xiaoming Zhai
102
26
0
20 Jan 2023
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with
  Multimodal Models
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models
Zhiqiu Lin
Samuel Yu
Zhiyi Kuang
Deepak Pathak
Deva Ramana
VLM
152
116
0
16 Jan 2023
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
M Saiful Bari
Aston Zhang
Shuai Zheng
Xingjian Shi
Yi Zhu
Shafiq Joty
Mu Li
RALMVLMVPVLMLRM
92
5
0
21 Dec 2022
Zero-shot Triplet Extraction by Template Infilling
Zero-shot Triplet Extraction by Template Infilling
Bosung Kim
Hayate Iso
Nikita Bhutani
Estevam R. Hruschka
Ndapandula Nakashole
Tom Mitchell
ViT
58
10
0
21 Dec 2022
Little Red Riding Hood Goes Around the Globe:Crosslingual Story Planning
  and Generation with Large Language Models
Little Red Riding Hood Goes Around the Globe:Crosslingual Story Planning and Generation with Large Language Models
E. Razumovskaia
Joshua Maynez
Annie Louis
Mirella Lapata
Shashi Narayan
LRM
49
5
0
20 Dec 2022
Empowering Sentence Encoders with Prompting and Label Retrieval for
  Zero-shot Text Classification
Empowering Sentence Encoders with Prompting and Label Retrieval for Zero-shot Text Classification
Jimin Hong
Jungsoo Park
Daeyoung Kim
Seongjae Choi
Bokyung Son
Jaewoo Kang
73
3
0
20 Dec 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Jian Yang
Shuming Ma
Li Dong
Shaohan Huang
Haoyang Huang
Yuwei Yin
Dongdong Zhang
Liqun Yang
Furu Wei
Zhoujun Li
SyDaAI4CE
70
25
0
20 Dec 2022
Large Language Models Are Reasoning Teachers
Large Language Models Are Reasoning Teachers
Namgyu Ho
Laura Schmid
Se-Young Yun
ReLMELMLRM
128
351
0
20 Dec 2022
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
Bairu Hou
J. O'Connor
Jacob Andreas
Shiyu Chang
Yang Zhang
VLM
57
44
0
19 Dec 2022
Language model acceptability judgements are not always robust to context
Language model acceptability judgements are not always robust to context
Koustuv Sinha
Jon Gauthier
Aaron Mueller
Kanishka Misra
Keren Fuentes
R. Levy
Adina Williams
90
18
0
18 Dec 2022
Pre-trained Language Models Can be Fully Zero-Shot Learners
Pre-trained Language Models Can be Fully Zero-Shot Learners
Xuandong Zhao
Siqi Ouyang
Zhiguo Yu
Ming-li Wu
Lei Li
VLMLRM
100
34
0
14 Dec 2022
From Cloze to Comprehension: Retrofitting Pre-trained Masked Language
  Model to Pre-trained Machine Reader
From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader
Weiwen Xu
Xin Li
Wenxuan Zhang
Meng Zhou
W. Lam
Luo Si
Lidong Bing
71
2
0
09 Dec 2022
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large
  Language Models
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
Chan Hee Song
Jiaman Wu
Clay Washington
Brian M Sadler
Wei-Lun Chao
Yu-Chuan Su
LLMAGLM&Ro
170
424
0
08 Dec 2022
Demystifying Prompts in Language Models via Perplexity Estimation
Demystifying Prompts in Language Models via Perplexity Estimation
Hila Gonen
Srini Iyer
Terra Blevins
Noah A. Smith
Luke Zettlemoyer
LRM
153
214
0
08 Dec 2022
Legal Prompting: Teaching a Language Model to Think Like a Lawyer
Legal Prompting: Teaching a Language Model to Think Like a Lawyer
Fang Yu
Lee Quartey
Frank Schilder
ELMLRM
54
69
0
02 Dec 2022
BadPrompt: Backdoor Attacks on Continuous Prompts
BadPrompt: Backdoor Attacks on Continuous Prompts
Xiangrui Cai
Haidong Xu
Sihan Xu
Ying Zhang
Xiaojie Yuan
SILM
78
67
0
27 Nov 2022
Global and Local Hierarchy-aware Contrastive Framework for Implicit
  Discourse Relation Recognition
Global and Local Hierarchy-aware Contrastive Framework for Implicit Discourse Relation Recognition
Yuxin Jiang
Linhan Zhang
Wei Wang
68
17
0
25 Nov 2022
Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt
Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt
Zhichao Yang
Sunjae Kwon
Zonghai Yao
Hongfeng Yu
71
18
0
24 Nov 2022
Multitask Vision-Language Prompt Tuning
Multitask Vision-Language Prompt Tuning
Sheng Shen
Shijia Yang
Tianjun Zhang
Bohan Zhai
Joseph E. Gonzalez
Kurt Keutzer
Trevor Darrell
VLMVPVLM
104
52
0
21 Nov 2022
Previous
123...567...111213
Next