ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXivPDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 606 papers shown
Title
Data-centric Artificial Intelligence: A Survey
Data-centric Artificial Intelligence: A Survey
Daochen Zha
Zaid Pervaiz Bhat
Kwei-Herng Lai
Fan Yang
Zhimeng Jiang
Shaochen Zhong
Xia Hu
27
192
0
17 Mar 2023
Automated Query Generation for Evidence Collection from Web Search
  Engines
Automated Query Generation for Evidence Collection from Web Search Engines
Nestor Prieto-Chavana
Julie Weeds
David J. Weir
HILM
13
1
0
15 Mar 2023
Model-tuning Via Prompts Makes NLP Models Adversarially Robust
Model-tuning Via Prompts Makes NLP Models Adversarially Robust
Mrigank Raman
Pratyush Maini
J. Zico Kolter
Zachary Chase Lipton
Danish Pruthi
AAML
38
17
0
13 Mar 2023
Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Yimeng Zhang
Xin Chen
Jinghan Jia
Sijia Liu
Ke Ding
23
25
0
09 Mar 2023
A Challenging Benchmark for Low-Resource Learning
A Challenging Benchmark for Low-Resource Learning
Yudong Wang
Chang Ma
Qingxiu Dong
Lingpeng Kong
Jingjing Xu
64
3
0
07 Mar 2023
OpenICL: An Open-Source Framework for In-context Learning
OpenICL: An Open-Source Framework for In-context Learning
Zhenyu Wu
Yaoxiang Wang
Jiacheng Ye
Jiangtao Feng
Jingjing Xu
Yu Qiao
Zhiyong Wu
29
49
0
06 Mar 2023
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
Zhen Wang
Yikang Shen
Leonid Karlinsky
Rogerio Feris
Huan Sun
Yoon Kim
VLM
VPVLM
44
107
0
06 Mar 2023
MathPrompter: Mathematical Reasoning using Large Language Models
MathPrompter: Mathematical Reasoning using Large Language Models
Shima Imani
Liang Du
H. Shrivastava
KELM
ReLM
LRM
6
194
0
04 Mar 2023
Investigating the Translation Performance of a Large Multilingual
  Language Model: the Case of BLOOM
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM
Rachel Bawden
François Yvon
VLM
LRM
25
60
0
03 Mar 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
E. Ponti
MoMe
OOD
32
73
0
22 Feb 2023
Mask-guided BERT for Few Shot Text Classification
Mask-guided BERT for Few Shot Text Classification
Wenxiong Liao
Zheng Liu
Haixing Dai
Zihao Wu
Yiyang Zhang
...
Dajiang Zhu
Tianming Liu
Sheng Li
Xiang Li
Hongmin Cai
VLM
47
39
0
21 Feb 2023
Can discrete information extraction prompts generalize across language
  models?
Can discrete information extraction prompts generalize across language models?
Nathanaël Carraz Rakotonirina
Roberto Dessì
Fabio Petroni
Sebastian Riedel
Marco Baroni
31
7
0
20 Feb 2023
Scalable Prompt Generation for Semi-supervised Learning with Language
  Models
Scalable Prompt Generation for Semi-supervised Learning with Language Models
Yuhang Zhou
Suraj Maharjan
Bei Liu
VLM
34
13
0
18 Feb 2023
Like a Good Nearest Neighbor: Practical Content Moderation and Text
  Classification
Like a Good Nearest Neighbor: Practical Content Moderation and Text Classification
Luke Bates
Iryna Gurevych
BDL
AI4MH
17
4
0
17 Feb 2023
Gradient-Based Automated Iterative Recovery for Parameter-Efficient
  Tuning
Gradient-Based Automated Iterative Recovery for Parameter-Efficient Tuning
Maximilian Mozes
Tolga Bolukbasi
Ann Yuan
Frederick Liu
Nithum Thain
Lucas Dixon
40
4
0
13 Feb 2023
Distinguishability Calibration to In-Context Learning
Distinguishability Calibration to In-Context Learning
Hongjing Li
Hanqi Yan
Yanran Li
Li Qian
Yulan He
Lin Gui
27
2
0
13 Feb 2023
Lightweight Transformers for Clinical Natural Language Processing
Lightweight Transformers for Clinical Natural Language Processing
Omid Rohanian
Mohammadmahdi Nouriborji
Hannah Jauncey
Samaneh Kouchaki
Isaric Clinical Characterisation Group
Lei A. Clifton
L. Merson
David A. Clifton
MedIm
LM&MA
24
12
0
09 Feb 2023
Prompting for Multimodal Hateful Meme Classification
Prompting for Multimodal Hateful Meme Classification
Rui Cao
Roy Ka-Wei Lee
Wen-Haw Chong
Jing Jiang
VLM
25
75
0
08 Feb 2023
What do Language Models know about word senses? Zero-Shot WSD with
  Language Models and Domain Inventories
What do Language Models know about word senses? Zero-Shot WSD with Language Models and Domain Inventories
Oscar Sainz
Oier López de Lacalle
Eneko Agirre
German Rigau
35
6
0
07 Feb 2023
Quantifying Context Mixing in Transformers
Quantifying Context Mixing in Transformers
Hosein Mohebbi
Willem H. Zuidema
Grzegorz Chrupała
A. Alishahi
168
24
0
30 Jan 2023
Prompt-Based Editing for Text Style Transfer
Prompt-Based Editing for Text Style Transfer
Guoqing Luo
Yu Tong Han
Lili Mou
Mauajama Firdaus
34
23
0
27 Jan 2023
Multitask Instruction-based Prompting for Fallacy Recognition
Multitask Instruction-based Prompting for Fallacy Recognition
Tariq Alhindi
Tuhin Chakrabarty
Elena Musi
Smaranda Muresan
LRM
12
30
0
24 Jan 2023
Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt
  Learning for Automatic Scoring in Science Education
Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt Learning for Automatic Scoring in Science Education
Xuansheng Wu
Xinyu He
Tianming Li
Ninghao Liu
Xiaoming Zhai
27
26
0
20 Jan 2023
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with
  Multimodal Models
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models
Zhiqiu Lin
Samuel Yu
Zhiyi Kuang
Deepak Pathak
Deva Ramana
VLM
20
100
0
16 Jan 2023
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
M Saiful Bari
Aston Zhang
Shuai Zheng
Xingjian Shi
Yi Zhu
Chenyu You
Mu Li
RALM
VLM
VPVLM
LRM
48
5
0
21 Dec 2022
Zero-shot Triplet Extraction by Template Infilling
Zero-shot Triplet Extraction by Template Infilling
Bosung Kim
Hayate Iso
Nikita Bhutani
Estevam R. Hruschka
Ndapandula Nakashole
Tom Mitchell
ViT
24
10
0
21 Dec 2022
Little Red Riding Hood Goes Around the Globe:Crosslingual Story Planning
  and Generation with Large Language Models
Little Red Riding Hood Goes Around the Globe:Crosslingual Story Planning and Generation with Large Language Models
E. Razumovskaia
Joshua Maynez
Annie Louis
Mirella Lapata
Shashi Narayan
LRM
22
5
0
20 Dec 2022
Empowering Sentence Encoders with Prompting and Label Retrieval for
  Zero-shot Text Classification
Empowering Sentence Encoders with Prompting and Label Retrieval for Zero-shot Text Classification
Jimin Hong
Jungsoo Park
Daeyoung Kim
Seongjae Choi
Bokyung Son
Jaewoo Kang
24
3
0
20 Dec 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Jian Yang
Shuming Ma
Li Dong
Shaohan Huang
Haoyang Huang
Yuwei Yin
Dongdong Zhang
Liqun Yang
Furu Wei
Zhoujun Li
SyDa
AI4CE
32
25
0
20 Dec 2022
Large Language Models Are Reasoning Teachers
Large Language Models Are Reasoning Teachers
Namgyu Ho
Laura Schmid
Se-Young Yun
ReLM
ELM
LRM
37
317
0
20 Dec 2022
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
Bairu Hou
J. O'Connor
Jacob Andreas
Shiyu Chang
Yang Zhang
VLM
21
44
0
19 Dec 2022
Language model acceptability judgements are not always robust to context
Language model acceptability judgements are not always robust to context
Koustuv Sinha
Jon Gauthier
Aaron Mueller
Kanishka Misra
Keren Fuentes
R. Levy
Adina Williams
21
17
0
18 Dec 2022
Pre-trained Language Models Can be Fully Zero-Shot Learners
Pre-trained Language Models Can be Fully Zero-Shot Learners
Xuandong Zhao
Siqi Ouyang
Zhiguo Yu
Ming-li Wu
Lei Li
VLM
LRM
37
34
0
14 Dec 2022
From Cloze to Comprehension: Retrofitting Pre-trained Masked Language
  Model to Pre-trained Machine Reader
From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader
Weiwen Xu
Xin Li
Wenxuan Zhang
Meng Zhou
W. Lam
Luo Si
Lidong Bing
27
2
0
09 Dec 2022
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large
  Language Models
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
Chan Hee Song
Jiaman Wu
Clay Washington
Brian M Sadler
Wei-Lun Chao
Yu-Chuan Su
LLMAG
LM&Ro
45
384
0
08 Dec 2022
Demystifying Prompts in Language Models via Perplexity Estimation
Demystifying Prompts in Language Models via Perplexity Estimation
Hila Gonen
Srini Iyer
Terra Blevins
Noah A. Smith
Luke Zettlemoyer
LRM
46
196
0
08 Dec 2022
Legal Prompting: Teaching a Language Model to Think Like a Lawyer
Legal Prompting: Teaching a Language Model to Think Like a Lawyer
Fang Yu
Lee Quartey
Frank Schilder
ELM
LRM
18
64
0
02 Dec 2022
BadPrompt: Backdoor Attacks on Continuous Prompts
BadPrompt: Backdoor Attacks on Continuous Prompts
Xiangrui Cai
Haidong Xu
Sihan Xu
Ying Zhang
Xiaojie Yuan
SILM
23
60
0
27 Nov 2022
Global and Local Hierarchy-aware Contrastive Framework for Implicit
  Discourse Relation Recognition
Global and Local Hierarchy-aware Contrastive Framework for Implicit Discourse Relation Recognition
Yuxin Jiang
Linhan Zhang
Wei Wang
38
17
0
25 Nov 2022
Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt
Multi-label Few-shot ICD Coding as Autoregressive Generation with Prompt
Zhichao Yang
Sunjae Kwon
Zonghai Yao
Hongfeng Yu
26
17
0
24 Nov 2022
Multitask Vision-Language Prompt Tuning
Multitask Vision-Language Prompt Tuning
Sheng Shen
Shijia Yang
Tianjun Zhang
Bohan Zhai
Joseph E. Gonzalez
Kurt Keutzer
Trevor Darrell
VLM
VPVLM
19
49
0
21 Nov 2022
UnifiedABSA: A Unified ABSA Framework Based on Multi-task Instruction
  Tuning
UnifiedABSA: A Unified ABSA Framework Based on Multi-task Instruction Tuning
Zengzhi Wang
Rui Xia
Jianfei Yu
26
11
0
20 Nov 2022
Is the Elephant Flying? Resolving Ambiguities in Text-to-Image
  Generative Models
Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models
Ninareh Mehrabi
Palash Goyal
Apurv Verma
Jwala Dhamala
Varun Kumar
Qian Hu
Kai-Wei Chang
R. Zemel
Aram Galstyan
Rahul Gupta
28
6
0
17 Nov 2022
On Measuring the Intrinsic Few-Shot Hardness of Datasets
On Measuring the Intrinsic Few-Shot Hardness of Datasets
Xinran Zhao
Shikhar Murty
Christopher D. Manning
11
5
0
16 Nov 2022
On the Compositional Generalization Gap of In-Context Learning
On the Compositional Generalization Gap of In-Context Learning
Arian Hosseini
Ankit Vani
Dzmitry Bahdanau
Alessandro Sordoni
Rameswar Panda
24
24
0
15 Nov 2022
MEAL: Stable and Active Learning for Few-Shot Prompting
MEAL: Stable and Active Learning for Few-Shot Prompting
Abdullatif Köksal
Timo Schick
Hinrich Schütze
27
25
0
15 Nov 2022
QAmeleon: Multilingual QA with Only 5 Examples
QAmeleon: Multilingual QA with Only 5 Examples
Priyanka Agrawal
Chris Alberti
Fantine Huot
Joshua Maynez
Ji Ma
Sebastian Ruder
Kuzman Ganchev
Dipanjan Das
Mirella Lapata
18
28
0
15 Nov 2022
A Universal Discriminator for Zero-Shot Generalization
A Universal Discriminator for Zero-Shot Generalization
Haike Xu
Zongyu Lin
Jing Zhou
Yanan Zheng
Zhilin Yang
AI4CE
21
14
0
15 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an
  Out-of-distribution Generalization Perspective
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Jindong Wang
Xingxu Xie
Yue Zhang
ELM
46
79
0
15 Nov 2022
Prompting Language Models for Linguistic Structure
Prompting Language Models for Linguistic Structure
Terra Blevins
Hila Gonen
Luke Zettlemoyer
LRM
35
40
0
15 Nov 2022
Previous
123...567...111213
Next