ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
v1v2 (latest)

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXiv (abs)PDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 613 papers shown
Title
UniPCM: Universal Pre-trained Conversation Model with Task-aware
  Automatic Prompt
UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt
Yucheng Cai
Wentao Ma
Yuchuan Wu
Shuzheng Si
Yuan Shao
Zhijian Ou
Yongbin Li
116
3
0
20 Sep 2023
ODSum: New Benchmarks for Open Domain Multi-Document Summarization
ODSum: New Benchmarks for Open Domain Multi-Document Summarization
Yijie Zhou
Kejian Shi
Wencai Zhang
Yixin Liu
Yilun Zhao
Arman Cohan
RALM
68
2
0
16 Sep 2023
How to Handle Different Types of Out-of-Distribution Scenarios in
  Computational Argumentation? A Comprehensive and Fine-Grained Field Study
How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field Study
Andreas Waldis
Yufang Hou
Iryna Gurevych
62
4
0
15 Sep 2023
Characterizing Latent Perspectives of Media Houses Towards Public
  Figures
Characterizing Latent Perspectives of Media Houses Towards Public Figures
S. Srivatsa
Srinath Srinivasa
44
0
0
12 Sep 2023
Detecting Natural Language Biases with Prompt-based Learning
Detecting Natural Language Biases with Prompt-based Learning
Md Abdul Aowal
Maliha T Islam
P. Mammen
Sandesh Shetty
51
1
0
11 Sep 2023
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Xinyi Wang
John Wieting
J. Clark
CLLALM
62
2
0
09 Sep 2023
Manifold-based Verbalizer Space Re-embedding for Tuning-free
  Prompt-based Classification
Manifold-based Verbalizer Space Re-embedding for Tuning-free Prompt-based Classification
Hao Wang
Sendong Zhao
Chi-Liang Liu
Nuwa Xi
Muzhen Cai
Bing Qin
Ting Liu
51
2
0
08 Sep 2023
BatchPrompt: Accomplish more with less
BatchPrompt: Accomplish more with less
Jianzhe Lin
Maurice Diesendruck
Liang Du
Robin Abraham
LRM
96
10
0
01 Sep 2023
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
Chengkun Wei
Wenlong Meng
Zhikun Zhang
M. Chen
Ming-Hui Zhao
Wenjing Fang
Lei Wang
Zihui Zhang
Wenzhi Chen
AAML
56
11
0
26 Aug 2023
Advancing Relation Extraction through Language Probing with Exemplars
  from Set Co-Expansion
Advancing Relation Extraction through Language Probing with Exemplars from Set Co-Expansion
Yerong Li
Roxana Girju
60
0
0
18 Aug 2023
Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt
  Generation for Few-shot Learning
Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt Generation for Few-shot Learning
Chengzhengxu Li
Xiaoming Liu
Yichen Wang
Duyi Li
Y. Lan
Chao Shen
83
6
0
14 Aug 2023
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Anna Rogers
A. Luccioni
155
21
0
14 Aug 2023
Towards Instance-adaptive Inference for Federated Learning
Towards Instance-adaptive Inference for Federated Learning
Chunhui Feng
Kai Yu
Nian Liu
Xinxing Xu
Salman Khan
W. Zuo
FedML
66
12
0
11 Aug 2023
Prompt2Gaussia: Uncertain Prompt-learning for Script Event Prediction
Prompt2Gaussia: Uncertain Prompt-learning for Script Event Prediction
Shiyao Cui
Xin Cong
Shuaiyi Nie
Xuebin Wang
Tingwen Liu
Jinqiao Shi
48
0
0
04 Aug 2023
GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning
GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning
Soumyadeep Roy
Jonas Wallat
Sowmya S. Sundaram
Wolfgang Nejdl
Niloy Ganguly
62
3
0
29 Jul 2023
Multi-output Headed Ensembles for Product Item Classification
Multi-output Headed Ensembles for Product Item Classification
H. Shiokawa
Pradipto Das
Arthur R. Toth
Justin Chiu
23
0
0
29 Jul 2023
PromptMagician: Interactive Prompt Engineering for Text-to-Image
  Creation
PromptMagician: Interactive Prompt Engineering for Text-to-Image Creation
Yingchaojie Feng
Xingbo Wang
Kamkwai Wong
Sijia Wang
Yuhong Lu
Minfeng Zhu
Baicheng Wang
Wei Chen
DiffM
77
83
0
18 Jul 2023
Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language
  Models
Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language Models
Zhiyuan Peng
Xuyang Wu
Qifan Wang
Yihan Fang
VLMRALM
94
12
0
17 Jul 2023
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning?
  Insights from Cross-Lingual Language Understanding
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning? Insights from Cross-Lingual Language Understanding
Bolei Ma
Ercong Nie
Helmut Schmid
Hinrich Schütze
AAMLVLMLRM
89
9
0
15 Jul 2023
Adapting an ASR Foundation Model for Spoken Language Assessment
Adapting an ASR Foundation Model for Spoken Language Assessment
Rao Ma
Mengjie Qian
Mark Gales
Kate Knill
56
14
0
13 Jul 2023
Attribute Controlled Dialogue Prompting
Attribute Controlled Dialogue Prompting
Runcheng Liu
Ahmad Rashid
I. Kobyzev
Mehdi Rezagholizadeh
Pascal Poupart
68
2
0
11 Jul 2023
Answering Ambiguous Questions via Iterative Prompting
Answering Ambiguous Questions via Iterative Prompting
Weiwei Sun
Hengyi Cai
Hongshen Chen
Pengjie Ren
Zhumin Chen
Maarten de Rijke
Zhaochun Ren
84
11
0
08 Jul 2023
Meta-training with Demonstration Retrieval for Efficient Few-shot
  Learning
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Aaron Mueller
Kanika Narang
Lambert Mathias
Qifan Wang
Hamed Firooz
RALM
72
3
0
30 Jun 2023
Topological Data Analysis Guided Segment Anything Model Prompt
  Optimization for Zero-Shot Segmentation in Biological Imaging
Topological Data Analysis Guided Segment Anything Model Prompt Optimization for Zero-Shot Segmentation in Biological Imaging
Ruben Glatt
Shusen Liu
43
4
0
30 Jun 2023
KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for
  Knowledge-Grounded Dialogue Generation
KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for Knowledge-Grounded Dialogue Generation
Jiaqi Bai
Zhao Yan
Jian Yang
Xinnian Liang
Hongcheng Guo
Zhoujun Li
48
9
0
27 Jun 2023
DiversiGATE: A Comprehensive Framework for Reliable Large Language
  Models
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models
Shima Imani
Ali Beyram
H. Shrivastava
37
1
0
22 Jun 2023
Adversarial Robustness of Prompt-based Few-Shot Learning for Natural
  Language Understanding
Adversarial Robustness of Prompt-based Few-Shot Learning for Natural Language Understanding
Venkata Prabhakara Sarath Nookala
Gaurav Verma
Subhabrata Mukherjee
Srijan Kumar
ELM
129
6
0
19 Jun 2023
Multilingual Few-Shot Learning via Language Model Retrieval
Multilingual Few-Shot Learning via Language Model Retrieval
Genta Indra Winata
Liang-Kang Huang
Soumya Vadlamannati
Yash Chandarana
RALM
49
2
0
19 Jun 2023
Seen to Unseen: Exploring Compositional Generalization of
  Multi-Attribute Controllable Dialogue Generation
Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation
Weihao Zeng
Lulu Zhao
Keqing He
Ruotong Geng
Jingang Wang
Wei Wu
Weiran Xu
73
3
0
17 Jun 2023
ActiveGLAE: A Benchmark for Deep Active Learning with Transformers
ActiveGLAE: A Benchmark for Deep Active Learning with Transformers
Lukas Rauch
Matthias Aßenmacher
Denis Huseljic
Moritz Wirth
Bernd Bischl
Bernhard Sick
89
13
0
16 Jun 2023
Politeness Stereotypes and Attack Vectors: Gender Stereotypes in
  Japanese and Korean Language Models
Politeness Stereotypes and Attack Vectors: Gender Stereotypes in Japanese and Korean Language Models
Victor Steinborn
Antonis Maronikolakis
Hinrich Schütze
63
0
0
16 Jun 2023
Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large
  Language Models
Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models
Myles Foley
Ambrish Rawat
Taesung Lee
Yufang Hou
Gabriele Picco
Giulio Zizzo
DeLMO
138
6
0
15 Jun 2023
MetricPrompt: Prompting Model as a Relevance Metric for Few-shot Text
  Classification
MetricPrompt: Prompting Model as a Relevance Metric for Few-shot Text Classification
Hongyuan Dong
Weinan Zhang
Wanxiang Che
VLM
54
3
0
15 Jun 2023
Assisting Language Learners: Automated Trans-Lingual Definition
  Generation via Contrastive Prompt Learning
Assisting Language Learners: Automated Trans-Lingual Definition Generation via Contrastive Prompt Learning
Hengyuan Zhang
Dawei Li
Yanran Li
Chenming Shang
Chufan Shi
Yong Jiang
155
15
0
09 Jun 2023
COVER: A Heuristic Greedy Adversarial Attack on Prompt-based Learning in
  Language Models
COVER: A Heuristic Greedy Adversarial Attack on Prompt-based Learning in Language Models
Zihao Tan
Qingliang Chen
Wenbin Zhu
Yongjian Huang
AAMLSILM
91
3
0
09 Jun 2023
Bias Against 93 Stigmatized Groups in Masked Language Models and
  Downstream Sentiment Classification Tasks
Bias Against 93 Stigmatized Groups in Masked Language Models and Downstream Sentiment Classification Tasks
Katelyn Mei
Sonia Fereidooni
Aylin Caliskan
87
56
0
08 Jun 2023
Artificial General Intelligence for Medical Imaging
Artificial General Intelligence for Medical Imaging
Xiang Li
Lu Zhang
Zihao Wu
Zheng Liu
Lin Zhao
...
Pingkuan Yan
Quanzheng Li
Wen Liu
Tianming Liu
Dinggang Shen
LM&MAAI4CE
140
42
0
08 Jun 2023
Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to
  Pre-trained Language Models Memories
Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models Memories
Shizhe Diao
Tianyang Xu
Ruijia Xu
Jiawei Wang
Tong Zhang
MoEAI4CE
55
41
0
08 Jun 2023
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data
  Augmentation
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation
Xiusi Chen
Yu Zhang
Jinliang Deng
Jyun-Yu Jiang
Wei Wang
57
12
0
07 Jun 2023
Prompt Space Optimizing Few-shot Reasoning Success with Large Language
  Models
Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models
Fobo Shi
Peijun Qing
Ke Wang
Nan Wang
Youbo Lei
H. Lu
Xiaodong Lin
Duantengchuan Li
VLMReLMLLMAGLRM
89
12
0
06 Jun 2023
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Momchil Hardalov
Pepa Atanasova
Todor Mihaylov
G. Angelova
K. Simov
P. Osenova
Ves Stoyanov
Ivan Koychev
Preslav Nakov
Dragomir R. Radev
ELMFedML
75
4
0
04 Jun 2023
Syntax-aware Hybrid prompt model for Few-shot multi-modal sentiment
  analysis
Syntax-aware Hybrid prompt model for Few-shot multi-modal sentiment analysis
Zikai Zhou
Haisong Feng
Baiyou Qiao
Gang Wu
Donghong Han
VLM
104
2
0
02 Jun 2023
In-Context Learning User Simulators for Task-Oriented Dialog Systems
In-Context Learning User Simulators for Task-Oriented Dialog Systems
Silvia Terragni
Modestas Filipavicius
Nghia Khau
Bruna Guedes
A. Manso
Roland Mathis
100
11
0
01 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
146
28
0
01 Jun 2023
Exploring Lottery Prompts for Pre-trained Language Models
Exploring Lottery Prompts for Pre-trained Language Models
Yulin Chen
Ning Ding
Xiaobin Wang
Shengding Hu
Haitao Zheng
Zhiyuan Liu
Pengjun Xie
VLMLRM
50
7
0
31 May 2023
Jointly Reparametrized Multi-Layer Adaptation for Efficient and Private
  Tuning
Jointly Reparametrized Multi-Layer Adaptation for Efficient and Private Tuning
Umang Gupta
Aram Galstyan
Greg Ver Steeg
53
2
0
30 May 2023
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive
  Prompt-Based Few-Shot Fine-Tuning
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning
Amirhossein Abaskohi
S. Rothe
Yadollah Yaghoobzadeh
VLM
97
18
0
29 May 2023
ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER
ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER
Amirhossein Layegh
A. H. Payberah
A. Soylu
Dumitru Roman
M. Matskin
VLM
80
8
0
29 May 2023
Ask an Expert: Leveraging Language Models to Improve Strategic Reasoning
  in Goal-Oriented Dialogue Models
Ask an Expert: Leveraging Language Models to Improve Strategic Reasoning in Goal-Oriented Dialogue Models
Qiang Zhang
Jason Naradowsky
Yusuke Miyao
ELM
91
35
0
29 May 2023
PromptNER: Prompt Locating and Typing for Named Entity Recognition
PromptNER: Prompt Locating and Typing for Named Entity Recognition
Yongliang Shen
Zeqi Tan
Shuhui Wu
Wenqi Zhang
Rongsheng Zhang
Yadong Xi
Weiming Lu
Yueting Zhuang
91
36
0
26 May 2023
Previous
12345...111213
Next