ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXivPDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 606 papers shown
Title
Advancing Relation Extraction through Language Probing with Exemplars
  from Set Co-Expansion
Advancing Relation Extraction through Language Probing with Exemplars from Set Co-Expansion
Yerong Li
Roxana Girju
36
0
0
18 Aug 2023
Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt
  Generation for Few-shot Learning
Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt Generation for Few-shot Learning
Chengzhengxu Li
Xiaoming Liu
Yichen Wang
Duyi Li
Y. Lan
Chao Shen
32
5
0
14 Aug 2023
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Anna Rogers
A. Luccioni
53
19
0
14 Aug 2023
Towards Instance-adaptive Inference for Federated Learning
Towards Instance-adaptive Inference for Federated Learning
Chunhui Feng
Kai Yu
Nian Liu
Xinxing Xu
Salman Khan
W. Zuo
FedML
39
11
0
11 Aug 2023
Prompt2Gaussia: Uncertain Prompt-learning for Script Event Prediction
Prompt2Gaussia: Uncertain Prompt-learning for Script Event Prediction
Shiyao Cui
Xin Cong
Shuaiyi Nie
Xuebin Wang
Tingwen Liu
Jinqiao Shi
22
0
0
04 Aug 2023
GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning
GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning
Soumyadeep Roy
Jonas Wallat
Sowmya S. Sundaram
Wolfgang Nejdl
Niloy Ganguly
30
3
0
29 Jul 2023
Multi-output Headed Ensembles for Product Item Classification
Multi-output Headed Ensembles for Product Item Classification
H. Shiokawa
Pradipto Das
Arthur R. Toth
Justin Chiu
8
0
0
29 Jul 2023
PromptMagician: Interactive Prompt Engineering for Text-to-Image
  Creation
PromptMagician: Interactive Prompt Engineering for Text-to-Image Creation
Yingchaojie Feng
Xingbo Wang
Kamkwai Wong
Sijia Wang
Yuhong Lu
Minfeng Zhu
Baicheng Wang
Wei Chen
DiffM
13
74
0
18 Jul 2023
Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language
  Models
Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language Models
Zhiyuan Peng
Xuyang Wu
Qifan Wang
Yihan Fang
VLM
RALM
46
11
0
17 Jul 2023
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning?
  Insights from Cross-Lingual Language Understanding
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning? Insights from Cross-Lingual Language Understanding
Bolei Ma
Ercong Nie
Helmut Schmid
Hinrich Schütze
AAML
VLM
LRM
37
8
0
15 Jul 2023
Adapting an ASR Foundation Model for Spoken Language Assessment
Adapting an ASR Foundation Model for Spoken Language Assessment
Rao Ma
Mengjie Qian
Mark J. F. Gales
Kate Knill
19
11
0
13 Jul 2023
Attribute Controlled Dialogue Prompting
Attribute Controlled Dialogue Prompting
Runcheng Liu
Ahmad Rashid
I. Kobyzev
Mehdi Rezagholizadeh
Pascal Poupart
29
2
0
11 Jul 2023
Answering Ambiguous Questions via Iterative Prompting
Answering Ambiguous Questions via Iterative Prompting
Weiwei Sun
Hengyi Cai
Hongshen Chen
Pengjie Ren
Zhumin Chen
Maarten de Rijke
Z. Ren
45
9
0
08 Jul 2023
Meta-training with Demonstration Retrieval for Efficient Few-shot
  Learning
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Aaron Mueller
Kanika Narang
Lambert Mathias
Qifan Wang
Hamed Firooz
RALM
22
3
0
30 Jun 2023
Topological Data Analysis Guided Segment Anything Model Prompt
  Optimization for Zero-Shot Segmentation in Biological Imaging
Topological Data Analysis Guided Segment Anything Model Prompt Optimization for Zero-Shot Segmentation in Biological Imaging
Ruben Glatt
Shusen Liu
30
3
0
30 Jun 2023
KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for
  Knowledge-Grounded Dialogue Generation
KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for Knowledge-Grounded Dialogue Generation
Jiaqi Bai
Zhao Yan
Jian Yang
Xinnian Liang
Hongcheng Guo
Zhoujun Li
18
9
0
27 Jun 2023
DiversiGATE: A Comprehensive Framework for Reliable Large Language
  Models
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models
Shima Imani
Ali Beyram
H. Shrivastava
21
1
0
22 Jun 2023
Adversarial Robustness of Prompt-based Few-Shot Learning for Natural
  Language Understanding
Adversarial Robustness of Prompt-based Few-Shot Learning for Natural Language Understanding
Venkata Prabhakara Sarath Nookala
Gaurav Verma
Subhabrata Mukherjee
Srijan Kumar
ELM
60
6
0
19 Jun 2023
Multilingual Few-Shot Learning via Language Model Retrieval
Multilingual Few-Shot Learning via Language Model Retrieval
Genta Indra Winata
Liang-Kang Huang
Soumya Vadlamannati
Yash Chandarana
RALM
39
2
0
19 Jun 2023
Seen to Unseen: Exploring Compositional Generalization of
  Multi-Attribute Controllable Dialogue Generation
Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation
Weihao Zeng
Lulu Zhao
Keqing He
Ruotong Geng
Jingang Wang
Wei Wu
Weiran Xu
40
3
0
17 Jun 2023
ActiveGLAE: A Benchmark for Deep Active Learning with Transformers
ActiveGLAE: A Benchmark for Deep Active Learning with Transformers
Lukas Rauch
Matthias Aßenmacher
Denis Huseljic
Moritz Wirth
Bernd Bischl
Bernhard Sick
36
11
0
16 Jun 2023
Politeness Stereotypes and Attack Vectors: Gender Stereotypes in
  Japanese and Korean Language Models
Politeness Stereotypes and Attack Vectors: Gender Stereotypes in Japanese and Korean Language Models
Victor Steinborn
Antonis Maronikolakis
Hinrich Schütze
31
0
0
16 Jun 2023
Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large
  Language Models
Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models
Myles Foley
Ambrish Rawat
Taesung Lee
Yufang Hou
Gabriele Picco
Giulio Zizzo
DeLMO
35
5
0
15 Jun 2023
MetricPrompt: Prompting Model as a Relevance Metric for Few-shot Text
  Classification
MetricPrompt: Prompting Model as a Relevance Metric for Few-shot Text Classification
Hongyuan Dong
Weinan Zhang
Wanxiang Che
VLM
21
2
0
15 Jun 2023
Assisting Language Learners: Automated Trans-Lingual Definition
  Generation via Contrastive Prompt Learning
Assisting Language Learners: Automated Trans-Lingual Definition Generation via Contrastive Prompt Learning
Hengyuan Zhang
Dawei Li
Yanran Li
Chenming Shang
Chufan Shi
Yong-jia Jiang
43
14
0
09 Jun 2023
COVER: A Heuristic Greedy Adversarial Attack on Prompt-based Learning in
  Language Models
COVER: A Heuristic Greedy Adversarial Attack on Prompt-based Learning in Language Models
Zihao Tan
Qingliang Chen
Wenbin Zhu
Yongjian Huang
AAML
SILM
28
3
0
09 Jun 2023
Bias Against 93 Stigmatized Groups in Masked Language Models and
  Downstream Sentiment Classification Tasks
Bias Against 93 Stigmatized Groups in Masked Language Models and Downstream Sentiment Classification Tasks
Katelyn Mei
Sonia Fereidooni
Aylin Caliskan
22
45
0
08 Jun 2023
Artificial General Intelligence for Medical Imaging
Artificial General Intelligence for Medical Imaging
Xiang Li
Lu Zhang
Zihao Wu
Zheng Liu
Lin Zhao
...
Pingkuan Yan
Quanzheng Li
Wei Liu
Tianming Liu
Dinggang Shen
LM&MA
AI4CE
19
40
0
08 Jun 2023
Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to
  Pre-trained Language Models Memories
Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models Memories
Shizhe Diao
Tianyang Xu
Ruijia Xu
Jiawei Wang
Tong Zhang
MoE
AI4CE
13
36
0
08 Jun 2023
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data
  Augmentation
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation
Xiusi Chen
Yu Zhang
Jinliang Deng
Jyun-Yu Jiang
Wei Wang
24
11
0
07 Jun 2023
Prompt Space Optimizing Few-shot Reasoning Success with Large Language
  Models
Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models
Fobo Shi
Peijun Qing
Ke Wang
Nan Wang
Youbo Lei
H. Lu
Xiaodong Lin
Duantengchuan Li
VLM
ReLM
LLMAG
LRM
29
11
0
06 Jun 2023
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Momchil Hardalov
Pepa Atanasova
Todor Mihaylov
G. Angelova
K. Simov
P. Osenova
Ves Stoyanov
Ivan Koychev
Preslav Nakov
Dragomir R. Radev
ELM
FedML
29
4
0
04 Jun 2023
Syntax-aware Hybrid prompt model for Few-shot multi-modal sentiment
  analysis
Syntax-aware Hybrid prompt model for Few-shot multi-modal sentiment analysis
Zikai Zhou
Haisong Feng
Baiyou Qiao
Gang Wu
Donghong Han
VLM
14
1
0
02 Jun 2023
In-Context Learning User Simulators for Task-Oriented Dialog Systems
In-Context Learning User Simulators for Task-Oriented Dialog Systems
Silvia Terragni
Modestas Filipavicius
Nghia Khau
Bruna Guedes
A. Manso
Roland Mathis
46
11
0
01 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
29
23
0
01 Jun 2023
Exploring Lottery Prompts for Pre-trained Language Models
Exploring Lottery Prompts for Pre-trained Language Models
Yulin Chen
Ning Ding
Xiaobin Wang
Shengding Hu
Haitao Zheng
Zhiyuan Liu
Pengjun Xie
VLM
LRM
24
7
0
31 May 2023
Jointly Reparametrized Multi-Layer Adaptation for Efficient and Private
  Tuning
Jointly Reparametrized Multi-Layer Adaptation for Efficient and Private Tuning
Umang Gupta
Aram Galstyan
Greg Ver Steeg
11
2
0
30 May 2023
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive
  Prompt-Based Few-Shot Fine-Tuning
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning
Amirhossein Abaskohi
S. Rothe
Yadollah Yaghoobzadeh
VLM
37
16
0
29 May 2023
ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER
ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER
Amirhossein Layegh
A. H. Payberah
A. Soylu
Dumitru Roman
M. Matskin
VLM
25
8
0
29 May 2023
Ask an Expert: Leveraging Language Models to Improve Strategic Reasoning
  in Goal-Oriented Dialogue Models
Ask an Expert: Leveraging Language Models to Improve Strategic Reasoning in Goal-Oriented Dialogue Models
Qiang Zhang
Jason Naradowsky
Yusuke Miyao
ELM
26
32
0
29 May 2023
PromptNER: Prompt Locating and Typing for Named Entity Recognition
PromptNER: Prompt Locating and Typing for Named Entity Recognition
Yongliang Shen
Zeqi Tan
Shuhui Wu
Wenqi Zhang
Rongsheng Zhang
Yadong Xi
Weiming Lu
Yueting Zhuang
41
33
0
26 May 2023
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained
  Transformer for Vision, Language, and Multimodal Tasks
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
Kai Zhang
Jun Yu
Eashan Adhikarla
Rong-Er Zhou
Zhilin Yan
...
Xun Chen
Yong Chen
Quanzheng Li
Hongfang Liu
Lichao Sun
LM&MA
MedIm
37
157
0
26 May 2023
Exploring Automatically Perturbed Natural Language Explanations in
  Relation Extraction
Exploring Automatically Perturbed Natural Language Explanations in Relation Extraction
Wanyun Cui
Xingran Chen
LRM
AAML
38
0
0
24 May 2023
A Simple and Effective Framework for Strict Zero-Shot Hierarchical
  Classification
A Simple and Effective Framework for Strict Zero-Shot Hierarchical Classification
R. Bhambhoria
L. Chen
Xiao-Dan Zhu
21
3
0
24 May 2023
Estimating class separability of text embeddings with persistent
  homology
Estimating class separability of text embeddings with persistent homology
Kostis Gourgoulias
Najah F. Ghalyan
Maxime Labonne
Yash Satsangi
Sean J. Moran
Joseph Sabelja
35
0
0
24 May 2023
Frugal Prompting for Dialog Models
Frugal Prompting for Dialog Models
Bishal Santra
Sakya Basak
Abhinandan De
Manish Gupta
Pawan Goyal
30
2
0
24 May 2023
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual
  Transfer
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
Akari Asai
Sneha Kudugunta
Xinyan Velocity Yu
Terra Blevins
Hila Gonen
Machel Reid
Yulia Tsvetkov
Sebastian Ruder
Hannaneh Hajishirzi
44
54
0
24 May 2023
Do prompt positions really matter?
Do prompt positions really matter?
Junyu Mao
Stuart E. Middleton
Mahesan Niranjan
VLM
31
3
0
23 May 2023
Active Learning Principles for In-Context Learning with Large Language
  Models
Active Learning Principles for In-Context Learning with Large Language Models
Katerina Margatina
Timo Schick
Nikolaos Aletras
Jane Dwivedi-Yu
30
39
0
23 May 2023
Can Language Models Understand Physical Concepts?
Can Language Models Understand Physical Concepts?
Lei Li
Jingjing Xu
Qingxiu Dong
Ce Zheng
Qi Liu
Lingpeng Kong
Xu Sun
ALM
33
18
0
23 May 2023
Previous
12345...111213
Next