ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
v1v2 (latest)

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXiv (abs)PDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 613 papers shown
Title
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained
  Transformer for Vision, Language, and Multimodal Tasks
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
Kai Zhang
Jun Yu
Eashan Adhikarla
Rong Zhou
Zhilin Yan
...
Xun Chen
Yong Chen
Quanzheng Li
Hongfang Liu
Lichao Sun
LM&MAMedIm
103
184
0
26 May 2023
Exploring Automatically Perturbed Natural Language Explanations in
  Relation Extraction
Exploring Automatically Perturbed Natural Language Explanations in Relation Extraction
Wanyun Cui
Xingran Chen
LRMAAML
55
0
0
24 May 2023
A Simple and Effective Framework for Strict Zero-Shot Hierarchical
  Classification
A Simple and Effective Framework for Strict Zero-Shot Hierarchical Classification
R. Bhambhoria
Lei Chen
Xiao-Dan Zhu
84
4
0
24 May 2023
Estimating class separability of text embeddings with persistent
  homology
Estimating class separability of text embeddings with persistent homology
Kostis Gourgoulias
Najah F. Ghalyan
Maxime Labonne
Yash Satsangi
Sean J. Moran
Joseph Sabelja
81
1
0
24 May 2023
Frugal Prompting for Dialog Models
Frugal Prompting for Dialog Models
Bishal Santra
Sakya Basak
Abhinandan De
Manish Gupta
Pawan Goyal
34
2
0
24 May 2023
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual
  Transfer
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
Akari Asai
Sneha Kudugunta
Xinyan Velocity Yu
Terra Blevins
Hila Gonen
Machel Reid
Yulia Tsvetkov
Sebastian Ruder
Hannaneh Hajishirzi
120
63
0
24 May 2023
Do prompt positions really matter?
Do prompt positions really matter?
Junyu Mao
Stuart E. Middleton
Mahesan Niranjan
VLM
65
6
0
23 May 2023
Active Learning Principles for In-Context Learning with Large Language
  Models
Active Learning Principles for In-Context Learning with Large Language Models
Katerina Margatina
Timo Schick
Nikolaos Aletras
Jane Dwivedi-Yu
113
44
0
23 May 2023
Can Language Models Understand Physical Concepts?
Can Language Models Understand Physical Concepts?
Lei Li
Jingjing Xu
Qingxiu Dong
Ce Zheng
Qi Liu
Lingpeng Kong
Xu Sun
ALM
61
22
0
23 May 2023
Robust Prompt Optimization for Large Language Models Against
  Distribution Shifts
Robust Prompt Optimization for Large Language Models Against Distribution Shifts
Moxin Li
Wenjie Wang
Fuli Feng
Yixin Cao
Jizhi Zhang
Tat-Seng Chua
OffRL
150
20
0
23 May 2023
Small Language Models Improve Giants by Rewriting Their Outputs
Small Language Models Improve Giants by Rewriting Their Outputs
Giorgos Vernikos
Arthur Bravzinskas
Jakub Adamek
Jonathan Mallinson
Aliaksei Severyn
Eric Malmi
BDLLRM
92
16
0
22 May 2023
Learning Easily Updated General Purpose Text Representations with
  Adaptable Task-Specific Prefixes
Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefixes
Kuan-Hao Huang
L Tan
Rui Hou
Sinong Wang
Amjad Almahairi
Ruty Rinott
AI4CE
78
0
0
22 May 2023
Interactive Data Synthesis for Systematic Vision Adaptation via
  LLMs-AIGCs Collaboration
Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration
Qifan Yu
Juncheng Li
Wentao Ye
Siliang Tang
Yueting Zhuang
70
14
0
22 May 2023
Enhancing Cross-lingual Natural Language Inference by Soft Prompting
  with Multilingual Verbalizer
Enhancing Cross-lingual Natural Language Inference by Soft Prompting with Multilingual Verbalizer
Shuang Li
Xuming Hu
Aiwei Liu
Yawen Yang
Fukun Ma
Philip S. Yu
Lijie Wen
134
4
0
22 May 2023
Automated Few-shot Classification with Instruction-Finetuned Language
  Models
Automated Few-shot Classification with Instruction-Finetuned Language Models
Rami Aly
Xingjian Shi
Kaixiang Lin
Aston Zhang
A. Wilson
75
11
0
21 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large
  Language Models
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Rada Mihalcea
LRM
136
6
0
21 May 2023
PromptNER: A Prompting Method for Few-shot Named Entity Recognition via
  k Nearest Neighbor Search
PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search
Mozhi Zhang
Hang Yan
Yaqian Zhou
Xipeng Qiu
68
10
0
20 May 2023
Hint of Thought prompting: an explainable and zero-shot approach to
  reasoning tasks with LLMs
Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs
IokTong Lei
Zhidong Deng
ReLMRALMLRM
50
4
0
19 May 2023
Zero-Shot Text Classification via Self-Supervised Tuning
Zero-Shot Text Classification via Self-Supervised Tuning
Chaoqun Liu
Wenxuan Zhang
Guizhen Chen
Xiaobao Wu
Anh Tuan Luu
Chip Hong Chang
Lidong Bing
VLM
78
11
0
19 May 2023
Efficient Prompting via Dynamic In-Context Learning
Efficient Prompting via Dynamic In-Context Learning
Wangchunshu Zhou
Yuchen Eleanor Jiang
Ryan Cotterell
Mrinmaya Sachan
70
19
0
18 May 2023
"I'm fully who I am": Towards Centering Transgender and Non-Binary
  Voices to Measure Biases in Open Language Generation
"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation
Anaelia Ovalle
Palash Goyal
Jwala Dhamala
Zachary Jaggers
Kai-Wei Chang
Aram Galstyan
R. Zemel
Rahul Gupta
93
72
0
17 May 2023
CPL-NoViD: Context-Aware Prompt-based Learning for Norm Violation
  Detection in Online Communities
CPL-NoViD: Context-Aware Prompt-based Learning for Norm Violation Detection in Online Communities
Zihao He
Jonathan May
Kristina Lerman
107
3
0
16 May 2023
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed
  Opportunity
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Raman Dutt
Linus Ericsson
Pedro Sanchez
Sotirios A. Tsaftaris
Timothy M. Hospedales
MedIm
121
55
0
14 May 2023
Distinguish Before Answer: Generating Contrastive Explanation as
  Knowledge for Commonsense Question Answering
Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering
Qianglong Chen
Guohai Xu
Mingshi Yan
Ji Zhang
Fei Huang
Luo Si
Yin Zhang
68
10
0
14 May 2023
Make Prompt-based Black-Box Tuning Colorful: Boosting Model
  Generalization from Three Orthogonal Perspectives
Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives
Qiushi Sun
Chengcheng Han
Nuo Chen
Renyu Zhu
Jing Gong
Xiang Li
Ming Gao
VLM
47
9
0
14 May 2023
Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual
  Transfer for Open-domain Dialogue Generation
Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue Generation
Lei Liu
J. Huang
CLL
73
2
0
12 May 2023
When Giant Language Brains Just Aren't Enough! Domain Pizzazz with
  Knowledge Sparkle Dust
When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust
Minh Le Nguyen
Duy-Hung Nguyen
Shahab Sabahi
Hung Le
Jeffrey Yang
Hajime Hotta
83
1
0
12 May 2023
Musketeer: Joint Training for Multi-task Vision Language Model with Task
  Explanation Prompts
Musketeer: Joint Training for Multi-task Vision Language Model with Task Explanation Prompts
Zhaoyang Zhang
Yantao Shen
Kunyu Shi
Zhaowei Cai
Jun Fang
Siqi Deng
Hao Yang
Davide Modolo
Zhuowen Tu
Stefano Soatto
VLM
80
2
0
11 May 2023
How Good are Commercial Large Language Models on African Languages?
How Good are Commercial Large Language Models on African Languages?
Jessica Ojo
Kelechi Ogueji
83
5
0
11 May 2023
Investigating Forgetting in Pre-Trained Representations Through
  Continual Learning
Investigating Forgetting in Pre-Trained Representations Through Continual Learning
Yun Luo
Zhen Yang
Xuefeng Bai
Fandong Meng
Jie Zhou
Yue Zhang
CLLKELM
103
17
0
10 May 2023
Towards Weakly-Supervised Hate Speech Classification Across Datasets
Towards Weakly-Supervised Hate Speech Classification Across Datasets
Yiping Jin
Leo Wanner
Vishakha Kadam
A. Shvets
63
5
0
04 May 2023
Black-box Prompt Tuning with Subspace Learning
Black-box Prompt Tuning with Subspace Learning
Yuanhang Zheng
Zhixing Tan
Peng Li
Yang Liu
VLM
127
11
0
04 May 2023
PTP: Boosting Stability and Performance of Prompt Tuning with
  Perturbation-Based Regularizer
PTP: Boosting Stability and Performance of Prompt Tuning with Perturbation-Based Regularizer
Lichang Chen
Heng-Chiao Huang
Varun Madhavan
AAML
176
12
0
03 May 2023
Causal Interventions-based Few-Shot Named Entity Recognition
Causal Interventions-based Few-Shot Named Entity Recognition
Zhen Yang
Yongbin Liu
Chunping Ouyang
CML
71
0
0
03 May 2023
Few-shot Event Detection: An Empirical Study and a Unified View
Few-shot Event Detection: An Empirical Study and a Unified View
Yubo Ma
Zehao Wang
Yixin Cao
Aixin Sun
103
11
0
03 May 2023
Don't Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner
Don't Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner
Zhengxiang Shi
Aldo Lipani
VLMCLL
80
22
0
02 May 2023
From Words to Code: Harnessing Data for Program Synthesis from Natural
  Language
From Words to Code: Harnessing Data for Program Synthesis from Natural Language
Anirudh Khatry
Joyce Cahoon
Jordan Henkel
Shaleen Deep
Venkatesh Emani
...
Vu Le
Mohammad Raza
Sherry Shi
Mukul Singh
A. Tiwari
113
12
0
02 May 2023
HQP: A Human-Annotated Dataset for Detecting Online Propaganda
HQP: A Human-Annotated Dataset for Detecting Online Propaganda
Abdurahman Maarouf
Dominik Bär
Dominique Geissler
Stefan Feuerriegel
73
10
0
28 Apr 2023
$π$-Tuning: Transferring Multimodal Foundation Models with Optimal
  Multi-task Interpolation
πππ-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation
Chengyue Wu
Teng Wang
Yixiao Ge
Zeyu Lu
Rui-Zhi Zhou
Ying Shan
Ping Luo
MoMe
145
37
0
27 Apr 2023
Generation-driven Contrastive Self-training for Zero-shot Text
  Classification with Instruction-following LLM
Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLM
Ruohong Zhang
Yau-Shian Wang
Yiming Yang
SyDa
49
10
0
24 Apr 2023
MasakhaNEWS: News Topic Classification for African languages
MasakhaNEWS: News Topic Classification for African languages
David Ifeoluwa Adelani
Marek Masiak
Israel Abebe Azime
Jesujoba Oluwadara Alabi
A. Tonja
...
Moges Ahmed Mehamed
Evrard Ngabire
Jules Jules
Ivan Ssenkungu
Pontus Stenetorp
114
24
0
19 Apr 2023
MixPro: Simple yet Effective Data Augmentation for Prompt-based Learning
MixPro: Simple yet Effective Data Augmentation for Prompt-based Learning
Bohan Li
Longxu Dou
Yutai Hou
Yunlong Feng
Honglin Mu
Qingfu Zhu
Qinghua Sun
Wanxiang Che
VLM
74
4
0
19 Apr 2023
Just Tell Me: Prompt Engineering in Business Process Management
Just Tell Me: Prompt Engineering in Business Process Management
Kiran Busch
Alexander Rochlitzer
Diana Sola
Henrik Leopold
86
29
0
14 Apr 2023
Global Prompt Cell: A Portable Control Module for Effective Prompt
  Tuning
Global Prompt Cell: A Portable Control Module for Effective Prompt Tuning
Chi-Liang Liu
Hao Wang
Nuwa Xi
Sendong Zhao
Bing Qin
VLM
69
1
0
12 Apr 2023
chatClimate: Grounding Conversational AI in Climate Science
chatClimate: Grounding Conversational AI in Climate Science
S. Vaghefi
Qian Wang
V. Muccione
Jingwei Ni
Mathias Kraus
...
Tobias Schimanski
Chiara Colesanti-Senni
Nicolas Webersinke
Christrian Huggel
Markus Leippold
KELMAI4MHHILM
109
73
0
11 Apr 2023
Similarity-Aware Multimodal Prompt Learning for Fake News Detection
Similarity-Aware Multimodal Prompt Learning for Fake News Detection
Ye Jiang
Xiaomin Yu
Yimin Wang
Xiaoman Xu
Xingyi Song
Diana Maynard
83
27
0
09 Apr 2023
WikiGoldSK: Annotated Dataset, Baselines and Few-Shot Learning
  Experiments for Slovak Named Entity Recognition
WikiGoldSK: Annotated Dataset, Baselines and Few-Shot Learning Experiments for Slovak Named Entity Recognition
Dávid Suba
Marek Suppa
Jozef Kubík
Endre Hamerlik
Martin Takáč
54
0
0
08 Apr 2023
Revisiting Automated Prompting: Are We Actually Doing Better?
Revisiting Automated Prompting: Are We Actually Doing Better?
Yulin Zhou
Yiren Zhao
Ilia Shumailov
Robert D. Mullins
Y. Gal
117
8
0
07 Apr 2023
Sociocultural knowledge is needed for selection of shots in hate speech
  detection tasks
Sociocultural knowledge is needed for selection of shots in hate speech detection tasks
Antonis Maronikolakis
Abdullatif Köksal
Hinrich Schütze
75
0
0
04 Apr 2023
Learning to Name Classes for Vision and Language Models
Learning to Name Classes for Vision and Language Models
Sarah Parisot
Yongxin Yang
Jingyu Sun
VLM
85
10
0
04 Apr 2023
Previous
123456...111213
Next