ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
v1v2 (latest)

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXiv (abs)PDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 613 papers shown
Title
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization
  for Language Models
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models
Chengzhengxu Li
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Chen Liu
Y. Lan
Chao Shen
142
4
0
15 Jun 2024
Survey for Landing Generative AI in Social and E-commerce Recsys -- the
  Industry Perspectives
Survey for Landing Generative AI in Social and E-commerce Recsys -- the Industry Perspectives
Da Xu
Danqing Zhang
Guangyu Yang
Bo Yang
Shuyuan Xu
Lingling Zheng
Cindy Liang
36
3
0
10 Jun 2024
Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from
  Imperfect Teacher Models in Low-Budget Scenarios
Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios
Yuhang Zhou
Wei Ai
96
7
0
08 Jun 2024
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter
Jitai Hao
Weiwei Sun
Xin Xin
Qi Meng
Zhumin Chen
Pengjie Ren
Zhaochun Ren
MoE
73
4
0
07 Jun 2024
BERTs are Generative In-Context Learners
BERTs are Generative In-Context Learners
David Samuel
85
8
0
07 Jun 2024
LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text
  Classification
LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification
Chun Liu
Hongguang Zhang
Kainan Zhao
Xinghai Ju
Lin Yang
77
4
0
06 Jun 2024
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
David Ifeoluwa Adelani
Jessica Ojo
Israel Abebe Azime
Jian Yun Zhuang
Jesujoba Oluwadara Alabi
...
Salomey Osei
Sokhar Samb
Tadesse Kebede Guge
Pontus Stenetorp
Pontus Stenetorp
ELM
200
10
0
05 Jun 2024
Latent Logic Tree Extraction for Event Sequence Explanation from LLMs
Latent Logic Tree Extraction for Event Sequence Explanation from LLMs
Zitao Song
Chao Yang
Chaojie Wang
Bo An
Shuang Li
113
7
0
03 Jun 2024
Diffusion Model Patching via Mixture-of-Prompts
Diffusion Model Patching via Mixture-of-Prompts
Seokil Ham
Sangmin Woo
Jin-Young Kim
Hyojun Go
Byeongjun Park
Changick Kim
VLM
76
2
0
28 May 2024
ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios
ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios
Markus Bayer
Justin Lutz
Christian A. Reuter
125
7
0
17 May 2024
Potential and Limitations of LLMs in Capturing Structured Semantics: A
  Case Study on SRL
Potential and Limitations of LLMs in Capturing Structured Semantics: A Case Study on SRL
Ning Cheng
Zhaohui Yan
Ziming Wang
Zhijie Li
Jiaming Yu
Zilong Zheng
Kewei Tu
Jinan Xu
Wenjuan Han
56
6
0
10 May 2024
Interpretable Cross-Examination Technique (ICE-T): Using highly
  informative features to boost LLM performance
Interpretable Cross-Examination Technique (ICE-T): Using highly informative features to boost LLM performance
Goran Muric
Ben Delay
Steven Minton
62
1
0
08 May 2024
SEvenLLM: Benchmarking, Eliciting, and Enhancing Abilities of Large
  Language Models in Cyber Threat Intelligence
SEvenLLM: Benchmarking, Eliciting, and Enhancing Abilities of Large Language Models in Cyber Threat Intelligence
Hangyuan Ji
Jian Yang
Linzheng Chai
Chaoren Wei
Liqun Yang
...
Tianzhen Sun
Hongcheng Guo
Tongliang Li
Changyu Ren
Zhoujun Li
74
9
0
06 May 2024
Leveraging Prompt-Learning for Structured Information Extraction from
  Crohn's Disease Radiology Reports in a Low-Resource Language
Leveraging Prompt-Learning for Structured Information Extraction from Crohn's Disease Radiology Reports in a Low-Resource Language
L. Hazan
G. Focht
N. Gavrielov
R. Reichart
Talar Hagopian
M. Greer
R. Cytter-Kuint
Dan Turner
M. Freiman
MedIm
72
1
0
02 May 2024
StablePT: Towards Stable Prompting for Few-shot Learning via Input
  Separation
StablePT: Towards Stable Prompting for Few-shot Learning via Input Separation
Xiaoming Liu
Chen Liu
Zhaohan Zhang
Chengzhengxu Li
Longtian Wang
Y. Lan
Chao Shen
VLM
87
4
0
30 Apr 2024
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four
  Interactive Vision-Language Domains with Memory-Augmented Language Models
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models
Gabriel H. Sarch
Sahil Somani
Raghav Kapoor
Michael J. Tarr
Katerina Fragkiadaki
LM&RoLLMAG
95
4
0
29 Apr 2024
PromptCL: Improving Event Representation via Prompt Template and
  Contrastive Learning
PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning
Yubo Feng
Lishuang Li
Yi Xiang
Xueyang Qin
VLM
72
2
0
27 Apr 2024
Evaluation of Few-Shot Learning for Classification Tasks in the Polish
  Language
Evaluation of Few-Shot Learning for Classification Tasks in the Polish Language
Tsimur Hadeliya
D. Kajtoch
117
1
0
27 Apr 2024
Enabling Natural Zero-Shot Prompting on Encoder Models via
  Statement-Tuning
Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning
Ahmed Elshabrawy
Yongix Huang
Iryna Gurevych
Alham Fikri Aji
72
1
0
19 Apr 2024
Mitigating Language-Level Performance Disparity in mPLMs via Teacher
  Language Selection and Cross-lingual Self-Distillation
Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation
Haozhe Zhao
Zefan Cai
Shuzheng Si
Liang Chen
Yufeng He
Kaikai An
Baobao Chang
60
0
0
12 Apr 2024
On Unified Prompt Tuning for Request Quality Assurance in Public Code
  Review
On Unified Prompt Tuning for Request Quality Assurance in Public Code Review
Xinyu Chen
Lin Li
Rui Zhang
Peng Liang
84
1
0
11 Apr 2024
Plug and Play with Prompts: A Prompt Tuning Approach for Controlling
  Text Generation
Plug and Play with Prompts: A Prompt Tuning Approach for Controlling Text Generation
R. Ajwani
Zining Zhu
Jonathan Rose
Frank Rudzicz
39
1
0
08 Apr 2024
GPT-DETOX: An In-Context Learning-Based Paraphraser for Text
  Detoxification
GPT-DETOX: An In-Context Learning-Based Paraphraser for Text Detoxification
Ali Pesaranghader
Nikhil Verma
Manasa Bharadwaj
89
5
0
03 Apr 2024
Shortcuts Arising from Contrast: Effective and Covert Clean-Label
  Attacks in Prompt-Based Learning
Shortcuts Arising from Contrast: Effective and Covert Clean-Label Attacks in Prompt-Based Learning
Xiaopeng Xie
Ming Yan
Xiwen Zhou
Chenlong Zhao
Suli Wang
Yong Zhang
Joey Tianyi Zhou
AAML
89
0
0
30 Mar 2024
Language Models for Text Classification: Is In-Context Learning Enough?
Language Models for Text Classification: Is In-Context Learning Enough?
A. Edwards
Jose Camacho-Collados
LRM
87
24
0
26 Mar 2024
m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt
m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt
Jian Yang
Hongcheng Guo
Yuwei Yin
Jiaqi Bai
Bing Wang
Jiaheng Liu
Xinnian Liang
Linzheng Cahi
Liqun Yang
Zhoujun Li
71
10
0
26 Mar 2024
Concerned with Data Contamination? Assessing Countermeasures in Code
  Language Model
Concerned with Data Contamination? Assessing Countermeasures in Code Language Model
Jialun Cao
Wuqi Zhang
Shing-Chi Cheung
60
20
0
25 Mar 2024
$\textit{LinkPrompt}$: Natural and Universal Adversarial Attacks on
  Prompt-based Language Models
LinkPrompt\textit{LinkPrompt}LinkPrompt: Natural and Universal Adversarial Attacks on Prompt-based Language Models
Yue Xu
Wenjie Wang
SILMAAML
82
2
0
25 Mar 2024
Monotonic Paraphrasing Improves Generalization of Language Model
  Prompting
Monotonic Paraphrasing Improves Generalization of Language Model Prompting
Qin Liu
Fei Wang
Nan Xu
Tianyi Yan
Tao Meng
Muhao Chen
LRM
89
8
0
24 Mar 2024
AI and Memory Wall
AI and Memory Wall
A. Gholami
Z. Yao
Sehoon Kim
Coleman Hooper
Michael W. Mahoney
Kurt Keutzer
84
161
0
21 Mar 2024
Clinical information extraction for Low-resource languages with Few-shot
  learning using Pre-trained language models and Prompting
Clinical information extraction for Low-resource languages with Few-shot learning using Pre-trained language models and Prompting
Phillip Richter-Pechanski
Philipp Wiesenbach
Dominic M. Schwab
Christina Kiriakou
Nicolas Geis
Christoph Dieterich
Anette Frank
70
7
0
20 Mar 2024
Cross-Lingual Transfer for Natural Language Inference via Multilingual
  Prompt Translator
Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator
Xiaoyu Qiu
Yuechen Wang
Jiaxin Shi
Wen-gang Zhou
Houqiang Li
LRM
91
3
0
19 Mar 2024
Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding
Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding
Zichen Wu
Hsiu-Yuan Huang
Fanyi Qu
Hao Sun
VLMMoE
83
5
0
17 Mar 2024
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias
  in Factual Knowledge Extraction
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction
Ziyang Xu
Keqin Peng
Liang Ding
Dacheng Tao
Xiliang Lu
74
10
0
15 Mar 2024
Unveiling the Generalization Power of Fine-Tuned Large Language Models
Unveiling the Generalization Power of Fine-Tuned Large Language Models
Haoran Yang
Yumeng Zhang
Jiaqi Xu
Hongyuan Lu
Pheng Ann Heng
Wai Lam
123
40
0
14 Mar 2024
Few-Shot Cross-Lingual Transfer for Prompting Large Language Models in
  Low-Resource Languages
Few-Shot Cross-Lingual Transfer for Prompting Large Language Models in Low-Resource Languages
Christopher Toukmaji
LRM
76
1
0
09 Mar 2024
DEEP-ICL: Definition-Enriched Experts for Language Model In-Context
  Learning
DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning
Xingwei Qu
Yiming Liang
Yucheng Wang
Tianyu Zheng
Tommy Yue
...
Jiajun Zhang
Wenhu Chen
Chenghua Lin
Jie Fu
Ge Zhang
55
2
0
07 Mar 2024
Exploring the Limitations of Large Language Models in Compositional
  Relation Reasoning
Exploring the Limitations of Large Language Models in Compositional Relation Reasoning
Jinman Zhao
Xueyan Zhang
BDLLRM
68
4
0
05 Mar 2024
OffensiveLang: A Community Based Implicit Offensive Language Dataset
OffensiveLang: A Community Based Implicit Offensive Language Dataset
Amit Das
Mostafa Rahgouy
Dongji Feng
Zheng Zhang
Tathagata Bhattacharya
...
Aman Chadha
Mary J. Sandage
Lauramarie Pope
Gerry V. Dozier
Cheryl Seals
92
2
0
04 Mar 2024
Derivative-Free Optimization for Low-Rank Adaptation in Large Language
  Models
Derivative-Free Optimization for Low-Rank Adaptation in Large Language Models
Feihu Jin
Yin Liu
Ying Tan
68
4
0
04 Mar 2024
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Benjamin Bergner
Andrii Skliar
Amelie Royer
Tijmen Blankevoort
Yuki Markus Asano
B. Bejnordi
101
7
0
26 Feb 2024
An Empirical Study of Challenges in Machine Learning Asset Management
An Empirical Study of Challenges in Machine Learning Asset Management
Zhimin Zhao
Yihao Chen
A. A. Bangash
Bram Adams
Ahmed E. Hassan
93
8
0
25 Feb 2024
PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables
  Parameter-Efficient Transfer Learning
PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer Learning
Zhisheng Lin
Han Fu
Chenghao Liu
Zhuo Li
Jianling Sun
MoEMoMe
47
6
0
23 Feb 2024
The Impact of Demonstrations on Multilingual In-Context Learning: A
  Multidimensional Analysis
The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis
Miaoran Zhang
Vagrant Gautam
Mingyang Wang
Jesujoba Oluwadara Alabi
Xiaoyu Shen
Dietrich Klakow
Marius Mosbach
101
12
0
20 Feb 2024
Comparing Specialised Small and General Large Language Models on Text Classification: 100 Labelled Samples to Achieve Break-Even Performance
Comparing Specialised Small and General Large Language Models on Text Classification: 100 Labelled Samples to Achieve Break-Even Performance
Branislav Pecher
Ivan Srba
Maria Bielikova
ALM
100
8
0
20 Feb 2024
An Empirical Categorization of Prompting Techniques for Large Language
  Models: A Practitioner's Guide
An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide
Oluwole Fagbohun
Rachel M. Harrison
Anton Dereventsov
131
9
0
18 Feb 2024
GNNavi: Navigating the Information Flow in Large Language Models by
  Graph Neural Network
GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network
Shuzhou Yuan
Ercong Nie
Michael Farber
Helmut Schmid
Hinrich Schütze
78
3
0
18 Feb 2024
Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers
Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers
Shuzhou Yuan
Ercong Nie
Bolei Ma
Michael Farber
103
3
0
18 Feb 2024
Prompt-Based Bias Calibration for Better Zero/Few-Shot Learning of
  Language Models
Prompt-Based Bias Calibration for Better Zero/Few-Shot Learning of Language Models
Kang He
Yinghan Long
Kaushik Roy
119
2
0
15 Feb 2024
Federated Prompt-based Decision Transformer for Customized VR Services
  in Mobile Edge Computing System
Federated Prompt-based Decision Transformer for Customized VR Services in Mobile Edge Computing System
Tailin Zhou
Jiadong Yu
Jun Zhang
Danny H. K. Tsang
48
1
0
15 Feb 2024
Previous
12345...111213
Next