Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.07118
Cited By
v1
v2 (latest)
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
15 September 2020
Timo Schick
Hinrich Schütze
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"
50 / 613 papers shown
Title
Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs
Xun Wang
Jing Xu
Franziska Boenisch
Michael Backes
Christopher A. Choquette-Choo
Adam Dziedzic
AAML
17
0
0
19 Jun 2025
If Pigs Could Fly... Can LLMs Logically Reason Through Counterfactuals?
Ishwar B Balappanawar
Vamshi Krishna Bonagiri
Anish Joishy
Manas Gaur
K. Thirunarayan
Ponnurangam Kumaraguru
ReLM
LRM
32
0
0
28 May 2025
Dissecting Physics Reasoning in Small Language Models: A Multi-Dimensional Analysis from an Educational Perspective
Nicy Scaria
Silvester John Joseph Kennedy
Diksha Seth
Deepak N. Subramani
LRM
40
0
0
27 May 2025
S
2
^2
2
GPT-PINNs: Sparse and Small models for PDEs
Yajie Ji
Yanlai Chen
Shawn Koohy
10
0
0
25 May 2025
Federated Retrieval-Augmented Generation: A Systematic Mapping Study
Abhijit Chakraborty
Chahana Dahal
Vivek Gupta
182
0
0
24 May 2025
Distribution Prompting: Understanding the Expressivity of Language Models Through the Next-Token Distributions They Can Produce
Haojin Wang
Zining Zhu
Freda Shi
61
0
0
18 May 2025
Comparing Lexical and Semantic Vector Search Methods When Classifying Medical Documents
Lee Harris
43
0
0
16 May 2025
Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models
Dawid Wi'sniewski
Antoni Solarski
Artur Nowakowski
LRM
97
0
0
09 May 2025
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Wenhao Li
Bo Jin
Mingyi Hong
Changhong Lu
Xiangfeng Wang
156
0
0
07 May 2025
Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom
Rishika Sen
Sujoy Roychowdhury
Sumit Soman
H. G. Ranjani
Srikhetra Mohanty
126
0
0
28 Apr 2025
Active Few-Shot Learning for Text Classification
Saeed Ahmadnia
Arash Yousefi Jordehi
Mahsa Hosseini Khasheh Heyran
Seyed Abolghasem Mirroshandel
Owen Rambow
Cornelia Caragea
102
0
0
26 Feb 2025
MKE-Coder: Multi-Axial Knowledge with Evidence Verification in ICD Coding for Chinese EMRs
Xinxin You
Xien Liu
Xue Yang
Ziyi Wang
Ji Wu
92
0
0
19 Feb 2025
Prompt-Driven Continual Graph Learning
Qi Wang
Tianfei Zhou
Ye Yuan
Rui Mao
CLL
142
0
0
10 Feb 2025
In-Context Learning (and Unlearning) of Length Biases
S. Schoch
Yangfeng Ji
167
0
0
10 Feb 2025
IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates
Aissatou Diallo
Antonis Bikakis
Luke Dickens
Anthony Hunter
Rob Miller
LRM
102
0
0
05 Feb 2025
Memory-Efficient Fine-Tuning of Transformers via Token Selection
Antoine Simoulin
Namyong Park
Xiaoyi Liu
Grey Yang
193
1
0
31 Jan 2025
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration
Thomas Walshe
S. Moon
Chunyang Xiao
Yawwani Gunawardana
Fran Silavong
112
4
0
21 Jan 2025
Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking
Dina Pisarevskaya
Arkaitz Zubiaga
109
1
0
18 Jan 2025
Differentiable Prompt Learning for Vision Language Models
Zhenhan Huang
Tejaswini Pedapati
Pin-Yu Chen
Jianxi Gao
VLM
90
0
0
03 Jan 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
201
13
0
31 Dec 2024
CODECLEANER: Elevating Standards with A Robust Data Contamination Mitigation Toolkit
Jialun Cao
Songqiang Chen
Wuqi Zhang
Hau Ching Lo
Shing-Chi Cheung
61
1
0
16 Nov 2024
A Practical Guide to Fine-tuning Language Models with Limited Data
Márton Szép
Daniel Rueckert
Rüdiger von Eisenhart-Rothe
Florian Hinterwimmer
SyDa
ALM
130
2
0
14 Nov 2024
What Should Baby Models Read? Exploring Sample-Efficient Data Composition on Model Performance
Hong Meng Yam
Nathan J Paek
114
1
0
11 Nov 2024
Cephalo: Harnessing Heterogeneous GPU Clusters for Training Transformer Models
Runsheng Benson Guo
Utkarsh Anand
Arthur Chen
Khuzaima Daudjee
59
1
0
01 Nov 2024
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Shailaja Keyur Sampat
Mutsumi Nakamura
Shankar Kailas
Kartik Aggarwal
Mandy Zhou
Yezhou Yang
Chitta Baral
MLLM
CoGe
ReLM
VLM
LRM
78
0
0
17 Oct 2024
SLM-Mod: Small Language Models Surpass LLMs at Content Moderation
Xianyang Zhan
Agam Goyal
Yilun Chen
Eshwar Chandrasekharan
Koustuv Saha
AI4MH
442
5
0
17 Oct 2024
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Yu-Chen Lin
Wei-Hua Li
Jun-Cheng Chen
Chu-Song Chen
51
1
0
10 Oct 2024
Exploring the Readiness of Prominent Small Language Models for the Democratization of Financial Literacy
Tagore Rao Kosireddy
Jeffrey D. Wall
Evan Lucas
51
1
0
09 Oct 2024
Manual Verbalizer Enrichment for Few-Shot Text Classification
Quang Anh Nguyen
Nadi Tomeh
M. Lebbah
Thierry Charnois
Hanene Azzag
Santiago Cordoba Muñoz
VLM
81
0
0
08 Oct 2024
SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics
Zhiwen You
Kanyao Han
Haotian Zhu
Bertram Ludäscher
Jana Diesner
60
1
0
02 Oct 2024
Exploring Gen-AI applications in building research and industry: A review
Hanlong Wan
Jian Zhang
Yan Chen
Weili Xu
Fan Feng
AI4CE
123
3
0
01 Oct 2024
Small Language Models: Survey, Measurements, and Insights
Zhenyan Lu
Xiang Li
Dongqi Cai
Rongjie Yi
Fangming Liu
Xiwen Zhang
Nicholas D. Lane
Mengwei Xu
ObjD
LRM
157
58
0
24 Sep 2024
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs
Guillermo Marco
Luz Rello
Julio Gonzalo
LM&MA
ALM
103
7
0
17 Sep 2024
Exploring the Potential of Large Language Models for Heterophilic Graphs
Yuxia Wu
Shujie Li
Yuan Fang
Chuan Shi
147
3
0
26 Aug 2024
Domain-specific long text classification from sparse relevant information
Célia DĆruz
J. Bereder
Frédéric Precioso
Michel Riveill
100
0
0
23 Aug 2024
SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Kai-Wei Chang
Haibin Wu
Yu-Kai Wang
Yuan-Kuei Wu
Hua Shen
Wei-Cheng Tseng
Iu-thing Kang
Shang-Wen Li
Hung-yi Lee
93
3
0
23 Aug 2024
LBC: Language-Based-Classifier for Out-Of-Variable Generalization
Kangjun Noh
Baekryun Seong
Hoyoon Byun
Youngjun Choi
Sungjin Song
Kyungwoo Song
76
0
0
20 Aug 2024
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Zhongyu Zhao
Menghang Dong
Rongyu Zhang
Wenzhao Zheng
Yunpeng Zhang
Huanrui Yang
Dalong Du
Kurt Keutzer
Shanghang Zhang
100
0
0
15 Aug 2024
Unlocking Efficiency: Adaptive Masking for Gene Transformer Models
Soumyadeep Roy
S. Sural
Niloy Ganguly
MedIm
72
0
0
13 Aug 2024
GPT-3 Powered Information Extraction for Building Robust Knowledge Bases
Ritabrata Roy Choudhury
Soumik Dey
62
1
0
31 Jul 2024
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian
S. Auriemma
Martina Miliani
Mauro Madeddu
Alessandro Bondielli
Lucia Passaro
Alessandro Lenci
63
0
0
30 Jul 2024
Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models
Jia-Hong Huang
Chao-Chun Yang
Yixian Shen
A. M. Pacces
Evangelos Kanoulas
ELM
AILaw
96
6
0
26 Jul 2024
Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery
Yuni Susanti
Michael Färber
74
3
0
26 Jul 2024
Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models
Ioana Buhnila
Aman Sinha
Mathieu Constant
LM&MA
54
1
0
23 Jul 2024
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL
Yunseon Choi
Sangmin Bae
Seonghyun Ban
Minchan Jeong
Wei Shen
Lei Song
Wei Xiong
Jiang Bian
Kee-Eung Kim
VLM
AAML
59
3
0
20 Jul 2024
Learning Program Behavioral Models from Synthesized Input-Output Pairs
Tural Mammadov
Dietrich Klakow
Alexander Koller
Andreas Zeller
99
3
0
11 Jul 2024
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?
Nicy Scaria
Silvester John Joseph Kennedy
Deepak N. Subramani
MU
118
2
0
01 Jul 2024
Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization
Niyati Bafna
Kenton Murray
David Yarowsky
126
2
0
19 Jun 2024
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Yuhang Zhou
Jing Zhu
Paiheng Xu
Xiaoyu Liu
Xiyao Wang
Danai Koutra
Wei Ai
Furong Huang
137
5
0
19 Jun 2024
Are Large Language Models a Good Replacement of Taxonomies?
Yushi Sun
Hao Xin
Kai Sun
Yongjun Xu
Xiao Yang
Xin Luna Dong
Nan Tang
Lei Chen
AI4MH
67
11
0
17 Jun 2024
1
2
3
4
...
11
12
13
Next