ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
v1v2 (latest)

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXiv (abs)PDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 613 papers shown
Title
Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs
Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs
Xun Wang
Jing Xu
Franziska Boenisch
Michael Backes
Christopher A. Choquette-Choo
Adam Dziedzic
AAML
17
0
0
19 Jun 2025
If Pigs Could Fly... Can LLMs Logically Reason Through Counterfactuals?
If Pigs Could Fly... Can LLMs Logically Reason Through Counterfactuals?
Ishwar B Balappanawar
Vamshi Krishna Bonagiri
Anish Joishy
Manas Gaur
K. Thirunarayan
Ponnurangam Kumaraguru
ReLMLRM
32
0
0
28 May 2025
Dissecting Physics Reasoning in Small Language Models: A Multi-Dimensional Analysis from an Educational Perspective
Dissecting Physics Reasoning in Small Language Models: A Multi-Dimensional Analysis from an Educational Perspective
Nicy Scaria
Silvester John Joseph Kennedy
Diksha Seth
Deepak N. Subramani
LRM
40
0
0
27 May 2025
S$^2$GPT-PINNs: Sparse and Small models for PDEs
S2^22GPT-PINNs: Sparse and Small models for PDEs
Yajie Ji
Yanlai Chen
Shawn Koohy
10
0
0
25 May 2025
Federated Retrieval-Augmented Generation: A Systematic Mapping Study
Federated Retrieval-Augmented Generation: A Systematic Mapping Study
Abhijit Chakraborty
Chahana Dahal
Vivek Gupta
182
0
0
24 May 2025
Distribution Prompting: Understanding the Expressivity of Language Models Through the Next-Token Distributions They Can Produce
Distribution Prompting: Understanding the Expressivity of Language Models Through the Next-Token Distributions They Can Produce
Haojin Wang
Zining Zhu
Freda Shi
61
0
0
18 May 2025
Comparing Lexical and Semantic Vector Search Methods When Classifying Medical Documents
Comparing Lexical and Semantic Vector Search Methods When Classifying Medical Documents
Lee Harris
43
0
0
16 May 2025
Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models
Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models
Dawid Wi'sniewski
Antoni Solarski
Artur Nowakowski
LRM
97
0
0
09 May 2025
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Wenhao Li
Bo Jin
Mingyi Hong
Changhong Lu
Xiangfeng Wang
156
0
0
07 May 2025
Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom
Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom
Rishika Sen
Sujoy Roychowdhury
Sumit Soman
H. G. Ranjani
Srikhetra Mohanty
126
0
0
28 Apr 2025
Active Few-Shot Learning for Text Classification
Active Few-Shot Learning for Text Classification
Saeed Ahmadnia
Arash Yousefi Jordehi
Mahsa Hosseini Khasheh Heyran
Seyed Abolghasem Mirroshandel
Owen Rambow
Cornelia Caragea
102
0
0
26 Feb 2025
MKE-Coder: Multi-Axial Knowledge with Evidence Verification in ICD Coding for Chinese EMRs
MKE-Coder: Multi-Axial Knowledge with Evidence Verification in ICD Coding for Chinese EMRs
Xinxin You
Xien Liu
Xue Yang
Ziyi Wang
Ji Wu
92
0
0
19 Feb 2025
Prompt-Driven Continual Graph Learning
Prompt-Driven Continual Graph Learning
Qi Wang
Tianfei Zhou
Ye Yuan
Rui Mao
CLL
142
0
0
10 Feb 2025
In-Context Learning (and Unlearning) of Length Biases
In-Context Learning (and Unlearning) of Length Biases
S. Schoch
Yangfeng Ji
167
0
0
10 Feb 2025
IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates
IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates
Aissatou Diallo
Antonis Bikakis
Luke Dickens
Anthony Hunter
Rob Miller
LRM
102
0
0
05 Feb 2025
Memory-Efficient Fine-Tuning of Transformers via Token Selection
Memory-Efficient Fine-Tuning of Transformers via Token Selection
Antoine Simoulin
Namyong Park
Xiaoyi Liu
Grey Yang
193
1
0
31 Jan 2025
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration
Thomas Walshe
S. Moon
Chunyang Xiao
Yawwani Gunawardana
Fran Silavong
112
4
0
21 Jan 2025
Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking
Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking
Dina Pisarevskaya
Arkaitz Zubiaga
109
1
0
18 Jan 2025
Differentiable Prompt Learning for Vision Language Models
Zhenhan Huang
Tejaswini Pedapati
Pin-Yu Chen
Jianxi Gao
VLM
90
0
0
03 Jan 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
201
13
0
31 Dec 2024
CODECLEANER: Elevating Standards with A Robust Data Contamination Mitigation Toolkit
Jialun Cao
Songqiang Chen
Wuqi Zhang
Hau Ching Lo
Shing-Chi Cheung
61
1
0
16 Nov 2024
A Practical Guide to Fine-tuning Language Models with Limited Data
A Practical Guide to Fine-tuning Language Models with Limited Data
Márton Szép
Daniel Rueckert
Rüdiger von Eisenhart-Rothe
Florian Hinterwimmer
SyDaALM
130
2
0
14 Nov 2024
What Should Baby Models Read? Exploring Sample-Efficient Data
  Composition on Model Performance
What Should Baby Models Read? Exploring Sample-Efficient Data Composition on Model Performance
Hong Meng Yam
Nathan J Paek
114
1
0
11 Nov 2024
Cephalo: Harnessing Heterogeneous GPU Clusters for Training Transformer
  Models
Cephalo: Harnessing Heterogeneous GPU Clusters for Training Transformer Models
Runsheng Benson Guo
Utkarsh Anand
Arthur Chen
Khuzaima Daudjee
59
1
0
01 Nov 2024
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic
  Reasoning Tasks
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Shailaja Keyur Sampat
Mutsumi Nakamura
Shankar Kailas
Kartik Aggarwal
Mandy Zhou
Yezhou Yang
Chitta Baral
MLLMCoGeReLMVLMLRM
78
0
0
17 Oct 2024
SLM-Mod: Small Language Models Surpass LLMs at Content Moderation
SLM-Mod: Small Language Models Surpass LLMs at Content Moderation
Xianyang Zhan
Agam Goyal
Yilun Chen
Eshwar Chandrasekharan
Koustuv Saha
AI4MH
442
5
0
17 Oct 2024
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Yu-Chen Lin
Wei-Hua Li
Jun-Cheng Chen
Chu-Song Chen
51
1
0
10 Oct 2024
Exploring the Readiness of Prominent Small Language Models for the
  Democratization of Financial Literacy
Exploring the Readiness of Prominent Small Language Models for the Democratization of Financial Literacy
Tagore Rao Kosireddy
Jeffrey D. Wall
Evan Lucas
51
1
0
09 Oct 2024
Manual Verbalizer Enrichment for Few-Shot Text Classification
Manual Verbalizer Enrichment for Few-Shot Text Classification
Quang Anh Nguyen
Nadi Tomeh
M. Lebbah
Thierry Charnois
Hanene Azzag
Santiago Cordoba Muñoz
VLM
81
0
0
08 Oct 2024
SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization
  of Scientific Topics
SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics
Zhiwen You
Kanyao Han
Haotian Zhu
Bertram Ludäscher
Jana Diesner
60
1
0
02 Oct 2024
Exploring Gen-AI applications in building research and industry: A review
Exploring Gen-AI applications in building research and industry: A review
Hanlong Wan
Jian Zhang
Yan Chen
Weili Xu
Fan Feng
AI4CE
123
3
0
01 Oct 2024
Small Language Models: Survey, Measurements, and Insights
Small Language Models: Survey, Measurements, and Insights
Zhenyan Lu
Xiang Li
Dongqi Cai
Rongjie Yi
Fangming Liu
Xiwen Zhang
Nicholas D. Lane
Mengwei Xu
ObjDLRM
157
58
0
24 Sep 2024
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs
Guillermo Marco
Luz Rello
Julio Gonzalo
LM&MAALM
103
7
0
17 Sep 2024
Exploring the Potential of Large Language Models for Heterophilic Graphs
Exploring the Potential of Large Language Models for Heterophilic Graphs
Yuxia Wu
Shujie Li
Yuan Fang
Chuan Shi
147
3
0
26 Aug 2024
Domain-specific long text classification from sparse relevant
  information
Domain-specific long text classification from sparse relevant information
Célia DĆruz
J. Bereder
Frédéric Precioso
Michel Riveill
100
0
0
23 Aug 2024
SpeechPrompt: Prompting Speech Language Models for Speech Processing
  Tasks
SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Kai-Wei Chang
Haibin Wu
Yu-Kai Wang
Yuan-Kuei Wu
Hua Shen
Wei-Cheng Tseng
Iu-thing Kang
Shang-Wen Li
Hung-yi Lee
93
3
0
23 Aug 2024
LBC: Language-Based-Classifier for Out-Of-Variable Generalization
LBC: Language-Based-Classifier for Out-Of-Variable Generalization
Kangjun Noh
Baekryun Seong
Hoyoon Byun
Youngjun Choi
Sungjin Song
Kyungwoo Song
76
0
0
20 Aug 2024
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large
  Language Models
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Zhongyu Zhao
Menghang Dong
Rongyu Zhang
Wenzhao Zheng
Yunpeng Zhang
Huanrui Yang
Dalong Du
Kurt Keutzer
Shanghang Zhang
100
0
0
15 Aug 2024
Unlocking Efficiency: Adaptive Masking for Gene Transformer Models
Unlocking Efficiency: Adaptive Masking for Gene Transformer Models
Soumyadeep Roy
S. Sural
Niloy Ganguly
MedIm
72
0
0
13 Aug 2024
GPT-3 Powered Information Extraction for Building Robust Knowledge Bases
GPT-3 Powered Information Extraction for Building Robust Knowledge Bases
Ritabrata Roy Choudhury
Soumik Dey
62
1
0
31 Jul 2024
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain
  Study in Italian
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian
S. Auriemma
Martina Miliani
Mauro Madeddu
Alessandro Bondielli
Lucia Passaro
Alessandro Lenci
63
0
0
30 Jul 2024
Optimizing Numerical Estimation and Operational Efficiency in the Legal
  Domain through Large Language Models
Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models
Jia-Hong Huang
Chao-Chun Yang
Yixian Shen
A. M. Pacces
Evangelos Kanoulas
ELMAILaw
96
6
0
26 Jul 2024
Knowledge Graph Structure as Prompt: Improving Small Language Models
  Capabilities for Knowledge-based Causal Discovery
Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery
Yuni Susanti
Michael Färber
74
3
0
26 Jul 2024
Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases
  Generation with Small Language Models
Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models
Ioana Buhnila
Aman Sinha
Mathieu Constant
LM&MA
54
1
0
23 Jul 2024
Hard Prompts Made Interpretable: Sparse Entropy Regularization for
  Prompt Tuning with RL
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL
Yunseon Choi
Sangmin Bae
Seonghyun Ban
Minchan Jeong
Wei Shen
Lei Song
Wei Xiong
Jiang Bian
Kee-Eung Kim
VLMAAML
59
3
0
20 Jul 2024
Learning Program Behavioral Models from Synthesized Input-Output Pairs
Learning Program Behavioral Models from Synthesized Input-Output Pairs
Tural Mammadov
Dietrich Klakow
Alexander Koller
Andreas Zeller
99
3
0
11 Jul 2024
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?
Nicy Scaria
Silvester John Joseph Kennedy
Deepak N. Subramani
MU
118
2
0
01 Jul 2024
Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization
Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization
Niyati Bafna
Kenton Murray
David Yarowsky
126
2
0
19 Jun 2024
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in
  Sequence-Level Knowledge Distillation
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Yuhang Zhou
Jing Zhu
Paiheng Xu
Xiaoyu Liu
Xiyao Wang
Danai Koutra
Wei Ai
Furong Huang
137
5
0
19 Jun 2024
Are Large Language Models a Good Replacement of Taxonomies?
Are Large Language Models a Good Replacement of Taxonomies?
Yushi Sun
Hao Xin
Kai Sun
Yongjun Xu
Xiao Yang
Xin Luna Dong
Nan Tang
Lei Chen
AI4MH
67
11
0
17 Jun 2024
1234...111213
Next