ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXivPDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 606 papers shown
Title
Comparing Lexical and Semantic Vector Search Methods When Classifying Medical Documents
Comparing Lexical and Semantic Vector Search Methods When Classifying Medical Documents
Lee Harris
Philippe De Wilde
James Bentham
2
0
0
16 May 2025
Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models
Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models
Dawid Wi'sniewski
Antoni Solarski
Artur Nowakowski
LRM
29
0
0
09 May 2025
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows
Wenhao Li
Bo Jin
Mingyi Hong
Changhong Lu
Xiangfeng Wang
48
0
0
07 May 2025
Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom
Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom
Rishika Sen
Sujoy Roychowdhury
Sumit Soman
H. G. Ranjani
Srikhetra Mohanty
66
0
0
28 Apr 2025
Active Few-Shot Learning for Text Classification
Active Few-Shot Learning for Text Classification
Saeed Ahmadnia
Arash Yousefi Jordehi
Mahsa Hosseini Khasheh Heyran
Seyed Abolghasem Mirroshandel
Owen Rambow
Cornelia Caragea
63
0
0
26 Feb 2025
MKE-Coder: Multi-Axial Knowledge with Evidence Verification in ICD Coding for Chinese EMRs
MKE-Coder: Multi-Axial Knowledge with Evidence Verification in ICD Coding for Chinese EMRs
Xinxin You
Xien Liu
Xue Yang
Ziyi Wang
Ji Wu
49
0
0
19 Feb 2025
Prompt-Driven Continual Graph Learning
Prompt-Driven Continual Graph Learning
Qi Wang
Tianfei Zhou
Ye Yuan
Rui Mao
CLL
47
0
0
10 Feb 2025
In-Context Learning (and Unlearning) of Length Biases
In-Context Learning (and Unlearning) of Length Biases
S. Schoch
Yangfeng Ji
97
0
0
10 Feb 2025
IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates
IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates
Aissatou Diallo
Antonis Bikakis
Luke Dickens
Anthony Hunter
Rob Miller
LRM
36
0
0
05 Feb 2025
Memory-Efficient Fine-Tuning of Transformers via Token Selection
Memory-Efficient Fine-Tuning of Transformers via Token Selection
Antoine Simoulin
Namyong Park
Xiaoyi Liu
Grey Yang
115
0
0
31 Jan 2025
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration
Thomas Walshe
S. Moon
Chunyang Xiao
Yawwani Gunawardana
Fran Silavong
47
2
0
21 Jan 2025
Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking
Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking
Dina Pisarevskaya
Arkaitz Zubiaga
48
0
0
18 Jan 2025
Differentiable Prompt Learning for Vision Language Models
Zhenhan Huang
Tejaswini Pedapati
Pin-Yu Chen
Jianxi Gao
VLM
28
0
0
03 Jan 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
93
12
0
31 Dec 2024
CODECLEANER: Elevating Standards with A Robust Data Contamination Mitigation Toolkit
Jialun Cao
Songqiang Chen
Wuqi Zhang
Hau Ching Lo
Shing-Chi Cheung
39
0
0
16 Nov 2024
A Practical Guide to Fine-tuning Language Models with Limited Data
A Practical Guide to Fine-tuning Language Models with Limited Data
Márton Szép
Daniel Rueckert
Rüdiger von Eisenhart-Rothe
Florian Hinterwimmer
SyDa
ALM
49
2
0
14 Nov 2024
What Should Baby Models Read? Exploring Sample-Efficient Data
  Composition on Model Performance
What Should Baby Models Read? Exploring Sample-Efficient Data Composition on Model Performance
Hong Meng Yam
Nathan J Paek
46
1
0
11 Nov 2024
Cephalo: Harnessing Heterogeneous GPU Clusters for Training Transformer
  Models
Cephalo: Harnessing Heterogeneous GPU Clusters for Training Transformer Models
Runsheng Benson Guo
Utkarsh Anand
Arthur Chen
Khuzaima Daudjee
44
1
0
01 Nov 2024
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic
  Reasoning Tasks
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Shailaja Keyur Sampat
Mutsumi Nakamura
Shankar Kailas
Kartik Aggarwal
Mandy Zhou
Yezhou Yang
Chitta Baral
MLLM
CoGe
ReLM
VLM
LRM
37
0
0
17 Oct 2024
SLM-Mod: Small Language Models Surpass LLMs at Content Moderation
SLM-Mod: Small Language Models Surpass LLMs at Content Moderation
Xianyang Zhan
Agam Goyal
Yilun Chen
Eshwar Chandrasekharan
Koustuv Saha
AI4MH
150
0
0
17 Oct 2024
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Yu-Chen Lin
Wei-Hua Li
Jun-Cheng Chen
Chu-Song Chen
35
1
0
10 Oct 2024
Exploring the Readiness of Prominent Small Language Models for the
  Democratization of Financial Literacy
Exploring the Readiness of Prominent Small Language Models for the Democratization of Financial Literacy
Tagore Rao Kosireddy
Jeffrey D. Wall
Evan Lucas
31
1
0
09 Oct 2024
Manual Verbalizer Enrichment for Few-Shot Text Classification
Manual Verbalizer Enrichment for Few-Shot Text Classification
Quang Anh Nguyen
Nadi Tomeh
M. Lebbah
Thierry Charnois
Hanene Azzag
Santiago Cordoba Muñoz
VLM
35
0
0
08 Oct 2024
SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization
  of Scientific Topics
SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics
Zhiwen You
Kanyao Han
Haotian Zhu
Bertram Ludäscher
Jana Diesner
35
1
0
02 Oct 2024
Exploring Gen-AI applications in building research and industry: A review
Exploring Gen-AI applications in building research and industry: A review
Hanlong Wan
Jian Zhang
Yan Chen
Weili Xu
Fan Feng
AI4CE
47
0
0
01 Oct 2024
Small Language Models: Survey, Measurements, and Insights
Small Language Models: Survey, Measurements, and Insights
Zhenyan Lu
Xiang Li
Dongqi Cai
Rongjie Yi
Fangming Liu
Xiwen Zhang
Nicholas D. Lane
Mengwei Xu
ObjD
LRM
58
36
0
24 Sep 2024
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs
Guillermo Marco
Luz Rello
Julio Gonzalo
LM&MA
ALM
51
6
0
17 Sep 2024
Exploring the Potential of Large Language Models for Heterophilic Graphs
Exploring the Potential of Large Language Models for Heterophilic Graphs
Yuxia Wu
Shujie Li
Yuan Fang
Chuan Shi
44
1
0
26 Aug 2024
Domain-specific long text classification from sparse relevant
  information
Domain-specific long text classification from sparse relevant information
Célia DĆruz
J. Bereder
Frédéric Precioso
Michel Riveill
37
0
0
23 Aug 2024
SpeechPrompt: Prompting Speech Language Models for Speech Processing
  Tasks
SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Kai-Wei Chang
Haibin Wu
Yu-Kai Wang
Yuan-Kuei Wu
Hua Shen
Wei-Cheng Tseng
Iu-thing Kang
Shang-Wen Li
Hung-yi Lee
53
3
0
23 Aug 2024
LBC: Language-Based-Classifier for Out-Of-Variable Generalization
LBC: Language-Based-Classifier for Out-Of-Variable Generalization
Kangjun Noh
Baekryun Seong
Hoyoon Byun
Youngjun Choi
Sungjin Song
Kyungwoo Song
31
0
0
20 Aug 2024
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large
  Language Models
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Zhongyu Zhao
Menghang Dong
Rongyu Zhang
Wenzhao Zheng
Yunpeng Zhang
Huanrui Yang
Dalong Du
Kurt Keutzer
Shanghang Zhang
51
0
0
15 Aug 2024
Unlocking Efficiency: Adaptive Masking for Gene Transformer Models
Unlocking Efficiency: Adaptive Masking for Gene Transformer Models
Soumyadeep Roy
S. Sural
Niloy Ganguly
MedIm
43
0
0
13 Aug 2024
GPT-3 Powered Information Extraction for Building Robust Knowledge Bases
GPT-3 Powered Information Extraction for Building Robust Knowledge Bases
Ritabrata Roy Choudhury
Soumik Dey
42
1
0
31 Jul 2024
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain
  Study in Italian
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian
S. Auriemma
Martina Miliani
Mauro Madeddu
Alessandro Bondielli
Lucia Passaro
Alessandro Lenci
40
0
0
30 Jul 2024
Optimizing Numerical Estimation and Operational Efficiency in the Legal
  Domain through Large Language Models
Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models
Jia-Hong Huang
Chao-Chun Yang
Yixian Shen
A. M. Pacces
Evangelos Kanoulas
ELM
AILaw
49
6
0
26 Jul 2024
Knowledge Graph Structure as Prompt: Improving Small Language Models
  Capabilities for Knowledge-based Causal Discovery
Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery
Yuni Susanti
Michael Färber
34
3
0
26 Jul 2024
Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases
  Generation with Small Language Models
Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models
Ioana Buhnila
Aman Sinha
Mathieu Constant
LM&MA
31
1
0
23 Jul 2024
Hard Prompts Made Interpretable: Sparse Entropy Regularization for
  Prompt Tuning with RL
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL
Yunseon Choi
Sangmin Bae
Seonghyun Ban
Minchan Jeong
Chuheng Zhang
Lei Song
Li Zhao
Jiang Bian
Kee-Eung Kim
VLM
AAML
36
3
0
20 Jul 2024
Learning Program Behavioral Models from Synthesized Input-Output Pairs
Learning Program Behavioral Models from Synthesized Input-Output Pairs
Tural Mammadov
Dietrich Klakow
Alexander Koller
Andreas Zeller
45
3
0
11 Jul 2024
Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization
Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization
Niyati Bafna
Kenton Murray
David Yarowsky
63
2
0
19 Jun 2024
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in
  Sequence-Level Knowledge Distillation
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Yuhang Zhou
Jing Zhu
Paiheng Xu
Xiaoyu Liu
Xiyao Wang
Danai Koutra
Wei Ai
Furong Huang
81
4
0
19 Jun 2024
Are Large Language Models a Good Replacement of Taxonomies?
Are Large Language Models a Good Replacement of Taxonomies?
Yushi Sun
Hao Xin
Kai Sun
Yongjun Xu
Xiao Yang
Xin Luna Dong
Nan Tang
Lei Chen
AI4MH
38
7
0
17 Jun 2024
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization
  for Language Models
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models
Chengzhengxu Li
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Chen Liu
Y. Lan
Chao Shen
60
2
0
15 Jun 2024
Survey for Landing Generative AI in Social and E-commerce Recsys -- the
  Industry Perspectives
Survey for Landing Generative AI in Social and E-commerce Recsys -- the Industry Perspectives
Da Xu
Danqing Zhang
Guangyu Yang
Bo Yang
Shuyuan Xu
Lingling Zheng
Cindy Liang
32
2
0
10 Jun 2024
Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from
  Imperfect Teacher Models in Low-Budget Scenarios
Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios
Yuhang Zhou
Wei Ai
40
5
0
08 Jun 2024
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter
Jitai Hao
Weiwei Sun
Xin Xin
Qi Meng
Zhumin Chen
Pengjie Ren
Zhaochun Ren
MoE
42
2
0
07 Jun 2024
BERTs are Generative In-Context Learners
BERTs are Generative In-Context Learners
David Samuel
48
5
0
07 Jun 2024
LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text
  Classification
LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification
Chun Liu
Hongguang Zhang
Kainan Zhao
Xinghai Ju
Lin Yang
50
4
0
06 Jun 2024
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
David Ifeoluwa Adelani
Jessica Ojo
Israel Abebe Azime
Jian Yun Zhuang
Jesujoba Oluwadara Alabi
...
Salomey Osei
Sokhar Samb
Tadesse Kebede Guge
Pontus Stenetorp
Pontus Stenetorp
ELM
65
7
0
05 Jun 2024
1234...111213
Next