ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07118
  4. Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
v1v2 (latest)

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

15 September 2020
Timo Schick
Hinrich Schütze
ArXiv (abs)PDFHTML

Papers citing "It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"

50 / 613 papers shown
Title
How Secure Are Large Language Models (LLMs) for Navigation in Urban Environments?
How Secure Are Large Language Models (LLMs) for Navigation in Urban Environments?
Congcong Wen
Jiazhao Liang
Shuaihang Yuan
Hao Huang
Geeta Chandra Raju Bethala
Yu-Shen Liu
Mengyu Wang
Anthony Tzes
Yi Fang
AAML
95
6
0
14 Feb 2024
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal
  Foundation Models
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models
Maurice Diesendruck
Jianzhe Lin
Shima Imani
Gayathri Mahalingam
Mingyang Xu
Jie Zhao
27
2
0
13 Feb 2024
Exploring Low-Resource Medical Image Classification with Weakly
  Supervised Prompt Learning
Exploring Low-Resource Medical Image Classification with Weakly Supervised Prompt Learning
Fudan Zheng
Jindong Cao
Weijiang Yu
Zhiguang Chen
Nong Xiao
Yutong Lu
VLMMedIm
53
19
0
06 Feb 2024
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence
  Labeling Tasks
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks
Bolei Ma
Ercong Nie
Shuzhou Yuan
Helmut Schmid
Michael Farber
Frauke Kreuter
Hinrich Schütze
VLM
140
6
0
29 Jan 2024
MAPLE: Micro Analysis of Pairwise Language Evolution for Few-Shot Claim
  Verification
MAPLE: Micro Analysis of Pairwise Language Evolution for Few-Shot Claim Verification
Xia Zeng
A. Zubiaga
107
5
0
29 Jan 2024
The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support
The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support
Inhwa Song
Sachin R. Pendse
Neha Kumar
Munmun De Choudhury
AI4MH
79
17
0
25 Jan 2024
Cheap Learning: Maximising Performance of Language Models for Social
  Data Science Using Minimal Data
Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data
Leonardo Castro-Gonzalez
Yi-Ling Chung
Hannak Rose Kirk
John Francis
Angus R. Williams
Pica Johansson
Jonathan Bright
69
1
0
22 Jan 2024
Using LLM such as ChatGPT for Designing and Implementing a RISC
  Processor: Execution,Challenges and Limitations
Using LLM such as ChatGPT for Designing and Implementing a RISC Processor: Execution,Challenges and Limitations
S. Hossain
Aayush Gohil
Yizhou Wang
37
3
0
18 Jan 2024
Leveraging Biases in Large Language Models: "bias-kNN'' for Effective
  Few-Shot Learning
Leveraging Biases in Large Language Models: "bias-kNN'' for Effective Few-Shot Learning
Yong Zhang
Hanzhang Li
Zhitao Li
Ning Cheng
Ming Li
Jing Xiao
Jianzong Wang
80
3
0
18 Jan 2024
Prompting open-source and commercial language models for grammatical error correction of English learner text
Prompting open-source and commercial language models for grammatical error correction of English learner text
Christopher Davis
Andrew Caines
Oistein Andersen
Shiva Taslimipoor
H. Yannakoudakis
Zheng Yuan
Christopher Bryant
Marek Rei
P. Buttery
116
17
0
15 Jan 2024
Promptly Predicting Structures: The Return of Inference
Promptly Predicting Structures: The Return of Inference
Maitrey Mehta
Valentina Pyatkin
Vivek Srikumar
111
4
0
12 Jan 2024
A Novel Prompt-tuning Method: Incorporating Scenario-specific Concepts
  into a Verbalizer
A Novel Prompt-tuning Method: Incorporating Scenario-specific Concepts into a Verbalizer
Yong Ma
Senlin Luo
Yu-Ming Shang
Zhengjun Li
Yong Liu
VLM
45
2
0
10 Jan 2024
MERA: A Comprehensive LLM Evaluation in Russian
MERA: A Comprehensive LLM Evaluation in Russian
Alena Fenogenova
Artem Chervyakov
Nikita Martynov
Anastasia Kozlova
Maria Tikhonova
...
Nikita Savushkin
Polina Mikhailova
Denis Dimitrov
Alexander Panchenko
Sergey Markov
ELM
97
12
0
09 Jan 2024
Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced
  Zero/Few-Shot Forecasting of Multivariate Time Series
Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series
Vijay Ekambaram
Arindam Jati
Pankaj Dayama
Sumanta Mukherjee
Nam H. Nguyen
Wesley M. Gifford
Chandra Reddy
Jayant Kalagnanam
AI4TSVLM
170
35
0
08 Jan 2024
Understanding LLMs: A Comprehensive Overview from Training to Inference
Understanding LLMs: A Comprehensive Overview from Training to Inference
Yi-Hsueh Liu
Haoyang He
Tianle Han
Xu-Yao Zhang
Mengyuan Liu
...
Xintao Hu
Tuo Zhang
Ning Qiang
Tianming Liu
Bao Ge
SyDa
151
77
0
04 Jan 2024
Temporal Adaptive RGBT Tracking with Modality Prompt
Temporal Adaptive RGBT Tracking with Modality Prompt
Hongyu Wang
Xiaotao Liu
Yifan Li
Meng Sun
Dian Yuan
Jing Liu
89
37
0
02 Jan 2024
Building Efficient Universal Classifiers with Natural Language Inference
Building Efficient Universal Classifiers with Natural Language Inference
Moritz Laurer
W. Atteveldt
Andreu Casas
Kasper Welbers
85
8
0
29 Dec 2023
Semantic Draw Engineering for Text-to-Image Creation
Semantic Draw Engineering for Text-to-Image Creation
Yang Li
Huaqiang Jiang
Yangkai Wu
51
1
0
23 Dec 2023
Decoupling Representation and Knowledge for Few-Shot Intent
  Classification and Slot Filling
Decoupling Representation and Knowledge for Few-Shot Intent Classification and Slot Filling
Jie Han
Yixiong Zou
Yining Qi
Jun Wang
Wei Liu
Yao Wu
Tao Zhang
Ruixuan Li
VLM
68
0
0
21 Dec 2023
Batched Low-Rank Adaptation of Foundation Models
Batched Low-Rank Adaptation of Foundation Models
Yeming Wen
Swarat Chaudhuri
OffRL
91
21
0
09 Dec 2023
MUFFIN: Curating Multi-Faceted Instructions for Improving
  Instruction-Following
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
Renze Lou
Kai Zhang
Jian Xie
Yuxuan Sun
Janice Ahn
Hanzi Xu
Yu Su
Wenpeng Yin
111
30
0
05 Dec 2023
TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP
  Models via GPT4
TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4
Zihao Tan
Qingliang Chen
Yongjian Huang
Chen Liang
SILMAAML
70
5
0
29 Nov 2023
MultiGPrompt for Multi-Task Pre-Training and Prompting on Graphs
MultiGPrompt for Multi-Task Pre-Training and Prompting on Graphs
Xingtong Yu
Chang Zhou
Yuan Fang
Xinming Zhang
100
37
0
28 Nov 2023
Large Language Models are Few-Shot Training Example Generators: A Case
  Study in Fallacy Recognition
Large Language Models are Few-Shot Training Example Generators: A Case Study in Fallacy Recognition
Tariq Alhindi
Smaranda Muresan
Preslav Nakov
HILMLRM
87
5
0
16 Nov 2023
SQATIN: Supervised Instruction Tuning Meets Question Answering for
  Improved Dialogue NLU
SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU
E. Razumovskaia
Goran Glavaš
Anna Korhonen
Ivan Vulić
LRM
63
2
0
16 Nov 2023
Think Before You Speak: Cultivating Communication Skills of Large
  Language Models via Inner Monologue
Think Before You Speak: Cultivating Communication Skills of Large Language Models via Inner Monologue
Junkai Zhou
Liang Pang
Huawei Shen
Xueqi Cheng
LRM
54
6
0
13 Nov 2023
Translating Legalese: Enhancing Public Understanding of Court Opinions
  with Legal Summarizers
Translating Legalese: Enhancing Public Understanding of Court Opinions with Legal Summarizers
Elliott Ash
Aniket Kesari
Suresh Naidu
Lena Song
Dominik Stammbach
ELM
74
5
0
11 Nov 2023
Mini Minds: Exploring Bebeshka and Zlata Baby Models
Mini Minds: Exploring Bebeshka and Zlata Baby Models
Irina Proskurina
Guillaume Metzler
Julien Velcin
ALM
56
1
0
06 Nov 2023
The language of prompting: What linguistic properties make a prompt
  successful?
The language of prompting: What linguistic properties make a prompt successful?
Alina Leidinger
R. Rooij
Ekaterina Shutova
96
44
0
03 Nov 2023
Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation
Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation
Xue-mei Hu
Ce Zhang
Yi Zhang
Bowen Hai
Ke Yu
Zhihai He
MDEVLM
98
18
0
02 Nov 2023
ProMap: Effective Bilingual Lexicon Induction via Language Model
  Prompting
ProMap: Effective Bilingual Lexicon Induction via Language Model Prompting
Abdellah El Mekki
Muhammad Abdul-Mageed
El Moatez Billah Nagoudi
Ismail Berrada
A. Khoumsi
78
2
0
28 Oct 2023
BLESS: Benchmarking Large Language Models on Sentence Simplification
BLESS: Benchmarking Large Language Models on Sentence Simplification
Tannon Kew
Alison Chi
Laura Vásquez-Rodríguez
Sweta Agrawal
Dennis Aumiller
Fernando Alva-Manchego
Teven Le Scao
93
26
0
24 Oct 2023
A Communication Theory Perspective on Prompting Engineering Methods for
  Large Language Models
A Communication Theory Perspective on Prompting Engineering Methods for Large Language Models
Yuanfeng Song
Yuanqin He
Xuefang Zhao
Hanlin Gu
Di Jiang
Haijun Yang
Lixin Fan
Qiang Yang
69
6
0
24 Oct 2023
Open-Ended Instructable Embodied Agents with Memory-Augmented Large
  Language Models
Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models
Gabriel H. Sarch
Yue Wu
Michael J. Tarr
Katerina Fragkiadaki
LM&RoLLMAG
111
19
0
23 Oct 2023
A Survey on Semantic Processing Techniques
A Survey on Semantic Processing Techniques
Rui Mao
Kai He
Xulang Zhang
Guanyi Chen
Jinjie Ni
Zonglin Yang
Min Zhang
95
34
0
22 Oct 2023
PromptMix: A Class Boundary Augmentation Method for Large Language Model
  Distillation
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Gaurav Sahu
Olga Vechtomova
Dzmitry Bahdanau
I. Laradji
VLM
104
27
0
22 Oct 2023
Transductive Learning for Textual Few-Shot Classification in API-based
  Embedding Models
Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models
Pierre Colombo
Victor Pellegrain
Malik Boudiaf
Victor Storchan
Myriam Tami
Ismail Ben Ayed
C´eline Hudelot
Pablo Piantanida
99
8
0
21 Oct 2023
Enhancing Zero-Shot Crypto Sentiment with Fine-tuned Language Model and
  Prompt Engineering
Enhancing Zero-Shot Crypto Sentiment with Fine-tuned Language Model and Prompt Engineering
Rahman S. M. Wahidur
Ishmam Tashdeed
Manjit Kaur
Heung-No Lee
ALM
89
17
0
20 Oct 2023
NameGuess: Column Name Expansion for Tabular Data
NameGuess: Column Name Expansion for Tabular Data
Jiani Zhang
Zhengyuan Shen
Balasubramaniam Srinivasan
Shen Wang
Huzefa Rangwala
George Karypis
47
6
0
19 Oct 2023
Label-Aware Automatic Verbalizer for Few-Shot Text Classification
Label-Aware Automatic Verbalizer for Few-Shot Text Classification
Thanakorn Thaminkaew
Piyawat Lertvittayakumjorn
P. Vateekul
VLM
44
1
0
19 Oct 2023
Chain-of-Thought Tuning: Masked Language Models can also Think Step By
  Step in Natural Language Understanding
Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding
Caoyun Fan
Jidong Tian
Yitian Li
Wenqing Chen
Hao He
Yaohui Jin
LRM
66
4
0
18 Oct 2023
Reformulating NLP tasks to Capture Longitudinal Manifestation of
  Language Disorders in People with Dementia
Reformulating NLP tasks to Capture Longitudinal Manifestation of Language Disorders in People with Dementia
Dimitris Gkoumas
Matthew Purver
Maria Liakata
63
2
0
15 Oct 2023
XAL: EXplainable Active Learning Makes Classifiers Better Low-resource
  Learners
XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners
Yun Luo
Zhen Yang
Fandong Meng
Yingjie Li
Fang Guo
Qinglin Qi
Jie Zhou
Yue Zhang
97
2
0
09 Oct 2023
Towards Better Chain-of-Thought Prompting Strategies: A Survey
Towards Better Chain-of-Thought Prompting Strategies: A Survey
Zihan Yu
Liang He
Zhen Wu
Xinyu Dai
Jiajun Chen
LRM
167
55
0
08 Oct 2023
Amortizing intractable inference in large language models
Amortizing intractable inference in large language models
Marvin Schmitt
Moksh Jain
Daniel Habermann
Younesse Kaddar
Ullrich Kothe
Stefan T. Radev
Nikolay Malkin
AIFinBDL
121
57
0
06 Oct 2023
Think before you speak: Training Language Models With Pause Tokens
Think before you speak: Training Language Models With Pause Tokens
Sachin Goyal
Ziwei Ji
A. S. Rawat
A. Menon
Sanjiv Kumar
Vaishnavh Nagarajan
LRM
111
122
0
03 Oct 2023
Intuitive or Dependent? Investigating LLMs' Behavior Style to
  Conflicting Prompts
Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts
Jiahao Ying
Yixin Cao
Kai Xiong
Yidong He
Long Cui
Yongbin Liu
46
13
0
29 Sep 2023
Prompt-and-Align: Prompt-Based Social Alignment for Few-Shot Fake News
  Detection
Prompt-and-Align: Prompt-Based Social Alignment for Few-Shot Fake News Detection
Jiaying Wu
Xinyu Chen
Haobin Yang
Qi Zhao
Yuhui Shi
AAML
85
12
0
28 Sep 2023
Defending Pre-trained Language Models as Few-shot Learners against
  Backdoor Attacks
Defending Pre-trained Language Models as Few-shot Learners against Backdoor Attacks
Zhaohan Xi
Tianyu Du
Changjiang Li
Ren Pang
S. Ji
Jinghui Chen
Fenglong Ma
Ting Wang
AAML
68
34
0
23 Sep 2023
Prompt Tuned Embedding Classification for Multi-Label Industry Sector
  Allocation
Prompt Tuned Embedding Classification for Multi-Label Industry Sector Allocation
V. Buchner
Lele Cao
Jan-Christoph Kalo
Vilhelm von Ehrenheim
VLM
83
2
0
21 Sep 2023
Previous
123456...111213
Next