Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.07118
Cited By
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
15 September 2020
Timo Schick
Hinrich Schütze
Re-assign community
ArXiv
PDF
HTML
Papers citing
"It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners"
50 / 606 papers shown
Title
Robust Prompt Optimization for Large Language Models Against Distribution Shifts
Moxin Li
Wenjie Wang
Fuli Feng
Yixin Cao
Jizhi Zhang
Tat-Seng Chua
OffRL
42
15
0
23 May 2023
Small Language Models Improve Giants by Rewriting Their Outputs
Giorgos Vernikos
Arthur Bravzinskas
Jakub Adamek
Jonathan Mallinson
Aliaksei Severyn
Eric Malmi
BDL
LRM
33
14
0
22 May 2023
Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefixes
Kuan-Hao Huang
L Tan
Rui Hou
Sinong Wang
Amjad Almahairi
Ruty Rinott
AI4CE
30
0
0
22 May 2023
Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration
Qifan Yu
Juncheng Li
Wentao Ye
Siliang Tang
Yueting Zhuang
36
13
0
22 May 2023
Enhancing Cross-lingual Natural Language Inference by Soft Prompting with Multilingual Verbalizer
Shuang Li
Xuming Hu
Aiwei Liu
Yawen Yang
Fukun Ma
Philip S. Yu
Lijie Wen
33
4
0
22 May 2023
Automated Few-shot Classification with Instruction-Finetuned Language Models
Rami Aly
Xingjian Shi
Kaixiang Lin
Aston Zhang
A. Wilson
38
9
0
21 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Rada Mihalcea
LRM
41
6
0
21 May 2023
PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search
Mozhi Zhang
Hang Yan
Yaqian Zhou
Xipeng Qiu
23
10
0
20 May 2023
Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs
IokTong Lei
Zhidong Deng
ReLM
RALM
LRM
27
4
0
19 May 2023
Zero-Shot Text Classification via Self-Supervised Tuning
Chaoqun Liu
Wenxuan Zhang
Guizhen Chen
Xiaobao Wu
A. Luu
Chip Hong Chang
Lidong Bing
VLM
37
11
0
19 May 2023
Efficient Prompting via Dynamic In-Context Learning
Wangchunshu Zhou
Yuchen Eleanor Jiang
Ryan Cotterell
Mrinmaya Sachan
27
19
0
18 May 2023
"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation
Anaelia Ovalle
Palash Goyal
Jwala Dhamala
Zachary Jaggers
Kai-Wei Chang
Aram Galstyan
R. Zemel
Rahul Gupta
25
61
0
17 May 2023
CPL-NoViD: Context-Aware Prompt-based Learning for Norm Violation Detection in Online Communities
Zihao He
Jonathan May
Kristina Lerman
44
3
0
16 May 2023
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
Raman Dutt
Linus Ericsson
Pedro Sanchez
Sotirios A. Tsaftaris
Timothy M. Hospedales
MedIm
27
50
0
14 May 2023
Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering
Qianglong Chen
Guohai Xu
Mingshi Yan
Ji Zhang
Fei Huang
Luo Si
Yin Zhang
21
9
0
14 May 2023
Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives
Qiushi Sun
Chengcheng Han
Nuo Chen
Renyu Zhu
Jing Gong
Xiang Li
Ming Gao
VLM
27
8
0
14 May 2023
Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue Generation
Lei Liu
J. Huang
CLL
29
2
0
12 May 2023
When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust
Minh Le Nguyen
Duy-Hung Nguyen
Shahab Sabahi
Hung Le
Jeffrey Yang
Hajime Hotta
33
1
0
12 May 2023
Musketeer: Joint Training for Multi-task Vision Language Model with Task Explanation Prompts
Zhaoyang Zhang
Yantao Shen
Kunyu Shi
Zhaowei Cai
Jun Fang
Siqi Deng
Hao Yang
Davide Modolo
Z. Tu
Stefano Soatto
VLM
28
2
0
11 May 2023
How Good are Commercial Large Language Models on African Languages?
Jessica Ojo
Kelechi Ogueji
26
5
0
11 May 2023
Investigating Forgetting in Pre-Trained Representations Through Continual Learning
Yun Luo
Zhen Yang
Xuefeng Bai
Fandong Meng
Jie Zhou
Yue Zhang
CLL
KELM
27
16
0
10 May 2023
Towards Weakly-Supervised Hate Speech Classification Across Datasets
Yiping Jin
Leo Wanner
Vishakha Kadam
A. Shvets
34
5
0
04 May 2023
Black-box Prompt Tuning with Subspace Learning
Yuanhang Zheng
Zhixing Tan
Peng Li
Yang Liu
VLM
59
9
0
04 May 2023
PTP: Boosting Stability and Performance of Prompt Tuning with Perturbation-Based Regularizer
Lichang Chen
Heng-Chiao Huang
Varun Madhavan
AAML
124
11
0
03 May 2023
Causal Interventions-based Few-Shot Named Entity Recognition
Zhen Yang
Yongbin Liu
Chunping Ouyang
CML
24
0
0
03 May 2023
Few-shot Event Detection: An Empirical Study and a Unified View
Yubo Ma
Zehao Wang
Yixin Cao
Aixin Sun
60
10
0
03 May 2023
Don't Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner
Zhengxiang Shi
Aldo Lipani
VLM
CLL
37
21
0
02 May 2023
From Words to Code: Harnessing Data for Program Synthesis from Natural Language
Anirudh Khatry
Joyce Cahoon
Jordan Henkel
Shaleen Deep
Venkatesh Emani
...
Vu Le
Mohammad Raza
Sherry Shi
Mukul Singh
A. Tiwari
39
12
0
02 May 2023
HQP: A Human-Annotated Dataset for Detecting Online Propaganda
Abdurahman Maarouf
Dominik Bär
Dominique Geissler
Stefan Feuerriegel
17
9
0
28 Apr 2023
π
π
π
-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation
Chengyue Wu
Teng Wang
Yixiao Ge
Zeyu Lu
Rui-Zhi Zhou
Ying Shan
Ping Luo
MoMe
88
35
0
27 Apr 2023
Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLM
Ruohong Zhang
Yau-Shian Wang
Yiming Yang
SyDa
34
10
0
24 Apr 2023
MasakhaNEWS: News Topic Classification for African languages
David Ifeoluwa Adelani
Marek Masiak
Israel Abebe Azime
Jesujoba Oluwadara Alabi
A. Tonja
...
Moges Ahmed Mehamed
Evrard Ngabire
Jules Jules
Ivan Ssenkungu
Pontus Stenetorp
28
24
0
19 Apr 2023
MixPro: Simple yet Effective Data Augmentation for Prompt-based Learning
Bohan Li
Longxu Dou
Yutai Hou
Yunlong Feng
Honglin Mu
Qingfu Zhu
Qinghua Sun
Wanxiang Che
VLM
37
3
0
19 Apr 2023
Just Tell Me: Prompt Engineering in Business Process Management
Kiran Busch
Alexander Rochlitzer
Diana Sola
Henrik Leopold
31
29
0
14 Apr 2023
Global Prompt Cell: A Portable Control Module for Effective Prompt Tuning
Chi-Liang Liu
Hao Wang
Nuwa Xi
Sendong Zhao
Bing Qin
VLM
16
1
0
12 Apr 2023
chatClimate: Grounding Conversational AI in Climate Science
S. Vaghefi
Qian Wang
V. Muccione
Jingwei Ni
Mathias Kraus
...
Tobias Schimanski
Chiara Colesanti-Senni
Nicolas Webersinke
Christrian Huggel
Markus Leippold
KELM
AI4MH
HILM
25
67
0
11 Apr 2023
Similarity-Aware Multimodal Prompt Learning for Fake News Detection
Ye Jiang
Xiaomin Yu
Yimin Wang
Xiaoman Xu
Xingyi Song
Diana Maynard
29
20
0
09 Apr 2023
WikiGoldSK: Annotated Dataset, Baselines and Few-Shot Learning Experiments for Slovak Named Entity Recognition
Dávid Suba
Marek Suppa
Jozef Kubík
Endre Hamerlik
Martin Takáč
32
0
0
08 Apr 2023
Revisiting Automated Prompting: Are We Actually Doing Better?
Yulin Zhou
Yiren Zhao
Ilia Shumailov
Robert D. Mullins
Y. Gal
29
8
0
07 Apr 2023
Sociocultural knowledge is needed for selection of shots in hate speech detection tasks
Antonis Maronikolakis
Abdullatif Köksal
Hinrich Schütze
43
0
0
04 Apr 2023
Learning to Name Classes for Vision and Language Models
Sarah Parisot
Yongxin Yang
Jingyu Sun
VLM
17
10
0
04 Apr 2023
Learning Federated Visual Prompt in Null Space for MRI Reconstruction
Chun-Mei Feng
Bangjun Li
Xinxing Xu
Yong Liu
Huazhu Fu
W. Zuo
FedML
32
42
0
28 Mar 2023
Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning
Vladislav Lialin
Vijeta Deshpande
Anna Rumshisky
45
167
0
28 Mar 2023
Analyzing the Performance of GPT-3.5 and GPT-4 in Grammatical Error Correction
Steven Coyne
Keisuke Sakaguchi
Diana Galván-Sosa
M. Zock
Kentaro Inui
24
12
0
25 Mar 2023
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World
Qifan Yu
Juncheng Li
Yuehua Wu
Siliang Tang
Wei Ji
Yueting Zhuang
30
34
0
23 Mar 2023
Fairness: from the ethical principle to the practice of Machine Learning development as an ongoing agreement with stakeholders
Georgina Curto
F. Comim
FaML
10
1
0
22 Mar 2023
Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization
Kaihang Pan
Juncheng Billy Li
Hongye Song
Jun Lin
Xiaozhong Liu
Siliang Tang
OffRL
38
10
0
22 Mar 2023
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
Zheng-Long Liu
Yue Huang
Xiao-Xing Yu
Lu Zhang
Zihao Wu
...
Dinggang Shen
Quanzheng Li
Tianming Liu
Dajiang Zhu
Xiang Li
LM&MA
MedIm
33
171
0
20 Mar 2023
Large Language Model Instruction Following: A Survey of Progresses and Challenges
Renze Lou
Kai Zhang
Wenpeng Yin
ALM
LRM
32
20
0
18 Mar 2023
SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models
Vithursan Thangarasa
Abhay Gupta
William Marshall
Tianda Li
Kevin Leong
D. DeCoste
Sean Lie
Shreyas Saxena
MoE
AI4CE
21
18
0
18 Mar 2023
Previous
1
2
3
4
5
6
...
11
12
13
Next