ConPET: Continual Parameter-Efficient Tuning for Large Language Models

ConPET: Continual Parameter-Efficient Tuning for Large Language Models

Papers citing "ConPET: Continual Parameter-Efficient Tuning for Large Language Models"

22 / 22 papers shown
Title
ELLE: Efficient Lifelong Pre-training for Emerging Data
ELLE: Efficient Lifelong Pre-training for Emerging Data
Yujia Qin
Jiajie Zhang
Yankai Lin
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
76
72
0
12 Mar 2022
MoEfication: Transformer Feed-forward Layers are Mixtures of Experts
MoEfication: Transformer Feed-forward Layers are Mixtures of Experts
Zhengyan Zhang
Yankai Lin
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
76
123
0
05 Oct 2021