Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.00751
Cited By
v1
v2 (latest)
Parameter-Efficient Transfer Learning for NLP
2 February 2019
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Parameter-Efficient Transfer Learning for NLP"
50 / 2,860 papers shown
Title
Continual Learning with Evolving Class Ontologies
Zhiqiu Lin
Deepak Pathak
Yu-Xiong Wang
Deva Ramanan
Shu Kong
CLL
83
9
0
10 Oct 2022
Revisiting adapters with adversarial training
Sylvestre-Alvise Rebuffi
Francesco Croce
Sven Gowal
AAML
71
17
0
10 Oct 2022
Hierarchical3D Adapters for Long Video-to-text Summarization
Pinelopi Papalampidi
Mirella Lapata
VGen
101
13
0
10 Oct 2022
SimSCOOD: Systematic Analysis of Out-of-Distribution Generalization in Fine-tuned Source Code Models
Hossein Hajipour
Ning Yu
Cristian-Alexandru Staicu
Mario Fritz
OODD
134
5
0
10 Oct 2022
Exploring Efficient-tuning Methods in Self-supervised Speech Models
Zih-Ching Chen
Chin-Lun Fu
Chih-Ying Liu
Shang-Wen Li
Hung-yi Lee
77
41
0
10 Oct 2022
Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization
Zonghan Yang
Xiaoyuan Yi
Peng Li
Yang Liu
Xing Xie
124
34
0
10 Oct 2022
XPrompt: Exploring the Extreme of Prompt Tuning
Fang Ma
Chen Zhang
Lei Ren
Jingang Wang
Qifan Wang
Wei Wu
Xiaojun Quan
Dawei Song
VLM
158
39
0
10 Oct 2022
Parameter-Efficient Tuning with Special Token Adaptation
Xiaoocong Yang
James Y. Huang
Wenxuan Zhou
Muhao Chen
99
12
0
10 Oct 2022
SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters
Shwai He
Liang Ding
Daize Dong
Miao Zhang
Dacheng Tao
MoE
139
91
0
09 Oct 2022
AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models
S. Kwon
Jeonghoon Kim
Jeongin Bae
Kang Min Yoo
Jin-Hwa Kim
Baeseong Park
Byeongwook Kim
Jung-Woo Ha
Nako Sung
Dongsoo Lee
MQ
125
32
0
08 Oct 2022
SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models
Omiros Pantazis
Gabriel J. Brostow
Kate E. Jones
Oisin Mac Aodha
VLM
84
42
0
07 Oct 2022
PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation
Haoqin Tu
Zhongliang Yang
Jinshuai Yang
Siyu Zhang
Yong Huang
54
7
0
07 Oct 2022
Polyhistor: Parameter-Efficient Multi-Task Adaptation for Dense Vision Tasks
Yen-Cheng Liu
Chih-Yao Ma
Junjiao Tian
Zijian He
Z. Kira
165
52
0
07 Oct 2022
Unsupervised Neural Stylistic Text Generation using Transfer learning and Adapters
Vinayshekhar Bannihatti Kumar
Rashmi Gangadharaiah
Dan Roth
71
1
0
07 Oct 2022
Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition
Somshubra Majumdar
Shantanu Acharya
Vitaly Lavrukhin
Boris Ginsburg
57
3
0
06 Oct 2022
Improving the Sample Efficiency of Prompt Tuning with Domain Adaptation
Xu Guo
Boyang Albert Li
Han Yu
VLM
121
24
0
06 Oct 2022
Reprogramming Pretrained Language Models for Antibody Sequence Infilling
Igor Melnyk
Vijil Chenthamarakshan
Pin-Yu Chen
Payel Das
Amit Dhurandhar
Inkit Padhi
Devleena Das
75
33
0
05 Oct 2022
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
428
1,103
0
05 Oct 2022
Granularity-aware Adaptation for Image Retrieval over Multiple Tasks
Jon Almazán
ByungSoo Ko
Geonmo Gu
Diane Larlus
Yannis Kalantidis
ObjD
VLM
98
7
0
05 Oct 2022
The (In)Effectiveness of Intermediate Task Training For Domain Adaptation and Cross-Lingual Transfer Learning
Sovesh Mohapatra
Somesh Mohapatra
74
0
0
03 Oct 2022
LPT: Long-tailed Prompt Tuning for Image Classification
Bowen Dong
Pan Zhou
Shuicheng Yan
W. Zuo
VPVLM
VLM
186
61
0
03 Oct 2022
Visual Prompt Tuning for Generative Transfer Learning
Kihyuk Sohn
Yuan Hao
José Lezama
Luisa F. Polanía
Huiwen Chang
Han Zhang
Irfan Essa
Lu Jiang
VPVLM
VLM
166
89
0
03 Oct 2022
The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection
Sondre Wold
KELM
62
4
0
03 Oct 2022
Towards a Unified View on Visual Parameter-Efficient Transfer Learning
Bruce X. B. Yu
Jianlong Chang
Lin Liu
Qi Tian
Changan Chen
VPVLM
VLM
115
36
0
03 Oct 2022
Differentially Private Optimization on Large Model at Small Cost
Zhiqi Bu
Yu Wang
Sheng Zha
George Karypis
130
55
0
30 Sep 2022
Differentially Private Bias-Term Fine-tuning of Foundation Models
Zhiqi Bu
Yu Wang
Sheng Zha
George Karypis
146
48
0
30 Sep 2022
Language-Family Adapters for Low-Resource Multilingual Neural Machine Translation
Alexandra Chronopoulou
Dario Stojanovski
Alexander Fraser
163
19
0
30 Sep 2022
Depth-Wise Attention (DWAtt): A Layer Fusion Method for Data-Efficient Classification
Muhammad N. ElNokrashy
Badr AlKhamissi
Mona T. Diab
MoMe
90
5
0
30 Sep 2022
ConceptNet infused DialoGPT for Underlying Commonsense Understanding and Reasoning in Dialogue Response Generation
Ye Liu
Wolfgang Maier
Wolfgang Minker
Stefan Ultes
89
2
0
29 Sep 2022
A Multiagent Framework for the Asynchronous and Collaborative Extension of Multitask ML Systems
Andrea Gesmundo
104
2
0
29 Sep 2022
An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation
Yuqiao Wen
Yongchang Hao
Yanshuai Cao
Lili Mou
124
10
0
29 Sep 2022
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
ReLM
LRM
228
300
0
29 Sep 2022
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention
Ziyu Guo
Renrui Zhang
Longtian Qiu
Xianzheng Ma
Xupeng Miao
Xuming He
Tengjiao Wang
VLM
AAML
115
119
0
28 Sep 2022
Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video Grounding
Erica K. Shimomoto
Edison Marrese-Taylor
Hiroya Takamura
Ichiro Kobayashi
Hideki Nakayama
Yusuke Miyao
88
7
0
26 Sep 2022
An Empirical Study on Cross-X Transfer for Legal Judgment Prediction
Joel Niklaus
Matthias Sturmer
Ilias Chalkidis
ELM
AILaw
122
20
0
25 Sep 2022
Collaboration of Pre-trained Models Makes Better Few-shot Learner
Renrui Zhang
Bohao Li
Wei Zhang
Hao Dong
Hongsheng Li
Peng Gao
Yu Qiao
VLM
114
7
0
25 Sep 2022
Efficient Few-Shot Learning Without Prompts
Lewis Tunstall
Nils Reimers
Unso Eun Seo Jo
Luke Bates
Daniel Korat
Moshe Wasserblat
Oren Pereg
VLM
95
197
0
22 Sep 2022
EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation Metrics
Daniil Larionov
Jens Grunwald
Christoph Leiter
Steffen Eger
86
5
0
20 Sep 2022
Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models
Zichun Yu
Tianyu Gao
Zhengyan Zhang
Yankai Lin
Zhiyuan Liu
Maosong Sun
Jie Zhou
VLM
LRM
55
1
0
20 Sep 2022
Selective Token Generation for Few-shot Natural Language Generation
DaeJin Jo
Taehwan Kwon
Eun-Sol Kim
Sungwoong Kim
72
1
0
17 Sep 2022
A Continual Development Methodology for Large-scale Multitask Dynamic ML Systems
Andrea Gesmundo
66
18
0
15 Sep 2022
Parameter-Efficient Finetuning for Robust Continual Multilingual Learning
Kartikeya Badola
Shachi Dave
Partha P. Talukdar
CLL
KELM
105
9
0
14 Sep 2022
Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models
Suhyune Son
Chanjun Park
Jungseob Lee
Midan Shim
Chanhee Lee
Yoonna Jang
Jaehyung Seo
Heu-Jeoung Lim
79
0
0
14 Sep 2022
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation
A. Maharana
Darryl Hannan
Joey Tianyi Zhou
DiffM
112
83
0
13 Sep 2022
Every picture tells a story: Image-grounded controllable stylistic story generation
Holy Lovenia
Bryan Wilie
Romain Barraud
Samuel Cahyawijaya
Willy Chung
Pascale Fung
98
8
0
04 Sep 2022
Petals: Collaborative Inference and Fine-tuning of Large Models
Alexander Borzunov
Dmitry Baranchuk
Tim Dettmers
Max Ryabinin
Younes Belkada
Artem Chumachenko
Pavel Samygin
Colin Raffel
VLM
122
67
0
02 Sep 2022
Structural Bias for Aspect Sentiment Triplet Extraction
Chen Zhang
Lei Ren
Fang Ma
Jingang Wang
Wei Wu
Dawei Song
107
11
0
02 Sep 2022
Efficient Methods for Natural Language Processing: A Survey
Marcos Vinícius Treviso
Ji-Ung Lee
Tianchu Ji
Betty van Aken
Qingqing Cao
...
Emma Strubell
Niranjan Balasubramanian
Leon Derczynski
Iryna Gurevych
Roy Schwartz
174
114
0
31 Aug 2022
Continuous QA Learning with Structured Prompts
Yinhe Zheng
CLL
119
1
0
31 Aug 2022
To Adapt or to Fine-tune: A Case Study on Abstractive Summarization
Zheng Zhao
Pinzhen Chen
41
3
0
30 Aug 2022
Previous
1
2
3
...
47
48
49
...
56
57
58
Next