Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.00751
Cited By
v1
v2 (latest)
Parameter-Efficient Transfer Learning for NLP
2 February 2019
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Parameter-Efficient Transfer Learning for NLP"
50 / 2,860 papers shown
Title
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Tuan Dinh
Yuchen Zeng
Ruisu Zhang
Ziqian Lin
Michael Gira
Shashank Rajput
Jy-yong Sohn
Dimitris Papailiopoulos
Kangwook Lee
LMTD
181
140
0
14 Jun 2022
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLM
111
246
0
13 Jun 2022
Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning
Yanpeng Sun
Qiang Chen
Xiangyu He
Jian Wang
Haocheng Feng
Junyu Han
Errui Ding
Jian Cheng
Zechao Li
Jingdong Wang
95
57
0
13 Jun 2022
DeepEmotex: Classifying Emotion in Text Messages using Deep Transfer Learning
Maryam Hasan
Elke A. Rundensteiner
E. Agu
VLM
36
8
0
12 Jun 2022
Neural Prompt Search
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
VPVLM
VLM
110
152
0
09 Jun 2022
Neural Collapse: A Review on Modelling Principles and Generalization
Vignesh Kothapalli
160
82
0
08 Jun 2022
Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning
Yu Jin Kim
Beong-woo Kwak
Youngwook Kim
Reinald Kim Amplayo
Seung-won Hwang
Jinyoung Yeo
LRM
66
14
0
08 Jun 2022
Making Large Language Models Better Reasoners with Step-Aware Verifier
Yifei Li
Zeqi Lin
Shizhuo Zhang
Qiang Fu
B. Chen
Jian-Guang Lou
Weizhu Chen
ReLM
LRM
125
230
0
06 Jun 2022
Exploring Cross-lingual Textual Style Transfer with Large Multilingual Language Models
Daniil Moskovskiy
Daryna Dementieva
Alexander Panchenko
73
3
0
05 Jun 2022
Instance-wise Prompt Tuning for Pretrained Language Models
Yuezihan Jiang
Hao Yang
Junyang Lin
Hanyu Zhao
An Yang
Chang Zhou
Hongxia Yang
Zhi-Xin Yang
Tengjiao Wang
VLM
65
7
0
04 Jun 2022
Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Virginia Adams
Sandeep Subramanian
Mike Chrzanowski
Oleksii Hrinchuk
Oleksii Kuchaiev
68
2
0
02 Jun 2022
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training
Yan Zeng
Wangchunshu Zhou
Ao Luo
Ziming Cheng
Xinsong Zhang
VLM
110
32
0
01 Jun 2022
A Cross-City Federated Transfer Learning Framework: A Case Study on Urban Region Profiling
Gaode Chen
Yijun Su
Xinghua Zhang
Anmin Hu
Guochun Chen
Siyuan Feng
Jinlin Xiang
Junbo Zhang
Yu Zheng
130
6
0
31 May 2022
Modular and On-demand Bias Mitigation with Attribute-Removal Subnetworks
Lukas Hauzenberger
Shahed Masoudian
Deepak Kumar
Markus Schedl
Navid Rekabsaz
95
18
0
30 May 2022
Generalizing Multimodal Pre-training into Multilingual via Language Acquisition
Liang Zhang
Anwen Hu
Qin Jin
VLM
52
5
0
29 May 2022
Parameter-Efficient and Student-Friendly Knowledge Distillation
Jun Rao
Xv Meng
Liang Ding
Shuhan Qi
Dacheng Tao
99
51
0
28 May 2022
Can Foundation Models Help Us Achieve Perfect Secrecy?
Simran Arora
Christopher Ré
FedML
92
8
0
27 May 2022
Contextual Adapters for Personalized Speech Recognition in Neural Transducers
Kanthashree Mysore Sathyendra
Thejaswi Muniyappa
Feng-Ju Chang
Jing Liu
Jinru Su
Grant P. Strimel
Athanasios Mouchtaris
Siegfried Kunzmann
85
79
0
26 May 2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Shoufa Chen
Chongjian Ge
Zhan Tong
Jiangliu Wang
Yibing Song
Jue Wang
Ping Luo
259
706
0
26 May 2022
An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems
Andrea Gesmundo
J. Dean
184
24
0
25 May 2022
Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-Experts
Qinyuan Ye
Juan Zha
Xiang Ren
MoE
90
14
0
25 May 2022
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation
Tu Vu
Aditya Barua
Brian Lester
Daniel Cer
Mohit Iyyer
Noah Constant
CLL
108
66
0
25 May 2022
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
Mingkai Deng
Jianyu Wang
Cheng-Ping Hsieh
Yihan Wang
Han Guo
Tianmin Shu
Meng Song
Eric Xing
Zhiting Hu
99
345
0
25 May 2022
Memorization in NLP Fine-tuning Methods
Fatemehsadat Mireshghallah
Archit Uniyal
Tianhao Wang
David Evans
Taylor Berg-Kirkpatrick
AAML
146
43
0
25 May 2022
Know Where You're Going: Meta-Learning for Parameter-Efficient Fine-Tuning
Mozhdeh Gheini
Xuezhe Ma
Jonathan May
101
5
0
25 May 2022
Adaptive multilingual speech recognition with pretrained models
Ngoc-Quan Pham
A. Waibel
Jan Niehues
VLM
72
23
0
24 May 2022
Enhancing Continual Learning with Global Prototypes: Counteracting Negative Representation Drift
Xueying Bai
Jinghuan Shang
Yifan Sun
Niranjan Balasubramanian
CLL
88
1
0
24 May 2022
Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer
Ahmet Üstün
Arianna Bisazza
G. Bouma
Gertjan van Noord
Sebastian Ruder
131
33
0
24 May 2022
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts
Akari Asai
Mohammadreza Salehi
Matthew E. Peters
Hannaneh Hajishirzi
219
102
0
24 May 2022
Representation Projection Invariance Mitigates Representation Collapse
Anastasia Razdaibiedina
A. Khetan
Zohar Karnin
Daniel Khashabi
Vishaal Kapoor
V. Madan
104
5
0
23 May 2022
When does Parameter-Efficient Transfer Learning Work for Machine Translation?
Ahmet Üstün
Asa Cooper Stickland
97
7
0
23 May 2022
Stop Filtering: Multi-View Attribute-Enhanced Dialogue Learning
Yiwei Li
Bin Sun
Shaoxiong Feng
Kan Li
63
3
0
23 May 2022
BBTv2: Towards a Gradient-Free Future with Large Language Models
Tianxiang Sun
Zhengfu He
Hong Qian
Yunhua Zhou
Xuanjing Huang
Xipeng Qiu
155
57
0
23 May 2022
Supporting Vision-Language Model Inference with Confounder-pruning Knowledge Prompt
Jiangmeng Li
Wenyi Mo
Jingyao Wang
Fuchun Sun
Changwen Zheng
Hui Xiong
Ji-Rong Wen
VLM
93
0
0
23 May 2022
Vector-Quantized Input-Contextualized Soft Prompts for Natural Language Understanding
Rishabh Bhardwaj
Amrita Saha
Guosheng Lin
Soujanya Poria
VLM
VPVLM
55
7
0
23 May 2022
Parameter-Efficient Sparsity for Large Language Models Fine-Tuning
Yuchao Li
Fuli Luo
Chuanqi Tan
Mengdi Wang
Songfang Huang
Shen Li
Junjie Bai
MQ
121
34
0
23 May 2022
muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems
Andrea Gesmundo
J. Dean
113
19
0
22 May 2022
Multilingual Machine Translation with Hyper-Adapters
Christos Baziotis
Mikel Artetxe
James Cross
Shruti Bhosale
127
23
0
22 May 2022
All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass
Jiaxin Huang
Tianqi Liu
Jialu Liu
Á. Lelkes
Cong Yu
Jiawei Han
63
1
0
22 May 2022
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
297
368
0
21 May 2022
FedAdapter: Efficient Federated Learning for Modern NLP
Dongqi Cai
Yaozong Wu
Shangguang Wang
F. Lin
Mengwei Xu
FedML
AI4CE
74
23
0
20 May 2022
Can Foundation Models Wrangle Your Data?
A. Narayan
Ines Chami
Laurel J. Orr
Simran Arora
Christopher Ré
LMTD
AI4CE
247
231
0
20 May 2022
Phylogeny-Inspired Adaptation of Multilingual Models to New Languages
Fahim Faisal
Antonios Anastasopoulos
AI4CE
LRM
99
27
0
19 May 2022
Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Yang Xiang
Zhihua Wu
Weibao Gong
Siyu Ding
Xianjie Mo
...
Yue Yu
Ge Li
Yu Sun
Yanjun Ma
Dianhai Yu
75
5
0
19 May 2022
Vision Transformer Adapter for Dense Predictions
Zhe Chen
Yuchen Duan
Wenhai Wang
Junjun He
Tong Lu
Jifeng Dai
Yu Qiao
182
572
0
17 May 2022
Classification of Astronomical Bodies by Efficient Layer Fine-Tuning of Deep Neural Networks
Sabeesh Ethiraj
B. Bolla
94
8
0
14 May 2022
Unified Modeling of Multi-Domain Multi-Device ASR Systems
Soumyajit Mitra
Swayambhu Nath Ray
Bharat Padi
Arunasish Sen
Raghavendra Bilgi
Harish Arsikere
Shalini Ghosh
A. Srinivasamurthy
Sri Garimella
69
3
0
13 May 2022
Lifting the Curse of Multilinguality by Pre-training Modular Transformers
Jonas Pfeiffer
Naman Goyal
Xi Lin
Xian Li
James Cross
Sebastian Riedel
Mikel Artetxe
LRM
113
146
0
12 May 2022
AdaVAE: Exploring Adaptive GPT-2s in Variational Auto-Encoders for Language Modeling
Haoqin Tu
Zhongliang Yang
Jinshuai Yang
Yong Huang
42
12
0
12 May 2022
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
Haokun Liu
Derek Tam
Mohammed Muqeeth
Jay Mohta
Tenghao Huang
Joey Tianyi Zhou
Colin Raffel
132
944
0
11 May 2022
Previous
1
2
3
...
49
50
51
...
56
57
58
Next