ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.00751
  4. Cited By
Parameter-Efficient Transfer Learning for NLP
v1v2 (latest)

Parameter-Efficient Transfer Learning for NLP

2 February 2019
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
ArXiv (abs)PDFHTML

Papers citing "Parameter-Efficient Transfer Learning for NLP"

50 / 2,860 papers shown
Title
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning
  Tasks
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Tuan Dinh
Yuchen Zeng
Ruisu Zhang
Ziqian Lin
Michael Gira
Shashank Rajput
Jy-yong Sohn
Dimitris Papailiopoulos
Kangwook Lee
LMTD
181
140
0
14 Jun 2022
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer
  Learning
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLM
111
246
0
13 Jun 2022
Singular Value Fine-tuning: Few-shot Segmentation requires
  Few-parameters Fine-tuning
Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning
Yanpeng Sun
Qiang Chen
Xiangyu He
Jian Wang
Haocheng Feng
Junyu Han
Errui Ding
Jian Cheng
Zechao Li
Jingdong Wang
95
57
0
13 Jun 2022
DeepEmotex: Classifying Emotion in Text Messages using Deep Transfer
  Learning
DeepEmotex: Classifying Emotion in Text Messages using Deep Transfer Learning
Maryam Hasan
Elke A. Rundensteiner
E. Agu
VLM
36
8
0
12 Jun 2022
Neural Prompt Search
Neural Prompt Search
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
VPVLMVLM
110
152
0
09 Jun 2022
Neural Collapse: A Review on Modelling Principles and Generalization
Neural Collapse: A Review on Modelling Principles and Generalization
Vignesh Kothapalli
160
82
0
08 Jun 2022
Modularized Transfer Learning with Multiple Knowledge Graphs for
  Zero-shot Commonsense Reasoning
Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning
Yu Jin Kim
Beong-woo Kwak
Youngwook Kim
Reinald Kim Amplayo
Seung-won Hwang
Jinyoung Yeo
LRM
66
14
0
08 Jun 2022
Making Large Language Models Better Reasoners with Step-Aware Verifier
Making Large Language Models Better Reasoners with Step-Aware Verifier
Yifei Li
Zeqi Lin
Shizhuo Zhang
Qiang Fu
B. Chen
Jian-Guang Lou
Weizhu Chen
ReLMLRM
125
230
0
06 Jun 2022
Exploring Cross-lingual Textual Style Transfer with Large Multilingual
  Language Models
Exploring Cross-lingual Textual Style Transfer with Large Multilingual Language Models
Daniil Moskovskiy
Daryna Dementieva
Alexander Panchenko
73
3
0
05 Jun 2022
Instance-wise Prompt Tuning for Pretrained Language Models
Instance-wise Prompt Tuning for Pretrained Language Models
Yuezihan Jiang
Hao Yang
Junyang Lin
Hanyu Zhao
An Yang
Chang Zhou
Hongxia Yang
Zhi-Xin Yang
Tengjiao Wang
VLM
65
7
0
04 Jun 2022
Finding the Right Recipe for Low Resource Domain Adaptation in Neural
  Machine Translation
Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Virginia Adams
Sandeep Subramanian
Mike Chrzanowski
Oleksii Hrinchuk
Oleksii Kuchaiev
68
2
0
02 Jun 2022
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal
  Pre-training
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training
Yan Zeng
Wangchunshu Zhou
Ao Luo
Ziming Cheng
Xinsong Zhang
VLM
110
32
0
01 Jun 2022
A Cross-City Federated Transfer Learning Framework: A Case Study on
  Urban Region Profiling
A Cross-City Federated Transfer Learning Framework: A Case Study on Urban Region Profiling
Gaode Chen
Yijun Su
Xinghua Zhang
Anmin Hu
Guochun Chen
Siyuan Feng
Jinlin Xiang
Junbo Zhang
Yu Zheng
130
6
0
31 May 2022
Modular and On-demand Bias Mitigation with Attribute-Removal Subnetworks
Modular and On-demand Bias Mitigation with Attribute-Removal Subnetworks
Lukas Hauzenberger
Shahed Masoudian
Deepak Kumar
Markus Schedl
Navid Rekabsaz
95
18
0
30 May 2022
Generalizing Multimodal Pre-training into Multilingual via Language
  Acquisition
Generalizing Multimodal Pre-training into Multilingual via Language Acquisition
Liang Zhang
Anwen Hu
Qin Jin
VLM
52
5
0
29 May 2022
Parameter-Efficient and Student-Friendly Knowledge Distillation
Parameter-Efficient and Student-Friendly Knowledge Distillation
Jun Rao
Xv Meng
Liang Ding
Shuhan Qi
Dacheng Tao
99
51
0
28 May 2022
Can Foundation Models Help Us Achieve Perfect Secrecy?
Can Foundation Models Help Us Achieve Perfect Secrecy?
Simran Arora
Christopher Ré
FedML
92
8
0
27 May 2022
Contextual Adapters for Personalized Speech Recognition in Neural
  Transducers
Contextual Adapters for Personalized Speech Recognition in Neural Transducers
Kanthashree Mysore Sathyendra
Thejaswi Muniyappa
Feng-Ju Chang
Jing Liu
Jinru Su
Grant P. Strimel
Athanasios Mouchtaris
Siegfried Kunzmann
85
79
0
26 May 2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual
  Recognition
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Shoufa Chen
Chongjian Ge
Zhan Tong
Jiangliu Wang
Yibing Song
Jue Wang
Ping Luo
259
706
0
26 May 2022
An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale
  Multitask Learning Systems
An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems
Andrea Gesmundo
J. Dean
184
24
0
25 May 2022
Eliciting and Understanding Cross-Task Skills with Task-Level
  Mixture-of-Experts
Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-Experts
Qinyuan Ye
Juan Zha
Xiang Ren
MoE
90
14
0
25 May 2022
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation
Tu Vu
Aditya Barua
Brian Lester
Daniel Cer
Mohit Iyyer
Noah Constant
CLL
108
66
0
25 May 2022
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
Mingkai Deng
Jianyu Wang
Cheng-Ping Hsieh
Yihan Wang
Han Guo
Tianmin Shu
Meng Song
Eric Xing
Zhiting Hu
99
345
0
25 May 2022
Memorization in NLP Fine-tuning Methods
Memorization in NLP Fine-tuning Methods
Fatemehsadat Mireshghallah
Archit Uniyal
Tianhao Wang
David Evans
Taylor Berg-Kirkpatrick
AAML
146
43
0
25 May 2022
Know Where You're Going: Meta-Learning for Parameter-Efficient
  Fine-Tuning
Know Where You're Going: Meta-Learning for Parameter-Efficient Fine-Tuning
Mozhdeh Gheini
Xuezhe Ma
Jonathan May
101
5
0
25 May 2022
Adaptive multilingual speech recognition with pretrained models
Adaptive multilingual speech recognition with pretrained models
Ngoc-Quan Pham
A. Waibel
Jan Niehues
VLM
72
23
0
24 May 2022
Enhancing Continual Learning with Global Prototypes: Counteracting
  Negative Representation Drift
Enhancing Continual Learning with Global Prototypes: Counteracting Negative Representation Drift
Xueying Bai
Jinghuan Shang
Yifan Sun
Niranjan Balasubramanian
CLL
88
1
0
24 May 2022
Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer
Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer
Ahmet Üstün
Arianna Bisazza
G. Bouma
Gertjan van Noord
Sebastian Ruder
131
33
0
24 May 2022
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures
  of Soft Prompts
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts
Akari Asai
Mohammadreza Salehi
Matthew E. Peters
Hannaneh Hajishirzi
219
102
0
24 May 2022
Representation Projection Invariance Mitigates Representation Collapse
Representation Projection Invariance Mitigates Representation Collapse
Anastasia Razdaibiedina
A. Khetan
Zohar Karnin
Daniel Khashabi
Vishaal Kapoor
V. Madan
104
5
0
23 May 2022
When does Parameter-Efficient Transfer Learning Work for Machine
  Translation?
When does Parameter-Efficient Transfer Learning Work for Machine Translation?
Ahmet Üstün
Asa Cooper Stickland
97
7
0
23 May 2022
Stop Filtering: Multi-View Attribute-Enhanced Dialogue Learning
Stop Filtering: Multi-View Attribute-Enhanced Dialogue Learning
Yiwei Li
Bin Sun
Shaoxiong Feng
Kan Li
63
3
0
23 May 2022
BBTv2: Towards a Gradient-Free Future with Large Language Models
BBTv2: Towards a Gradient-Free Future with Large Language Models
Tianxiang Sun
Zhengfu He
Hong Qian
Yunhua Zhou
Xuanjing Huang
Xipeng Qiu
155
57
0
23 May 2022
Supporting Vision-Language Model Inference with Confounder-pruning
  Knowledge Prompt
Supporting Vision-Language Model Inference with Confounder-pruning Knowledge Prompt
Jiangmeng Li
Wenyi Mo
Jingyao Wang
Fuchun Sun
Changwen Zheng
Hui Xiong
Ji-Rong Wen
VLM
93
0
0
23 May 2022
Vector-Quantized Input-Contextualized Soft Prompts for Natural Language
  Understanding
Vector-Quantized Input-Contextualized Soft Prompts for Natural Language Understanding
Rishabh Bhardwaj
Amrita Saha
Guosheng Lin
Soujanya Poria
VLMVPVLM
55
7
0
23 May 2022
Parameter-Efficient Sparsity for Large Language Models Fine-Tuning
Parameter-Efficient Sparsity for Large Language Models Fine-Tuning
Yuchao Li
Fuli Luo
Chuanqi Tan
Mengdi Wang
Songfang Huang
Shen Li
Junjie Bai
MQ
121
34
0
23 May 2022
muNet: Evolving Pretrained Deep Neural Networks into Scalable
  Auto-tuning Multitask Systems
muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems
Andrea Gesmundo
J. Dean
113
19
0
22 May 2022
Multilingual Machine Translation with Hyper-Adapters
Multilingual Machine Translation with Hyper-Adapters
Christos Baziotis
Mikel Artetxe
James Cross
Shruti Bhosale
127
23
0
22 May 2022
All Birds with One Stone: Multi-task Text Classification for Efficient
  Inference with One Forward Pass
All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass
Jiaxin Huang
Tianqi Liu
Jialu Liu
Á. Lelkes
Cong Yu
Jiawei Han
63
1
0
22 May 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSLAI4TS
297
368
0
21 May 2022
FedAdapter: Efficient Federated Learning for Modern NLP
FedAdapter: Efficient Federated Learning for Modern NLP
Dongqi Cai
Yaozong Wu
Shangguang Wang
F. Lin
Mengwei Xu
FedMLAI4CE
74
23
0
20 May 2022
Can Foundation Models Wrangle Your Data?
Can Foundation Models Wrangle Your Data?
A. Narayan
Ines Chami
Laurel J. Orr
Simran Arora
Christopher Ré
LMTDAI4CE
247
231
0
20 May 2022
Phylogeny-Inspired Adaptation of Multilingual Models to New Languages
Phylogeny-Inspired Adaptation of Multilingual Models to New Languages
Fahim Faisal
Antonios Anastasopoulos
AI4CELRM
99
27
0
19 May 2022
Nebula-I: A General Framework for Collaboratively Training Deep Learning
  Models on Low-Bandwidth Cloud Clusters
Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters
Yang Xiang
Zhihua Wu
Weibao Gong
Siyu Ding
Xianjie Mo
...
Yue Yu
Ge Li
Yu Sun
Yanjun Ma
Dianhai Yu
75
5
0
19 May 2022
Vision Transformer Adapter for Dense Predictions
Vision Transformer Adapter for Dense Predictions
Zhe Chen
Yuchen Duan
Wenhai Wang
Junjun He
Tong Lu
Jifeng Dai
Yu Qiao
182
572
0
17 May 2022
Classification of Astronomical Bodies by Efficient Layer Fine-Tuning of
  Deep Neural Networks
Classification of Astronomical Bodies by Efficient Layer Fine-Tuning of Deep Neural Networks
Sabeesh Ethiraj
B. Bolla
94
8
0
14 May 2022
Unified Modeling of Multi-Domain Multi-Device ASR Systems
Soumyajit Mitra
Swayambhu Nath Ray
Bharat Padi
Arunasish Sen
Raghavendra Bilgi
Harish Arsikere
Shalini Ghosh
A. Srinivasamurthy
Sri Garimella
69
3
0
13 May 2022
Lifting the Curse of Multilinguality by Pre-training Modular
  Transformers
Lifting the Curse of Multilinguality by Pre-training Modular Transformers
Jonas Pfeiffer
Naman Goyal
Xi Lin
Xian Li
James Cross
Sebastian Riedel
Mikel Artetxe
LRM
113
146
0
12 May 2022
AdaVAE: Exploring Adaptive GPT-2s in Variational Auto-Encoders for
  Language Modeling
AdaVAE: Exploring Adaptive GPT-2s in Variational Auto-Encoders for Language Modeling
Haoqin Tu
Zhongliang Yang
Jinshuai Yang
Yong Huang
42
12
0
12 May 2022
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than
  In-Context Learning
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
Haokun Liu
Derek Tam
Mohammed Muqeeth
Jay Mohta
Tenghao Huang
Joey Tianyi Zhou
Colin Raffel
132
944
0
11 May 2022
Previous
123...495051...565758
Next