ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.05874
  4. Cited By
Gradient Vaccine: Investigating and Improving Multi-task Optimization in
  Massively Multilingual Models

Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models

12 October 2020
Zirui Wang
Yulia Tsvetkov
Orhan Firat
Yuan Cao
ArXivPDFHTML

Papers citing "Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models"

28 / 128 papers shown
Title
Combining Modular Skills in Multitask Learning
Combining Modular Skills in Multitask Learning
E. Ponti
Alessandro Sordoni
Yoshua Bengio
Siva Reddy
MoE
12
37
0
28 Feb 2022
Structured Multi-task Learning for Molecular Property Prediction
Structured Multi-task Learning for Molecular Property Prediction
Shengchao Liu
Meng Qu
Zuobai Zhang
Huiyu Cai
Jian Tang
20
24
0
22 Feb 2022
mSLAM: Massively multilingual joint pre-training for speech and text
mSLAM: Massively multilingual joint pre-training for speech and text
Ankur Bapna
Colin Cherry
Yu Zhang
Ye Jia
Melvin Johnson
Yong Cheng
Simran Khanuja
Jason Riesa
Alexis Conneau
VLM
30
111
0
03 Feb 2022
Multi-Task Learning as a Bargaining Game
Multi-Task Learning as a Bargaining Game
Aviv Navon
Aviv Shamsian
Idan Achituve
Haggai Maron
Kenji Kawaguchi
Gal Chechik
Ethan Fetaya
25
140
0
02 Feb 2022
In Defense of the Unitary Scalarization for Deep Multi-Task Learning
In Defense of the Unitary Scalarization for Deep Multi-Task Learning
Vitaly Kurin
Alessandro De Palma
Ilya Kostrikov
Shimon Whiteson
M. P. Kumar
39
74
0
11 Jan 2022
Parameter Differentiation based Multilingual Neural Machine Translation
Parameter Differentiation based Multilingual Neural Machine Translation
Qian Wang
Jiajun Zhang
27
17
0
27 Dec 2021
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
V. Aribandi
Yi Tay
Tal Schuster
J. Rao
H. Zheng
...
Jianmo Ni
Jai Gupta
Kai Hui
Sebastian Ruder
Donald Metzler
MoE
23
213
0
22 Nov 2021
Reasonable Effectiveness of Random Weighting: A Litmus Test for
  Multi-Task Learning
Reasonable Effectiveness of Random Weighting: A Litmus Test for Multi-Task Learning
Baijiong Lin
Feiyang Ye
Yu Zhang
Ivor W. Tsang
42
93
0
20 Nov 2021
An Empirical Study of Training End-to-End Vision-and-Language
  Transformers
An Empirical Study of Training End-to-End Vision-and-Language Transformers
Zi-Yi Dou
Yichong Xu
Zhe Gan
Jianfeng Wang
Shuohang Wang
...
Pengchuan Zhang
Lu Yuan
Nanyun Peng
Zicheng Liu
Michael Zeng
VLM
32
369
0
03 Nov 2021
Speech Representation Learning Through Self-supervised Pretraining And
  Multi-task Finetuning
Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning
Yi-Chen Chen
Shu-Wen Yang
Cheng-Kuang Lee
Simon See
Hung-yi Lee
SSL
19
12
0
18 Oct 2021
Deep Transfer Learning & Beyond: Transformer Language Models in
  Information Systems Research
Deep Transfer Learning & Beyond: Transformer Language Models in Information Systems Research
Ross Gruetzemacher
D. Paradice
25
30
0
18 Oct 2021
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation
  with Multi-Armed Bandits
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits
Julia Kreutzer
David Vilar
Artem Sokolov
49
15
0
13 Oct 2021
Balancing Average and Worst-case Accuracy in Multitask Learning
Balancing Average and Worst-case Accuracy in Multitask Learning
Paul Michel
Sebastian Ruder
Dani Yogatama
21
11
0
12 Oct 2021
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual
  Learning
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning
Seanie Lee
Haebeom Lee
Juho Lee
Sung Ju Hwang
MoMe
CLL
45
16
0
06 Oct 2021
Multi-Task Learning in Natural Language Processing: An Overview
Multi-Task Learning in Natural Language Processing: An Overview
Shijie Chen
Yu Zhang
Qiang Yang
AIMat
41
99
0
19 Sep 2021
A Conditional Generative Matching Model for Multi-lingual Reply
  Suggestion
A Conditional Generative Matching Model for Multi-lingual Reply Suggestion
Budhaditya Deb
Guoqing Zheng
Milad Shokouhi
Ahmed Hassan Awadallah
31
1
0
15 Sep 2021
Improving Multilingual Translation by Representation and Gradient
  Regularization
Improving Multilingual Translation by Representation and Gradient Regularization
Yilin Yang
Akiko Eriguchi
Alexandre Muzio
Prasad Tadepalli
Stefan Lee
Hany Hassan
47
40
0
10 Sep 2021
Efficiently Identifying Task Groupings for Multi-Task Learning
Efficiently Identifying Task Groupings for Multi-Task Learning
Christopher Fifty
Ehsan Amid
Zhe Zhao
Tianhe Yu
Rohan Anil
Chelsea Finn
213
238
1
10 Sep 2021
Distributionally Robust Multilingual Machine Translation
Distributionally Robust Multilingual Machine Translation
Chunting Zhou
Daniel Levy
Xian Li
Marjan Ghazvininejad
Graham Neubig
73
24
0
09 Sep 2021
Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural
  Machine Translation Training
Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation Training
Minghao Wu
Yitong Li
Meng Zhang
Liangyou Li
Gholamreza Haffari
Qun Liu
34
22
0
06 Sep 2021
Domain Generalization via Gradient Surgery
Domain Generalization via Gradient Surgery
Lucas Mansilla
Rodrigo Echeveste
Diego H. Milone
Enzo Ferrante
OOD
24
78
0
03 Aug 2021
Scaling End-to-End Models for Large-Scale Multilingual ASR
Scaling End-to-End Models for Large-Scale Multilingual ASR
Bo-wen Li
Ruoming Pang
Tara N. Sainath
Anmol Gulati
Yu Zhang
James Qin
Parisa Haghani
Yifan Jiang
Min Ma
Junwen Bai
CLL
34
76
0
30 Apr 2021
Adaptive Sparse Transformer for Multilingual Translation
Adaptive Sparse Transformer for Multilingual Translation
Hongyu Gong
Xian Li
Dmitriy Genzel
20
14
0
15 Apr 2021
RotoGrad: Gradient Homogenization in Multitask Learning
RotoGrad: Gradient Homogenization in Multitask Learning
Adrián Javaloy
Isabel Valera
21
86
0
03 Mar 2021
Measuring and Harnessing Transference in Multi-Task Learning
Measuring and Harnessing Transference in Multi-Task Learning
Christopher Fifty
Ehsan Amid
Zhe Zhao
Tianhe Yu
Rohan Anil
Chelsea Finn
28
15
0
29 Oct 2020
A Survey on Negative Transfer
A Survey on Negative Transfer
Wen Zhang
Lingfei Deng
Lei Zhang
Dongrui Wu
AAML
27
205
0
02 Sep 2020
Investigating Multilingual NMT Representations at Scale
Investigating Multilingual NMT Representations at Scale
Sneha Kudugunta
Ankur Bapna
Isaac Caswell
N. Arivazhagan
Orhan Firat
LRM
144
120
0
05 Sep 2019
Multi-Way, Multilingual Neural Machine Translation with a Shared
  Attention Mechanism
Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism
Orhan Firat
Kyunghyun Cho
Yoshua Bengio
LRM
AIMat
231
623
0
06 Jan 2016
Previous
123