ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.06748
  4. Cited By
Balancing Training for Multilingual Neural Machine Translation

Balancing Training for Multilingual Neural Machine Translation

14 April 2020
Xinyi Wang
Yulia Tsvetkov
Graham Neubig
ArXivPDFHTML

Papers citing "Balancing Training for Multilingual Neural Machine Translation"

50 / 70 papers shown
Title
HBO: Hierarchical Balancing Optimization for Fine-Tuning Large Language Models
HBO: Hierarchical Balancing Optimization for Fine-Tuning Large Language Models
Weixuan Wang
Minghao Wu
Barry Haddow
Alexandra Birch
14
0
0
18 May 2025
DRPruning: Efficient Large Language Model Pruning through
  Distributionally Robust Optimization
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
VLM
85
0
0
21 Nov 2024
What is Wrong with Perplexity for Long-context Language Modeling?
What is Wrong with Perplexity for Long-context Language Modeling?
Lizhe Fang
Yifei Wang
Zhaoyang Liu
Chenheng Zhang
Stefanie Jegelka
Jinyang Gao
Bolin Ding
Yisen Wang
69
6
0
31 Oct 2024
Optimizing the Training Schedule of Multilingual NMT using Reinforcement
  Learning
Optimizing the Training Schedule of Multilingual NMT using Reinforcement Learning
Alexis Allemann
Àlex R. Atrio
Andrei Popescu-Belis
36
0
0
08 Oct 2024
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
Tianjian Li
Haoran Xu
Weiting Tan
Kenton Murray
Daniel Khashabi
35
1
0
06 Oct 2024
Can the Variation of Model Weights be used as a Criterion for Self-Paced
  Multilingual NMT?
Can the Variation of Model Weights be used as a Criterion for Self-Paced Multilingual NMT?
Àlex R. Atrio
Alexis Allemann
Ljiljana Dolamic
Andrei Popescu-Belis
43
1
0
05 Oct 2024
NLIP_Lab-IITH Low-Resource MT System for WMT24 Indic MT Shared Task
NLIP_Lab-IITH Low-Resource MT System for WMT24 Indic MT Shared Task
Pramit Sahoo
Maharaj Brahma
Maunendra Sankar Desarkar
36
0
0
04 Oct 2024
Can Optimization Trajectories Explain Multi-Task Transfer?
Can Optimization Trajectories Explain Multi-Task Transfer?
David Mueller
Mark Dredze
Matthew Wiesner
63
1
0
26 Aug 2024
Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large
  Language Models
Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models
Minghao Wu
Thuy-Trang Vu
Lizhen Qu
Gholamreza Haffari
31
5
0
13 Jun 2024
To Label or Not to Label: Hybrid Active Learning for Neural Machine
  Translation
To Label or Not to Label: Hybrid Active Learning for Neural Machine Translation
Abdul Hameed Azeemi
I. Qazi
Agha Ali Raza
AI4CE
23
2
0
14 Mar 2024
Aya Model: An Instruction Finetuned Open-Access Multilingual Language
  Model
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
Ahmet Üstün
Viraat Aryabumi
Zheng-Xin Yong
Wei-Yin Ko
Daniel D'souza
...
Shayne Longpre
Niklas Muennighoff
Marzieh Fadaee
Julia Kreutzer
Sara Hooker
ALM
ELM
SyDa
LRM
37
200
0
12 Feb 2024
Order Matters in the Presence of Dataset Imbalance for Multilingual
  Learning
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Dami Choi
Derrick Xin
Hamid Dadkhahi
Justin Gilmer
Ankush Garg
Orhan Firat
Chih-Kuan Yeh
Andrew M. Dai
Behrooz Ghorbani
55
3
0
11 Dec 2023
Error Norm Truncation: Robust Training in the Presence of Data Noise for
  Text Generation Models
Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models
Tianjian Li
Haoran Xu
Philipp Koehn
Daniel Khashabi
Kenton W. Murray
38
4
0
02 Oct 2023
Neural Machine Translation for the Indigenous Languages of the Americas:
  An Introduction
Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction
Manuel Mager
Rajat Bhatnagar
Graham Neubig
Ngoc Thang Vu
Katharina Kann
30
10
0
11 Jun 2023
Towards Higher Pareto Frontier in Multilingual Machine Translation
Towards Higher Pareto Frontier in Multilingual Machine Translation
Yi-Chong Huang
Xiaocheng Feng
Xinwei Geng
Baohang Li
Bing Qin
43
9
0
25 May 2023
LIMIT: Language Identification, Misidentification, and Translation using
  Hierarchical Models in 350+ Languages
LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages
M. Agarwal
Md Mahfuz Ibn Alam
Antonios Anastasopoulos
38
5
0
23 May 2023
A Pretrainer's Guide to Training Data: Measuring the Effects of Data
  Age, Domain Coverage, Quality, & Toxicity
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Shayne Longpre
Gregory Yauney
Emily Reif
Katherine Lee
Adam Roberts
...
Denny Zhou
Jason W. Wei
Kevin Robinson
David M. Mimno
Daphne Ippolito
31
150
0
22 May 2023
RECKONING: Reasoning through Dynamic Knowledge Encoding
RECKONING: Reasoning through Dynamic Knowledge Encoding
Zeming Chen
Gail Weiss
E. Mitchell
Asli Celikyilmaz
Antoine Bosselut
KELM
LRM
35
12
0
10 May 2023
Learning Language-Specific Layers for Multilingual Machine Translation
Learning Language-Specific Layers for Multilingual Machine Translation
Telmo Pires
Robin M. Schmidt
Yi-Hsiu Liao
Stephan Peitz
52
17
0
04 May 2023
UniMax: Fairer and more Effective Language Sampling for Large-Scale
  Multilingual Pretraining
UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Hyung Won Chung
Noah Constant
Xavier Garcia
Adam Roberts
Yi Tay
Sharan Narang
Orhan Firat
31
51
0
18 Apr 2023
On the Pareto Front of Multilingual Neural Machine Translation
On the Pareto Front of Multilingual Neural Machine Translation
Liang Chen
Shuming Ma
Dongdong Zhang
Furu Wei
Baobao Chang
MoE
23
5
0
06 Apr 2023
Towards Reliable Neural Machine Translation with Consistency-Aware
  Meta-Learning
Towards Reliable Neural Machine Translation with Consistency-Aware Meta-Learning
Rongxiang Weng
Qiang Wang
Wensen Cheng
Changfeng Zhu
Min Zhang
32
2
0
20 Mar 2023
Scaling Laws for Multilingual Neural Machine Translation
Scaling Laws for Multilingual Neural Machine Translation
Patrick Fernandes
Behrooz Ghorbani
Xavier Garcia
Markus Freitag
Orhan Firat
49
29
0
19 Feb 2023
Measuring The Impact Of Programming Language Distribution
Measuring The Impact Of Programming Language Distribution
Gabriel Orlanski
Kefan Xiao
Xavier Garcia
Jeffrey Hui
Joshua Howland
J. Malmaud
Jacob Austin
Rishah Singh
Michele Catasta
30
28
0
03 Feb 2023
Causes and Cures for Interference in Multilingual Translation
Causes and Cures for Interference in Multilingual Translation
Uri Shaham
Maha Elbayad
Vedanuj Goswami
Omer Levy
Shruti Bhosale
23
26
0
14 Dec 2022
Domain Curricula for Code-Switched MT at MixMT 2022
Domain Curricula for Code-Switched MT at MixMT 2022
Lekan Raheem
Maab Elrashid
34
1
0
31 Oct 2022
Forging Multiple Training Objectives for Pre-trained Language Models via
  Meta-Learning
Forging Multiple Training Objectives for Pre-trained Language Models via Meta-Learning
Hongqiu Wu
Ruixue Ding
Haizhen Zhao
Boli Chen
Pengjun Xie
Fei Huang
Min Zhang
MoMe
32
8
0
19 Oct 2022
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale
  African Languages
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale African Languages
Wenxiang Jiao
Zhaopeng Tu
Jiarui Li
Wenxuan Wang
Jen-tse Huang
Shuming Shi
54
15
0
18 Oct 2022
You Can Have Your Data and Balance It Too: Towards Balanced and
  Efficient Multilingual Models
You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models
Tomasz Limisiewicz
Daniel Malkin
Gabriel Stanovsky
24
4
0
13 Oct 2022
Language Tokens: A Frustratingly Simple Approach Improves Zero-Shot
  Performance of Multilingual Translation
Language Tokens: A Frustratingly Simple Approach Improves Zero-Shot Performance of Multilingual Translation
Muhammad N. ElNokrashy
Amr Hendy
Mohamed Maher
Mohamed Afify
Hany Awadalla
25
2
0
11 Aug 2022
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional
  MoEs
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
Jinguo Zhu
Xizhou Zhu
Wenhai Wang
Xiaohua Wang
Hongsheng Li
Xiaogang Wang
Jifeng Dai
MoMe
MoE
39
66
0
09 Jun 2022
Multilingual Neural Machine Translation with Deep Encoder and Multiple
  Shallow Decoders
Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders
Xiang Kong
Adithya Renduchintala
James Cross
Yuqing Tang
Jiatao Gu
Xian Li
31
32
0
05 Jun 2022
Unifying the Convergences in Multilingual Neural Machine Translation
Unifying the Convergences in Multilingual Neural Machine Translation
Yi-Chong Huang
Xiaocheng Feng
Xinwei Geng
Bing Qin
36
6
0
03 May 2022
Meta Learning for Natural Language Processing: A Survey
Meta Learning for Natural Language Processing: A Survey
Hung-yi Lee
Shang-Wen Li
Ngoc Thang Vu
57
42
0
03 May 2022
Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient
  Optimization in Few-Shot Cross-Lingual Transfer
Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer
Haoran Xu
Kenton W. Murray
32
12
0
29 Apr 2022
PAEG: Phrase-level Adversarial Example Generation for Neural Machine
  Translation
PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation
Juncheng Wan
Jian Yang
Shuming Ma
Dongdong Zhang
Weinan Zhang
Yong Yu
Zhoujun Li
SILM
AAML
24
5
0
06 Jan 2022
Multilingual Machine Translation Systems from Microsoft for WMT21 Shared
  Task
Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task
Jian Yang
Shuming Ma
Haoyang Huang
Dongdong Zhang
Li Dong
...
Alexandre Muzio
Saksham Singhal
Hany Awadalla
Xia Song
Furu Wei
35
45
0
03 Nov 2021
Tricks for Training Sparse Translation Models
Tricks for Training Sparse Translation Models
Dheeru Dua
Shruti Bhosale
Vedanuj Goswami
James Cross
M. Lewis
Angela Fan
MoE
150
19
0
15 Oct 2021
Multilingual Neural Machine Translation:Can Linguistic Hierarchies Help?
Multilingual Neural Machine Translation:Can Linguistic Hierarchies Help?
Fahimeh Saleh
Wray Buntine
Gholamreza Haffari
Lan Du
26
6
0
15 Oct 2021
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation
  with Multi-Armed Bandits
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits
Julia Kreutzer
David Vilar
Artem Sokolov
49
15
0
13 Oct 2021
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual
  Learning
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning
Seanie Lee
Haebeom Lee
Juho Lee
Sung Ju Hwang
MoMe
CLL
53
17
0
06 Oct 2021
Improving Multilingual Translation by Representation and Gradient
  Regularization
Improving Multilingual Translation by Representation and Gradient Regularization
Yilin Yang
Akiko Eriguchi
Alexandre Muzio
Prasad Tadepalli
Stefan Lee
Hany Hassan
47
41
0
10 Sep 2021
Distributionally Robust Multilingual Machine Translation
Distributionally Robust Multilingual Machine Translation
Chunting Zhou
Daniel Levy
Xian Li
Marjan Ghazvininejad
Graham Neubig
83
24
0
09 Sep 2021
Competence-based Curriculum Learning for Multilingual Machine
  Translation
Competence-based Curriculum Learning for Multilingual Machine Translation
Mingliang Zhang
Fandong Meng
Y. Tong
Jie Zhou
39
16
0
09 Sep 2021
Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural
  Machine Translation Training
Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation Training
Minghao Wu
Yitong Li
Meng Zhang
Liangyou Li
Gholamreza Haffari
Qun Liu
34
22
0
06 Sep 2021
Turning Tables: Generating Examples from Semi-structured Tables for
  Endowing Language Models with Reasoning Skills
Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills
Ori Yoran
Alon Talmor
Jonathan Berant
ReLM
LRM
183
53
0
15 Jul 2021
A Survey on Low-Resource Neural Machine Translation
A Survey on Low-Resource Neural Machine Translation
Rui Wang
Xu Tan
Renqian Luo
Tao Qin
Tie-Yan Liu
3DV
43
58
0
09 Jul 2021
Neural Machine Translation for Low-Resource Languages: A Survey
Neural Machine Translation for Low-Resource Languages: A Survey
Surangika Ranathunga
E. Lee
Marjana Prifti Skenduli
Ravi Shekhar
Mehreen Alam
Rishemjit Kaur
45
237
0
29 Jun 2021
Minimax and Neyman-Pearson Meta-Learning for Outlier Languages
Minimax and Neyman-Pearson Meta-Learning for Outlier Languages
Edoardo Ponti
Rahul Aralikatte
Disha Shrivastava
Siva Reddy
Anders Søgaard
42
16
0
02 Jun 2021
Efficient Weight factorization for Multilingual Speech Recognition
Efficient Weight factorization for Multilingual Speech Recognition
Ngoc-Quan Pham
Tuan-Nam Nguyen
S. Stueker
A. Waibel
43
19
0
07 May 2021
12
Next