ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02889
  4. Cited By
Focus on the Target's Vocabulary: Masked Label Smoothing for Machine
  Translation
v1v2 (latest)

Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation

6 March 2022
Liang Chen
Runxin Xu
Baobao Chang
ArXiv (abs)PDFHTMLGithub (12★)

Papers citing "Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation"

18 / 18 papers shown
Title
DeltaLM: Encoder-Decoder Pre-training for Language Generation and
  Translation by Augmenting Pretrained Multilingual Encoders
DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders
Shuming Ma
Li Dong
Shaohan Huang
Dongdong Zhang
Alexandre Muzio
Saksham Singhal
Hany Awadalla
Xia Song
Furu Wei
SLRAI4CE
78
81
0
25 Jun 2021
Contrastive Learning for Many-to-many Multilingual Neural Machine
  Translation
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation
Xiao Pan
Mingxuan Wang
Liwei Wu
Lei Li
92
206
0
20 May 2021
Semantic Label Smoothing for Sequence to Sequence Problems
Semantic Label Smoothing for Sequence to Sequence Problems
Michal Lukasik
Himanshu Jain
A. Menon
Seungyeon Kim
Srinadh Bhojanapalli
Felix X. Yu
Sanjiv Kumar
AI4TS
37
18
0
15 Oct 2020
Pre-training Multilingual Neural Machine Translation by Leveraging
  Alignment Information
Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information
Zehui Lin
Xiao Pan
Mingxuan Wang
Xipeng Qiu
Jiangtao Feng
Hao Zhou
Lei Li
58
128
0
07 Oct 2020
On the Inference Calibration of Neural Machine Translation
On the Inference Calibration of Neural Machine Translation
Shuo Wang
Zhaopeng Tu
Shuming Shi
Yang Liu
106
82
0
03 May 2020
Generalized Entropy Regularization or: There's Nothing Special about
  Label Smoothing
Generalized Entropy Regularization or: There's Nothing Special about Label Smoothing
Clara Meister
Elizabeth Salesky
Ryan Cotterell
UQCV
49
61
0
02 May 2020
Multilingual Denoising Pre-training for Neural Machine Translation
Multilingual Denoising Pre-training for Neural Machine Translation
Yinhan Liu
Jiatao Gu
Naman Goyal
Xian Li
Sergey Edunov
Marjan Ghazvininejad
M. Lewis
Luke Zettlemoyer
AI4CEAIMat
128
1,816
0
22 Jan 2020
Shared-Private Bilingual Word Embeddings for Neural Machine Translation
Shared-Private Bilingual Word Embeddings for Neural Machine Translation
Xuebo Liu
Derek F. Wong
Yang Liu
Lidia S. Chao
Tong Xiao
Jingbo Zhu
83
37
0
07 Jun 2019
When Does Label Smoothing Help?
When Does Label Smoothing Help?
Rafael Müller
Simon Kornblith
Geoffrey E. Hinton
UQCV
212
1,958
0
06 Jun 2019
Effective Cross-lingual Transfer of Neural Machine Translation Models
  without Shared Vocabularies
Effective Cross-lingual Transfer of Neural Machine Translation Models without Shared Vocabularies
Yunsu Kim
Yingbo Gao
Hermann Ney
VLM
76
88
0
14 May 2019
MASS: Masked Sequence to Sequence Pre-training for Language Generation
MASS: Masked Sequence to Sequence Pre-training for Language Generation
Kaitao Song
Xu Tan
Tao Qin
Jianfeng Lu
Tie-Yan Liu
136
966
0
07 May 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLMFaML
132
3,159
0
01 Apr 2019
Classical Structured Prediction Losses for Sequence to Sequence Learning
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
120
186
0
14 Nov 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
817
132,725
0
12 Jun 2017
Regularizing Neural Networks by Penalizing Confident Output
  Distributions
Regularizing Neural Networks by Penalizing Confident Output Distributions
Gabriel Pereyra
George Tucker
J. Chorowski
Lukasz Kaiser
Geoffrey E. Hinton
NoLa
168
1,141
0
23 Jan 2017
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Reward Augmented Maximum Likelihood for Neural Structured Prediction
Mohammad Norouzi
Samy Bengio
Zhiwen Chen
Navdeep Jaitly
M. Schuster
Yonghui Wu
Dale Schuurmans
104
253
0
01 Sep 2016
Rethinking the Inception Architecture for Computer Vision
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DVBDL
886
27,444
0
02 Dec 2015
Neural Machine Translation of Rare Words with Subword Units
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
244
7,765
0
31 Aug 2015
1