ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.12662
  4. Cited By
Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation

Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation

24 December 2022
Wenjie Hao
Hongfei Xu
Lingling Mu
Hongying Zan
    MoE
ArXiv (abs)PDFHTML

Papers citing "Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation"

29 / 29 papers shown
Title
Optimizing Deeper Transformers on Small Datasets
Optimizing Deeper Transformers on Small Datasets
Peng Xu
Dhruv Kumar
Wei Yang
Wenjie Zi
Keyi Tang
Chenyang Huang
Jackie C.K. Cheung
S. Prince
Yanshuai Cao
AI4CE
89
69
0
30 Dec 2020
Learning Light-Weight Translation Models from Deep Transformer
Learning Light-Weight Translation Models from Deep Transformer
Bei Li
Ziyang Wang
Hui Liu
Quan Du
Tong Xiao
Chunliang Zhang
Jingbo Zhu
VLM
154
40
0
27 Dec 2020
Shallow-to-Deep Training for Neural Machine Translation
Shallow-to-Deep Training for Neural Machine Translation
Bei Li
Ziyang Wang
Hui Liu
Yufan Jiang
Quan Du
Tong Xiao
Huizhen Wang
Jingbo Zhu
61
49
0
08 Oct 2020
Dynamically Adjusting Transformer Batch Size by Monitoring Gradient
  Direction Change
Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change
Hongfei Xu
Josef van Genabith
Deyi Xiong
Qiuhui Liu
32
11
0
05 May 2020
Multiscale Collaborative Deep Models for Neural Machine Translation
Multiscale Collaborative Deep Models for Neural Machine Translation
Xiangpeng Wei
Heng Yu
Yue Hu
Yue Zhang
Rongxiang Weng
Weihua Luo
68
29
0
29 Apr 2020
On Layer Normalization in the Transformer Architecture
On Layer Normalization in the Transformer Architecture
Ruibin Xiong
Yunchang Yang
Di He
Kai Zheng
Shuxin Zheng
Chen Xing
Huishuai Zhang
Yanyan Lan
Liwei Wang
Tie-Yan Liu
AI4CE
145
996
0
12 Feb 2020
Lipschitz Constrained Parameter Initialization for Deep Transformers
Lipschitz Constrained Parameter Initialization for Deep Transformers
Hongfei Xu
Qiuhui Liu
Josef van Genabith
Deyi Xiong
Jingyi Zhang
ODL
82
26
0
08 Nov 2019
On the use of BERT for Neural Machine Translation
On the use of BERT for Neural Machine Translation
Stéphane Clinchant
K. Jung
Vassilina Nikoulina
71
90
0
27 Sep 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSLAIMat
373
6,469
0
26 Sep 2019
Improving Deep Transformer with Depth-Scaled Initialization and Merged
  Attention
Improving Deep Transformer with Depth-Scaled Initialization and Merged Attention
Biao Zhang
Ivan Titov
Rico Sennrich
55
103
0
29 Aug 2019
Depth Growing for Neural Machine Translation
Depth Growing for Neural Machine Translation
Lijun Wu
Yiren Wang
Yingce Xia
Fei Tian
Fei Gao
Tao Qin
Jianhuang Lai
Tie-Yan Liu
55
41
0
03 Jul 2019
Learning Deep Transformer Models for Machine Translation
Learning Deep Transformer Models for Machine Translation
Qiang Wang
Bei Li
Tong Xiao
Jingbo Zhu
Changliang Li
Derek F. Wong
Lidia S. Chao
80
672
0
05 Jun 2019
Revisiting Low-Resource Neural Machine Translation: A Case Study
Revisiting Low-Resource Neural Machine Translation: A Case Study
Rico Sennrich
Biao Zhang
63
223
0
28 May 2019
Pre-trained Language Model Representations for Language Generation
Pre-trained Language Model Representations for Language Generation
Sergey Edunov
Alexei Baevski
Michael Auli
69
129
0
22 Mar 2019
Neutron: An Implementation of the Transformer Translation Model and its
  Variants
Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu
Qiuhui Liu
67
19
0
18 Mar 2019
Simple Fusion: Return of the Language Model
Simple Fusion: Return of the Language Model
Felix Stahlberg
James Cross
Veselin Stoyanov
83
74
0
01 Sep 2018
Training Deeper Neural Machine Translation Models with Transparent
  Attention
Training Deeper Neural Machine Translation Models with Transparent Attention
Ankur Bapna
Mengzhao Chen
Orhan Firat
Yuan Cao
Yonghui Wu
79
139
0
22 Aug 2018
SwitchOut: an Efficient Data Augmentation Algorithm for Neural Machine
  Translation
SwitchOut: an Efficient Data Augmentation Algorithm for Neural Machine Translation
Xinyi Wang
Hieu H. Pham
Zihang Dai
Graham Neubig
70
197
0
22 Aug 2018
SentencePiece: A simple and language independent subword tokenizer and
  detokenizer for Neural Text Processing
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
Taku Kudo
John Richardson
206
3,531
0
19 Aug 2018
A Call for Clarity in Reporting BLEU Scores
A Call for Clarity in Reporting BLEU Scores
Matt Post
179
2,998
0
23 Apr 2018
Phrase-Based & Neural Unsupervised Machine Translation
Phrase-Based & Neural Unsupervised Machine Translation
Guillaume Lample
Myle Ott
Alexis Conneau
Ludovic Denoyer
MarcÁurelio Ranzato
91
682
0
20 Apr 2018
When and Why are Pre-trained Word Embeddings Useful for Neural Machine
  Translation?
When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?
Ye Qi
Devendra Singh Sachan
Matthieu Felix
Sarguna Padmanabhan
Graham Neubig
99
344
0
17 Apr 2018
Self-Attention with Relative Position Representations
Self-Attention with Relative Position Representations
Peter Shaw
Jakob Uszkoreit
Ashish Vaswani
182
2,299
0
06 Mar 2018
Improving Lexical Choice in Neural Machine Translation
Improving Lexical Choice in Neural Machine Translation
Toan Q. Nguyen
David Chiang
58
86
0
03 Oct 2017
Six Challenges for Neural Machine Translation
Six Challenges for Neural Machine Translation
Philipp Koehn
Rebecca Knowles
AAMLAIMat
377
1,225
0
12 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
792
132,454
0
12 Jun 2017
Data Augmentation for Low-Resource Neural Machine Translation
Data Augmentation for Low-Resource Neural Machine Translation
Marzieh Fadaee
Arianna Bisazza
Christof Monz
105
469
0
01 May 2017
Improving Neural Machine Translation Models with Monolingual Data
Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich
Barry Haddow
Alexandra Birch
261
2,723
0
20 Nov 2015
Neural Machine Translation of Rare Words with Subword Units
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
235
7,760
0
31 Aug 2015
1