Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.02008
Cited By
Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change
5 May 2020
Hongfei Xu
Josef van Genabith
Deyi Xiong
Qiuhui Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change"
3 / 3 papers shown
Title
Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation
Wenjie Hao
Hongfei Xu
Lingling Mu
Hongying Zan
MoE
18
4
0
24 Dec 2022
Small Batch Sizes Improve Training of Low-Resource Neural MT
Àlex R. Atrio
Andrei Popescu-Belis
35
6
0
20 Mar 2022
Rewiring the Transformer with Depth-Wise LSTMs
Hongfei Xu
Yang Song
Qiuhui Liu
Josef van Genabith
Deyi Xiong
37
6
0
13 Jul 2020
1