ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.02008
  4. Cited By
Dynamically Adjusting Transformer Batch Size by Monitoring Gradient
  Direction Change

Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change

5 May 2020
Hongfei Xu
Josef van Genabith
Deyi Xiong
Qiuhui Liu
ArXivPDFHTML

Papers citing "Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change"

3 / 3 papers shown
Title
Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation
Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation
Wenjie Hao
Hongfei Xu
Lingling Mu
Hongying Zan
MoE
18
4
0
24 Dec 2022
Small Batch Sizes Improve Training of Low-Resource Neural MT
Small Batch Sizes Improve Training of Low-Resource Neural MT
Àlex R. Atrio
Andrei Popescu-Belis
35
6
0
20 Mar 2022
Rewiring the Transformer with Depth-Wise LSTMs
Rewiring the Transformer with Depth-Wise LSTMs
Hongfei Xu
Yang Song
Qiuhui Liu
Josef van Genabith
Deyi Xiong
37
6
0
13 Jul 2020
1