ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.03265
  4. Cited By
On the Variance of the Adaptive Learning Rate and Beyond

On the Variance of the Adaptive Learning Rate and Beyond

8 August 2019
Liyuan Liu
Haoming Jiang
Pengcheng He
Weizhu Chen
Xiaodong Liu
Jianfeng Gao
Jiawei Han
    ODL
ArXivPDFHTML

Papers citing "On the Variance of the Adaptive Learning Rate and Beyond"

23 / 373 papers shown
Title
MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy?
MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy?
Joseph Bethge
Christian Bartz
Haojin Yang
Ying-Cong Chen
Christoph Meinel
MQ
33
91
0
16 Jan 2020
Invertible Generative Modeling using Linear Rational Splines
Invertible Generative Modeling using Linear Rational Splines
H. M. Dolatabadi
S. Erfani
C. Leckie
40
65
0
15 Jan 2020
Hippocampus Segmentation on Epilepsy and Alzheimer's Disease Studies
  with Multiple Convolutional Neural Networks
Hippocampus Segmentation on Epilepsy and Alzheimer's Disease Studies with Multiple Convolutional Neural Networks
Diedre Carmo
Bruna Silva
C. Yasuda
Letícia Rittner
R. Lotufo
45
45
0
14 Jan 2020
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling
  and Denoising
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising
Ziyi Yang
Chenguang Zhu
R. Gmyr
Michael Zeng
Xuedong Huang
Eric Darve
33
61
0
03 Jan 2020
Regularizing Deep Multi-Task Networks using Orthogonal Gradients
Regularizing Deep Multi-Task Networks using Orthogonal Gradients
Mihai Suteu
Yike Guo
29
59
0
14 Dec 2019
NASNet: A Neuron Attention Stage-by-Stage Net for Single Image Deraining
Xu Qin
Zhiling Wang
36
35
0
06 Dec 2019
EventGAN: Leveraging Large Scale Image Datasets for Event Cameras
EventGAN: Leveraging Large Scale Image Datasets for Event Cameras
A. Z. Zhu
ZiYun Wang
Kaung Khant
Kostas Daniilidis
GAN
39
45
0
03 Dec 2019
The Group Loss for Deep Metric Learning
The Group Loss for Deep Metric Learning
Ismail Elezi
Sebastiano Vascon
Alessandro Torcinovich
Marcello Pelillo
Laura Leal-Taixe
22
50
0
01 Dec 2019
Learning Rate Dropout
Learning Rate Dropout
Huangxing Lin
Weihong Zeng
Xinghao Ding
Yue Huang
Yihong Zhuang
John Paisley
ODL
29
9
0
30 Nov 2019
End-to-End Model-Free Reinforcement Learning for Urban Driving using
  Implicit Affordances
End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
49
205
0
25 Nov 2019
Technical report: supervised training of convolutional spiking neural
  networks with PyTorch
Technical report: supervised training of convolutional spiking neural networks with PyTorch
Romain Zimmer
Thomas Pellegrini
S. Singh
T. Masquelier
36
32
0
22 Nov 2019
Weakly Supervised Multi-Task Learning for Cell Detection and
  Segmentation
Weakly Supervised Multi-Task Learning for Cell Detection and Segmentation
Alireza Chamanzar
Yao Nie
27
53
0
27 Oct 2019
TreeCaps: Tree-Structured Capsule Networks for Program Source Code
  Processing
TreeCaps: Tree-Structured Capsule Networks for Program Source Code Processing
Vinoj Jayasundara
Nghi D. Q. Bui
Lingxiao Jiang
David Lo
28
16
0
27 Oct 2019
Filterbank design for end-to-end speech separation
Filterbank design for end-to-end speech separation
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
37
69
0
23 Oct 2019
Torchreid: A Library for Deep Learning Person Re-Identification in
  Pytorch
Torchreid: A Library for Deep Learning Person Re-Identification in Pytorch
Kaiyang Zhou
Tao Xiang
38
117
0
22 Oct 2019
Transformers without Tears: Improving the Normalization of
  Self-Attention
Transformers without Tears: Improving the Normalization of Self-Attention
Toan Q. Nguyen
Julian Salazar
55
226
0
14 Oct 2019
On Empirical Comparisons of Optimizers for Deep Learning
On Empirical Comparisons of Optimizers for Deep Learning
Dami Choi
Christopher J. Shallue
Zachary Nado
Jaehoon Lee
Chris J. Maddison
George E. Dahl
46
256
0
11 Oct 2019
On the adequacy of untuned warmup for adaptive optimization
On the adequacy of untuned warmup for adaptive optimization
Jerry Ma
Denis Yarats
59
70
0
09 Oct 2019
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
AAML
70
127
0
03 Sep 2019
Use What You Have: Video Retrieval Using Representations From
  Collaborative Experts
Use What You Have: Video Retrieval Using Representations From Collaborative Experts
Yang Liu
Samuel Albanie
Arsha Nagrani
Andrew Zisserman
41
387
0
31 Jul 2019
DeepShift: Towards Multiplication-Less Neural Networks
DeepShift: Towards Multiplication-Less Neural Networks
Mostafa Elhoushi
Zihao Chen
F. Shafiq
Ye Tian
Joey Yiwei Li
MQ
44
97
0
30 May 2019
Gram-Gauss-Newton Method: Learning Overparameterized Neural Networks for
  Regression Problems
Gram-Gauss-Newton Method: Learning Overparameterized Neural Networks for Regression Problems
Tianle Cai
Ruiqi Gao
Jikai Hou
Siyu Chen
Dong Wang
Di He
Zhihua Zhang
Liwei Wang
ODL
26
57
0
28 May 2019
Neutron: An Implementation of the Transformer Translation Model and its
  Variants
Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu
Qiuhui Liu
50
19
0
18 Mar 2019
Previous
12345678