Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1908.03265
Cited By
On the Variance of the Adaptive Learning Rate and Beyond
8 August 2019
Liyuan Liu
Haoming Jiang
Pengcheng He
Weizhu Chen
Xiaodong Liu
Jianfeng Gao
Jiawei Han
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On the Variance of the Adaptive Learning Rate and Beyond"
23 / 373 papers shown
Title
MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy?
Joseph Bethge
Christian Bartz
Haojin Yang
Ying-Cong Chen
Christoph Meinel
MQ
33
91
0
16 Jan 2020
Invertible Generative Modeling using Linear Rational Splines
H. M. Dolatabadi
S. Erfani
C. Leckie
40
65
0
15 Jan 2020
Hippocampus Segmentation on Epilepsy and Alzheimer's Disease Studies with Multiple Convolutional Neural Networks
Diedre Carmo
Bruna Silva
C. Yasuda
Letícia Rittner
R. Lotufo
45
45
0
14 Jan 2020
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising
Ziyi Yang
Chenguang Zhu
R. Gmyr
Michael Zeng
Xuedong Huang
Eric Darve
33
61
0
03 Jan 2020
Regularizing Deep Multi-Task Networks using Orthogonal Gradients
Mihai Suteu
Yike Guo
29
59
0
14 Dec 2019
NASNet: A Neuron Attention Stage-by-Stage Net for Single Image Deraining
Xu Qin
Zhiling Wang
36
35
0
06 Dec 2019
EventGAN: Leveraging Large Scale Image Datasets for Event Cameras
A. Z. Zhu
ZiYun Wang
Kaung Khant
Kostas Daniilidis
GAN
39
45
0
03 Dec 2019
The Group Loss for Deep Metric Learning
Ismail Elezi
Sebastiano Vascon
Alessandro Torcinovich
Marcello Pelillo
Laura Leal-Taixe
22
50
0
01 Dec 2019
Learning Rate Dropout
Huangxing Lin
Weihong Zeng
Xinghao Ding
Yue Huang
Yihong Zhuang
John Paisley
ODL
29
9
0
30 Nov 2019
End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
49
205
0
25 Nov 2019
Technical report: supervised training of convolutional spiking neural networks with PyTorch
Romain Zimmer
Thomas Pellegrini
S. Singh
T. Masquelier
36
32
0
22 Nov 2019
Weakly Supervised Multi-Task Learning for Cell Detection and Segmentation
Alireza Chamanzar
Yao Nie
27
53
0
27 Oct 2019
TreeCaps: Tree-Structured Capsule Networks for Program Source Code Processing
Vinoj Jayasundara
Nghi D. Q. Bui
Lingxiao Jiang
David Lo
28
16
0
27 Oct 2019
Filterbank design for end-to-end speech separation
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
37
69
0
23 Oct 2019
Torchreid: A Library for Deep Learning Person Re-Identification in Pytorch
Kaiyang Zhou
Tao Xiang
38
117
0
22 Oct 2019
Transformers without Tears: Improving the Normalization of Self-Attention
Toan Q. Nguyen
Julian Salazar
55
226
0
14 Oct 2019
On Empirical Comparisons of Optimizers for Deep Learning
Dami Choi
Christopher J. Shallue
Zachary Nado
Jaehoon Lee
Chris J. Maddison
George E. Dahl
46
256
0
11 Oct 2019
On the adequacy of untuned warmup for adaptive optimization
Jerry Ma
Denis Yarats
59
70
0
09 Oct 2019
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement
Morten Kolbæk
Zheng-Hua Tan
S. H. Jensen
Jesper Jensen
AAML
70
127
0
03 Sep 2019
Use What You Have: Video Retrieval Using Representations From Collaborative Experts
Yang Liu
Samuel Albanie
Arsha Nagrani
Andrew Zisserman
41
387
0
31 Jul 2019
DeepShift: Towards Multiplication-Less Neural Networks
Mostafa Elhoushi
Zihao Chen
F. Shafiq
Ye Tian
Joey Yiwei Li
MQ
44
97
0
30 May 2019
Gram-Gauss-Newton Method: Learning Overparameterized Neural Networks for Regression Problems
Tianle Cai
Ruiqi Gao
Jikai Hou
Siyu Chen
Dong Wang
Di He
Zhihua Zhang
Liwei Wang
ODL
26
57
0
28 May 2019
Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu
Qiuhui Liu
50
19
0
18 Mar 2019
Previous
1
2
3
4
5
6
7
8