On the Variance of the Adaptive Learning Rate and Beyond

8 August 2019

Xiaodong Liu

Papers citing "On the Variance of the Adaptive Learning Rate and Beyond"

23 / 373 papers shown

Title
MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy? Joseph Bethge Christian Bartz Haojin Yang Ying-Cong Chen Christoph Meinel MQ 33 91 0 16 Jan 2020
Invertible Generative Modeling using Linear Rational Splines H. M. Dolatabadi S. Erfani C. Leckie 40 65 0 15 Jan 2020
Hippocampus Segmentation on Epilepsy and Alzheimer's Disease Studies with Multiple Convolutional Neural Networks Diedre Carmo Bruna Silva C. Yasuda Letícia Rittner R. Lotufo 45 45 0 14 Jan 2020
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising Ziyi Yang Chenguang Zhu R. Gmyr Michael Zeng Xuedong Huang Eric Darve 33 61 0 03 Jan 2020
Regularizing Deep Multi-Task Networks using Orthogonal Gradients Mihai Suteu Yike Guo 29 59 0 14 Dec 2019
NASNet: A Neuron Attention Stage-by-Stage Net for Single Image Deraining Xu Qin Zhiling Wang 36 35 0 06 Dec 2019
EventGAN: Leveraging Large Scale Image Datasets for Event Cameras A. Z. Zhu ZiYun Wang Kaung Khant Kostas Daniilidis GAN 39 45 0 03 Dec 2019
The Group Loss for Deep Metric Learning Ismail Elezi Sebastiano Vascon Alessandro Torcinovich Marcello Pelillo Laura Leal-Taixe 22 50 0 01 Dec 2019
Learning Rate Dropout Huangxing Lin Weihong Zeng Xinghao Ding Yue Huang Yihong Zhuang John Paisley ODL 29 9 0 30 Nov 2019
End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances Marin Toromanoff É. Wirbel Fabien Moutarde OffRL 49 205 0 25 Nov 2019
Technical report: supervised training of convolutional spiking neural networks with PyTorch Romain Zimmer Thomas Pellegrini S. Singh T. Masquelier 36 32 0 22 Nov 2019
Weakly Supervised Multi-Task Learning for Cell Detection and Segmentation Alireza Chamanzar Yao Nie 27 53 0 27 Oct 2019
TreeCaps: Tree-Structured Capsule Networks for Program Source Code Processing Vinoj Jayasundara Nghi D. Q. Bui Lingxiao Jiang David Lo 28 16 0 27 Oct 2019
Filterbank design for end-to-end speech separation Manuel Pariente Samuele Cornell Antoine Deleforge Emmanuel Vincent 37 69 0 23 Oct 2019
Torchreid: A Library for Deep Learning Person Re-Identification in Pytorch Kaiyang Zhou Tao Xiang 38 117 0 22 Oct 2019
Transformers without Tears: Improving the Normalization of Self-Attention Toan Q. Nguyen Julian Salazar 55 226 0 14 Oct 2019
On Empirical Comparisons of Optimizers for Deep Learning Dami Choi Christopher J. Shallue Zachary Nado Jaehoon Lee Chris J. Maddison George E. Dahl 46 256 0 11 Oct 2019
On the adequacy of untuned warmup for adaptive optimization Jerry Ma Denis Yarats 59 70 0 09 Oct 2019
On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement Morten Kolbæk Zheng-Hua Tan S. H. Jensen Jesper Jensen AAML 70 127 0 03 Sep 2019
Use What You Have: Video Retrieval Using Representations From Collaborative Experts Yang Liu Samuel Albanie Arsha Nagrani Andrew Zisserman 41 387 0 31 Jul 2019
DeepShift: Towards Multiplication-Less Neural Networks Mostafa Elhoushi Zihao Chen F. Shafiq Ye Tian Joey Yiwei Li MQ 44 97 0 30 May 2019
Gram-Gauss-Newton Method: Learning Overparameterized Neural Networks for Regression Problems Tianle Cai Ruiqi Gao Jikai Hou Siyu Chen Dong Wang Di He Zhihua Zhang Liwei Wang ODL 26 57 0 28 May 2019
Neutron: An Implementation of the Transformer Translation Model and its Variants Hongfei Xu Qiuhui Liu 50 19 0 18 Mar 2019