ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.07868
  4. Cited By
Weight Normalization: A Simple Reparameterization to Accelerate Training
  of Deep Neural Networks

Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks

25 February 2016
Tim Salimans
Diederik P. Kingma
    ODL
ArXivPDFHTML

Papers citing "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks"

50 / 957 papers shown
Title
Gradient-based Data Augmentation for Semi-Supervised Learning
Gradient-based Data Augmentation for Semi-Supervised Learning
H. Kaizuka
33
2
0
28 Mar 2020
Exemplar Normalization for Learning Deep Representation
Exemplar Normalization for Learning Deep Representation
Ruimao Zhang
Zhanglin Peng
Lingyun Wu
Zhuguo Li
Ping Luo
OOD
27
13
0
19 Mar 2020
PowerNorm: Rethinking Batch Normalization in Transformers
PowerNorm: Rethinking Batch Normalization in Transformers
Sheng Shen
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
BDL
24
16
0
17 Mar 2020
Geometric Approaches to Increase the Expressivity of Deep Neural
  Networks for MR Reconstruction
Geometric Approaches to Increase the Expressivity of Deep Neural Networks for MR Reconstruction
Eunju Cha
Gyutaek Oh
J. C. Ye
27
11
0
17 Mar 2020
DeepEMD: Differentiable Earth Mover's Distance for Few-Shot Learning
DeepEMD: Differentiable Earth Mover's Distance for Few-Shot Learning
Chi Zhang
Yujun Cai
Guosheng Lin
Chunhua Shen
VLM
26
110
0
15 Mar 2020
Extended Batch Normalization
Extended Batch Normalization
Chunjie Luo
Jianfeng Zhan
Lei Wang
Wanling Gao
12
14
0
12 Mar 2020
Embedding Propagation: Smoother Manifold for Few-Shot Classification
Embedding Propagation: Smoother Manifold for Few-Shot Classification
Pau Rodríguez
I. Laradji
Alexandre Drouin
Alexandre Lacoste
4
192
0
09 Mar 2020
TaskNorm: Rethinking Batch Normalization for Meta-Learning
TaskNorm: Rethinking Batch Normalization for Meta-Learning
J. Bronskill
Jonathan Gordon
James Requeima
Sebastian Nowozin
Richard Turner
71
89
0
06 Mar 2020
Self-Supervised Visual Learning by Variable Playback Speeds Prediction
  of a Video
Self-Supervised Visual Learning by Variable Playback Speeds Prediction of a Video
Hyeon Cho
Taehoon Kim
H. Chang
Wonjun Hwang
20
19
0
05 Mar 2020
Batch Normalization Provably Avoids Rank Collapse for Randomly
  Initialised Deep Networks
Batch Normalization Provably Avoids Rank Collapse for Randomly Initialised Deep Networks
Hadi Daneshmand
Jonas Köhler
Francis R. Bach
Thomas Hofmann
Aurelien Lucchi
OOD
ODL
8
4
0
03 Mar 2020
Training BatchNorm and Only BatchNorm: On the Expressive Power of Random
  Features in CNNs
Training BatchNorm and Only BatchNorm: On the Expressive Power of Random Features in CNNs
Jonathan Frankle
D. Schwab
Ari S. Morcos
20
139
0
29 Feb 2020
Batch Normalization Biases Residual Blocks Towards the Identity Function
  in Deep Networks
Batch Normalization Biases Residual Blocks Towards the Identity Function in Deep Networks
Soham De
Samuel L. Smith
ODL
27
20
0
24 Feb 2020
Tuning-free Plug-and-Play Proximal Algorithm for Inverse Imaging
  Problems
Tuning-free Plug-and-Play Proximal Algorithm for Inverse Imaging Problems
Kaixuan Wei
Angelica Aviles-Rivero
Jingwei Liang
Ying Fu
Carola-Bibiane Schönlieb
Hua Huang
21
103
0
22 Feb 2020
Adapted Center and Scale Prediction: More Stable and More Accurate
Adapted Center and Scale Prediction: More Stable and More Accurate
Wenhao Wang
23
24
0
20 Feb 2020
Boosting Adversarial Training with Hypersphere Embedding
Boosting Adversarial Training with Hypersphere Embedding
Tianyu Pang
Xiao Yang
Yinpeng Dong
Kun Xu
Jun Zhu
Hang Su
AAML
33
154
0
20 Feb 2020
Do We Really Need to Access the Source Data? Source Hypothesis Transfer
  for Unsupervised Domain Adaptation
Do We Really Need to Access the Source Data? Source Hypothesis Transfer for Unsupervised Domain Adaptation
Jian Liang
Dapeng Hu
Jiashi Feng
50
1,210
0
20 Feb 2020
On the Discrepancy between Density Estimation and Sequence Generation
On the Discrepancy between Density Estimation and Sequence Generation
Jason D. Lee
Dustin Tran
Orhan Firat
Kyunghyun Cho
8
11
0
17 Feb 2020
Augmented Normalizing Flows: Bridging the Gap Between Generative Flows
  and Latent Variable Models
Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models
Chin-Wei Huang
Laurent Dinh
Aaron Courville
DRL
31
87
0
17 Feb 2020
Cross-Iteration Batch Normalization
Cross-Iteration Batch Normalization
Zhuliang Yao
Yu Cao
Shuxin Zheng
Gao Huang
Stephen Lin
19
85
0
13 Feb 2020
Deep Learning for Source Code Modeling and Generation: Models,
  Applications and Challenges
Deep Learning for Source Code Modeling and Generation: Models, Applications and Challenges
T. H. Le
Hao Chen
Muhammad Ali Babar
VLM
64
152
0
13 Feb 2020
Data-Driven Permanent Magnet Temperature Estimation in Synchronous
  Motors with Supervised Machine Learning
Data-Driven Permanent Magnet Temperature Estimation in Synchronous Motors with Supervised Machine Learning
Wilhelm Kirchgässner
Oliver Wallscheid
J. Böcker
23
68
0
17 Jan 2020
Reliable and Energy Efficient MLC STT-RAM Buffer for CNN Accelerators
Reliable and Energy Efficient MLC STT-RAM Buffer for CNN Accelerators
Masoomeh Jasemi
S. Hessabi
N. Bagherzadeh
11
9
0
14 Jan 2020
An Internal Covariate Shift Bounding Algorithm for Deep Neural Networks
  by Unitizing Layers' Outputs
An Internal Covariate Shift Bounding Algorithm for Deep Neural Networks by Unitizing Layers' Outputs
You Huang
Yuanlong Yu
10
6
0
09 Jan 2020
On Computation and Generalization of Generative Adversarial Imitation
  Learning
On Computation and Generalization of Generative Adversarial Imitation Learning
Minshuo Chen
Yizhou Wang
Tianyi Liu
Zhuoran Yang
Xingguo Li
Zhaoran Wang
T. Zhao
37
40
0
09 Jan 2020
Self-Orthogonality Module: A Network Architecture Plug-in for Learning
  Orthogonal Filters
Self-Orthogonality Module: A Network Architecture Plug-in for Learning Orthogonal Filters
Ziming Zhang
Wenchi Ma
Yuanwei Wu
Guanghui Wang
40
10
0
05 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent
  Advances, and Future Trends
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
34
81
0
02 Jan 2020
A Comprehensive and Modularized Statistical Framework for Gradient Norm
  Equality in Deep Neural Networks
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks
Zhaodong Chen
Lei Deng
Bangyan Wang
Guoqi Li
Yuan Xie
35
28
0
01 Jan 2020
Score and Lyrics-Free Singing Voice Generation
Score and Lyrics-Free Singing Voice Generation
Jen-Yu Liu
Yu-Hua Chen
Yin-Cheng Yeh
Yi-Hsuan Yang
27
22
0
26 Dec 2019
TRADI: Tracking deep neural network weight distributions for uncertainty
  estimation
TRADI: Tracking deep neural network weight distributions for uncertainty estimation
Gianni Franchi
Andrei Bursuc
Emanuel Aldea
Séverine Dubuisson
Isabelle Bloch
UQCV
26
51
0
24 Dec 2019
Group-Connected Multilayer Perceptron Networks
Group-Connected Multilayer Perceptron Networks
Mohammad Kachuee
Sajad Darabi
Shayan Fazeli
Majid Sarrafzadeh
AI4CE
22
1
0
20 Dec 2019
Optimization for deep learning: theory and algorithms
Optimization for deep learning: theory and algorithms
Ruoyu Sun
ODL
27
168
0
19 Dec 2019
Direction Concentration Learning: Enhancing Congruency in Machine
  Learning
Direction Concentration Learning: Enhancing Congruency in Machine Learning
Yan Luo
Yongkang Wong
Mohan Kankanhalli
Qi Zhao
23
12
0
17 Dec 2019
Generative Teaching Networks: Accelerating Neural Architecture Search by
  Learning to Generate Synthetic Training Data
Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data
F. Such
Aditya Rawal
Joel Lehman
Kenneth O. Stanley
Jeff Clune
DD
19
155
0
17 Dec 2019
Local Context Normalization: Revisiting Local Normalization
Local Context Normalization: Revisiting Local Normalization
Anthony Ortiz
Caleb Robinson
Dan Morris
O. Fuentes
Christopher Kiekintveld
Mahmudulla Hassan
Nebojsa Jojic
11
25
0
12 Dec 2019
To Balance or Not to Balance: A Simple-yet-Effective Approach for
  Learning with Long-Tailed Distributions
To Balance or Not to Balance: A Simple-yet-Effective Approach for Learning with Long-Tailed Distributions
Junjie Zhang
Lingqiao Liu
Peng Wang
Chunhua Shen
21
25
0
10 Dec 2019
AugMix: A Simple Data Processing Method to Improve Robustness and
  Uncertainty
AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty
Dan Hendrycks
Norman Mu
E. D. Cubuk
Barret Zoph
Justin Gilmer
Balaji Lakshminarayanan
OOD
UQCV
48
1,280
0
05 Dec 2019
RTN: Reparameterized Ternary Network
RTN: Reparameterized Ternary Network
Yuhang Li
Xin Dong
Shanghang Zhang
Haoli Bai
Yuanpeng Chen
Wei Wang
MQ
16
28
0
04 Dec 2019
Analyzing and Improving the Image Quality of StyleGAN
Analyzing and Improving the Image Quality of StyleGAN
Tero Karras
S. Laine
M. Aittala
Janne Hellsten
J. Lehtinen
Timo Aila
GAN
84
5,732
0
03 Dec 2019
Viewpoint-Aware Loss with Angular Regularization for Person
  Re-Identification
Viewpoint-Aware Loss with Angular Regularization for Person Re-Identification
Zhihui Zhu
Xinyang Jiang
Feng Zheng
Xiao-Wei Guo
Feiyue Huang
Weishi Zheng
Xing Sun
17
71
0
03 Dec 2019
WaveFlow: A Compact Flow-based Model for Raw Audio
WaveFlow: A Compact Flow-based Model for Raw Audio
Ming-Yu Liu
Kainan Peng
Kexin Zhao
Z. Song
23
116
0
03 Dec 2019
Flow Contrastive Estimation of Energy-Based Models
Flow Contrastive Estimation of Energy-Based Models
Ruiqi Gao
Erik Nijkamp
Diederik P. Kingma
Zhen Xu
Andrew M. Dai
Ying Nian Wu
GAN
19
112
0
02 Dec 2019
Mean Shift Rejection: Training Deep Neural Networks Without Minibatch
  Statistics or Normalization
Mean Shift Rejection: Training Deep Neural Networks Without Minibatch Statistics or Normalization
B. Ruff
Taylor Beck
Joscha Bach
14
3
0
29 Nov 2019
Orthogonal Wasserstein GANs
Orthogonal Wasserstein GANs
J. Müller
Reinhard Klein
Michael Weinmann
48
9
0
29 Nov 2019
Orthogonal Convolutional Neural Networks
Orthogonal Convolutional Neural Networks
Jiayun Wang
Yubei Chen
Rudrasis Chakraborty
Stella X. Yu
27
186
0
27 Nov 2019
Invert to Learn to Invert
Invert to Learn to Invert
P. Putzky
Max Welling
11
75
0
25 Nov 2019
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question
  Answering
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering
Akari Asai
Kazuma Hashimoto
Hannaneh Hajishirzi
R. Socher
Caiming Xiong
RALM
KELM
LRM
26
283
0
24 Nov 2019
Rethinking Normalization and Elimination Singularity in Neural Networks
Rethinking Normalization and Elimination Singularity in Neural Networks
Siyuan Qiao
Huiyu Wang
Chenxi Liu
Wei Shen
Alan Yuille
22
10
0
21 Nov 2019
Filter Response Normalization Layer: Eliminating Batch Dependence in the
  Training of Deep Neural Networks
Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Networks
Saurabh Singh
Shankar Krishnan
UQCV
74
125
0
21 Nov 2019
Approximated Orthonormal Normalisation in Training Neural Networks
Approximated Orthonormal Normalisation in Training Neural Networks
Guoqiang Zhang
Kenta Niwa
W. Kleijn
11
3
0
21 Nov 2019
Implicit Regularization and Convergence for Weight Normalization
Implicit Regularization and Convergence for Weight Normalization
Xiaoxia Wu
Yan Sun
Tongzheng Ren
Shanshan Wu
Zhiyuan Li
Suriya Gunasekar
Rachel A. Ward
Qiang Liu
25
21
0
18 Nov 2019
Previous
123...111213...181920
Next