ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.07868
  4. Cited By
Weight Normalization: A Simple Reparameterization to Accelerate Training
  of Deep Neural Networks

Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks

25 February 2016
Tim Salimans
Diederik P. Kingma
    ODL
ArXivPDFHTML

Papers citing "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks"

50 / 957 papers shown
Title
L2 Regularization versus Batch and Weight Normalization
L2 Regularization versus Batch and Weight Normalization
Twan van Laarhoven
8
291
0
16 Jun 2017
von Mises-Fisher Mixture Model-based Deep learning: Application to Face
  Verification
von Mises-Fisher Mixture Model-based Deep learning: Application to Face Verification
Abul Hasnat
Julien Bohné
Jonathan Milgram
S. Gentric
Liming Luke Chen
CVBM
39
91
0
13 Jun 2017
Deep Control - a simple automatic gain control for memory efficient and
  high performance training of deep convolutional neural networks
Deep Control - a simple automatic gain control for memory efficient and high performance training of deep convolutional neural networks
B. Ruff
ODL
19
0
0
13 Jun 2017
Sliced Wasserstein Generative Models
Sliced Wasserstein Generative Models
Jiqing Wu
Zhiwu Huang
Dinesh Acharya
Wen Li
Janine Thoma
Danda Pani Paudel
Luc Van Gool
DiffM
22
0
0
08 Jun 2017
Self-Normalizing Neural Networks
Self-Normalizing Neural Networks
G. Klambauer
Thomas Unterthiner
Andreas Mayr
Sepp Hochreiter
61
2,483
0
08 Jun 2017
DiracNets: Training Very Deep Neural Networks Without Skip-Connections
DiracNets: Training Very Deep Neural Networks Without Skip-Connections
Sergey Zagoruyko
N. Komodakis
UQCV
OOD
11
117
0
01 Jun 2017
Learning to Generate Chairs with Generative Adversarial Nets
Learning to Generate Chairs with Generative Adversarial Nets
E. Zamyatin
Andrey Filchenkov
GAN
13
9
0
29 May 2017
Deep Complex Networks
Deep Complex Networks
C. Trabelsi
O. Bilaniuk
Ying Zhang
Dmitriy Serdyuk
Sandeep Subramanian
J. F. Santos
Soroush Mehri
Negar Rostamzadeh
Yoshua Bengio
C. Pal
39
824
0
27 May 2017
Flow-GAN: Combining Maximum Likelihood and Adversarial Learning in
  Generative Models
Flow-GAN: Combining Maximum Likelihood and Adversarial Learning in Generative Models
Aditya Grover
Manik Dhar
Stefano Ermon
GAN
36
24
0
24 May 2017
Diminishing Batch Normalization
Diminishing Batch Normalization
Yintai Ma
Diego Klabjan
25
15
0
22 May 2017
Probabilistic Image Colorization
Probabilistic Image Colorization
Amelie Royer
Alexander Kolesnikov
Christoph H. Lampert
17
44
0
11 May 2017
Convolutional Sequence to Sequence Learning
Convolutional Sequence to Sequence Learning
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
19
3,263
0
08 May 2017
Parseval Networks: Improving Robustness to Adversarial Examples
Parseval Networks: Improving Robustness to Adversarial Examples
Moustapha Cissé
Piotr Bojanowski
Edouard Grave
Yann N. Dauphin
Nicolas Usunier
AAML
54
796
0
28 Apr 2017
A Neural Network model with Bidirectional Whitening
A Neural Network model with Bidirectional Whitening
Y. Fujimoto
T. Ohira
24
4
0
24 Apr 2017
NormFace: L2 Hypersphere Embedding for Face Verification
NormFace: L2 Hypersphere Embedding for Face Verification
Feng Wang
Xiang Xiang
Jian Cheng
Alan Yuille
3DH
CVBM
10
739
0
21 Apr 2017
On the Effects of Batch and Weight Normalization in Generative
  Adversarial Networks
On the Effects of Batch and Weight Normalization in Generative Adversarial Networks
Sitao Xiang
Hao Li
GAN
27
82
0
13 Apr 2017
Simplified Stochastic Feedforward Neural Networks
Simplified Stochastic Feedforward Neural Networks
Kimin Lee
Jaehyung Kim
S. Chong
Jinwoo Shin
13
3
0
11 Apr 2017
Learning to Generate Reviews and Discovering Sentiment
Learning to Generate Reviews and Discovering Sentiment
Alec Radford
Rafal Jozefowicz
Ilya Sutskever
38
503
0
05 Apr 2017
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Colin Raffel
Minh-Thang Luong
Peter J. Liu
Ron J. Weiss
Douglas Eck
27
255
0
03 Apr 2017
Block-Matching Convolutional Neural Network for Image Denoising
Block-Matching Convolutional Neural Network for Image Denoising
Byeongyong Ahn
N. Cho
SupR
24
45
0
03 Apr 2017
Improved Training of Wasserstein GANs
Improved Training of Wasserstein GANs
Ishaan Gulrajani
Faruk Ahmed
Martín Arjovsky
Vincent Dumoulin
Aaron Courville
GAN
16
9,467
0
31 Mar 2017
One-Shot Imitation Learning
One-Shot Imitation Learning
Yan Duan
Marcin Andrychowicz
Bradly C. Stadie
Jonathan Ho
Jonas Schneider
Ilya Sutskever
Pieter Abbeel
Wojciech Zaremba
OffRL
9
679
0
21 Mar 2017
Arbitrary Style Transfer in Real-time with Adaptive Instance
  Normalization
Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
Xun Huang
Serge J. Belongie
OOD
13
4,300
0
20 Mar 2017
Second-order Convolutional Neural Networks
Second-order Convolutional Neural Networks
Kaicheng Yu
Mathieu Salzmann
8
51
0
20 Mar 2017
Sharp Minima Can Generalize For Deep Nets
Sharp Minima Can Generalize For Deep Nets
Laurent Dinh
Razvan Pascanu
Samy Bengio
Yoshua Bengio
ODL
46
755
0
15 Mar 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
362
11,684
0
09 Mar 2017
All You Need is Beyond a Good Init: Exploring Better Solution for
  Training Extremely Deep Convolutional Neural Networks with Orthonormality and
  Modulation
All You Need is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks with Orthonormality and Modulation
Di Xie
Jiang Xiong
Shiliang Pu
19
181
0
06 Mar 2017
Mean teachers are better role models: Weight-averaged consistency
  targets improve semi-supervised deep learning results
Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results
Antti Tarvainen
Harri Valpola
OOD
MoMe
264
1,275
0
06 Mar 2017
Boosted Generative Models
Boosted Generative Models
Aditya Grover
Stefano Ermon
12
51
0
27 Feb 2017
Dynamic Word Embeddings
Dynamic Word Embeddings
Robert Bamler
Stephan Mandt
BDL
161
231
0
27 Feb 2017
Cosine Normalization: Using Cosine Similarity Instead of Dot Product in
  Neural Networks
Cosine Normalization: Using Cosine Similarity Instead of Dot Product in Neural Networks
Chunjie Luo
Jianfeng Zhan
Lei Wang
Qiang Yang
24
198
0
20 Feb 2017
Batch Renormalization: Towards Reducing Minibatch Dependence in
  Batch-Normalized Models
Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models
Sergey Ioffe
BDL
8
537
0
10 Feb 2017
Optimization on Product Submanifolds of Convolution Kernels
Optimization on Product Submanifolds of Convolution Kernels
Mete Ozay
Takayuki Okatani
AAML
23
0
0
22 Jan 2017
Modeling Grasp Motor Imagery through Deep Conditional Generative Models
Modeling Grasp Motor Imagery through Deep Conditional Generative Models
M. Veres
M. Moussa
Graham W. Taylor
GAN
21
37
0
11 Jan 2017
Language Modeling with Gated Convolutional Networks
Language Modeling with Gated Convolutional Networks
Yann N. Dauphin
Angela Fan
Michael Auli
David Grangier
48
2,357
0
23 Dec 2016
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
Soroush Mehri
Kundan Kumar
Ishaan Gulrajani
Rithesh Kumar
Shubham Jain
Jose M. R. Sotelo
Aaron Courville
Yoshua Bengio
15
594
0
22 Dec 2016
Neuromorphic Deep Learning Machines
Neuromorphic Deep Learning Machines
Emre Neftci
C. Augustine
Somnath Paul
Georgios Detorakis
BDL
132
257
0
16 Dec 2016
Improving training of deep neural networks via Singular Value Bounding
Improving training of deep neural networks via Singular Value Bounding
Kui Jia
AAML
25
90
0
18 Nov 2016
Normalizing the Normalizers: Comparing and Extending Network
  Normalization Schemes
Normalizing the Normalizers: Comparing and Extending Network Normalization Schemes
Mengye Ren
Renjie Liao
R. Urtasun
Fabian H. Sinz
R. Zemel
24
81
0
14 Nov 2016
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
RL2^22: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
33
1,007
0
09 Nov 2016
Variational Lossy Autoencoder
Variational Lossy Autoencoder
Xi Chen
Diederik P. Kingma
Tim Salimans
Yan Duan
Prafulla Dhariwal
John Schulman
Ilya Sutskever
Pieter Abbeel
DRL
SSL
GAN
30
671
0
08 Nov 2016
Entropy-SGD: Biasing Gradient Descent Into Wide Valleys
Entropy-SGD: Biasing Gradient Descent Into Wide Valleys
Pratik Chaudhari
A. Choromańska
Stefano Soatto
Yann LeCun
Carlo Baldassi
C. Borgs
J. Chayes
Levent Sagun
R. Zecchina
ODL
37
763
0
06 Nov 2016
Learning Continuous Semantic Representations of Symbolic Expressions
Learning Continuous Semantic Representations of Symbolic Expressions
Miltiadis Allamanis
Pankajan Chanthirasegaran
Pushmeet Kohli
Charles Sutton
CLL
NAI
31
99
0
04 Nov 2016
Deep Convolutional Neural Network Design Patterns
Deep Convolutional Neural Network Design Patterns
L. Smith
Nicholay Topin
AI4CE
OOD
38
59
0
02 Nov 2016
Optimization on Submanifolds of Convolution Kernels in CNNs
Optimization on Submanifolds of Convolution Kernels in CNNs
Mete Ozay
Takayuki Okatani
48
46
0
22 Oct 2016
Streaming Normalization: Towards Simpler and More Biologically-plausible
  Normalizations for Online and Recurrent Learning
Streaming Normalization: Towards Simpler and More Biologically-plausible Normalizations for Online and Recurrent Learning
Q. Liao
Kenji Kawaguchi
T. Poggio
16
28
0
19 Oct 2016
Temporal Ensembling for Semi-Supervised Learning
Temporal Ensembling for Semi-Supervised Learning
S. Laine
Timo Aila
UQCV
18
2,522
0
07 Oct 2016
Multiplicative LSTM for sequence modelling
Multiplicative LSTM for sequence modelling
Ben Krause
Liang Lu
Iain Murray
Steve Renals
27
208
0
26 Sep 2016
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
17
10,301
0
21 Jul 2016
Adjusting for Dropout Variance in Batch Normalization and Weight
  Initialization
Adjusting for Dropout Variance in Batch Normalization and Weight Initialization
Dan Hendrycks
Kevin Gimpel
ODL
16
9
0
08 Jul 2016
Previous
123...181920
Next