ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.06450
  4. Cited By
Layer Normalization

Layer Normalization

21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
ArXivPDFHTML

Papers citing "Layer Normalization"

50 / 5,502 papers shown
Title
Low-Shot Learning with Imprinted Weights
Low-Shot Learning with Imprinted Weights
Qi
Matthew A. Brown
D. Lowe
VLM
15
12
0
19 Dec 2017
Video Object Detection with an Aligned Spatial-Temporal Memory
Video Object Detection with an Aligned Spatial-Temporal Memory
Fanyi Xiao
Yong Jae Lee
49
189
0
18 Dec 2017
Deep Neural Generative Model of Functional MRI Images for Psychiatric
  Disorder Diagnosis
Deep Neural Generative Model of Functional MRI Images for Psychiatric Disorder Diagnosis
Takashi Matsubara
T. Tashiro
K. Uehara
MedIm
21
42
0
18 Dec 2017
Sockeye: A Toolkit for Neural Machine Translation
Sockeye: A Toolkit for Neural Machine Translation
Felix Hieber
Tobias Domhan
Michael J. Denkowski
David Vilar
Artem Sokolov
Ann Clifton
Matt Post
13
215
0
15 Dec 2017
The exploding gradient problem demystified - definition, prevalence,
  impact, origin, tradeoffs, and solutions
The exploding gradient problem demystified - definition, prevalence, impact, origin, tradeoffs, and solutions
George Philipp
D. Song
J. Carbonell
ODL
35
46
0
15 Dec 2017
Deep Prior
Deep Prior
Alexandre Lacoste
Thomas Boquet
Negar Rostamzadeh
Boris N. Oreshkin
Wonchang Chung
David M. Krueger
SSL
UQCV
VLM
OOD
BDL
18
7
0
13 Dec 2017
Stochastic Answer Networks for Machine Reading Comprehension
Stochastic Answer Networks for Machine Reading Comprehension
Xiaodong Liu
Yelong Shen
Kevin Duh
Jianfeng Gao
RALM
10
197
0
10 Dec 2017
Modulating and attending the source image during encoding improves
  Multimodal Translation
Modulating and attending the source image during encoding improves Multimodal Translation
Jean-Benoit Delbrouck
Stéphane Dupont
17
20
0
09 Dec 2017
Semi-Supervised Learning with IPM-based GANs: an Empirical Study
Semi-Supervised Learning with IPM-based GANs: an Empirical Study
Tom Sercu
Youssef Mroueh
GAN
25
1
0
07 Dec 2017
Distance-based Self-Attention Network for Natural Language Inference
Distance-based Self-Attention Network for Natural Language Inference
Jinbae Im
Sungzoon Cho
43
76
0
06 Dec 2017
Deep Gradient Compression: Reducing the Communication Bandwidth for
  Distributed Training
Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Yujun Lin
Song Han
Huizi Mao
Yu Wang
W. Dally
44
1,386
0
05 Dec 2017
Deep Semantic Role Labeling with Self-Attention
Deep Semantic Role Labeling with Self-Attention
Zhixing Tan
Mingxuan Wang
Jun Xie
Yidong Chen
X. Shi
27
308
0
05 Dec 2017
Face Translation between Images and Videos using Identity-aware CycleGAN
Face Translation between Images and Videos using Identity-aware CycleGAN
Zhiwu Huang
Bernhard Kratzwald
D. Paudel
Jiqing Wu
Luc Van Gool
CVBM
37
6
0
04 Dec 2017
Improving Visually Grounded Sentence Representations with Self-Attention
Improving Visually Grounded Sentence Representations with Self-Attention
Kang Min Yoo
Youhyun Shin
Sang-goo Lee
29
5
0
02 Dec 2017
Improving Video Generation for Multi-functional Applications
Improving Video Generation for Multi-functional Applications
Bernhard Kratzwald
Zhiwu Huang
D. Paudel
Acharya Dinesh
Luc Van Gool
DiffM
37
13
0
30 Nov 2017
Automating Vehicles by Deep Reinforcement Learning using Task Separation
  with Hill Climbing
Automating Vehicles by Deep Reinforcement Learning using Task Separation with Hill Climbing
M. Plessen
19
13
0
29 Nov 2017
AttGAN: Facial Attribute Editing by Only Changing What You Want
AttGAN: Facial Attribute Editing by Only Changing What You Want
Zhenliang He
W. Zuo
Meina Kan
Shiguang Shan
Xilin Chen
GAN
CVBM
36
699
0
29 Nov 2017
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models
Plan, Attend, Generate: Planning for Sequence-to-Sequence Models
Francis Dutil
Çağlar Gülçehre
Adam Trischler
Yoshua Bengio
23
12
0
28 Nov 2017
Large-scale Point Cloud Semantic Segmentation with Superpoint Graphs
Large-scale Point Cloud Semantic Segmentation with Superpoint Graphs
Loic Landrieu
M. Simonovsky
GNN
3DPC
92
1,240
0
27 Nov 2017
Visual Feature Attribution using Wasserstein GANs
Visual Feature Attribution using Wasserstein GANs
Christian F. Baumgartner
Lisa M. Koch
K. Tezcan
Jia Xi Ang
E. Konukoglu
GAN
MedIm
44
145
0
24 Nov 2017
Wasserstein Introspective Neural Networks
Wasserstein Introspective Neural Networks
Kwonjoon Lee
Weijian Xu
Fan Fan
Z. Tu
27
57
0
24 Nov 2017
Exploiting temporal information for 3D pose estimation
Exploiting temporal information for 3D pose estimation
Mir Rayat Imtiaz Hossain
James J. Little
3DH
16
308
0
23 Nov 2017
Hello Edge: Keyword Spotting on Microcontrollers
Hello Edge: Keyword Spotting on Microcontrollers
Yundong Zhang
Naveen Suda
Liangzhen Lai
Vikas Chandra
27
429
0
20 Nov 2017
Run, skeleton, run: skeletal model in a physics-based simulation
Run, skeleton, run: skeletal model in a physics-based simulation
Mikhail Pavlov
Sergey Kolesnikov
Sergey Plis
AI4CE
18
14
0
18 Nov 2017
Learning to Find Good Correspondences
Learning to Find Good Correspondences
K. M. Yi
Eduard Trulls
Y. Ono
Vincent Lepetit
Mathieu Salzmann
Pascal Fua
3DV
14
477
0
16 Nov 2017
Data Augmentation Generative Adversarial Networks
Data Augmentation Generative Adversarial Networks
Antreas Antoniou
Amos Storkey
Harrison Edwards
MedIm
GAN
65
1,066
0
12 Nov 2017
The Lifted Matrix-Space Model for Semantic Composition
The Lifted Matrix-Space Model for Semantic Composition
Woojin Chung
Sheng-Fu Wang
Samuel R. Bowman
37
6
0
09 Nov 2017
Compression-aware Training of Deep Networks
Compression-aware Training of Deep Networks
J. Álvarez
Mathieu Salzmann
21
172
0
07 Nov 2017
Single Image Super-Resolution Using Lightweight CNN with Maxout Units
Single Image Super-Resolution Using Lightweight CNN with Maxout Units
Jae-Seok Choi
Munchurl Kim
SupR
30
2
0
07 Nov 2017
Variational Walkback: Learning a Transition Operator as a Stochastic
  Recurrent Net
Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net
Anirudh Goyal
Nan Rosemary Ke
Surya Ganguli
Yoshua Bengio
DiffM
35
55
0
07 Nov 2017
Weighted Transformer Network for Machine Translation
Weighted Transformer Network for Machine Translation
Karim Ahmed
N. Keskar
R. Socher
27
133
0
06 Nov 2017
Wider and Deeper, Cheaper and Faster: Tensorized LSTMs for Sequence
  Learning
Wider and Deeper, Cheaper and Faster: Tensorized LSTMs for Sequence Learning
Zhen He
Shaobing Gao
Liang Xiao
Daxue Liu
Hangen He
David Barber
AIMat
37
64
0
05 Nov 2017
Neural Language Modeling by Jointly Learning Syntax and Lexicon
Neural Language Modeling by Jointly Learning Syntax and Lexicon
Songlin Yang
Zhouhan Lin
Chin-Wei Huang
Aaron Courville
38
178
0
02 Nov 2017
TasNet: time-domain audio separation network for real-time,
  single-channel speech separation
TasNet: time-domain audio separation network for real-time, single-channel speech separation
Yi Luo
N. Mesgarani
19
621
0
01 Nov 2017
Regularization for Deep Learning: A Taxonomy
Regularization for Deep Learning: A Taxonomy
J. Kukačka
Vladimir Golkov
Daniel Cremers
33
335
0
29 Oct 2017
Speeding up Context-based Sentence Representation Learning with
  Non-autoregressive Convolutional Decoding
Speeding up Context-based Sentence Representation Learning with Non-autoregressive Convolutional Decoding
Shuai Tang
Hailin Jin
Chen Fang
Zhaowen Wang
V. D. Sa
SSL
19
6
0
28 Oct 2017
Progressive Growing of GANs for Improved Quality, Stability, and
  Variation
Progressive Growing of GANs for Improved Quality, Stability, and Variation
Tero Karras
Timo Aila
S. Laine
J. Lehtinen
GAN
68
7,281
0
27 Oct 2017
Rotational Unit of Memory
Rotational Unit of Memory
Rumen Dangovski
L. Jing
Marin Soljacic
11
7
0
26 Oct 2017
Relative Transfer Function Inverse Regression from Low Dimensional
  Manifold
Relative Transfer Function Inverse Regression from Low Dimensional Manifold
Ziteng Wang
Emmanuel Vincent
Yonghong Yan
19
1
0
25 Oct 2017
ContextVP: Fully Context-Aware Video Prediction
ContextVP: Fully Context-Aware Video Prediction
Wonmin Byeon
Qin Wang
R. Srivastava
Petros Koumoutsakos
28
8
0
23 Oct 2017
Attending to All Mention Pairs for Full Abstract Biological Relation
  Extraction
Attending to All Mention Pairs for Full Abstract Biological Relation Extraction
Pat Verga
Emma Strubell
O. Shai
Andrew McCallum
3DV
16
11
0
23 Oct 2017
Multi-Task Domain Adaptation for Deep Learning of Instance Grasping from
  Simulation
Multi-Task Domain Adaptation for Deep Learning of Instance Grasping from Simulation
Kuan Fang
Yunfei Bai
Stefan Hinterstoißer
Silvio Savarese
Mrinal Kalakrishnan
OOD
25
116
0
17 Oct 2017
Searching for Activation Functions
Searching for Activation Functions
Prajit Ramachandran
Barret Zoph
Quoc V. Le
27
599
0
16 Oct 2017
Low-Rank RNN Adaptation for Context-Aware Language Modeling
Low-Rank RNN Adaptation for Context-Aware Language Modeling
Aaron Jaech
Mari Ostendorf
25
25
0
06 Oct 2017
Projection Based Weight Normalization for Deep Neural Networks
Projection Based Weight Normalization for Deep Neural Networks
Lei Huang
Xianglong Liu
B. Lang
Bo-wen Li
28
18
0
06 Oct 2017
Dilated Recurrent Neural Networks
Dilated Recurrent Neural Networks
Shiyu Chang
Yang Zhang
Wei Han
Mo Yu
Xiaoxiao Guo
Wei Tan
Xiaodong Cui
Michael Witbrock
M. Hasegawa-Johnson
Thomas S. Huang
35
298
0
05 Oct 2017
Training Feedforward Neural Networks with Standard Logistic Activations
  is Feasible
Training Feedforward Neural Networks with Standard Logistic Activations is Feasible
Emanuele Sansone
F. D. De Natale
24
4
0
03 Oct 2017
Generative Adversarial Mapping Networks
Generative Adversarial Mapping Networks
Jianbo Guo
Guangxiang Zhu
Jian Li
GAN
20
3
0
28 Sep 2017
Riemannian approach to batch normalization
Riemannian approach to batch normalization
Minhyung Cho
Jaehyung Lee
29
93
0
27 Sep 2017
Comparison of Batch Normalization and Weight Normalization Algorithms
  for the Large-scale Image Classification
Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification
Igor Gitman
Boris Ginsburg
8
65
0
24 Sep 2017
Previous
123...107108109110111
Next