ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.06450
  4. Cited By
Layer Normalization

Layer Normalization

21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
ArXivPDFHTML

Papers citing "Layer Normalization"

50 / 5,515 papers shown
Title
Rethinking the Usage of Batch Normalization and Dropout in the Training
  of Deep Neural Networks
Rethinking the Usage of Batch Normalization and Dropout in the Training of Deep Neural Networks
Guangyong Chen
Pengfei Chen
Yujun Shi
Chang-Yu Hsieh
B. Liao
Shengyu Zhang
OOD
11
80
0
15 May 2019
Online Normalization for Training Neural Networks
Online Normalization for Training Neural Networks
Vitaliy Chiley
I. Sharapov
Atli Kosson
Urs Koster
R. Reece
S. D. L. Fuente
Vishal Subbiah
Michael James
OnRL
26
55
0
15 May 2019
Language Modeling with Deep Transformers
Language Modeling with Deep Transformers
Kazuki Irie
Albert Zeyer
Ralf Schluter
Hermann Ney
KELM
46
171
0
10 May 2019
Data-dependent Sample Complexity of Deep Neural Networks via Lipschitz
  Augmentation
Data-dependent Sample Complexity of Deep Neural Networks via Lipschitz Augmentation
Colin Wei
Tengyu Ma
25
109
0
09 May 2019
Deep Closest Point: Learning Representations for Point Cloud
  Registration
Deep Closest Point: Learning Representations for Point Cloud Registration
Yue Wang
Justin Solomon
3DPC
34
840
0
08 May 2019
MetaPred: Meta-Learning for Clinical Risk Prediction with Limited
  Patient Electronic Health Records
MetaPred: Meta-Learning for Clinical Risk Prediction with Limited Patient Electronic Health Records
Xi Sheryl Zhang
Fengyi Tang
H. H. Dodge
Jiayu Zhou
Fei Wang
14
108
0
08 May 2019
ShapeGlot: Learning Language for Shape Differentiation
ShapeGlot: Learning Language for Shape Differentiation
Panos Achlioptas
Judy Fan
Robert D. Hawkins
Noah D. Goodman
Leonidas J. Guibas
36
83
0
08 May 2019
Local Light Field Fusion: Practical View Synthesis with Prescriptive
  Sampling Guidelines
Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines
B. Mildenhall
Pratul P. Srinivasan
Rodrigo Ortiz Cayon
N. Kalantari
R. Ramamoorthi
Ren Ng
Abhishek Kar
34
486
0
02 May 2019
Investigation of F0 conditioning and Fully Convolutional Networks in
  Variational Autoencoder based Voice Conversion
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion
Wen-Chin Huang
Yi-Chiao Wu
Chen-Chou Lo
Patrick Lumban Tobing
Tomoki Hayashi
Kazuhiro Kobayashi
Tomoki Toda
Yu Tsao
H. Wang
DRL
27
13
0
02 May 2019
Learn to synthesize and synthesize to learn
Learn to synthesize and synthesize to learn
Behzad Bozorgtabar
Mohammad Saeed Rad
H. K. Ekenel
Jean-Philippe Thiran
GAN
CVBM
6
19
0
01 May 2019
Very Deep Self-Attention Networks for End-to-End Speech Recognition
Very Deep Self-Attention Networks for End-to-End Speech Recognition
Ngoc-Quan Pham
T. Nguyen
Jan Niehues
Markus Müller
Sebastian Stüker
A. Waibel
28
161
0
30 Apr 2019
A self-attention based deep learning method for lesion attribute
  detection from CT reports
A self-attention based deep learning method for lesion attribute detection from CT reports
Yifan Peng
Ke Yan
V. Sandfort
Ronald M. Summers
Zhiyong Lu
MedIm
27
18
0
30 Apr 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering
TVQA+: Spatio-Temporal Grounding for Video Question Answering
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
31
227
0
25 Apr 2019
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural
  Speaker Separation
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation
Yuzhou Liu
DeLiang Wang
32
157
0
25 Apr 2019
Learning Discriminative Features Via Weights-biased Softmax Loss
Learning Discriminative Features Via Weights-biased Softmax Loss
Xiaobin Li
Weiqiang Wang
22
22
0
25 Apr 2019
Generating Long Sequences with Sparse Transformers
Generating Long Sequences with Sparse Transformers
R. Child
Scott Gray
Alec Radford
Ilya Sutskever
36
1,854
0
23 Apr 2019
Switchable Whitening for Deep Representation Learning
Switchable Whitening for Deep Representation Learning
Xingang Pan
Xiaohang Zhan
Jianping Shi
Xiaoou Tang
Ping Luo
32
149
0
22 Apr 2019
Improving Multi-Task Deep Neural Networks via Knowledge Distillation for
  Natural Language Understanding
Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
FedML
13
181
0
20 Apr 2019
Unifying Question Answering, Text Classification, and Regression via
  Span Extraction
Unifying Question Answering, Text Classification, and Regression via Span Extraction
N. Keskar
Bryan McCann
Caiming Xiong
R. Socher
BDL
16
21
0
19 Apr 2019
Code-Switching for Enhancing NMT with Pre-Specified Translation
Code-Switching for Enhancing NMT with Pre-Specified Translation
Kai Song
Yue Zhang
Heng Yu
Weihua Luo
Kun Wang
Min Zhang
35
116
0
19 Apr 2019
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
Shijie Wu
Mark Dredze
VLM
SSeg
44
670
0
19 Apr 2019
Deep Parametric Shape Predictions using Distance Fields
Deep Parametric Shape Predictions using Distance Fields
Dmitriy Smirnov
Matthew Fisher
Vladimir G. Kim
Richard Y. Zhang
Justin Solomon
22
54
0
18 Apr 2019
Neural-Attention-Based Deep Learning Architectures for Modeling Traffic
  Dynamics on Lane Graphs
Neural-Attention-Based Deep Learning Architectures for Modeling Traffic Dynamics on Lane Graphs
Matthew A. Wright
Simon F. G. Ehlers
R. Horowitz
AI4CE
GNN
14
4
0
18 Apr 2019
Modulating Image Restoration with Continual Levels via Adaptive Feature
  Modification Layers
Modulating Image Restoration with Continual Levels via Adaptive Feature Modification Layers
Jingwen He
Chao Dong
Yu Qiao
SupR
33
91
0
17 Apr 2019
Neural Message Passing for Multi-Label Classification
Neural Message Passing for Multi-Label Classification
Jack Lanchantin
Arshdeep Sekhon
Yanjun Qi
33
38
0
17 Apr 2019
Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial
  Event Extraction
Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction
Shun Zheng
Wei Cao
Wenyuan Xu
Jiang Bian
15
163
0
16 Apr 2019
Personalized Re-ranking for Recommendation
Personalized Re-ranking for Recommendation
Changhua Pei
Yi Zhang
Yongfeng Zhang
Fei Sun
Xiao Lin
Hanxiao Sun
Jian Wu
Peng Jiang
Wenwu Ou
OffRL
25
6
0
15 Apr 2019
BERT4Rec: Sequential Recommendation with Bidirectional Encoder
  Representations from Transformer
BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer
Fei Sun
Jun Liu
Jian Wu
Changhua Pei
Xiao Lin
Wenwu Ou
Peng Jiang
BDL
HAI
17
2,099
0
14 Apr 2019
M2H-GAN: A GAN-based Mapping from Machine to Human Transcripts for
  Speech Understanding
M2H-GAN: A GAN-based Mapping from Machine to Human Transcripts for Speech Understanding
Titouan Parcollet
Mohamed Morchid
Xavier Bost
G. Linarès
GAN
17
2
0
13 Apr 2019
EvalNorm: Estimating Batch Normalization Statistics for Evaluation
EvalNorm: Estimating Batch Normalization Statistics for Evaluation
Saurabh Singh
Abhinav Shrivastava
26
51
0
12 Apr 2019
Deep Learning for System Trace Restoration
Deep Learning for System Trace Restoration
Ilia Sucholutsky
Apurva Narayan
Matthias Schonlau
S. Fischmeister
AI4TS
8
7
0
10 Apr 2019
Depth from Videos in the Wild: Unsupervised Monocular Depth Learning
  from Unknown Cameras
Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras
A. Gordon
Hanhan Li
Rico Jonschkowski
A. Angelova
MDE
30
363
0
10 Apr 2019
Unsupervised Recurrent Neural Network Grammars
Unsupervised Recurrent Neural Network Grammars
Yoon Kim
Alexander M. Rush
Lei Yu
A. Kuncoro
Chris Dyer
Gábor Melis
LRM
RALM
SSL
32
115
0
07 Apr 2019
SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for
  Unsupervised Abstractive Sentence Compression
SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression
Christos Baziotis
Ion Androutsopoulos
Ioannis Konstas
Alexandros Potamianos
25
83
0
07 Apr 2019
Instance-Level Meta Normalization
Instance-Level Meta Normalization
Songhao Jia
Ding-Jie Chen
Hwann-Tzong Chen
24
20
0
06 Apr 2019
Iterative Normalization: Beyond Standardization towards Efficient
  Whitening
Iterative Normalization: Beyond Standardization towards Efficient Whitening
Lei Huang
Yi Zhou
Fan Zhu
Li Liu
Ling Shao
32
140
0
06 Apr 2019
Modeling Point Clouds with Self-Attention and Gumbel Subset Sampling
Modeling Point Clouds with Self-Attention and Gumbel Subset Sampling
Jiancheng Yang
Qiang Zhang
Bingbing Ni
Linguo Li
Jinxian Liu
Mengdie Zhou
Qi Tian
3DPC
38
379
0
06 Apr 2019
Jasper: An End-to-End Convolutional Neural Acoustic Model
Jasper: An End-to-End Convolutional Neural Acoustic Model
Jason Chun Lok Li
Vitaly Lavrukhin
Boris Ginsburg
Ryan Leary
Oleksii Kuchaiev
Jonathan M. Cohen
Huyen Nguyen
R. Gadde
DRL
VLM
AuLLM
19
264
0
05 Apr 2019
Convolutional Self-Attention Networks
Convolutional Self-Attention Networks
Baosong Yang
Longyue Wang
Derek F. Wong
Lidia S. Chao
Zhaopeng Tu
24
124
0
05 Apr 2019
Modeling Recurrence for Transformer
Modeling Recurrence for Transformer
Jie Hao
Xing Wang
Baosong Yang
Longyue Wang
Jinfeng Zhang
Zhaopeng Tu
45
85
0
05 Apr 2019
Regularizing Activation Distribution for Training Binarized Deep
  Networks
Regularizing Activation Distribution for Training Binarized Deep Networks
Ruizhou Ding
Ting-Wu Chin
Z. Liu
Diana Marculescu
MQ
30
145
0
04 Apr 2019
Learning to Reason: Leveraging Neural Networks for Approximate DNF
  Counting
Learning to Reason: Leveraging Neural Networks for Approximate DNF Counting
Ralph Abboud
.Ismail .Ilkan Ceylan
Thomas Lukasiewicz
38
28
0
04 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable
  Convolutions
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
28
95
0
04 Apr 2019
Differentiable Sampling with Flexible Reference Word Order for Neural
  Machine Translation
Differentiable Sampling with Flexible Reference Word Order for Neural Machine Translation
Weijia Xu
Xing Niu
Marine Carpuat
26
10
0
04 Apr 2019
Constrained Generative Adversarial Networks for Interactive Image
  Generation
Constrained Generative Adversarial Networks for Interactive Image Generation
Eric Heim
GAN
28
26
0
03 Apr 2019
Towards annotation-efficient segmentation via image-to-image translation
Towards annotation-efficient segmentation via image-to-image translation
Eugene Vorontsov
Pavlo Molchanov
Christopher Beckham
Jan Kautz
Samuel Kadoury
MedIm
20
13
0
02 Apr 2019
Regional Homogeneity: Towards Learning Transferable Universal
  Adversarial Perturbations Against Defenses
Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses
Yingwei Li
S. Bai
Cihang Xie
Zhenyu A. Liao
Xiaohui Shen
Alan Yuille
AAML
47
50
0
01 Apr 2019
Making Neural Machine Reading Comprehension Faster
Making Neural Machine Reading Comprehension Faster
Debajyoti Chatterjee
AIMat
21
9
0
29 Mar 2019
Micro-Batch Training with Batch-Channel Normalization and Weight
  Standardization
Micro-Batch Training with Batch-Channel Normalization and Weight Standardization
Siyuan Qiao
Huiyu Wang
Chenxi Liu
Wei Shen
Alan Yuille
MQ
32
144
0
25 Mar 2019
Fine-tune BERT for Extractive Summarization
Fine-tune BERT for Extractive Summarization
Yang Liu
17
479
0
25 Mar 2019
Previous
123...100101102...109110111
Next