Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.06450
Cited By
Layer Normalization
21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Layer Normalization"
50 / 5,515 papers shown
Title
DAWN: Dual Augmented Memory Network for Unsupervised Video Object Tracking
Zhenmei Shi
Haoyang Fang
Yu-Wing Tai
Chi-Keung Tang
21
2
0
02 Aug 2019
Retrosynthesis with Attention-Based NMT Model and Chemical Analysis of the "Wrong" Predictions
H. Duan
Ling Wang
Chengyun Zhang
Jianjun Li
22
29
0
02 Aug 2019
Universal Transforming Geometric Network
Jin Li
20
9
0
02 Aug 2019
An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation
Vincent Michalski
Vikram S. Voleti
Samira Ebrahimi Kahou
Anthony Ortiz
Pascal Vincent
C. Pal
Doina Precup
BDL
30
6
0
31 Jul 2019
On Mutual Information Maximization for Representation Learning
Michael Tschannen
Josip Djolonga
Paul Kishan Rubenstein
Sylvain Gelly
Mario Lucic
SSL
43
487
0
31 Jul 2019
Expectation-Maximization Attention Networks for Semantic Segmentation
Xia Li
Zhisheng Zhong
Jianlong Wu
Yibo Yang
Zhouchen Lin
Hong Liu
3DV
3DPC
14
553
0
31 Jul 2019
Representation Degeneration Problem in Training Natural Language Generation Models
Jun Gao
Di He
Xu Tan
Tao Qin
Liwei Wang
Tie-Yan Liu
15
263
0
28 Jul 2019
Weakly Supervised Domain Detection
Yumo Xu
Mirella Lapata
35
9
0
26 Jul 2019
Expressive Graph Informer Networks
Jaak Simm
Adam Arany
E. Brouwer
Yves Moreau
GNN
28
2
0
25 Jul 2019
DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks
Zehui Lin
Pengfei Liu
Luyao Huang
Junkun Chen
Xipeng Qiu
Xuanjing Huang
3DPC
16
44
0
25 Jul 2019
Adaptive Noise Injection: A Structure-Expanding Regularization for RNN
Rui Li
Kai Shuang
Mengyu Gu
Sen Su
17
0
0
25 Jul 2019
U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation
Junho Kim
Minjae Kim
Hyeonwoo Kang
Kwanghee Lee
ViT
6
551
0
25 Jul 2019
SpanBERT: Improving Pre-training by Representing and Predicting Spans
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
Omer Levy
75
1,945
0
24 Jul 2019
Channel Normalization in Convolutional Neural Network avoids Vanishing Gradients
Zhenwei Dai
Reinhard Heckel
12
24
0
22 Jul 2019
Switchable Normalization for Learning-to-Normalize Deep Representation
Ping Luo
Ruimao Zhang
Jiamin Ren
Zhanglin Peng
Jingyu Li
30
73
0
22 Jul 2019
Construct Dynamic Graphs for Hand Gesture Recognition via Spatial-Temporal Attention
Yuxiao Chen
Long Zhao
Xi Peng
Jianbo Yuan
Dimitris N. Metaxas
21
87
0
20 Jul 2019
SUMBT: Slot-Utterance Matching for Universal and Scalable Belief Tracking
Hwaran Lee
Jinsik Lee
Tae-Yoon Kim
12
162
0
17 Jul 2019
Single-bit-per-weight deep convolutional neural networks without batch-normalization layers for embedded systems
Mark D Mcdonnell
Hesham Mostafa
Runchun Wang
Andre van Schaik
MQ
36
2
0
16 Jul 2019
Meta-Learning for Black-box Optimization
T. Vishnu
Pankaj Malhotra
Jyoti Narwariya
L. Vig
Gautam M. Shroff
18
18
0
16 Jul 2019
Adversarial Video Generation on Complex Datasets
Aidan Clark
Jeff Donahue
Karen Simonyan
VGen
GAN
29
74
0
15 Jul 2019
Visual Tracking via Dynamic Memory Networks
Tianyu Yang
Antoni B. Chan
26
55
0
12 Jul 2019
Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling
Yuping Luo
Huazhe Xu
Tengyu Ma
SSL
26
13
0
12 Jul 2019
R-Transformer: Recurrent Neural Network Enhanced Transformer
Z. Wang
Yao Ma
Zitao Liu
Jiliang Tang
ViT
24
105
0
12 Jul 2019
Privileged Features Distillation at Taobao Recommendations
Chen Xu
Quan Li
Junfeng Ge
Jinyang Gao
Xiaoyong Yang
Changhua Pei
Fei Sun
Jian Wu
Hanxiao Sun
Wenwu Ou
21
67
0
11 Jul 2019
Order and Chaos: NTK views on DNN Normalization, Checkerboard and Boundary Artifacts
Arthur Jacot
Franck Gabriel
François Ged
Clément Hongler
19
23
0
11 Jul 2019
Positional Normalization
Boyi Li
Felix Wu
Kilian Q. Weinberger
Serge J. Belongie
24
91
0
09 Jul 2019
Multilingual Universal Sentence Encoder for Semantic Retrieval
Yinfei Yang
Daniel Cer
Amin Ahmad
Mandy Guo
Jax Law
...
Steve Yuan
Chris Tar
Yun-hsuan Sung
B. Strope
R. Kurzweil
3DV
37
475
0
09 Jul 2019
Learning to Optimize Domain Specific Normalization for Domain Generalization
Seonguk Seo
Yumin Suh
Dongwan Kim
Geeho Kim
Jongwoo Han
Bohyung Han
AI4CE
36
243
0
09 Jul 2019
Mean Spectral Normalization of Deep Neural Networks for Embedded Automation
Anand Subramanian
N. Chong
17
2
0
09 Jul 2019
A Bi-directional Transformer for Musical Chord Recognition
Jonggwon Park
Kyoyun Choi
Sungwook Jeon
Dokyun Kim
Jonghun Park
24
38
0
05 Jul 2019
ACNe: Attentive Context Normalization for Robust Permutation-Equivariant Learning
Weiwei Sun
Wei Jiang
Eduard Trulls
Andrea Tagliasacchi
K. M. Yi
3DPC
22
20
0
04 Jul 2019
Dimensional Reweighting Graph Convolutional Networks
Xu Zou
Qiuye Jia
Jianwei Zhang
Chang Zhou
Hongxia Yang
Jie Tang
GNN
28
8
0
04 Jul 2019
AMI-Net+: A Novel Multi-Instance Neural Network for Medical Diagnosis from Incomplete and Imbalanced Data
Zeyuan Wang
Josiah Poon
S. Poon
22
7
0
03 Jul 2019
Compositional Structure Learning for Sequential Video Data
Kyoung-Woon On
Eun-Sol Kim
Y. Heo
Byoung-Tak Zhang
19
1
0
03 Jul 2019
Augmenting Self-attention with Persistent Memory
Sainbayar Sukhbaatar
Edouard Grave
Guillaume Lample
Hervé Jégou
Armand Joulin
RALM
KELM
29
135
0
02 Jul 2019
A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning
Yo Joong Choe
Jiyeon Ham
Kyubyong Park
Yeoil Yoon
39
81
0
02 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
22
111
0
02 Jul 2019
Disentangled Makeup Transfer with Generative Adversarial Network
Honglun Zhang
Wenqing Chen
Hao He
Yaohui Jin
GAN
31
26
0
02 Jul 2019
Few-Shot Representation Learning for Out-Of-Vocabulary Words
Ziniu Hu
Ting-Li Chen
Kai-Wei Chang
Yizhou Sun
31
76
0
01 Jul 2019
Multilingual Bottleneck Features for Query by Example Spoken Term Detection
Dhananjay Ram
Lesly Miculicich
H. Bourlard
11
17
0
30 Jun 2019
Learning Manifold Patch-Based Representations of Man-Made Shapes
Dmitriy Smirnov
Mikhail Bessmeltsev
Justin Solomon
SSL
3DPC
31
18
0
28 Jun 2019
GNN-FiLM: Graph Neural Networks with Feature-wise Linear Modulation
Marc Brockschmidt
30
134
0
28 Jun 2019
Localizing Unseen Activities in Video via Image Query
Zhu Zhang
Zhou Zhao
Zhijie Lin
Jingkuan Song
Deng Cai
ViT
21
13
0
28 Jun 2019
ARMIN: Towards a More Efficient and Light-weight Recurrent Memory Network
Zhangheng Li
Jia-Xing Zhong
Jingjia Huang
Tao Zhang
Thomas H. Li
Ge Li
25
2
0
28 Jun 2019
A Concise Model for Multi-Criteria Chinese Word Segmentation with Transformer Encoder
Xipeng Qiu
Hengzhi Pei
Hang Yan
Xuanjing Huang
14
13
0
28 Jun 2019
RUSLAN: Russian Spoken Language Corpus for Speech Synthesis
Lenar Gabdrakhmanov
Rustem Garaev
E. Razinkov
31
9
0
26 Jun 2019
Deep Modular Co-Attention Networks for Visual Question Answering
Zhou Yu
Jun Yu
Yuhao Cui
Dacheng Tao
Q. Tian
36
798
0
25 Jun 2019
Compound Probabilistic Context-Free Grammars for Grammar Induction
Yoon Kim
Chris Dyer
Alexander M. Rush
11
151
0
24 Jun 2019
Learning Waveform-Based Acoustic Models using Deep Variational Convolutional Neural Networks
Dino Oglic
Zoran Cvetkovic
Peter Sollich
BDL
12
8
0
23 Jun 2019
Stacked Capsule Autoencoders
Adam R. Kosiorek
S. Sabour
Yee Whye Teh
Geoffrey E. Hinton
OCL
19
262
0
17 Jun 2019
Previous
1
2
3
...
98
99
100
...
109
110
111
Next