Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.06450
Cited By
Layer Normalization
21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Layer Normalization"
50 / 5,517 papers shown
Title
Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events
Wim Boes
Hugo Van hamme
21
17
0
02 Dec 2019
Multi-Scale Self-Attention for Text Classification
Qipeng Guo
Xipeng Qiu
Pengfei Liu
Xiangyang Xue
Zheng Zhang
ViT
15
63
0
02 Dec 2019
ST-GRAT: A Novel Spatio-temporal Graph Attention Network for Accurately Forecasting Dynamically Changing Road Speed
Cheonbok Park
Chunggi Lee
Hyojin Bahng
Taeyun Won
Kihwan Kim
Seungmin Jin
Sungahn Ko
Jaegul Choo
GNN
AI4TS
16
33
0
29 Nov 2019
Mean Shift Rejection: Training Deep Neural Networks Without Minibatch Statistics or Normalization
B. Ruff
Taylor Beck
Joscha Bach
14
3
0
29 Nov 2019
Orthogonal Wasserstein GANs
J. Müller
Reinhard Klein
Michael Weinmann
58
9
0
29 Nov 2019
Contrastive Learning of Structured World Models
Thomas Kipf
Elise van der Pol
Max Welling
OCL
DRL
28
278
0
27 Nov 2019
ConCare: Personalized Clinical Feature Embedding via Capturing the Healthcare Context
Liantao Ma
Chaohe Zhang
Yasha Wang
Wenjie Ruan
Jiantao Wang
Wen Tang
Xinyu Ma
Xin Gao
Junyi Gao
25
152
0
27 Nov 2019
Orthogonal Convolutional Neural Networks
Jiayun Wang
Yubei Chen
Rudrasis Chakraborty
Stella X. Yu
27
188
0
27 Nov 2019
Music Source Separation in the Waveform Domain
Alexandre Défossez
Nicolas Usunier
Léon Bottou
Francis R. Bach
16
266
0
27 Nov 2019
AdaSample: Adaptive Sampling of Hard Positives for Descriptor Learning
Xinyu Zhang
Le Zhang
Zao-Yi Zheng
Yun-Hai Liu
Jiawang Bian
Ming-Ming Cheng
19
6
0
27 Nov 2019
Adversarial Deep Reinforcement Learning based Adaptive Moving Target Defense
Taha Eghtesad
Yevgeniy Vorobeychik
Aron Laszka
AAML
21
8
0
27 Nov 2019
Autoencoding Undirected Molecular Graphs With Neural Networks
Jeppe Johan Waarkjaer Olsen
Peter Ebert Christensen
Martin Hangaard Hansen
alexander rosenberg johansen
AI4CE
19
0
0
26 Nov 2019
Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs
Van-Quang Nguyen
Masanori Suganuma
Takayuki Okatani
16
7
0
26 Nov 2019
Region Normalization for Image Inpainting
Tao Yu
Zongyu Guo
Xin Jin
Shilin Wu
Zhibo Chen
Weiping Li
Zhizheng Zhang
Sen Liu
26
182
0
23 Nov 2019
Unsupervised Keyword Extraction for Full-sentence VQA
Kohei Uehara
Tatsuya Harada
30
1
0
23 Nov 2019
TreeGen: A Tree-Based Transformer Architecture for Code Generation
Zeyu Sun
Qihao Zhu
Yingfei Xiong
Yican Sun
Lili Mou
Lu Zhang
25
174
0
22 Nov 2019
Rethinking Normalization and Elimination Singularity in Neural Networks
Siyuan Qiao
Huiyu Wang
Chenxi Liu
Wei Shen
Alan Yuille
22
10
0
21 Nov 2019
Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Networks
Saurabh Singh
Shankar Krishnan
UQCV
74
126
0
21 Nov 2019
Approximated Orthonormal Normalisation in Training Neural Networks
Guoqiang Zhang
Kenta Niwa
W. Kleijn
11
3
0
21 Nov 2019
Controlling Neural Machine Translation Formality with Synthetic Supervision
Xing Niu
Marine Carpuat
43
35
0
20 Nov 2019
End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern Architectures
Gabriel Synnaeve
Qiantong Xu
Jacob Kahn
Tatiana Likhomanenko
Edouard Grave
Vineel Pratap
Anuroop Sriram
Vitaliy Liptchinsky
R. Collobert
SSL
AI4TS
36
246
0
19 Nov 2019
Attention-Privileged Reinforcement Learning
Sasha Salter
Dushyant Rao
Markus Wulfmeier
R. Hadsell
Ingmar Posner
23
8
0
19 Nov 2019
Neural Network based End-to-End Query by Example Spoken Term Detection
Dhananjay Ram
Lesly Miculicich
H. Bourlard
22
25
0
19 Nov 2019
Implicit Regularization and Convergence for Weight Normalization
Xiaoxia Wu
Yan Sun
Tongzheng Ren
Shanshan Wu
Zhiyuan Li
Suriya Gunasekar
Rachel A. Ward
Qiang Liu
28
21
0
18 Nov 2019
Multi-task Sentence Encoding Model for Semantic Retrieval in Question Answering Systems
Qiang Huang
Jianhui Bu
Weijian Xie
Shengwen Yang
Weijia Wu
Liping Liu
32
17
0
18 Nov 2019
Understanding and Improving Layer Normalization
Jingjing Xu
Xu Sun
Zhiyuan Zhang
Guangxiang Zhao
Junyang Lin
FAtt
47
342
0
16 Nov 2019
Improving Graph Neural Network Representations of Logical Formulae with Subgraph Pooling
Mayank Agarwal
Ibrahim Abdelaziz
Cristina Cornelio
Veronika Thost
Lingfei Wu
Kenneth D. Forbus
Achille Fokoue
NAI
AI4CE
GNN
116
36
0
15 Nov 2019
Sequential Recommendation with Relation-Aware Kernelized Self-Attention
Mingi Ji
Weonyoung Joo
Kyungwoo Song
Yoon-Yeong Kim
Il-Chul Moon
AI4TS
24
30
0
15 Nov 2019
Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA
Ronghang Hu
Amanpreet Singh
Trevor Darrell
Marcus Rohrbach
32
195
0
14 Nov 2019
Understanding the Disharmony between Weight Normalization Family and Weight Decay:
ε
−
ε-
ε
−
shifted
L
2
L_2
L
2
Regularizer
Li Xiang
Chen Shuo
Xia Yan
Yang Jian
34
2
0
14 Nov 2019
ZiMM: a deep learning model for long term and blurry relapses with non-clinical claims data
A. Kabeshova
Yiyang Yu
Bertrand Lukacs
Emmanuel Bacry
Stéphane Gaïffas
VLM
MedIm
22
2
0
13 Nov 2019
word2ket: Space-efficient Word Embeddings inspired by Quantum Entanglement
Ali (Aliakbar) Panahi
Seyran Saeedi
Tom Arodz
21
29
0
12 Nov 2019
Attending to Entities for Better Text Understanding
Pengxiang Cheng
K. Erk
LRM
24
37
0
11 Nov 2019
BP-Transformer: Modelling Long-Range Context via Binary Partitioning
Zihao Ye
Qipeng Guo
Quan Gan
Xipeng Qiu
Zheng Zhang
28
77
0
11 Nov 2019
RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers
Bailin Wang
Richard Shin
Xiaodong Liu
Oleksandr Polozov
Matthew Richardson
22
574
0
10 Nov 2019
Improving Transformer Models by Reordering their Sublayers
Ofir Press
Noah A. Smith
Omer Levy
22
87
0
10 Nov 2019
Learning to Few-Shot Learn Across Diverse Natural Language Classification Tasks
Trapit Bansal
Rishikesh Jha
Andrew McCallum
SSL
21
118
0
10 Nov 2019
Speaker Adaptation for Attention-Based End-to-End Speech Recognition
Zhong Meng
Yashesh Gaur
Jinyu Li
Jiawei Liu
29
38
0
09 Nov 2019
Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Takaki Makino
H. Liao
Yannis Assael
Brendan Shillingford
Basi García
Otavio Braga
Olivier Siohan
32
129
0
08 Nov 2019
Lipschitz Constrained Parameter Initialization for Deep Transformers
Hongfei Xu
Qiuhui Liu
Josef van Genabith
Deyi Xiong
Jingyi Zhang
ODL
31
26
0
08 Nov 2019
Ruminating Word Representations with Random Noised Masker
Hwiyeol Jo
Byoung-Tak Zhang
48
0
0
08 Nov 2019
Turbo Autoencoder: Deep learning based channel codes for point-to-point communication channels
Yihan Jiang
Hyeji Kim
Himanshu Asnani
Sreeram Kannan
Sewoong Oh
Pramod Viswanath
35
134
0
08 Nov 2019
Sequence-Aware Factorization Machines for Temporal Predictive Analytics
Tong Chen
Hongzhi Yin
Quoc Viet Hung Nguyen
Wen-Chih Peng
Xue Li
Xiaofang Zhou
11
63
0
07 Nov 2019
Making the Best Use of Review Summary for Sentiment Analysis
Sen Yang
Leyang Cui
Jun Xie
Yue Zhang
33
0
0
07 Nov 2019
Asynchronous Online Federated Learning for Edge Devices with Non-IID Data
Yujing Chen
Yue Ning
Martin Slawski
Huzefa Rangwala
FedML
26
56
0
05 Nov 2019
Learning One-Shot Imitation from Humans without Humans
Alessandro Bonardi
Stephen James
Andrew J. Davison
32
79
0
04 Nov 2019
Image-Conditioned Graph Generation for Road Network Extraction
Davide Belli
Thomas Kipf
GNN
24
40
0
31 Oct 2019
An Augmented Transformer Architecture for Natural Language Generation Tasks
Hailiang Li
Adele Y. C. Wang
Yang Liu
Du Tang
Zhibin Lei
Wenye Li
ViT
21
12
0
30 Oct 2019
Ordered Memory
Songlin Yang
Shawn Tan
Seyedarian Hosseini
Zhouhan Lin
Alessandro Sordoni
Aaron Courville
24
23
0
29 Oct 2019
Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech Synthesis
Mingrui Yuan
Z. Duan
9
1
0
29 Oct 2019
Previous
1
2
3
...
95
96
97
...
109
110
111
Next