Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.06450
Cited By
Layer Normalization
21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Layer Normalization"
50 / 5,502 papers shown
Title
Self-Attentional Acoustic Models
Matthias Sperber
Jan Niehues
Graham Neubig
Sebastian Stüker
A. Waibel
22
151
0
26 Mar 2018
Low-Resource Speech-to-Text Translation
Sameer Bansal
Herman Kamper
Karen Livescu
Adam Lopez
Sharon Goldwater
19
56
0
24 Mar 2018
Group Normalization
Yuxin Wu
Kaiming He
45
3,596
0
22 Mar 2018
Attention, Learn to Solve Routing Problems!
W. Kool
H. V. Hoof
Max Welling
28
1,175
0
22 Mar 2018
A Comprehensive Analysis of Deep Regression
Stéphane Lathuilière
Pablo Mesejo
Xavier Alameda-Pineda
Radu Horaud
BDL
16
294
0
22 Mar 2018
Stacked Cross Attention for Image-Text Matching
Kuang-Huei Lee
Xi Chen
G. Hua
Houdong Hu
Xiaodong He
30
1,140
0
21 Mar 2018
Learning Dynamic Memory Networks for Object Tracking
Tianyu Yang
Antoni B. Chan
27
284
0
20 Mar 2018
SentEval: An Evaluation Toolkit for Universal Sentence Representations
Alexis Conneau
Douwe Kiela
37
630
0
14 Mar 2018
Learning to Explore with Meta-Policy Gradient
Tianbing Xu
Qiang Liu
Liang Zhao
Jian Peng
14
54
0
13 Mar 2018
Fast Decoding in Sequence Models using Discrete Latent Variables
Łukasz Kaiser
Aurko Roy
Ashish Vaswani
Niki Parmar
Samy Bengio
Jakob Uszkoreit
Noam M. Shazeer
16
230
0
09 Mar 2018
Compositional Attention Networks for Machine Reasoning
Drew A. Hudson
Christopher D. Manning
BDL
OOD
LRM
32
572
0
08 Mar 2018
Self-Attention with Relative Position Representations
Peter Shaw
Jakob Uszkoreit
Ashish Vaswani
17
2,251
0
06 Mar 2018
Norm matters: efficient and accurate normalization schemes in deep networks
Elad Hoffer
Ron Banner
Itay Golan
Daniel Soudry
OffRL
17
178
0
05 Mar 2018
Neural Architectures for Open-Type Relation Argument Extraction
Benjamin Roth
Costanza Conforti
Nina Poerner
Sanjeev Kumar Karn
Hinrich Schütze
BDL
13
13
0
05 Mar 2018
Memorization Precedes Generation: Learning Unsupervised GANs with Memory Networks
Youngjin Kim
Minjung Kim
Gunhee Kim
GAN
26
40
0
05 Mar 2018
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
DRL
42
4,724
0
04 Mar 2018
Not All Samples Are Created Equal: Deep Learning with Importance Sampling
Angelos Katharopoulos
F. Fleuret
9
507
0
02 Mar 2018
Ring loss: Convex Feature Normalization for Face Recognition
Yutong Zheng
Dipan K. Pal
Marios Savvides
CVBM
19
198
0
28 Feb 2018
Simultaneously Self-Attending to All Mentions for Full-Abstract Biological Relation Extraction
Pat Verga
Emma Strubell
Andrew McCallum
11
255
0
28 Feb 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Martin Riedmiller
Roland Hafner
Thomas Lampe
Michael Neunert
Jonas Degrave
T. Wiele
Volodymyr Mnih
N. Heess
Jost Tobias Springenberg
35
445
0
28 Feb 2018
Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse Coding
Dong Liu
Ke Sun
Zhangyang Wang
Runsheng Liu
Zhengjun Zha
24
12
0
28 Feb 2018
L1-Norm Batch Normalization for Efficient Training of Deep Neural Networks
Shuang Wu
Guoqi Li
Lei Deng
Liu Liu
Yuan Xie
Luping Shi
17
117
0
27 Feb 2018
Train Feedfoward Neural Network with Layer-wise Adaptive Rate via Approximating Back-matching Propagation
Huishuai Zhang
Wei-neng Chen
Tie-Yan Liu
17
6
0
27 Feb 2018
Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network
Zizhao Zhang
Yuanpu Xie
L. Yang
EGVM
32
304
0
26 Feb 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
C. Riquelme
George Tucker
Jasper Snoek
BDL
41
365
0
26 Feb 2018
Disentangling by Factorising
Hyunjik Kim
A. Mnih
CoGe
OOD
12
1,327
0
16 Feb 2018
Spectral Normalization for Generative Adversarial Networks
Takeru Miyato
Toshiki Kataoka
Masanori Koyama
Yuichi Yoshida
ODL
38
4,399
0
16 Feb 2018
Image Transformer
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
ViT
42
1,660
0
15 Feb 2018
CNN+LSTM Architecture for Speech Emotion Recognition with Data Augmentation
Caroline Etienne
Guillaume Fidanza
Andrei Petrovskii
Laurence Devillers
B. Schmauch
19
99
0
15 Feb 2018
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
14
11,481
0
15 Feb 2018
Learning a SAT Solver from Single-Bit Supervision
Daniel Selsam
Matthew Lamm
Benedikt Bünz
Percy Liang
L. D. Moura
D. Dill
NAI
14
416
0
11 Feb 2018
Batch Kalman Normalization: Towards Training Deep Neural Networks with Micro-Batches
Guangrun Wang
Jiefeng Peng
Ping Luo
Xinjiang Wang
Liang Lin
29
18
0
09 Feb 2018
Generating Triples with Adversarial Networks for Scene Graph Construction
Matthew Klawonn
Eric Heim
GAN
GNN
32
22
0
07 Feb 2018
Clustering and Unsupervised Anomaly Detection with L2 Normalized Deep Auto-Encoder Representations
Çağlar Aytekin
Xingyang Ni
Francesco Cricri
Emre B. Aksu
SSL
UQCV
18
146
0
01 Feb 2018
Discrete Autoencoders for Sequence Models
Lukasz Kaiser
Samy Bengio
BDL
22
49
0
29 Jan 2018
Statistically Motivated Second Order Pooling
Kaicheng Yu
Mathieu Salzmann
24
42
0
23 Jan 2018
Composite Functional Gradient Learning of Generative Adversarial Models
Rie Johnson
Tong Zhang
GAN
35
14
0
19 Jan 2018
Evidential Occupancy Grid Map Augmentation using Deep Learning
Sascha Wirges
Felix Hartenbach
Christoph Stiller
17
24
0
16 Jan 2018
Unsupervised Cipher Cracking Using Discrete GANs
Aidan Gomez
Sicong Huang
Ivan Zhang
Bryan M. Li
Muhammad Osama
Lukasz Kaiser
GAN
25
59
0
15 Jan 2018
Predicting Movie Genres Based on Plot Summaries
Q. Hoang
15
24
0
15 Jan 2018
Predicting Future Lane Changes of Other Highway Vehicles using RNN-based Deep Models
Sajan Patel
Brent A. Griffin
Kristofer D. Kusano
Jason J. Corso
19
30
0
12 Jan 2018
Deep Episodic Memory: Encoding, Recalling, and Predicting Episodic Experiences for Robot Action Execution
Jonas Rothfuss
Fabio Ferreira
E. Aksoy
You Zhou
Tamim Asfour
38
37
0
12 Jan 2018
Enhancing Underwater Imagery using Generative Adversarial Networks
C. Fabbri
M. Islam
Junaed Sattar
GAN
24
561
0
11 Jan 2018
Model-Based Action Exploration for Learning Dynamic Motion Skills
Glen Berseth
M. van de Panne
25
0
0
11 Jan 2018
Improved English to Russian Translation by Neural Suffix Prediction
Kai Song
Yue Zhang
Min Zhang
Weihua Luo
16
10
0
11 Jan 2018
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
27
1,096
0
02 Jan 2018
CNN Is All You Need
Qiming Chen
R. Wu
AIMat
8
17
0
27 Dec 2017
Find the Conversation Killers: a Predictive Study of Thread-ending Posts
Yunhao Jiao
Cheng Li
Fei Wu
Qiaozhu Mei
14
17
0
22 Dec 2017
On the Diversity of Realistic Image Synthesis
Zichen Yang
Haifeng Liu
Deng Cai
27
6
0
20 Dec 2017
A Flexible Approach to Automated RNN Architecture Generation
Martin Schrimpf
Stephen Merity
James Bradbury
R. Socher
21
15
0
20 Dec 2017
Previous
1
2
3
...
106
107
108
109
110
111
Next