Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.06450
Cited By
Layer Normalization
21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Layer Normalization"
50 / 5,502 papers shown
Title
EraseReLU: A Simple Way to Ease the Training of Deep Convolution Neural Networks
Xuanyi Dong
Guoliang Kang
Kun Zhan
Yi Yang
11
16
0
22 Sep 2017
Learning to update Auto-associative Memory in Recurrent Neural Networks for Improving Sequence Memorization
Wei Zhang
Bowen Zhou
14
12
0
19 Sep 2017
Language Modeling with Highway LSTM
Gakuto Kurata
Bhuvana Ramabhadran
G. Saon
A. Sethy
AI4TS
21
38
0
19 Sep 2017
One-Shot Visual Imitation Learning via Meta-Learning
Chelsea Finn
Tianhe Yu
Tianhao Zhang
Pieter Abbeel
Sergey Levine
SSL
16
554
0
14 Sep 2017
Shifting Mean Activation Towards Zero with Bipolar Activation Functions
L. Eidnes
Arild Nøkland
26
18
0
12 Sep 2017
Proportionate gradient updates with PercentDelta
Sami Abu-El-Haija
36
7
0
24 Aug 2017
Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses
Ryan J. Lowe
Michael Noseworthy
Iulian Serban
Nicolas Angelard-Gontier
Yoshua Bengio
Joelle Pineau
11
370
0
23 Aug 2017
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks
Victor Campos
Brendan Jou
Xavier Giró-i-Nieto
Jordi Torres
Shih-Fu Chang
21
217
0
22 Aug 2017
Neural Machine Translation with Extended Context
Jörg Tiedemann
Yves Scherrer
6
249
0
20 Aug 2017
The Helsinki Neural Machine Translation System
Robert Östling
Yves Scherrer
Jörg Tiedemann
Gongbo Tang
Tommi Nieminen
DRL
11
20
0
20 Aug 2017
SMASH: One-Shot Model Architecture Search through HyperNetworks
Andrew Brock
Theodore Lim
J. Ritchie
Nick Weston
22
761
0
17 Aug 2017
Recent Trends in Deep Learning Based Natural Language Processing
Tom Young
Devamanyu Hazarika
Soujanya Poria
Min Zhang
35
2,824
0
09 Aug 2017
Regularizing and Optimizing LSTM Language Models
Stephen Merity
N. Keskar
R. Socher
60
1,091
0
07 Aug 2017
Revisiting Activation Regularization for Language RNNs
Stephen Merity
Bryan McCann
R. Socher
33
44
0
03 Aug 2017
The University of Edinburgh's Neural MT Systems for WMT17
Rico Sennrich
Alexandra Birch
Anna Currey
Ulrich Germann
Barry Haddow
Kenneth Heafield
Antonio Valerio Miceli Barone
Philip Williams
MoE
14
182
0
02 Aug 2017
An Effective Training Method For Deep Convolutional Neural Network
Yangzhou Jiang
Zeyang Dou
Qun Hao
Jie Cao
Kun Gao
Xi Chen
32
0
0
31 Jul 2017
Learning Algorithms for Active Learning
Philip Bachman
Alessandro Sordoni
Adam Trischler
VLM
21
155
0
31 Jul 2017
Photographic Image Synthesis with Cascaded Refinement Networks
Qifeng Chen
V. Koltun
26
946
0
28 Jul 2017
Dual Rectified Linear Units (DReLUs): A Replacement for Tanh Activation Functions in Quasi-Recurrent Neural Networks
Fréderic Godin
Jonas Degrave
J. Dambre
W. D. Neve
MU
15
46
0
25 Jul 2017
Improving Robustness of Feature Representations to Image Deformations using Powered Convolution in CNNs
Zhun Sun
Mete Ozay
Takayuki Okatani
16
3
0
25 Jul 2017
Deep Architectures for Neural Machine Translation
Antonio Valerio Miceli Barone
Jindřich Helcl
Rico Sennrich
Barry Haddow
Alexandra Birch
19
111
0
24 Jul 2017
Learning Transferable Architectures for Scalable Image Recognition
Barret Zoph
Vijay Vasudevan
Jonathon Shlens
Quoc V. Le
54
5,553
0
21 Jul 2017
Learning Visually Grounded Sentence Representations
Douwe Kiela
Alexis Conneau
Allan Jabri
Maximilian Nickel
SSL
21
69
0
19 Jul 2017
Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition
Taesup Kim
Inchul Song
Yoshua Bengio
28
65
0
19 Jul 2017
LIUM Machine Translation Systems for WMT17 News Translation Task
Mercedes García-Martínez
Ozan Caglayan
Walid Aransa
Adrien Bardet
Fethi Bougares
Loïc Barrault
26
15
0
14 Jul 2017
LIUM-CVC Submissions for WMT17 Multimodal Translation Task
Ozan Caglayan
Walid Aransa
Adrien Bardet
Mercedes García-Martínez
Fethi Bougares
Loïc Barrault
Marc Masana
Luis Herranz
Joost van de Weijer
28
57
0
14 Jul 2017
UTS submission to Google YouTube-8M Challenge 2017
Linchao Zhu
Yanbin Liu
Yi Yang
23
5
0
13 Jul 2017
Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration
Rouhollah Rahmatizadeh
P. Abolghasemi
Ladislau Bölöni
Sergey Levine
25
254
0
10 Jul 2017
Aggregating Frame-level Features for Large-Scale Video Classification
Shaoxiang Chen
Xi Wang
Yongyi Tang
Xinpeng Chen
Zuxuan Wu
Yu-Gang Jiang
18
22
0
04 Jul 2017
Multiscale sequence modeling with a learned dictionary
B. V. Merrienboer
Amartya Sanyal
Hugo Larochelle
Yoshua Bengio
26
10
0
03 Jul 2017
The YouTube-8M Kaggle Competition: Challenges and Methods
Haosheng Zou
Kun Xu
Jialian Li
Jun Zhu
8
13
0
28 Jun 2017
Encoding Video and Label Priors for Multi-label Video Classification on YouTube-8M dataset
Seil Na
Youngjae Yu
Sangho Lee
Jisung Kim
Gunhee Kim
19
12
0
24 Jun 2017
L2 Regularization versus Batch and Weight Normalization
Twan van Laarhoven
8
292
0
16 Jun 2017
One Model To Learn Them All
Lukasz Kaiser
Aidan Gomez
Noam M. Shazeer
Ashish Vaswani
Niki Parmar
Llion Jones
Jakob Uszkoreit
VLM
ViT
19
333
0
16 Jun 2017
Hardware-efficient on-line learning through pipelined truncated-error backpropagation in binary-state networks
H. Elsayed
Bruno U. Pedroni
Sadique Sheik
Gert Cauwenberghs
21
8
0
15 Jun 2017
Plan, Attend, Generate: Character-level Neural Machine Translation with Planning in the Decoder
Çağlar Gülçehre
Francis Dutil
Adam Trischler
Yoshua Bengio
11
7
0
13 Jun 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
21
127,963
0
12 Jun 2017
Online Learning for Neural Machine Translation Post-editing
Álvaro Peris
L. Cebrián
F. Casacuberta
OnRL
19
33
0
10 Jun 2017
Depthwise Separable Convolutions for Neural Machine Translation
Lukasz Kaiser
Aidan Gomez
François Chollet
33
278
0
09 Jun 2017
Self-Normalizing Neural Networks
G. Klambauer
Thomas Unterthiner
Andreas Mayr
Sepp Hochreiter
76
2,483
0
08 Jun 2017
Deep Learning: Generalization Requires Deep Compositional Feature Space Design
Mrinal Haloi
MLT
OOD
11
3
0
06 Jun 2017
Parameter Space Noise for Exploration
Matthias Plappert
Rein Houthooft
Prafulla Dhariwal
Szymon Sidor
Richard Y. Chen
Xi Chen
Tamim Asfour
Pieter Abbeel
Marcin Andrychowicz
15
593
0
06 Jun 2017
NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems
Ozan Caglayan
Mercedes García-Martínez
Adrien Bardet
Walid Aransa
Fethi Bougares
Loïc Barrault
27
65
0
01 Jun 2017
Fisher GAN
Youssef Mroueh
Tom Sercu
GAN
AI4CE
22
132
0
26 May 2017
Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs
Jean Maillard
S. Clark
Dani Yogatama
26
87
0
25 May 2017
Flow-GAN: Combining Maximum Likelihood and Adversarial Learning in Generative Models
Aditya Grover
Manik Dhar
Stefano Ermon
GAN
39
24
0
24 May 2017
Adaptive Detrending to Accelerate Convolutional Gated Recurrent Unit Training for Contextual Video Recognition
Minju Jung
Haanvid Lee
Jun Tani
AI4TS
25
42
0
24 May 2017
Fast-Slow Recurrent Neural Networks
Asier Mujika
Florian Meier
Angelika Steger
14
76
0
24 May 2017
Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit
B. Maginnis
Pierre Harvey Richemond
AI4TS
16
1
0
23 May 2017
Diminishing Batch Normalization
Yintai Ma
Diego Klabjan
28
15
0
22 May 2017
Previous
1
2
3
...
108
109
110
111
Next