Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.06450
Cited By
Layer Normalization
21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Layer Normalization"
50 / 5,513 papers shown
Title
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Lionel Blondé
Alexandros Kalousis
GAN
13
47
0
06 Sep 2018
Dual Ask-Answer Network for Machine Reading Comprehension
Hang Xiao
Feng Wang
Jianfeng Yan
Jingyao Zheng
24
8
0
06 Sep 2018
Document-Level Neural Machine Translation with Hierarchical Attention Networks
Lesly Miculicich
Dhananjay Ram
Nikolaos Pappas
James Henderson
AIMat
26
267
0
05 Sep 2018
Parameter Sharing Methods for Multilingual Self-Attentional Translation Models
Devendra Singh Sachan
Graham Neubig
MoE
42
114
0
01 Sep 2018
Microsoft's Submission to the WMT2018 News Translation Task: How I Learned to Stop Worrying and Love the Data
Marcin Junczys-Dowmunt
13
38
0
01 Sep 2018
Contextual Encoding for Translation Quality Estimation
Junjie Hu
Wei-Cheng Chang
Yuexin Wu
Graham Neubig
22
3
0
01 Sep 2018
A Self-Attention Network for Hierarchical Data Structures with an Application to Claims Management
Leander Löw
Martin Spindler
E. Brechmann
11
0
0
30 Aug 2018
Revisiting Character-Based Neural Machine Translation with Capacity and Compression
Colin Cherry
George F. Foster
Ankur Bapna
Orhan Firat
Wolfgang Macherey
25
94
0
29 Aug 2018
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders
Wen-Chin Huang
Hsin-Te Hwang
Yu-Huai Peng
Yu Tsao
H. Wang
27
43
0
29 Aug 2018
The University of Cambridge's Machine Translation Systems for WMT18
Felix Stahlberg
Adria de Gispert
Bill Byrne
21
20
0
28 Aug 2018
Semi-Autoregressive Neural Machine Translation
Chunqi Wang
Ji Zhang
Haiqing Chen
19
88
0
26 Aug 2018
Self-Attentive Sequential Recommendation
Wang-Cheng Kang
Julian McAuley
HAI
BDL
12
2,355
0
20 Aug 2018
A Simple Convolutional Generative Network for Next Item Recommendation
Fajie Yuan
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
Xiangnan He
31
547
0
15 Aug 2018
Collapse of Deep and Narrow Neural Nets
Lu Lu
Yanhui Su
George Karniadakis
ODL
27
153
0
15 Aug 2018
MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics
Xinchen Yan
Akash Rastogi
Ruben Villegas
Kalyan Sunkavalli
Eli Shechtman
Sunil Hadap
Ersin Yumer
Honglak Lee
30
150
0
14 Aug 2018
Ancient-Modern Chinese Translation with a Large Training Dataset
Dayiheng Liu
Jiancheng Lv
Kexin Yang
Qian Qu
24
13
0
11 Aug 2018
Ensemble Kalman Inversion: A Derivative-Free Technique For Machine Learning Tasks
Nikola B. Kovachki
Andrew M. Stuart
BDL
44
136
0
10 Aug 2018
CT Super-resolution GAN Constrained by the Identical, Residual, and Cycle Learning Ensemble(GAN-CIRCLE)
Chenyu You
Guang Li
Yi Zhang
Xiaoliu Zhang
Hongming Shan
...
Zhuiyang Zhang
W. Cong
Michael W. Vannier
P. Saha
Ge Wang
MedIm
GAN
SupR
21
400
0
10 Aug 2018
Character-Level Language Modeling with Deeper Self-Attention
Rami Al-Rfou
Dokook Choe
Noah Constant
Mandy Guo
Llion Jones
33
387
0
09 Aug 2018
Deep Factorised Inverse-Sketching
Kaiyue Pang
Da Li
Jifei Song
Yi-Zhe Song
Tao Xiang
Timothy M. Hospedales
24
18
0
07 Aug 2018
A Review of Learning with Deep Generative Models from Perspective of Graphical Modeling
Zhijian Ou
31
16
0
05 Aug 2018
Visual Reasoning with Multi-hop Feature Modulation
Florian Strub
Mathieu Seurin
Ethan Perez
H. D. Vries
Jérémie Mary
Philippe Preux
Aaron Courville
Olivier Pietquin
28
26
0
03 Aug 2018
News Session-Based Recommendations using Deep Neural Networks
Gabriel de Souza P. Moreira
F. Ferreira
A. Cunha
12
80
0
31 Jul 2018
Doubly Attentive Transformer Machine Translation
Hasan Sait Arslan
Mark Fishel
G. Anbarjafari
35
13
0
30 Jul 2018
Human Motion Analysis with Deep Metric Learning
Huseyin Coskun
D. Tan
Sailesh Conjeti
Nassir Navab
Federico Tombari
11
49
0
30 Jul 2018
Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
62
700
0
29 Jul 2018
Effectiveness of Scaled Exponentially-Regularized Linear Units (SERLUs)
Guoqiang Zhang
Hao Li
11
10
0
26 Jul 2018
Iterative Amortized Inference
Joseph Marino
Yisong Yue
Stephan Mandt
BDL
DRL
26
165
0
24 Jul 2018
Recurrent Neural Networks for Long and Short-Term Sequential Recommendation
Kiewan Villatel
E. Smirnova
Jérémie Mary
Philippe Preux
HAI
11
26
0
23 Jul 2018
Recent Advances in Deep Learning: An Overview
Matiur Rahman Minar
Jibon Naher
VLM
29
116
0
21 Jul 2018
Learning Heuristics for Quantified Boolean Formulas through Deep Reinforcement Learning
Gil Lederman
M. Rabe
Edward A. Lee
S. Seshia
13
38
0
20 Jul 2018
An Efficient End-to-End Neural Model for Handwritten Text Recognition
Arindam Chowdhury
L. Vig
21
80
0
20 Jul 2018
Scheduling Computation Graphs of Deep Learning Models on Manycore CPUs
Linpeng Tang
Yida Wang
Theodore L. Willke
Kai Li
GNN
21
22
0
16 Jul 2018
A Large-Scale Study on Regularization and Normalization in GANs
Karol Kurach
Mario Lucic
Xiaohua Zhai
Marcin Michalski
Sylvain Gelly
AI4CE
33
155
0
12 Jul 2018
Universal Transformers
Mostafa Dehghani
Stephan Gouws
Oriol Vinyals
Jakob Uszkoreit
Lukasz Kaiser
37
745
0
10 Jul 2018
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
53
10,050
0
10 Jul 2018
Revisiting the Hierarchical Multiscale LSTM
Ákos Kádár
Marc-Alexandre Côté
Grzegorz Chrupała
A. Alishahi
22
13
0
10 Jul 2018
Position-aware Self-attention with Relative Positional Encodings for Slot Filling
I. Bilan
Benjamin Roth
23
22
0
09 Jul 2018
Robust and Scalable Differentiable Neural Computer for Question Answering
Jörg Franke
Jan Niehues
A. Waibel
OOD
21
24
0
07 Jul 2018
Product-based Neural Networks for User Response Prediction over Multi-field Categorical Data
Yanru Qu
Bohui Fang
Weinan Zhang
Ruiming Tang
Minzhe Niu
Huifeng Guo
Yong Yu
Xiuqiang He
17
179
0
01 Jul 2018
Differentiable Learning-to-Normalize via Switchable Normalization
Ping Luo
Jiamin Ren
Zhanglin Peng
Ruimao Zhang
Jingyu Li
11
176
0
28 Jun 2018
Adaptive Blending Units: Trainable Activation Functions for Deep Neural Networks
L. R. Sütfeld
Flemming Brieger
Holger Finger
S. Füllhase
G. Pipa
28
28
0
26 Jun 2018
Color Constancy by Reweighting Image Feature Maps
J. Qiu
Haisong Xu
Z. Ye
14
22
0
25 Jun 2018
Countdown Regression: Sharp and Calibrated Survival Predictions
Anand Avati
Tony Duan
Sharon Zhou
Kenneth Jung
N. Shah
A. Ng
22
55
0
21 Jun 2018
DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural Architectures
Jin-Dong Dong
A. Cheng
Da-Cheng Juan
Wei Wei
Min Sun
25
181
0
21 Jun 2018
BFGAN: Backward and Forward Generative Adversarial Networks for Lexically Constrained Sentence Generation
Dayiheng Liu
Jie Fu
Qian Qu
Jiancheng Lv
GAN
14
35
0
21 Jun 2018
The Natural Language Decathlon: Multitask Learning as Question Answering
Bryan McCann
N. Keskar
Caiming Xiong
R. Socher
AIMat
MLLM
BDL
25
641
0
20 Jun 2018
Uncertainty in Multitask Transfer Learning
Alexandre Lacoste
Boris N. Oreshkin
Wonchang Chung
Thomas Boquet
Negar Rostamzadeh
David M. Krueger
BDL
UQCV
SSL
24
21
0
20 Jun 2018
Learning to Update for Object Tracking with Recurrent Meta-learner
Bi Li
Wenxuan Xie
Wenjun Zeng
Wenyu Liu
27
25
0
19 Jun 2018
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin
Linhao Dong
Shiyu Zhou
Wei Chen
Bo Xu
24
22
0
17 Jun 2018
Previous
1
2
3
...
104
105
106
...
109
110
111
Next