Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.06450
Cited By
Layer Normalization
21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Layer Normalization"
50 / 5,515 papers shown
Title
Improving speech emotion recognition via Transformer-based Predictive Coding through transfer learning
Zheng Lian
Ya Li
J. Tao
Jian Huang
ViT
13
18
0
11 Nov 2018
Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series
Qiang Zhang
Kyle Birkeland
Yaodong Yang
Yixiao Liu
33
9
0
08 Nov 2018
Neural Phrase-to-Phrase Machine Translation
Jiangtao Feng
Lingpeng Kong
Po-Sen Huang
Chong-Jun Wang
Da Huang
Jiayuan Mao
Kan Qiao
Dengyong Zhou
AIMat
24
14
0
06 Nov 2018
Compact Personalized Models for Neural Machine Translation
Joern Wuebker
A. Paz
N. Ravid
VLM
21
56
0
05 Nov 2018
Bi-Directional Differentiable Input Reconstruction for Low-Resource Neural Machine Translation
Xing Niu
Weijia Xu
Marine Carpuat
19
17
0
02 Nov 2018
Abstractive Summarization of Reddit Posts with Multi-level Memory Networks
Byeongchang Kim
Hyunwoo J. Kim
Gunhee Kim
31
182
0
02 Nov 2018
Towards Linear Time Neural Machine Translation with Capsule Networks
Mingxuan Wang
Jun Xie
Zhixing Tan
Jinsong Su
Deyi Xiong
Lei Li
AIMat
24
27
0
01 Nov 2018
Hybrid Self-Attention Network for Machine Translation
Kaitao Song
Tan Xu
Furong Peng
Jianfeng Lu
21
12
0
01 Nov 2018
Dial2Desc: End-to-end Dialogue Description Generation
Haojie Pan
Junpei Zhou
Zhou Zhao
Yan Liu
Deng Cai
Min Yang
VLM
18
14
0
01 Nov 2018
A task in a suit and a tie: paraphrase generation with semantic augmentation
Su Wang
Rahul Gupta
Nancy Chang
Jason Baldridge
19
65
0
31 Oct 2018
Convolutional Self-Attention Network
Baosong Yang
Longyue Wang
Derek F. Wong
Lidia S. Chao
Zhaopeng Tu
BDL
19
11
0
31 Oct 2018
Machine Translation between Vietnamese and English: an Empirical Study
Hong-Hai Phan-Vu
Viet-Trung Tran
V. Nguyen
Hoang-Vu Dang
Phan-Thuan Do
20
17
0
30 Oct 2018
A Simple Recurrent Unit with Reduced Tensor Product Representations
Shuai Tang
P. Smolensky
V. D. Sa
22
2
0
29 Oct 2018
A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation
Akhilesh Deepak Gotmare
N. Keskar
Caiming Xiong
R. Socher
ODL
19
275
0
29 Oct 2018
Towards Principled Uncertainty Estimation for Deep Neural Networks
Richard E. Harang
Ethan M. Rudd
BDL
UQCV
36
6
0
29 Oct 2018
Convolutional neural networks with extra-classical receptive fields
Brian Hu
Stefan Mihalas
20
3
0
27 Oct 2018
One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
Tianhe Yu
Pieter Abbeel
Sergey Levine
Chelsea Finn
13
68
0
25 Oct 2018
Batch Normalization Sampling
Zhaodong Chen
Lei Deng
Guoqi Li
Jiawei Sun
Xing Hu
Xin Ma
Yuan Xie
21
0
0
25 Oct 2018
Exploiting Deep Representations for Neural Machine Translation
Zi-Yi Dou
Zhaopeng Tu
Xing Wang
Shuming Shi
Tong Zhang
44
92
0
24 Oct 2018
Ain't Nobody Got Time For Coding: Structure-Aware Program Synthesis From Natural Language
J. Bednarek
K. Piaskowski
K. Krawiec
6
12
0
23 Oct 2018
Towards Universal Dialogue State Tracking
Liliang Ren
Kaige Xie
Lu Chen
Kai Yu
16
121
0
22 Oct 2018
A Fully Attention-Based Information Retriever
Alvaro H. C. Correia
Jorge Luiz Moreira Silva
Thiago de C. Martins
Fabio Gagliardi Cozman
36
4
0
22 Oct 2018
Lightweight Convolutional Approaches to Reading Comprehension on SQuAD
T. Bell
Benjamin Penchas
14
3
0
19 Oct 2018
Investigating Object Compositionality in Generative Adversarial Networks
Sjoerd van Steenkiste
Karol Kurach
Jürgen Schmidhuber
Sylvain Gelly
GAN
OCL
29
20
0
17 Oct 2018
Sequence-to-Sequence Acoustic Modeling for Voice Conversion
Jing-Xuan Zhang
Zhenhua Ling
Li-Juan Liu
Yuan Jiang
Lirong Dai
19
129
0
16 Oct 2018
MeanSum: A Neural Model for Unsupervised Multi-document Abstractive Summarization
Eric Chu
Peter J. Liu
20
19
0
12 Oct 2018
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
T. Paine
Sergio Gomez Colmenarejo
Ziyun Wang
Scott E. Reed
Y. Aytar
...
Matthew W. Hoffman
Gabriel Barth-Maron
Serkan Cabi
David Budden
Nando de Freitas
OffRL
22
26
0
11 Oct 2018
Image Captioning as Neural Machine Translation Task in SOCKEYE
Loris Bazzani
Tobias Domhan
Felix Hieber
VLM
19
2
0
09 Oct 2018
Task-Embedded Control Networks for Few-Shot Imitation Learning
Stephen James
Michael Bloesch
Andrew J. Davison
38
135
0
08 Oct 2018
On Self Modulation for Generative Adversarial Networks
Ting Chen
Mario Lucic
N. Houlsby
Sylvain Gelly
GAN
27
104
0
02 Oct 2018
Improving Sentence Representations with Consensus Maximisation
Shuai Tang
V. D. Sa
SSL
AI4TS
37
4
0
02 Oct 2018
Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks
Juho Lee
Yoonho Lee
Jungtaek Kim
Adam R. Kosiorek
Seungjin Choi
Yee Whye Teh
23
274
0
01 Oct 2018
Open-Ended Content-Style Recombination Via Leakage Filtering
Karl Ridgeway
Michael C. Mozer
DRL
VLM
22
2
0
28 Sep 2018
SALSA-TEXT : self attentive latent space based adversarial text generation
Jules Gagnon-Marchand
Hamed Sadeghi
Md. Akmal Haidar
Mehdi Rezagholizadeh
19
18
0
28 Sep 2018
Learning Robust, Transferable Sentence Representations for Text Classification
Wasi Uddin Ahmad
Xueying Bai
Nanyun Peng
Kai-Wei Chang
AI4TS
OOD
25
5
0
28 Sep 2018
Batch-normalized Recurrent Highway Networks
Chi Zhang
Thang Nguyen
Shagan Sah
R. Ptucha
A. Loui
C. Salvaggio
25
8
0
26 Sep 2018
Utilizing Class Information for Deep Network Representation Shaping
Daeyoung Choi
Wonjong Rhee
16
2
0
25 Sep 2018
Learning to Read by Spelling: Towards Unsupervised Text Recognition
Ankush Gupta
Andrea Vedaldi
Andrew Zisserman
SSL
25
20
0
23 Sep 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
69
1,754
0
20 Sep 2018
Latent Topic Conversational Models
Tsung-Hsien Wen
Minh-Thang Luong
BDL
24
4
0
19 Sep 2018
Removing the Feature Correlation Effect of Multiplicative Noise
Zijun Zhang
Yining Zhang
Zongpeng Li
13
8
0
19 Sep 2018
Learning Universal Sentence Representations with Mean-Max Attention Autoencoder
Minghua Zhang
Yunfang Wu
W. Li
Wei Li
SSL
27
27
0
18 Sep 2018
Switching Isotropic and Directional Exploration with Parameter Space Noise in Deep Reinforcement Learning
Izumi Karino
Kazutoshi Tanaka
Ryuma Niiyama
Yasuo Kuniyoshi
19
3
0
18 Sep 2018
A Domain Agnostic Normalization Layer for Unsupervised Adversarial Domain Adaptation
Rob Romijnders
Panagiotis Meletis
Gijs Dubbelman
AI4CE
21
27
0
14 Sep 2018
Zero-Shot Cross-lingual Classification Using Multilingual Neural Machine Translation
Akiko Eriguchi
Melvin Johnson
Orhan Firat
Hideto Kazawa
Wolfgang Macherey
22
62
0
12 Sep 2018
Multitask Learning on Graph Neural Networks: Learning Multiple Graph Centrality Measures with a Unified Network
Pedro H. C. Avelar
Henrique Lemos
Marcelo O. R. Prates
Luís C. Lamb
45
17
0
11 Sep 2018
Unsupervised Stylish Image Description Generation via Domain Layer Norm
Cheng Kuan Chen
Zhufeng Pan
Min Sun
Ming Liu
23
29
0
11 Sep 2018
Normalization in Training U-Net for 2D Biomedical Semantic Segmentation
Xiao-Yun Zhou
Guang-Zhong Yang
18
77
0
11 Sep 2018
Exploiting Invertible Decoders for Unsupervised Sentence Representation Learning
Shuai Tang
V. D. Sa
SSL
13
1
0
08 Sep 2018
Learning to Solve NP-Complete Problems - A Graph Neural Network for Decision TSP
Marcelo O. R. Prates
Pedro H. C. Avelar
Henrique Lemos
Luís C. Lamb
Moshe Y. Vardi
GNN
30
176
0
08 Sep 2018
Previous
1
2
3
...
103
104
105
...
109
110
111
Next