ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.06450
  4. Cited By
Layer Normalization

Layer Normalization

21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
ArXivPDFHTML

Papers citing "Layer Normalization"

50 / 5,513 papers shown
Title
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Sample-Efficient Imitation Learning via Generative Adversarial Nets
Lionel Blondé
Alexandros Kalousis
GAN
13
47
0
06 Sep 2018
Dual Ask-Answer Network for Machine Reading Comprehension
Dual Ask-Answer Network for Machine Reading Comprehension
Hang Xiao
Feng Wang
Jianfeng Yan
Jingyao Zheng
24
8
0
06 Sep 2018
Document-Level Neural Machine Translation with Hierarchical Attention
  Networks
Document-Level Neural Machine Translation with Hierarchical Attention Networks
Lesly Miculicich
Dhananjay Ram
Nikolaos Pappas
James Henderson
AIMat
26
267
0
05 Sep 2018
Parameter Sharing Methods for Multilingual Self-Attentional Translation
  Models
Parameter Sharing Methods for Multilingual Self-Attentional Translation Models
Devendra Singh Sachan
Graham Neubig
MoE
42
114
0
01 Sep 2018
Microsoft's Submission to the WMT2018 News Translation Task: How I
  Learned to Stop Worrying and Love the Data
Microsoft's Submission to the WMT2018 News Translation Task: How I Learned to Stop Worrying and Love the Data
Marcin Junczys-Dowmunt
13
38
0
01 Sep 2018
Contextual Encoding for Translation Quality Estimation
Contextual Encoding for Translation Quality Estimation
Junjie Hu
Wei-Cheng Chang
Yuexin Wu
Graham Neubig
22
3
0
01 Sep 2018
A Self-Attention Network for Hierarchical Data Structures with an
  Application to Claims Management
A Self-Attention Network for Hierarchical Data Structures with an Application to Claims Management
Leander Löw
Martin Spindler
E. Brechmann
11
0
0
30 Aug 2018
Revisiting Character-Based Neural Machine Translation with Capacity and
  Compression
Revisiting Character-Based Neural Machine Translation with Capacity and Compression
Colin Cherry
George F. Foster
Ankur Bapna
Orhan Firat
Wolfgang Macherey
25
94
0
29 Aug 2018
Voice Conversion Based on Cross-Domain Features Using Variational Auto
  Encoders
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders
Wen-Chin Huang
Hsin-Te Hwang
Yu-Huai Peng
Yu Tsao
H. Wang
27
43
0
29 Aug 2018
The University of Cambridge's Machine Translation Systems for WMT18
The University of Cambridge's Machine Translation Systems for WMT18
Felix Stahlberg
Adria de Gispert
Bill Byrne
21
20
0
28 Aug 2018
Semi-Autoregressive Neural Machine Translation
Semi-Autoregressive Neural Machine Translation
Chunqi Wang
Ji Zhang
Haiqing Chen
19
88
0
26 Aug 2018
Self-Attentive Sequential Recommendation
Self-Attentive Sequential Recommendation
Wang-Cheng Kang
Julian McAuley
HAI
BDL
12
2,355
0
20 Aug 2018
A Simple Convolutional Generative Network for Next Item Recommendation
A Simple Convolutional Generative Network for Next Item Recommendation
Fajie Yuan
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
Xiangnan He
31
547
0
15 Aug 2018
Collapse of Deep and Narrow Neural Nets
Collapse of Deep and Narrow Neural Nets
Lu Lu
Yanhui Su
George Karniadakis
ODL
27
153
0
15 Aug 2018
MT-VAE: Learning Motion Transformations to Generate Multimodal Human
  Dynamics
MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics
Xinchen Yan
Akash Rastogi
Ruben Villegas
Kalyan Sunkavalli
Eli Shechtman
Sunil Hadap
Ersin Yumer
Honglak Lee
30
150
0
14 Aug 2018
Ancient-Modern Chinese Translation with a Large Training Dataset
Ancient-Modern Chinese Translation with a Large Training Dataset
Dayiheng Liu
Jiancheng Lv
Kexin Yang
Qian Qu
24
13
0
11 Aug 2018
Ensemble Kalman Inversion: A Derivative-Free Technique For Machine
  Learning Tasks
Ensemble Kalman Inversion: A Derivative-Free Technique For Machine Learning Tasks
Nikola B. Kovachki
Andrew M. Stuart
BDL
44
136
0
10 Aug 2018
CT Super-resolution GAN Constrained by the Identical, Residual, and
  Cycle Learning Ensemble(GAN-CIRCLE)
CT Super-resolution GAN Constrained by the Identical, Residual, and Cycle Learning Ensemble(GAN-CIRCLE)
Chenyu You
Guang Li
Yi Zhang
Xiaoliu Zhang
Hongming Shan
...
Zhuiyang Zhang
W. Cong
Michael W. Vannier
P. Saha
Ge Wang
MedIm
GAN
SupR
21
400
0
10 Aug 2018
Character-Level Language Modeling with Deeper Self-Attention
Character-Level Language Modeling with Deeper Self-Attention
Rami Al-Rfou
Dokook Choe
Noah Constant
Mandy Guo
Llion Jones
33
387
0
09 Aug 2018
Deep Factorised Inverse-Sketching
Deep Factorised Inverse-Sketching
Kaiyue Pang
Da Li
Jifei Song
Yi-Zhe Song
Tao Xiang
Timothy M. Hospedales
24
18
0
07 Aug 2018
A Review of Learning with Deep Generative Models from Perspective of
  Graphical Modeling
A Review of Learning with Deep Generative Models from Perspective of Graphical Modeling
Zhijian Ou
31
16
0
05 Aug 2018
Visual Reasoning with Multi-hop Feature Modulation
Visual Reasoning with Multi-hop Feature Modulation
Florian Strub
Mathieu Seurin
Ethan Perez
H. D. Vries
Jérémie Mary
Philippe Preux
Aaron Courville
Olivier Pietquin
28
26
0
03 Aug 2018
News Session-Based Recommendations using Deep Neural Networks
News Session-Based Recommendations using Deep Neural Networks
Gabriel de Souza P. Moreira
F. Ferreira
A. Cunha
12
80
0
31 Jul 2018
Doubly Attentive Transformer Machine Translation
Doubly Attentive Transformer Machine Translation
Hasan Sait Arslan
Mark Fishel
G. Anbarjafari
35
13
0
30 Jul 2018
Human Motion Analysis with Deep Metric Learning
Human Motion Analysis with Deep Metric Learning
Huseyin Coskun
D. Tan
Sailesh Conjeti
Nassir Navab
Federico Tombari
11
49
0
30 Jul 2018
Speaker Recognition from Raw Waveform with SincNet
Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
62
700
0
29 Jul 2018
Effectiveness of Scaled Exponentially-Regularized Linear Units (SERLUs)
Effectiveness of Scaled Exponentially-Regularized Linear Units (SERLUs)
Guoqiang Zhang
Hao Li
11
10
0
26 Jul 2018
Iterative Amortized Inference
Iterative Amortized Inference
Joseph Marino
Yisong Yue
Stephan Mandt
BDL
DRL
26
165
0
24 Jul 2018
Recurrent Neural Networks for Long and Short-Term Sequential
  Recommendation
Recurrent Neural Networks for Long and Short-Term Sequential Recommendation
Kiewan Villatel
E. Smirnova
Jérémie Mary
Philippe Preux
HAI
11
26
0
23 Jul 2018
Recent Advances in Deep Learning: An Overview
Recent Advances in Deep Learning: An Overview
Matiur Rahman Minar
Jibon Naher
VLM
29
116
0
21 Jul 2018
Learning Heuristics for Quantified Boolean Formulas through Deep
  Reinforcement Learning
Learning Heuristics for Quantified Boolean Formulas through Deep Reinforcement Learning
Gil Lederman
M. Rabe
Edward A. Lee
S. Seshia
13
38
0
20 Jul 2018
An Efficient End-to-End Neural Model for Handwritten Text Recognition
An Efficient End-to-End Neural Model for Handwritten Text Recognition
Arindam Chowdhury
L. Vig
21
80
0
20 Jul 2018
Scheduling Computation Graphs of Deep Learning Models on Manycore CPUs
Scheduling Computation Graphs of Deep Learning Models on Manycore CPUs
Linpeng Tang
Yida Wang
Theodore L. Willke
Kai Li
GNN
21
22
0
16 Jul 2018
A Large-Scale Study on Regularization and Normalization in GANs
A Large-Scale Study on Regularization and Normalization in GANs
Karol Kurach
Mario Lucic
Xiaohua Zhai
Marcin Michalski
Sylvain Gelly
AI4CE
33
155
0
12 Jul 2018
Universal Transformers
Universal Transformers
Mostafa Dehghani
Stephan Gouws
Oriol Vinyals
Jakob Uszkoreit
Lukasz Kaiser
37
745
0
10 Jul 2018
Representation Learning with Contrastive Predictive Coding
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
53
10,050
0
10 Jul 2018
Revisiting the Hierarchical Multiscale LSTM
Revisiting the Hierarchical Multiscale LSTM
Ákos Kádár
Marc-Alexandre Côté
Grzegorz Chrupała
A. Alishahi
22
13
0
10 Jul 2018
Position-aware Self-attention with Relative Positional Encodings for
  Slot Filling
Position-aware Self-attention with Relative Positional Encodings for Slot Filling
I. Bilan
Benjamin Roth
23
22
0
09 Jul 2018
Robust and Scalable Differentiable Neural Computer for Question
  Answering
Robust and Scalable Differentiable Neural Computer for Question Answering
Jörg Franke
Jan Niehues
A. Waibel
OOD
21
24
0
07 Jul 2018
Product-based Neural Networks for User Response Prediction over
  Multi-field Categorical Data
Product-based Neural Networks for User Response Prediction over Multi-field Categorical Data
Yanru Qu
Bohui Fang
Weinan Zhang
Ruiming Tang
Minzhe Niu
Huifeng Guo
Yong Yu
Xiuqiang He
17
179
0
01 Jul 2018
Differentiable Learning-to-Normalize via Switchable Normalization
Differentiable Learning-to-Normalize via Switchable Normalization
Ping Luo
Jiamin Ren
Zhanglin Peng
Ruimao Zhang
Jingyu Li
11
176
0
28 Jun 2018
Adaptive Blending Units: Trainable Activation Functions for Deep Neural
  Networks
Adaptive Blending Units: Trainable Activation Functions for Deep Neural Networks
L. R. Sütfeld
Flemming Brieger
Holger Finger
S. Füllhase
G. Pipa
28
28
0
26 Jun 2018
Color Constancy by Reweighting Image Feature Maps
Color Constancy by Reweighting Image Feature Maps
J. Qiu
Haisong Xu
Z. Ye
14
22
0
25 Jun 2018
Countdown Regression: Sharp and Calibrated Survival Predictions
Countdown Regression: Sharp and Calibrated Survival Predictions
Anand Avati
Tony Duan
Sharon Zhou
Kenneth Jung
N. Shah
A. Ng
22
55
0
21 Jun 2018
DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural
  Architectures
DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural Architectures
Jin-Dong Dong
A. Cheng
Da-Cheng Juan
Wei Wei
Min Sun
25
181
0
21 Jun 2018
BFGAN: Backward and Forward Generative Adversarial Networks for
  Lexically Constrained Sentence Generation
BFGAN: Backward and Forward Generative Adversarial Networks for Lexically Constrained Sentence Generation
Dayiheng Liu
Jie Fu
Qian Qu
Jiancheng Lv
GAN
14
35
0
21 Jun 2018
The Natural Language Decathlon: Multitask Learning as Question Answering
The Natural Language Decathlon: Multitask Learning as Question Answering
Bryan McCann
N. Keskar
Caiming Xiong
R. Socher
AIMat
MLLM
BDL
25
641
0
20 Jun 2018
Uncertainty in Multitask Transfer Learning
Uncertainty in Multitask Transfer Learning
Alexandre Lacoste
Boris N. Oreshkin
Wonchang Chung
Thomas Boquet
Negar Rostamzadeh
David M. Krueger
BDL
UQCV
SSL
24
21
0
20 Jun 2018
Learning to Update for Object Tracking with Recurrent Meta-learner
Learning to Update for Object Tracking with Recurrent Meta-learner
Bi Li
Wenxuan Xie
Wenjun Zeng
Wenyu Liu
27
25
0
19 Jun 2018
Extending Recurrent Neural Aligner for Streaming End-to-End Speech
  Recognition in Mandarin
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin
Linhao Dong
Shiyu Zhou
Wei Chen
Bo Xu
24
22
0
17 Jun 2018
Previous
123...104105106...109110111
Next