ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.06450
  4. Cited By
Layer Normalization

Layer Normalization

21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
ArXivPDFHTML

Papers citing "Layer Normalization"

50 / 5,502 papers shown
Title
Self-Attentional Acoustic Models
Self-Attentional Acoustic Models
Matthias Sperber
Jan Niehues
Graham Neubig
Sebastian Stüker
A. Waibel
22
151
0
26 Mar 2018
Low-Resource Speech-to-Text Translation
Low-Resource Speech-to-Text Translation
Sameer Bansal
Herman Kamper
Karen Livescu
Adam Lopez
Sharon Goldwater
19
56
0
24 Mar 2018
Group Normalization
Group Normalization
Yuxin Wu
Kaiming He
45
3,596
0
22 Mar 2018
Attention, Learn to Solve Routing Problems!
Attention, Learn to Solve Routing Problems!
W. Kool
H. V. Hoof
Max Welling
28
1,175
0
22 Mar 2018
A Comprehensive Analysis of Deep Regression
A Comprehensive Analysis of Deep Regression
Stéphane Lathuilière
Pablo Mesejo
Xavier Alameda-Pineda
Radu Horaud
BDL
16
294
0
22 Mar 2018
Stacked Cross Attention for Image-Text Matching
Stacked Cross Attention for Image-Text Matching
Kuang-Huei Lee
Xi Chen
G. Hua
Houdong Hu
Xiaodong He
30
1,140
0
21 Mar 2018
Learning Dynamic Memory Networks for Object Tracking
Learning Dynamic Memory Networks for Object Tracking
Tianyu Yang
Antoni B. Chan
27
284
0
20 Mar 2018
SentEval: An Evaluation Toolkit for Universal Sentence Representations
SentEval: An Evaluation Toolkit for Universal Sentence Representations
Alexis Conneau
Douwe Kiela
37
630
0
14 Mar 2018
Learning to Explore with Meta-Policy Gradient
Learning to Explore with Meta-Policy Gradient
Tianbing Xu
Qiang Liu
Liang Zhao
Jian Peng
14
54
0
13 Mar 2018
Fast Decoding in Sequence Models using Discrete Latent Variables
Fast Decoding in Sequence Models using Discrete Latent Variables
Łukasz Kaiser
Aurko Roy
Ashish Vaswani
Niki Parmar
Samy Bengio
Jakob Uszkoreit
Noam M. Shazeer
16
230
0
09 Mar 2018
Compositional Attention Networks for Machine Reasoning
Compositional Attention Networks for Machine Reasoning
Drew A. Hudson
Christopher D. Manning
BDL
OOD
LRM
32
572
0
08 Mar 2018
Self-Attention with Relative Position Representations
Self-Attention with Relative Position Representations
Peter Shaw
Jakob Uszkoreit
Ashish Vaswani
17
2,251
0
06 Mar 2018
Norm matters: efficient and accurate normalization schemes in deep
  networks
Norm matters: efficient and accurate normalization schemes in deep networks
Elad Hoffer
Ron Banner
Itay Golan
Daniel Soudry
OffRL
17
178
0
05 Mar 2018
Neural Architectures for Open-Type Relation Argument Extraction
Neural Architectures for Open-Type Relation Argument Extraction
Benjamin Roth
Costanza Conforti
Nina Poerner
Sanjeev Kumar Karn
Hinrich Schütze
BDL
13
13
0
05 Mar 2018
Memorization Precedes Generation: Learning Unsupervised GANs with Memory
  Networks
Memorization Precedes Generation: Learning Unsupervised GANs with Memory Networks
Youngjin Kim
Minjung Kim
Gunhee Kim
GAN
26
40
0
05 Mar 2018
An Empirical Evaluation of Generic Convolutional and Recurrent Networks
  for Sequence Modeling
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
DRL
42
4,724
0
04 Mar 2018
Not All Samples Are Created Equal: Deep Learning with Importance
  Sampling
Not All Samples Are Created Equal: Deep Learning with Importance Sampling
Angelos Katharopoulos
F. Fleuret
9
507
0
02 Mar 2018
Ring loss: Convex Feature Normalization for Face Recognition
Ring loss: Convex Feature Normalization for Face Recognition
Yutong Zheng
Dipan K. Pal
Marios Savvides
CVBM
19
198
0
28 Feb 2018
Simultaneously Self-Attending to All Mentions for Full-Abstract
  Biological Relation Extraction
Simultaneously Self-Attending to All Mentions for Full-Abstract Biological Relation Extraction
Pat Verga
Emma Strubell
Andrew McCallum
11
255
0
28 Feb 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Martin Riedmiller
Roland Hafner
Thomas Lampe
Michael Neunert
Jonas Degrave
T. Wiele
Volodymyr Mnih
N. Heess
Jost Tobias Springenberg
35
445
0
28 Feb 2018
Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse
  Coding
Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse Coding
Dong Liu
Ke Sun
Zhangyang Wang
Runsheng Liu
Zhengjun Zha
24
12
0
28 Feb 2018
L1-Norm Batch Normalization for Efficient Training of Deep Neural
  Networks
L1-Norm Batch Normalization for Efficient Training of Deep Neural Networks
Shuang Wu
Guoqi Li
Lei Deng
Liu Liu
Yuan Xie
Luping Shi
17
117
0
27 Feb 2018
Train Feedfoward Neural Network with Layer-wise Adaptive Rate via
  Approximating Back-matching Propagation
Train Feedfoward Neural Network with Layer-wise Adaptive Rate via Approximating Back-matching Propagation
Huishuai Zhang
Wei-neng Chen
Tie-Yan Liu
17
6
0
27 Feb 2018
Photographic Text-to-Image Synthesis with a Hierarchically-nested
  Adversarial Network
Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network
Zizhao Zhang
Yuanpu Xie
L. Yang
EGVM
32
304
0
26 Feb 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep
  Networks for Thompson Sampling
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
C. Riquelme
George Tucker
Jasper Snoek
BDL
41
365
0
26 Feb 2018
Disentangling by Factorising
Disentangling by Factorising
Hyunjik Kim
A. Mnih
CoGe
OOD
12
1,327
0
16 Feb 2018
Spectral Normalization for Generative Adversarial Networks
Spectral Normalization for Generative Adversarial Networks
Takeru Miyato
Toshiki Kataoka
Masanori Koyama
Yuichi Yoshida
ODL
38
4,399
0
16 Feb 2018
Image Transformer
Image Transformer
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
ViT
42
1,660
0
15 Feb 2018
CNN+LSTM Architecture for Speech Emotion Recognition with Data
  Augmentation
CNN+LSTM Architecture for Speech Emotion Recognition with Data Augmentation
Caroline Etienne
Guillaume Fidanza
Andrei Petrovskii
Laurence Devillers
B. Schmauch
19
99
0
15 Feb 2018
Deep contextualized word representations
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
14
11,481
0
15 Feb 2018
Learning a SAT Solver from Single-Bit Supervision
Learning a SAT Solver from Single-Bit Supervision
Daniel Selsam
Matthew Lamm
Benedikt Bünz
Percy Liang
L. D. Moura
D. Dill
NAI
14
416
0
11 Feb 2018
Batch Kalman Normalization: Towards Training Deep Neural Networks with
  Micro-Batches
Batch Kalman Normalization: Towards Training Deep Neural Networks with Micro-Batches
Guangrun Wang
Jiefeng Peng
Ping Luo
Xinjiang Wang
Liang Lin
29
18
0
09 Feb 2018
Generating Triples with Adversarial Networks for Scene Graph
  Construction
Generating Triples with Adversarial Networks for Scene Graph Construction
Matthew Klawonn
Eric Heim
GAN
GNN
32
22
0
07 Feb 2018
Clustering and Unsupervised Anomaly Detection with L2 Normalized Deep
  Auto-Encoder Representations
Clustering and Unsupervised Anomaly Detection with L2 Normalized Deep Auto-Encoder Representations
Çağlar Aytekin
Xingyang Ni
Francesco Cricri
Emre B. Aksu
SSL
UQCV
18
146
0
01 Feb 2018
Discrete Autoencoders for Sequence Models
Discrete Autoencoders for Sequence Models
Lukasz Kaiser
Samy Bengio
BDL
22
49
0
29 Jan 2018
Statistically Motivated Second Order Pooling
Statistically Motivated Second Order Pooling
Kaicheng Yu
Mathieu Salzmann
24
42
0
23 Jan 2018
Composite Functional Gradient Learning of Generative Adversarial Models
Composite Functional Gradient Learning of Generative Adversarial Models
Rie Johnson
Tong Zhang
GAN
35
14
0
19 Jan 2018
Evidential Occupancy Grid Map Augmentation using Deep Learning
Evidential Occupancy Grid Map Augmentation using Deep Learning
Sascha Wirges
Felix Hartenbach
Christoph Stiller
17
24
0
16 Jan 2018
Unsupervised Cipher Cracking Using Discrete GANs
Unsupervised Cipher Cracking Using Discrete GANs
Aidan Gomez
Sicong Huang
Ivan Zhang
Bryan M. Li
Muhammad Osama
Lukasz Kaiser
GAN
25
59
0
15 Jan 2018
Predicting Movie Genres Based on Plot Summaries
Predicting Movie Genres Based on Plot Summaries
Q. Hoang
15
24
0
15 Jan 2018
Predicting Future Lane Changes of Other Highway Vehicles using RNN-based
  Deep Models
Predicting Future Lane Changes of Other Highway Vehicles using RNN-based Deep Models
Sajan Patel
Brent A. Griffin
Kristofer D. Kusano
Jason J. Corso
19
30
0
12 Jan 2018
Deep Episodic Memory: Encoding, Recalling, and Predicting Episodic
  Experiences for Robot Action Execution
Deep Episodic Memory: Encoding, Recalling, and Predicting Episodic Experiences for Robot Action Execution
Jonas Rothfuss
Fabio Ferreira
E. Aksoy
You Zhou
Tamim Asfour
38
37
0
12 Jan 2018
Enhancing Underwater Imagery using Generative Adversarial Networks
Enhancing Underwater Imagery using Generative Adversarial Networks
C. Fabbri
M. Islam
Junaed Sattar
GAN
24
561
0
11 Jan 2018
Model-Based Action Exploration for Learning Dynamic Motion Skills
Model-Based Action Exploration for Learning Dynamic Motion Skills
Glen Berseth
M. van de Panne
25
0
0
11 Jan 2018
Improved English to Russian Translation by Neural Suffix Prediction
Improved English to Russian Translation by Neural Suffix Prediction
Kai Song
Yue Zhang
Min Zhang
Weihua Luo
16
10
0
11 Jan 2018
DeepMind Control Suite
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
27
1,096
0
02 Jan 2018
CNN Is All You Need
CNN Is All You Need
Qiming Chen
R. Wu
AIMat
8
17
0
27 Dec 2017
Find the Conversation Killers: a Predictive Study of Thread-ending Posts
Find the Conversation Killers: a Predictive Study of Thread-ending Posts
Yunhao Jiao
Cheng Li
Fei Wu
Qiaozhu Mei
14
17
0
22 Dec 2017
On the Diversity of Realistic Image Synthesis
On the Diversity of Realistic Image Synthesis
Zichen Yang
Haifeng Liu
Deng Cai
27
6
0
20 Dec 2017
A Flexible Approach to Automated RNN Architecture Generation
A Flexible Approach to Automated RNN Architecture Generation
Martin Schrimpf
Stephen Merity
James Bradbury
R. Socher
21
15
0
20 Dec 2017
Previous
123...106107108109110111
Next