ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.06450
  4. Cited By
Layer Normalization

Layer Normalization

21 July 2016
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
ArXivPDFHTML

Papers citing "Layer Normalization"

50 / 5,515 papers shown
Title
Improving speech emotion recognition via Transformer-based Predictive Coding through transfer learning
Zheng Lian
Ya Li
J. Tao
Jian Huang
ViT
13
18
0
11 Nov 2018
Benchmarking Deep Sequential Models on Volatility Predictions for
  Financial Time Series
Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series
Qiang Zhang
Kyle Birkeland
Yaodong Yang
Yixiao Liu
33
9
0
08 Nov 2018
Neural Phrase-to-Phrase Machine Translation
Neural Phrase-to-Phrase Machine Translation
Jiangtao Feng
Lingpeng Kong
Po-Sen Huang
Chong-Jun Wang
Da Huang
Jiayuan Mao
Kan Qiao
Dengyong Zhou
AIMat
24
14
0
06 Nov 2018
Compact Personalized Models for Neural Machine Translation
Compact Personalized Models for Neural Machine Translation
Joern Wuebker
A. Paz
N. Ravid
VLM
21
56
0
05 Nov 2018
Bi-Directional Differentiable Input Reconstruction for Low-Resource
  Neural Machine Translation
Bi-Directional Differentiable Input Reconstruction for Low-Resource Neural Machine Translation
Xing Niu
Weijia Xu
Marine Carpuat
19
17
0
02 Nov 2018
Abstractive Summarization of Reddit Posts with Multi-level Memory
  Networks
Abstractive Summarization of Reddit Posts with Multi-level Memory Networks
Byeongchang Kim
Hyunwoo J. Kim
Gunhee Kim
31
182
0
02 Nov 2018
Towards Linear Time Neural Machine Translation with Capsule Networks
Towards Linear Time Neural Machine Translation with Capsule Networks
Mingxuan Wang
Jun Xie
Zhixing Tan
Jinsong Su
Deyi Xiong
Lei Li
AIMat
24
27
0
01 Nov 2018
Hybrid Self-Attention Network for Machine Translation
Hybrid Self-Attention Network for Machine Translation
Kaitao Song
Tan Xu
Furong Peng
Jianfeng Lu
21
12
0
01 Nov 2018
Dial2Desc: End-to-end Dialogue Description Generation
Dial2Desc: End-to-end Dialogue Description Generation
Haojie Pan
Junpei Zhou
Zhou Zhao
Yan Liu
Deng Cai
Min Yang
VLM
18
14
0
01 Nov 2018
A task in a suit and a tie: paraphrase generation with semantic
  augmentation
A task in a suit and a tie: paraphrase generation with semantic augmentation
Su Wang
Rahul Gupta
Nancy Chang
Jason Baldridge
19
65
0
31 Oct 2018
Convolutional Self-Attention Network
Baosong Yang
Longyue Wang
Derek F. Wong
Lidia S. Chao
Zhaopeng Tu
BDL
19
11
0
31 Oct 2018
Machine Translation between Vietnamese and English: an Empirical Study
Machine Translation between Vietnamese and English: an Empirical Study
Hong-Hai Phan-Vu
Viet-Trung Tran
V. Nguyen
Hoang-Vu Dang
Phan-Thuan Do
20
17
0
30 Oct 2018
A Simple Recurrent Unit with Reduced Tensor Product Representations
A Simple Recurrent Unit with Reduced Tensor Product Representations
Shuai Tang
P. Smolensky
V. D. Sa
22
2
0
29 Oct 2018
A Closer Look at Deep Learning Heuristics: Learning rate restarts,
  Warmup and Distillation
A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation
Akhilesh Deepak Gotmare
N. Keskar
Caiming Xiong
R. Socher
ODL
19
275
0
29 Oct 2018
Towards Principled Uncertainty Estimation for Deep Neural Networks
Towards Principled Uncertainty Estimation for Deep Neural Networks
Richard E. Harang
Ethan M. Rudd
BDL
UQCV
36
6
0
29 Oct 2018
Convolutional neural networks with extra-classical receptive fields
Convolutional neural networks with extra-classical receptive fields
Brian Hu
Stefan Mihalas
20
3
0
27 Oct 2018
One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
Tianhe Yu
Pieter Abbeel
Sergey Levine
Chelsea Finn
13
68
0
25 Oct 2018
Batch Normalization Sampling
Batch Normalization Sampling
Zhaodong Chen
Lei Deng
Guoqi Li
Jiawei Sun
Xing Hu
Xin Ma
Yuan Xie
21
0
0
25 Oct 2018
Exploiting Deep Representations for Neural Machine Translation
Exploiting Deep Representations for Neural Machine Translation
Zi-Yi Dou
Zhaopeng Tu
Xing Wang
Shuming Shi
Tong Zhang
44
92
0
24 Oct 2018
Ain't Nobody Got Time For Coding: Structure-Aware Program Synthesis From
  Natural Language
Ain't Nobody Got Time For Coding: Structure-Aware Program Synthesis From Natural Language
J. Bednarek
K. Piaskowski
K. Krawiec
6
12
0
23 Oct 2018
Towards Universal Dialogue State Tracking
Towards Universal Dialogue State Tracking
Liliang Ren
Kaige Xie
Lu Chen
Kai Yu
16
121
0
22 Oct 2018
A Fully Attention-Based Information Retriever
A Fully Attention-Based Information Retriever
Alvaro H. C. Correia
Jorge Luiz Moreira Silva
Thiago de C. Martins
Fabio Gagliardi Cozman
36
4
0
22 Oct 2018
Lightweight Convolutional Approaches to Reading Comprehension on SQuAD
Lightweight Convolutional Approaches to Reading Comprehension on SQuAD
T. Bell
Benjamin Penchas
14
3
0
19 Oct 2018
Investigating Object Compositionality in Generative Adversarial Networks
Investigating Object Compositionality in Generative Adversarial Networks
Sjoerd van Steenkiste
Karol Kurach
Jürgen Schmidhuber
Sylvain Gelly
GAN
OCL
29
20
0
17 Oct 2018
Sequence-to-Sequence Acoustic Modeling for Voice Conversion
Sequence-to-Sequence Acoustic Modeling for Voice Conversion
Jing-Xuan Zhang
Zhenhua Ling
Li-Juan Liu
Yuan Jiang
Lirong Dai
19
129
0
16 Oct 2018
MeanSum: A Neural Model for Unsupervised Multi-document Abstractive
  Summarization
MeanSum: A Neural Model for Unsupervised Multi-document Abstractive Summarization
Eric Chu
Peter J. Liu
20
19
0
12 Oct 2018
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
T. Paine
Sergio Gomez Colmenarejo
Ziyun Wang
Scott E. Reed
Y. Aytar
...
Matthew W. Hoffman
Gabriel Barth-Maron
Serkan Cabi
David Budden
Nando de Freitas
OffRL
22
26
0
11 Oct 2018
Image Captioning as Neural Machine Translation Task in SOCKEYE
Image Captioning as Neural Machine Translation Task in SOCKEYE
Loris Bazzani
Tobias Domhan
Felix Hieber
VLM
19
2
0
09 Oct 2018
Task-Embedded Control Networks for Few-Shot Imitation Learning
Task-Embedded Control Networks for Few-Shot Imitation Learning
Stephen James
Michael Bloesch
Andrew J. Davison
38
135
0
08 Oct 2018
On Self Modulation for Generative Adversarial Networks
On Self Modulation for Generative Adversarial Networks
Ting Chen
Mario Lucic
N. Houlsby
Sylvain Gelly
GAN
27
104
0
02 Oct 2018
Improving Sentence Representations with Consensus Maximisation
Improving Sentence Representations with Consensus Maximisation
Shuai Tang
V. D. Sa
SSL
AI4TS
37
4
0
02 Oct 2018
Set Transformer: A Framework for Attention-based Permutation-Invariant
  Neural Networks
Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks
Juho Lee
Yoonho Lee
Jungtaek Kim
Adam R. Kosiorek
Seungjin Choi
Yee Whye Teh
23
274
0
01 Oct 2018
Open-Ended Content-Style Recombination Via Leakage Filtering
Open-Ended Content-Style Recombination Via Leakage Filtering
Karl Ridgeway
Michael C. Mozer
DRL
VLM
22
2
0
28 Sep 2018
SALSA-TEXT : self attentive latent space based adversarial text
  generation
SALSA-TEXT : self attentive latent space based adversarial text generation
Jules Gagnon-Marchand
Hamed Sadeghi
Md. Akmal Haidar
Mehdi Rezagholizadeh
19
18
0
28 Sep 2018
Learning Robust, Transferable Sentence Representations for Text
  Classification
Learning Robust, Transferable Sentence Representations for Text Classification
Wasi Uddin Ahmad
Xueying Bai
Nanyun Peng
Kai-Wei Chang
AI4TS
OOD
25
5
0
28 Sep 2018
Batch-normalized Recurrent Highway Networks
Batch-normalized Recurrent Highway Networks
Chi Zhang
Thang Nguyen
Shagan Sah
R. Ptucha
A. Loui
C. Salvaggio
25
8
0
26 Sep 2018
Utilizing Class Information for Deep Network Representation Shaping
Utilizing Class Information for Deep Network Representation Shaping
Daeyoung Choi
Wonjong Rhee
16
2
0
25 Sep 2018
Learning to Read by Spelling: Towards Unsupervised Text Recognition
Learning to Read by Spelling: Towards Unsupervised Text Recognition
Ankush Gupta
Andrea Vedaldi
Andrew Zisserman
SSL
25
20
0
23 Sep 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
69
1,754
0
20 Sep 2018
Latent Topic Conversational Models
Latent Topic Conversational Models
Tsung-Hsien Wen
Minh-Thang Luong
BDL
24
4
0
19 Sep 2018
Removing the Feature Correlation Effect of Multiplicative Noise
Removing the Feature Correlation Effect of Multiplicative Noise
Zijun Zhang
Yining Zhang
Zongpeng Li
13
8
0
19 Sep 2018
Learning Universal Sentence Representations with Mean-Max Attention
  Autoencoder
Learning Universal Sentence Representations with Mean-Max Attention Autoencoder
Minghua Zhang
Yunfang Wu
W. Li
Wei Li
SSL
27
27
0
18 Sep 2018
Switching Isotropic and Directional Exploration with Parameter Space
  Noise in Deep Reinforcement Learning
Switching Isotropic and Directional Exploration with Parameter Space Noise in Deep Reinforcement Learning
Izumi Karino
Kazutoshi Tanaka
Ryuma Niiyama
Yasuo Kuniyoshi
19
3
0
18 Sep 2018
A Domain Agnostic Normalization Layer for Unsupervised Adversarial
  Domain Adaptation
A Domain Agnostic Normalization Layer for Unsupervised Adversarial Domain Adaptation
Rob Romijnders
Panagiotis Meletis
Gijs Dubbelman
AI4CE
21
27
0
14 Sep 2018
Zero-Shot Cross-lingual Classification Using Multilingual Neural Machine
  Translation
Zero-Shot Cross-lingual Classification Using Multilingual Neural Machine Translation
Akiko Eriguchi
Melvin Johnson
Orhan Firat
Hideto Kazawa
Wolfgang Macherey
22
62
0
12 Sep 2018
Multitask Learning on Graph Neural Networks: Learning Multiple Graph
  Centrality Measures with a Unified Network
Multitask Learning on Graph Neural Networks: Learning Multiple Graph Centrality Measures with a Unified Network
Pedro H. C. Avelar
Henrique Lemos
Marcelo O. R. Prates
Luís C. Lamb
45
17
0
11 Sep 2018
Unsupervised Stylish Image Description Generation via Domain Layer Norm
Unsupervised Stylish Image Description Generation via Domain Layer Norm
Cheng Kuan Chen
Zhufeng Pan
Min Sun
Ming Liu
23
29
0
11 Sep 2018
Normalization in Training U-Net for 2D Biomedical Semantic Segmentation
Normalization in Training U-Net for 2D Biomedical Semantic Segmentation
Xiao-Yun Zhou
Guang-Zhong Yang
18
77
0
11 Sep 2018
Exploiting Invertible Decoders for Unsupervised Sentence Representation
  Learning
Exploiting Invertible Decoders for Unsupervised Sentence Representation Learning
Shuai Tang
V. D. Sa
SSL
13
1
0
08 Sep 2018
Learning to Solve NP-Complete Problems - A Graph Neural Network for
  Decision TSP
Learning to Solve NP-Complete Problems - A Graph Neural Network for Decision TSP
Marcelo O. R. Prates
Pedro H. C. Avelar
Henrique Lemos
Luís C. Lamb
Moshe Y. Vardi
GNN
30
176
0
08 Sep 2018
Previous
123...103104105...109110111
Next