Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.07561
Cited By
Training Deeper Neural Machine Translation Models with Transparent Attention
22 August 2018
Ankur Bapna
Mengzhao Chen
Orhan Firat
Yuan Cao
Yonghui Wu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training Deeper Neural Machine Translation Models with Transparent Attention"
27 / 27 papers shown
Title
Lipschitz Constrained Parameter Initialization for Deep Transformers
Hongfei Xu
Qiuhui Liu
Josef van Genabith
Deyi Xiong
Jingyi Zhang
ODL
59
26
0
08 Nov 2019
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation
Mengzhao Chen
Orhan Firat
Ankur Bapna
Melvin Johnson
Wolfgang Macherey
...
Niki Parmar
M. Schuster
Zhifeng Chen
Yonghui Wu
Macduff Hughes
AIMat
52
457
0
26 Apr 2018
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
139
11,520
0
15 Feb 2018
The exploding gradient problem demystified - definition, prevalence, impact, origin, tradeoffs, and solutions
George Philipp
D. Song
J. Carbonell
ODL
64
46
0
15 Dec 2017
Deep Architectures for Neural Machine Translation
Antonio Valerio Miceli Barone
Jindřich Helcl
Rico Sennrich
Barry Haddow
Alexandra Birch
38
111
0
24 Jul 2017
SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability
M. Raghu
Justin Gilmer
J. Yosinski
Jascha Narain Sohl-Dickstein
DRL
33
31
0
19 Jun 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
514
129,831
0
12 Jun 2017
Convolutional Sequence to Sequence Learning
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
124
3,279
0
08 May 2017
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU
Jacob Devlin
51
36
0
04 May 2017
Deep Neural Machine Translation with Linear Associative Unit
Mingxuan Wang
Zhengdong Lu
Jie Zhou
Qun Liu
40
54
0
02 May 2017
Massive Exploration of Neural Machine Translation Architectures
D. Britz
Anna Goldie
Minh-Thang Luong
Quoc V. Le
56
516
0
11 Mar 2017
Skip Connections Eliminate Singularities
Emin Orhan
Xaq Pitkow
53
25
0
31 Jan 2017
Identity Matters in Deep Learning
Moritz Hardt
Tengyu Ma
OOD
76
399
0
14 Nov 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
830
6,768
0
26 Sep 2016
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
PINN
3DV
675
36,599
0
25 Aug 2016
Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation
Jie Zhou
Ying Cao
Xuguang Wang
Peng Li
Wenyuan Xu
AIMat
42
215
0
14 Jun 2016
Density estimation using Real NVP
Falong Shen
Jascha Sohl-Dickstein
Gang Zeng
DRL
51
45
0
27 May 2016
Deep Networks with Stochastic Depth
Gao Huang
Yu Sun
Zhuang Liu
Daniel Sedra
Kilian Q. Weinberger
165
2,344
0
30 Mar 2016
Learning Functions: When Is Deep Better Than Shallow
H. Mhaskar
Q. Liao
T. Poggio
56
144
0
03 Mar 2016
Benefits of depth in neural networks
Matus Telgarsky
304
605
0
14 Feb 2016
The Power of Depth for Feedforward Neural Networks
Ronen Eldan
Ohad Shamir
168
731
0
12 Dec 2015
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.6K
192,638
0
10 Dec 2015
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
172
7,683
0
31 Aug 2015
Highway Networks
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
133
1,765
0
03 May 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.1K
149,474
0
22 Dec 2014
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
317
20,491
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
413
27,205
0
01 Sep 2014
1