Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1409.0473
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Neural Machine Translation by Jointly Learning to Align and Translate
1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Machine Translation by Jointly Learning to Align and Translate"
50 / 8,358 papers shown
Title
From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification
André F. T. Martins
Ramón Fernández Astudillo
217
726
0
05 Feb 2016
EIE: Efficient Inference Engine on Compressed Deep Neural Network
Song Han
Xingyu Liu
Huizi Mao
Jing Pu
A. Pedram
M. Horowitz
W. Dally
156
2,467
0
04 Feb 2016
Survey on the attention based RNN model and its applications in computer vision
Feng Wang
David Tax
AI4TS
AIMat
74
114
0
25 Jan 2016
Long Short-Term Memory-Networks for Machine Reading
Jianpeng Cheng
Li Dong
Mirella Lapata
AIMat
RALM
121
1,123
0
25 Jan 2016
A Taxonomy of Deep Convolutional Neural Nets for Computer Vision
Suraj Srinivas
Ravi Kiran Sarvadevabhatla
Konda Reddy Mopuri
N. Prabhu
S. Kruthiventi
R. Venkatesh Babu
OOD
69
216
0
25 Jan 2016
Modeling Coverage for Neural Machine Translation
Zhaopeng Tu
Zhengdong Lu
Yang Liu
Xiaohua Liu
Hang Li
108
748
0
19 Jan 2016
Conversion of Artificial Recurrent Neural Networks to Spiking Neural Networks for Low-power Neuromorphic Hardware
P. U. Diehl
Guido Zarrella
A. Cassidy
Bruno U. Pedroni
Emre Neftci
89
219
0
16 Jan 2016
Multimodal Pivots for Image Caption Translation
Julian Hitschler
Shigehiko Schamoni
Stefan Riezler
160
97
0
15 Jan 2016
Implicit Distortion and Fertility Models for Attention-based Encoder-Decoder NMT Model
Shi Feng
Shujie Liu
Mu Li
M. Zhou
123
44
0
13 Jan 2016
Language to Logical Form with Neural Attention
Li Dong
Mirella Lapata
AI4CE
NAI
138
730
0
06 Jan 2016
Incorporating Structural Alignment Biases into an Attentional Neural Translation Model
Trevor Cohn
Cong Duy Vu Hoang
Ekaterina Vymolova
Kaisheng Yao
Chris Dyer
Gholamreza Haffari
80
174
0
06 Jan 2016
Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism
Orhan Firat
Kyunghyun Cho
Yoshua Bengio
LRM
AIMat
277
627
0
06 Jan 2016
Mutual Information and Diverse Decoding Improve Neural Machine Translation
Jiwei Li
Dan Jurafsky
82
120
0
04 Jan 2016
Learning Natural Language Inference with LSTM
Shuohang Wang
Jing Jiang
114
446
0
30 Dec 2015
Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems
Colin Raffel
D. Ellis
CLL
84
305
0
29 Dec 2015
Feedforward Sequential Memory Networks: A New Structure to Learn Long-term Dependency
Shiliang Zhang
Cong Liu
Hui Jiang
Si Wei
Lirong Dai
Yu Hu
91
76
0
28 Dec 2015
Morphological Inflection Generation Using Character Sequence to Sequence Learning
Manaal Faruqui
Yulia Tsvetkov
Graham Neubig
Chris Dyer
71
137
0
18 Dec 2015
A Survey of Available Corpora for Building Data-Driven Dialogue Systems
Iulian Serban
Ryan J. Lowe
Peter Henderson
Laurent Charlin
Joelle Pineau
78
342
0
17 Dec 2015
Semi-supervised Question Retrieval with Gated Convolutions
Tao Lei
Hrishikesh Joshi
Regina Barzilay
Tommi Jaakkola
K. Tymoshenko
Alessandro Moschitti
Lluís Màrquez i Villodre
RALM
101
107
0
17 Dec 2015
ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs
Wenpeng Yin
Hinrich Schütze
Bing Xiang
Bowen Zhou
101
945
0
16 Dec 2015
DNA-Level Splice Junction Prediction using Deep Recurrent Neural Networks
Byunghan Lee
Taehoon Lee
Byunggook Na
Sungroh Yoon
47
43
0
16 Dec 2015
Strategies for Training Large Vocabulary Neural Language Models
Welin Chen
David Grangier
Michael Auli
VLM
70
139
0
15 Dec 2015
Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation
Yong Cheng
Shiqi Shen
Zhongjun He
W. He
Hua Wu
Maosong Sun
Yang Liu
91
73
0
15 Dec 2015
RNN Fisher Vectors for Action Recognition and Image Annotation
Guy Lev
Gil Sadeh
Benjamin Klein
Lior Wolf
55
164
0
12 Dec 2015
Distilling Knowledge from Deep Networks with Applications to Healthcare Domain
Zhengping Che
S. Purushotham
R. Khemani
Yan Liu
78
139
0
11 Dec 2015
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
175
2,979
0
08 Dec 2015
Minimum Risk Training for Neural Machine Translation
Shiqi Shen
Yong Cheng
Zhongjun He
W. He
Hua Wu
Maosong Sun
Yang Liu
153
469
0
08 Dec 2015
Deep Attention Recurrent Q-Network
Ivan Sorokin
Alexey Seleznev
Mikhail Pavlov
A. Fedorov
Anastasiia Ignateva
70
152
0
05 Dec 2015
Neural Generative Question Answering
Jun Yin
Xin Jiang
Zhengdong Lu
Lifeng Shang
Hang Li
Xiaoming Li
108
216
0
04 Dec 2015
Effective LSTMs for Target-Dependent Sentiment Classification
Duyu Tang
Bing Qin
Xiaocheng Feng
Ting Liu
114
889
0
03 Dec 2015
Neural Enquirer: Learning to Query Tables with Natural Language
Pengcheng Yin
Zhengdong Lu
Hang Li
B. Kao
LMTD
96
41
0
03 Dec 2015
Multilingual Language Processing From Bytes
D. Gillick
Clifford Brunk
Oriol Vinyals
A. Subramanya
93
223
0
01 Dec 2015
A Deep Architecture for Semantic Matching with Multiple Positional Sentence Representations
Shengxian Wan
Yanyan Lan
Jiafeng Guo
Jun Xu
Liang Pang
Xueqi Cheng
96
347
0
26 Nov 2015
Recurrent Instance Segmentation
Bernardino Romera-Paredes
Philip Torr
SSeg
92
328
0
25 Nov 2015
Natural Language Understanding with Distributed Representation
Kyunghyun Cho
GNN
BDL
86
55
0
24 Nov 2015
On the Generalization Error Bounds of Neural Networks under Diversity-Inducing Mutual Angular Regularization
P. Xie
Yuntian Deng
Eric Xing
113
28
0
23 Nov 2015
ReSeg: A Recurrent Neural Network-based Model for Semantic Segmentation
Francesco Visin
Marco Ciccone
Adriana Romero
Kyle Kastner
Kyunghyun Cho
Yoshua Bengio
Matteo Matteucci
Aaron Courville
VLM
SSeg
97
251
0
22 Nov 2015
Evaluating Prerequisite Qualities for Learning End-to-End Dialog Systems
Jesse Dodge
Andreea Gane
Xiang Zhang
Antoine Bordes
S. Chopra
Alexander H. Miller
Arthur Szlam
Jason Weston
ELM
105
198
0
21 Nov 2015
Adding Gradient Noise Improves Learning for Very Deep Networks
Arvind Neelakantan
Luke Vilnis
Quoc V. Le
Ilya Sutskever
Lukasz Kaiser
Karol Kurach
James Martens
AI4CE
ODL
85
545
0
21 Nov 2015
Sequence Level Training with Recurrent Neural Networks
MarcÁurelio Ranzato
S. Chopra
Michael Auli
Wojciech Zaremba
138
1,620
0
20 Nov 2015
Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich
Barry Haddow
Alexandra Birch
282
2,724
0
20 Nov 2015
Task Loss Estimation for Sequence Prediction
Dzmitry Bahdanau
Dmitriy Serdyuk
Philemon Brakel
Nan Rosemary Ke
J. Chorowski
Aaron Courville
Yoshua Bengio
113
33
0
19 Nov 2015
Towards Principled Unsupervised Learning
Ilya Sutskever
Rafal Jozefowicz
Karol Gregor
Danilo Jimenez Rezende
Timothy Lillicrap
Oriol Vinyals
OOD
SSL
98
49
0
19 Nov 2015
Delving Deeper into Convolutional Networks for Learning Video Representations
Nicolas Ballas
L. Yao
C. Pal
Aaron Courville
MDE
106
703
0
19 Nov 2015
Binding via Reconstruction Clustering
Klaus Greff
R. Srivastava
Jürgen Schmidhuber
OCL
98
40
0
19 Nov 2015
Recurrent Models for Auditory Attention in Multi-Microphone Distance Speech Recognition
Suyoun Kim
Ian Lane
71
26
0
19 Nov 2015
Neural Random-Access Machines
Karol Kurach
Marcin Andrychowicz
Ilya Sutskever
OOD
BDL
76
156
0
19 Nov 2015
Generating Sentences from a Continuous Space
Samuel R. Bowman
Luke Vilnis
Oriol Vinyals
Andrew M. Dai
Rafal Jozefowicz
Samy Bengio
DRL
119
2,367
0
19 Nov 2015
Multi-task Sequence to Sequence Learning
Minh-Thang Luong
Quoc V. Le
Ilya Sutskever
Oriol Vinyals
Lukasz Kaiser
AIMat
116
808
0
19 Nov 2015
Deep Learning for Tactile Understanding From Visual and Haptic Data
Yang Gao
Lisa Anne Hendricks
Katherine J. Kuchenbecker
Trevor Darrell
96
245
0
19 Nov 2015
Previous
1
2
3
...
164
165
166
167
168
Next