ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate
v1v2v3v4v5v6v7 (latest)

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 8,379 papers shown
Title
Sharp Minima Can Generalize For Deep Nets
Sharp Minima Can Generalize For Deep Nets
Laurent Dinh
Razvan Pascanu
Samy Bengio
Yoshua Bengio
ODL
147
774
0
15 Mar 2017
Emergence of Grounded Compositional Language in Multi-Agent Populations
Emergence of Grounded Compositional Language in Multi-Agent Populations
Igor Mordatch
Pieter Abbeel
LLMAG
173
704
0
15 Mar 2017
Improving Neural Machine Translation with Conditional Sequence
  Generative Adversarial Nets
Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets
Zhen-Le Yang
Wei Chen
Feng Wang
Bo Xu
GANAI4CE
89
170
0
15 Mar 2017
Learned Optimizers that Scale and Generalize
Learned Optimizers that Scale and Generalize
Olga Wichrowska
Niru Maheswaranathan
Matthew W. Hoffman
Sergio Gomez Colmenarejo
Misha Denil
Nando de Freitas
Jascha Narain Sohl-Dickstein
AI4CE
94
284
0
14 Mar 2017
DRAGNN: A Transition-based Framework for Dynamically Connected Neural
  Networks
DRAGNN: A Transition-based Framework for Dynamically Connected Neural Networks
Lingpeng Kong
Chris Alberti
D. Andor
Ivan Bogatyy
David J. Weiss
GNN
76
34
0
13 Mar 2017
Deep Value Networks Learn to Evaluate and Iteratively Refine Structured
  Outputs
Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs
Michael Gygli
Mohammad Norouzi
A. Angelova
TDI
147
68
0
13 Mar 2017
Nematus: a Toolkit for Neural Machine Translation
Nematus: a Toolkit for Neural Machine Translation
Rico Sennrich
Orhan Firat
Kyunghyun Cho
Alexandra Birch
Barry Haddow
...
Marcin Junczys-Dowmunt
Samuel Läubli
Antonio Valerio Miceli Barone
Jozef Mokry
Maria Nadejde
68
407
0
13 Mar 2017
End-to-End Learning of Geometry and Context for Deep Stereo Regression
End-to-End Learning of Geometry and Context for Deep Stereo Regression
Alex Kendall
H. Martirosyan
Saumitro Dasgupta
Peter Henry
Ryan Kennedy
Abraham Bachrach
Adam Bry
3DV3DPCMDE
100
1,339
0
13 Mar 2017
Massive Exploration of Neural Machine Translation Architectures
Massive Exploration of Neural Machine Translation Architectures
D. Britz
Anna Goldie
Minh-Thang Luong
Quoc V. Le
97
519
0
11 Mar 2017
Learning to Remember Rare Events
Learning to Remember Rare Events
Lukasz Kaiser
Ofir Nachum
Aurko Roy
Samy Bengio
RALMCLL
144
366
0
09 Mar 2017
Linguistic Knowledge as Memory for Recurrent Neural Networks
Linguistic Knowledge as Memory for Recurrent Neural Networks
Bhuwan Dhingra
Zhilin Yang
William W. Cohen
Ruslan Salakhutdinov
RALM
111
37
0
07 Mar 2017
Data Noising as Smoothing in Neural Network Language Models
Data Noising as Smoothing in Neural Network Language Models
Ziang Xie
Sida I. Wang
Jiwei Li
Daniel Levy
Allen Nie
Dan Jurafsky
A. Ng
83
239
0
07 Mar 2017
Neural Machine Translation and Sequence-to-sequence Models: A Tutorial
Neural Machine Translation and Sequence-to-sequence Models: A Tutorial
Graham Neubig
AIMat
104
173
0
05 Mar 2017
Axiomatic Attribution for Deep Networks
Axiomatic Attribution for Deep Networks
Mukund Sundararajan
Ankur Taly
Qiqi Yan
OODFAtt
213
6,048
0
04 Mar 2017
Machine Learning on Sequential Data Using a Recurrent Weighted Average
Machine Learning on Sequential Data Using a Recurrent Weighted Average
Jared Ostmeyer
L. Cowell
61
32
0
03 Mar 2017
Exponential Moving Average Model in Parallel Speech Recognition Training
Exponential Moving Average Model in Parallel Speech Recognition Training
Xudong Tian
Jun Zhang
Zejun Ma
Yi He
Juan Wei
48
4
0
03 Mar 2017
Toward Controlled Generation of Text
Toward Controlled Generation of Text
Zhiting Hu
Zichao Yang
Xiaodan Liang
Ruslan Salakhutdinov
Eric Xing
260
990
0
02 Mar 2017
Robust Spatial Filtering with Graph Convolutional Neural Networks
Robust Spatial Filtering with Graph Convolutional Neural Networks
F. Such
Shagan Sah
Miguel Domínguez
Suhas Pillai
Chao Zhang
A. Michael
N. Cahill
R. Ptucha
GNN
117
140
0
02 Mar 2017
Evolving Deep Neural Networks
Evolving Deep Neural Networks
Risto Miikkulainen
J. Liang
Elliot Meyerson
Aditya Rawal
Daniel Fink
...
B. Raju
Hormoz Shahrzad
Arshak Navruzyan
Nigel P. Duffy
Babak Hodjat
124
891
0
01 Mar 2017
Improving the Neural GPU Architecture for Algorithm Learning
Improving the Neural GPU Architecture for Algorithm Learning
Kārlis Freivalds
Renars Liepins
162
43
0
28 Feb 2017
Neural Map: Structured Memory for Deep Reinforcement Learning
Neural Map: Structured Memory for Deep Reinforcement Learning
Emilio Parisotto
Ruslan Salakhutdinov
101
261
0
27 Feb 2017
Analyzing and Exploiting NARX Recurrent Neural Networks for Long-Term
  Dependencies
Analyzing and Exploiting NARX Recurrent Neural Networks for Long-Term Dependencies
R. DiPietro
Christian Rupprecht
Nassir Navab
Gregory Hager
44
26
0
24 Feb 2017
On the Origin of Deep Learning
On the Origin of Deep Learning
Haohan Wang
Bhiksha Raj
MedIm3DVVLM
145
225
0
24 Feb 2017
Sequence Modeling via Segmentations
Sequence Modeling via Segmentations
Chong-Jun Wang
Yining Wang
Po-Sen Huang
Abdel-rahman Mohamed
Dengyong Zhou
Li Deng
111
45
0
24 Feb 2017
Multi-Context Attention for Human Pose Estimation
Multi-Context Attention for Human Pose Estimation
Xiao Chu
Wei Yang
Wanli Ouyang
Cheng Ma
Alan Yuille
Xiaogang Wang
3DH
105
645
0
24 Feb 2017
Are Emojis Predictable?
Are Emojis Predictable?
Horacio Saggion
Miguel Ballesteros
Francesco Barbieri
49
123
0
23 Feb 2017
Training a Subsampling Mechanism in Expectation
Training a Subsampling Mechanism in Expectation
Colin Raffel
Dieterich Lawson
53
4
0
22 Feb 2017
Memory Matching Networks for Genomic Sequence Classification
Memory Matching Networks for Genomic Sequence Classification
Jack Lanchantin
Ritambhara Singh
Yanjun Qi
20
3
0
22 Feb 2017
Data Distillation for Controlling Specificity in Dialogue Generation
Data Distillation for Controlling Specificity in Dialogue Generation
Jiwei Li
Will Monroe
Dan Jurafsky
84
22
0
22 Feb 2017
Enabling Multi-Source Neural Machine Translation By Concatenating Source
  Sentences In Multiple Languages
Enabling Multi-Source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages
Raj Dabre
Fabien Cromierès
Sadao Kurohashi
96
32
0
20 Feb 2017
An Attention-Based Deep Net for Learning to Rank
An Attention-Based Deep Net for Learning to Rank
Baiyang Wang
Diego Klabjan
57
14
0
20 Feb 2017
MAT: A Multimodal Attentive Translator for Image Captioning
MAT: A Multimodal Attentive Translator for Image Captioning
Chang Liu
F. Sun
Changhu Wang
Feng Wang
Alan Yuille
93
59
0
18 Feb 2017
Soft + Hardwired Attention: An LSTM Framework for Human Trajectory
  Prediction and Abnormal Event Detection
Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection
Tharindu Fernando
Simon Denman
Sridha Sridharan
Clinton Fookes
HAI
84
336
0
18 Feb 2017
soc2seq: Social Embedding meets Conversation Model
soc2seq: Social Embedding meets Conversation Model
Parminder Bhatia
Marsal Gavaldà
Arash Einolghozati
62
8
0
17 Feb 2017
Generative Temporal Models with Memory
Generative Temporal Models with Memory
Mevlana Gemici
Chia-Chun Hung
Adam Santoro
Greg Wayne
S. Mohamed
Danilo Jimenez Rezende
David Amos
Timothy Lillicrap
72
57
0
15 Feb 2017
Frustratingly Short Attention Spans in Neural Language Modeling
Frustratingly Short Attention Spans in Neural Language Modeling
Michal Daniluk
Tim Rocktaschel
Johannes Welbl
Sebastian Riedel
111
112
0
15 Feb 2017
Batch Policy Gradient Methods for Improving Neural Conversation Models
Batch Policy Gradient Methods for Improving Neural Conversation Models
Kirthevasan Kandasamy
Yoram Bachrach
Ryota Tomioka
Daniel Tarlow
David Carter
OffRL
66
37
0
10 Feb 2017
Trainable Greedy Decoding for Neural Machine Translation
Trainable Greedy Decoding for Neural Machine Translation
Jiatao Gu
Kyunghyun Cho
Victor O.K. Li
165
74
0
08 Feb 2017
A Hybrid Convolutional Variational Autoencoder for Text Generation
A Hybrid Convolutional Variational Autoencoder for Text Generation
Stanislau Semeniuta
Aliaksei Severyn
Erhardt Barth
103
253
0
08 Feb 2017
Neural Machine Translation with Source-Side Latent Graph Parsing
Neural Machine Translation with Source-Side Latent Graph Parsing
Kazuma Hashimoto
Yoshimasa Tsuruoka
BDL
110
48
0
08 Feb 2017
Deep Learning with Dynamic Computation Graphs
Deep Learning with Dynamic Computation Graphs
Moshe Looks
Marcello Herreshoff
DeLesley S. Hutchins
Peter Norvig
GNNAI4CE
96
135
0
07 Feb 2017
Beam Search Strategies for Neural Machine Translation
Beam Search Strategies for Neural Machine Translation
Markus Freitag
Yaser Al-Onaizan
127
396
0
06 Feb 2017
Neural Semantic Parsing over Multiple Knowledge-bases
Neural Semantic Parsing over Multiple Knowledge-bases
Jonathan Herzig
Jonathan Berant
82
57
0
06 Feb 2017
Attentional Network for Visual Object Detection
Attentional Network for Visual Object Detection
Kota Hara
Ming-Yuan Liu
Oncel Tuzel
Amir-massoud Farahmand
ObjD
115
28
0
06 Feb 2017
All-but-the-Top: Simple and Effective Postprocessing for Word
  Representations
All-but-the-Top: Simple and Effective Postprocessing for Word Representations
Jiaqi Mu
S. Bhat
Pramod Viswanath
102
311
0
05 Feb 2017
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation
Iacer Calixto
Qun Liu
N. Campbell
174
183
0
04 Feb 2017
Predicting Target Language CCG Supertags Improves Neural Machine
  Translation
Predicting Target Language CCG Supertags Improves Neural Machine Translation
Maria Nadejde
Siva Reddy
Rico Sennrich
Tomasz Dwojak
Marcin Junczys-Dowmunt
Philipp Koehn
Alexandra Birch
96
81
0
03 Feb 2017
Multilingual Multi-modal Embeddings for Natural Language Processing
Multilingual Multi-modal Embeddings for Natural Language Processing
Iacer Calixto
Qun Liu
N. Campbell
62
19
0
03 Feb 2017
Structured Attention Networks
Structured Attention Networks
Yoon Kim
Carl Denton
Luong Hoang
Alexander M. Rush
146
463
0
03 Feb 2017
Symbolic, Distributed and Distributional Representations for Natural
  Language Processing in the Era of Deep Learning: a Survey
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey
L. Ferrone
Fabio Massimo Zanzotto
49
38
0
02 Feb 2017
Previous
123...156157158...166167168
Next