Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
Progressively Diffused Networks for Semantic Image Segmentation
Ruimao Zhang
Wei Yang
Zhanglin Peng
Xiaogang Wang
Liang Lin
SSeg
23
3
0
20 Feb 2017
Person Search with Natural Language Description
Shuang Li
Tong Xiao
Hongsheng Li
Bolei Zhou
Dayu Yue
Xiaogang Wang
109
397
0
19 Feb 2017
MAT: A Multimodal Attentive Translator for Image Captioning
Chang Liu
F. Sun
Changhu Wang
Feng Wang
Alan Yuille
93
59
0
18 Feb 2017
Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection
Tharindu Fernando
Simon Denman
Sridha Sridharan
Clinton Fookes
HAI
84
336
0
18 Feb 2017
Experiment Segmentation in Scientific Discourse as Clause-level Structured Prediction using Recurrent Neural Networks
Pradeep Dasigi
Gully A. Burns
Eduard H. Hovy
A. Waard
35
27
0
17 Feb 2017
Frustratingly Short Attention Spans in Neural Language Modeling
Michal Daniluk
Tim Rocktaschel
Johannes Welbl
Sebastian Riedel
113
112
0
15 Feb 2017
Gated Multimodal Units for Information Fusion
John Arevalo
Thamar Solorio
Manuel Montes-y-Gómez
Fabio Gonzalez
108
382
0
07 Feb 2017
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation
Iacer Calixto
Qun Liu
N. Campbell
174
183
0
04 Feb 2017
Structured Attention Networks
Yoon Kim
Carl Denton
Luong Hoang
Alexander M. Rush
152
463
0
03 Feb 2017
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey
L. Ferrone
Fabio Massimo Zanzotto
49
38
0
02 Feb 2017
Deep Reinforcement Learning for Visual Object Tracking in Videos
Da Zhang
H. Maei
Xin Eric Wang
Yuan-fang Wang
151
117
0
31 Jan 2017
Memory Augmented Neural Networks with Wormhole Connections
Çağlar Gülçehre
A. Chandar
Yoshua Bengio
102
63
0
30 Jan 2017
Supervised Deep Sparse Coding Networks
Xiaoxia Sun
Nasser M. Nasrabadi
T. Tran
BDL
94
15
0
29 Jan 2017
Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation
N. Mostafazadeh
Chris Brockett
W. Dolan
Michel Galley
Jianfeng Gao
Georgios P. Spithourakis
Lucy Vanderwende
111
183
0
28 Jan 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
346
1,550
0
25 Jan 2017
Incorporating Global Visual Features into Attention-Based Neural Machine Translation
Iacer Calixto
Qun Liu
Nick Campbell
136
156
0
23 Jan 2017
Understanding the Effective Receptive Field in Deep Convolutional Neural Networks
Wenjie Luo
Yujia Li
R. Urtasun
R. Zemel
HAI
106
1,813
0
15 Jan 2017
Simplified Gating in Long Short-term Memory (LSTM) Recurrent Neural Networks
Yuzhen Lu
F. Salem
39
39
0
12 Jan 2017
Comprehension-guided referring expressions
Ruotian Luo
Gregory Shakhnarovich
ObjD
107
171
0
12 Jan 2017
Attention-Based Multimodal Fusion for Video Description
Chiori Hori
Takaaki Hori
Teng-Yok Lee
Kazuhiro Sumi
J. Hershey
Tim K. Marks
95
361
0
11 Jan 2017
Context-aware Captions from Context-agnostic Supervision
Ramakrishna Vedantam
Samy Bengio
Kevin Patrick Murphy
Devi Parikh
Gal Chechik
96
152
0
11 Jan 2017
Towards Decoding as Continuous Optimization in Neural Machine Translation
Cong Duy Vu Hoang
Gholamreza Haffari
Trevor Cohn
AI4CE
89
42
0
11 Jan 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
356
1,900
0
10 Jan 2017
Textual Entailment with Structured Attentions and Composition
Kai Zhao
Liang Huang
Mingbo Ma
87
28
0
04 Jan 2017
Dynamic Deep Neural Networks: Optimizing Accuracy-Efficiency Trade-offs by Selective Execution
Lanlan Liu
Jia Deng
119
206
0
02 Jan 2017
Aspect-augmented Adversarial Networks for Domain Adaptation
Yuan Zhang
Regina Barzilay
Tommi Jaakkola
119
96
0
01 Jan 2017
Feedback Networks
Amir Zamir
Te-Lin Wu
Lin Sun
Bokui (William) Shen
Jitendra Malik
Silvio Savarese
95
211
0
30 Dec 2016
FastMask: Segment Multi-scale Object Candidates in One Shot
Hexiang Hu
Shiyi Lan
Yuning Jiang
Zhimin Cao
Fei Sha
SSeg
3DPC
86
28
0
28 Dec 2016
Robust LSTM-Autoencoders for Face De-Occlusion in the Wild
F. Zhao
Jiashi Feng
Jian-jun Zhao
Wenhan Yang
Shuicheng Yan
CVBM
71
140
0
27 Dec 2016
Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: the LUNA16 challenge
A. Setio
A. Traverso
Thomas de Bel
Moira S. N. Berens
C. V. D. Bogaard
...
Jef Vandemeulebroucke
N. Walasek
G. Zuidhof
Bram van Ginneken
Colin Jacobs
142
1,093
0
23 Dec 2016
Understanding Image and Text Simultaneously: a Dual Vision-Language Machine Comprehension Task
Nan Ding
Sebastian Goodman
Fei Sha
Radu Soricut
VLM
85
9
0
22 Dec 2016
Re-evaluating Automatic Metrics for Image Captioning
Mert Kilickaya
Aykut Erdem
Nazli Ikizler-Cinbis
Erkut Erdem
66
181
0
22 Dec 2016
A Context-aware Attention Network for Interactive Question Answering
Huayu Li
Martin Renqiang Min
Yong Ge
Asim Kadav
65
69
0
22 Dec 2016
Top-down Visual Saliency Guided by Captions
Vasili Ramanishka
Abir Das
Jianming Zhang
Kate Saenko
87
143
0
21 Dec 2016
Multi-Agent Cooperation and the Emergence of (Natural) Language
Angeliki Lazaridou
A. Peysakhovich
Marco Baroni
LLMAG
167
434
0
21 Dec 2016
An Empirical Study of Language CNN for Image Captioning
Jiuxiang Gu
G. Wang
Jianfei Cai
Tsuhan Chen
95
134
0
21 Dec 2016
Action-Driven Object Detection with Top-Down Visual Attentions
Donggeun Yoo
Sunggyun Park
K. Paeng
Joon-Young Lee
In So Kweon
ObjD
48
6
0
20 Dec 2016
Automatic Generation of Grounded Visual Questions
Shijie Zhang
Zhuang Li
Shaodi You
Zhenglu Yang
Jiawan Zhang
OOD
79
79
0
20 Dec 2016
Large-Scale Image Retrieval with Attentive Deep Local Features
Hyeonwoo Noh
A. Araújo
Jack Sim
Tobias Weyand
Bohyung Han
3DV
145
777
0
19 Dec 2016
Few-Shot Object Recognition from Machine-Labeled Web Images
Zhongwen Xu
Linchao Zhu
Yi Yang
VLM
86
66
0
19 Dec 2016
Learning to predict where to look in interactive environments using deep recurrent q-learning
Seyed Sajad Mousavi
Michael Schukat
Enda Howley
Ali Borji
N. Mozayani
59
31
0
17 Dec 2016
Delta Networks for Optimized Recurrent Network Computation
Daniel Neil
Junhaeng Lee
T. Delbruck
Shih-Chii Liu
106
66
0
16 Dec 2016
CSVideoNet: A Real-time End-to-end Learning Framework for High-frame-rate Video Compressive Sensing
Kai Xu
Fengbo Ren
64
8
0
15 Dec 2016
Recurrent Image Captioner: Describing Images with Spatial-Invariant Transformation and Attention Filtering
Hao Liu
Yang Yang
Fumin Shen
Lixin Duan
Heng Tao Shen
65
9
0
15 Dec 2016
Single Image Action Recognition using Semantic Body Part Actions
Zhichen Zhao
Huimin Ma
Shaodi You
75
74
0
14 Dec 2016
End-to-End Deep Reinforcement Learning for Lane Keeping Assist
Ahmad El-Sallab
Mohammed Abdou
E. Perot
S. Yogamani
77
176
0
13 Dec 2016
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer
Sergey Zagoruyko
N. Komodakis
152
2,598
0
12 Dec 2016
Empirical Evaluation of A New Approach to Simplifying Long Short-term Memory (LSTM)
Yuzhen Lu
24
2
0
12 Dec 2016
VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering
Marc Bolaños
Álvaro Peris
F. Casacuberta
Petia Radeva
68
6
0
12 Dec 2016
Text-guided Attention Model for Image Captioning
Jonghwan Mun
Minsu Cho
Bohyung Han
VLM
59
93
0
12 Dec 2016
Previous
1
2
3
...
63
64
65
...
69
70
71
Next