Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
Imputer: Sequence Modelling via Imputation and Dynamic Programming
William Chan
Chitwan Saharia
Geoffrey E. Hinton
Mohammad Norouzi
Navdeep Jaitly
BDL
AI4TS
97
116
0
20 Feb 2020
Stroke Constrained Attention Network for Online Handwritten Mathematical Expression Recognition
Jiaming Wang
Jun Du
Jianshu Zhang
74
24
0
20 Feb 2020
A Convolutional Baseline for Person Re-Identification Using Vision and Language Descriptions
Ammarah Farooq
Muhammad Awais
F. Yan
J. Kittler
A. Akbari
S. S. Khalid
119
8
0
20 Feb 2020
Deep Fusion of Local and Non-Local Features for Precision Landslide Recognition
Qing Zhu
Lin Chen
Han Hu
Binzhi Xu
Yeting Zhang
Haifeng Li
18
10
0
20 Feb 2020
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching
Tianlang Chen
Jiebo Luo
72
69
0
20 Feb 2020
When Radiology Report Generation Meets Knowledge Graph
Yixiao Zhang
Xiaosong Wang
Ziyue Xu
Qihang Yu
Alan Yuille
Daguang Xu
MedIm
90
305
0
19 Feb 2020
CAUSE: Learning Granger Causality from Event Sequences using Attribution Methods
W. Zhang
Thomas Kobber Panum
S. Jha
P. Chalasani
David Page
CML
AI4TS
83
49
0
18 Feb 2020
LocoGAN -- Locally Convolutional GAN
Lukasz Struski
Szymon Knop
Jacek Tabor
Wiktor Daniec
Przemysław Spurek
GAN
47
10
0
18 Feb 2020
MAST: A Memory-Augmented Self-supervised Tracker
Zihang Lai
Erika Lu
Weidi Xie
VOS
120
186
0
18 Feb 2020
Neural Attentive Multiview Machines
Oren Barkan
Ori Katz
Noam Koenigstein
HAI
61
18
0
18 Feb 2020
Text Classification with Lexicon from PreAttention Mechanism
Qingbiao Li
Chunhua Wu
K. Zheng
VLM
41
0
0
18 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
122
140
0
18 Feb 2020
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework
C. Sur
124
7
0
16 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC)
C. Sur
56
17
0
15 Feb 2020
Sparse and Structured Visual Attention
Pedro Henrique Martins
S. Becker
Zita Marinho
Michael Arens
81
8
0
13 Feb 2020
SpotNet: Self-Attention Multi-Task Network for Object Detection
Hughes Perreault
Guillaume-Alexandre Bilodeau
Nicolas Saunier
Maguelonne Héritier
177
44
0
13 Feb 2020
Deep Learning for Source Code Modeling and Generation: Models, Applications and Challenges
T. H. Le
Hao Chen
Muhammad Ali Babar
VLM
147
155
0
13 Feb 2020
HAN-ECG: An Interpretable Atrial Fibrillation Detection Model Using Hierarchical Attention Networks
Sajad Mousavi
Fatemeh Afghah
U. Rajendra
55
97
0
12 Feb 2020
Vision-based Fight Detection from Surveillance Cameras
Seymanur Akti
G. A. Tataroglu
H. K. Ekenel
51
78
0
11 Feb 2020
What Changed Your Mind: The Roles of Dynamic Topics and Discourse in Argumentation Process
Jichuan Zeng
Jing Li
Yulan He
Cuiyun Gao
Michael R. Lyu
Irwin King
68
16
0
10 Feb 2020
Blank Language Models
T. Shen
Victor Quach
Regina Barzilay
Tommi Jaakkola
291
73
0
08 Feb 2020
Attentive Group Equivariant Convolutional Networks
David W. Romero
Erik J. Bekkers
Jakub M. Tomczak
Mark Hoogendoorn
117
91
0
07 Feb 2020
Multimodal Matching Transformer for Live Commenting
Chaoqun Duan
Lei Cui
Shuming Ma
Furu Wei
Conghui Zhu
Tiejun Zhao
33
12
0
07 Feb 2020
Exploiting Temporal Coherence for Multi-modal Video Categorization
Palash Goyal
Saurabh Sahu
Shalini Ghosh
Chul Lee
38
1
0
07 Feb 2020
The Costs and Benefits of Goal-Directed Attention in Deep Convolutional Neural Networks
Xiaoliang Luo
Brett D. Roads
Bradley C. Love
49
18
0
06 Feb 2020
GIM: Gaussian Isolation Machines
Guy Amit
Ishai Rosenberg
Mosh Levy
Ron Bitton
A. Shabtai
Yuval Elovici
55
0
0
06 Feb 2020
Lossless Attention in Convolutional Networks for Facial Expression Recognition in the Wild
Chuan Wang
R. Hu
Min Hu
Jiang-Dong Liu
Ting-fei Ren
Shan He
Ming Jiang
Jing Miao
CVBM
38
5
0
31 Jan 2020
Teaching Machines to Converse
Jiwei Li
98
4
0
31 Jan 2020
Convolutional Hierarchical Attention Network for Query-Focused Video Summarization
Shuwen Xiao
Zhou Zhao
Zijian Zhang
Ziyu Guan
Deng Cai
82
48
0
31 Jan 2020
Dual Convolutional LSTM Network for Referring Image Segmentation
Linwei Ye
Zhi Liu
Yang Wang
84
46
0
30 Jan 2020
Evaluating the Progress of Deep Learning for Visual Relational Concepts
Sebastian Stabinger
Peer David
J. Piater
A. Rodríguez-Sánchez
86
19
0
29 Jan 2020
aiTPR: Attribute Interaction-Tensor Product Representation for Image Caption
C. Sur
47
8
0
27 Jan 2020
Uncertainty based Class Activation Maps for Visual Question Answering
Badri N. Patro
Mayank Lunayach
Vinay P. Namboodiri
FAtt
UQCV
44
1
0
23 Jan 2020
Deep Bayesian Network for Visual Question Generation
Badri N. Patro
V. Kurmi
Sandeep Kumar
Vinay P. Namboodiri
BDL
52
18
0
23 Jan 2020
Robust Explanations for Visual Question Answering
Badri N. Patro
Shivansh Pate
Vinay P. Namboodiri
OOD
AAML
73
19
0
23 Jan 2020
Visual Summary of Value-level Feature Attribution in Prediction Classes with Recurrent Neural Networks
Chuan-Chi Wang
Xumeng Wang
K. Ma
FAtt
HAI
42
1
0
23 Jan 2020
Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models
M. Farazi
Salman H. Khan
Nick Barnes
81
18
0
20 Jan 2020
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Ruiyi Zhang
Changyou Chen
Zhe Gan
Zheng Wen
Wenlin Wang
Lawrence Carin
81
7
0
20 Jan 2020
Human-Aware Motion Deblurring
Ziyi Shen
Wenguan Wang
Xiankai Lu
Jianbing Shen
Haibin Ling
Tingfa Xu
Ling Shao
3DH
116
290
0
19 Jan 2020
Text-to-Image Generation with Attention Based Recurrent Neural Networks
Tehseen Zia
Shahan Arif
Shakeeb Murtaza
M. A. Ullah
35
7
0
18 Jan 2020
Multi-step Joint-Modality Attention Network for Scene-Aware Dialogue System
Yun-Wei Chu
Kuan-Yen Lin
Chao-Chun Hsu
Lun-Wei Ku
140
22
0
17 Jan 2020
Multimodal Story Generation on Plural Images
Jing Jiang
DiffM
18
0
0
16 Jan 2020
Delving Deeper into the Decoder for Video Captioning
Haoran Chen
Jianmin Li
Xiaolin Hu
73
35
0
16 Jan 2020
A "Network Pruning Network" Approach to Deep Model Compression
Vinay Kumar Verma
Pravendra Singh
Vinay P. Namboodiri
Piyush Rai
3DPC
VLM
58
8
0
15 Jan 2020
Show, Recall, and Tell: Image Captioning with Recall Mechanism
Li Wang
Zechen Bai
Yonghua Zhang
Hongtao Lu
79
67
0
15 Jan 2020
Visual Storytelling via Predicting Anchor Word Embeddings in the Stories
Bowen Zhang
Hexiang Hu
Fei Sha
46
6
0
13 Jan 2020
In Defense of Grid Features for Visual Question Answering
Huaizu Jiang
Ishan Misra
Marcus Rohrbach
Erik Learned-Miller
Xinlei Chen
OOD
ObjD
105
320
0
10 Jan 2020
Visual Question Answering on 360° Images
Shih-Han Chou
Wei-Lun Chao
Wei-Sheng Lai
Min Sun
Ming-Hsuan Yang
59
22
0
10 Jan 2020
On Interpretability of Artificial Neural Networks: A Survey
Fenglei Fan
Jinjun Xiong
Mengzhou Li
Ge Wang
AAML
AI4CE
96
318
0
08 Jan 2020
Res3ATN -- Deep 3D Residual Attention Network for Hand Gesture Recognition in Videos
Naina Dhingra
A. Kunz
3DPC
SLR
82
36
0
04 Jan 2020
Previous
1
2
3
...
34
35
36
...
69
70
71
Next