Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
ECG Segmentation by Neural Networks: Errors and Correction
Iana Sereda
Sergey Alekseev
A. Koneva
R. Kataev
Grigory V. Osipov
UQCV
61
27
0
26 Dec 2018
Hierarchical LSTMs with Adaptive Attention for Visual Captioning
Jingkuan Song
Xiangpeng Li
Lianli Gao
Heng Tao Shen
111
223
0
26 Dec 2018
3D PersonVLAD: Learning Deep Global Representations for Video-based Person Re-identification
Lin Wu
Yang Wang
Ling Shao
Ming Wang
3DPC
101
94
0
26 Dec 2018
Attention Branch Network: Learning of Attention Mechanism for Visual Explanation
Hiroshi Fukui
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
XAI
FAtt
95
410
0
25 Dec 2018
TextNet: Irregular Text Reading from Images with an End-to-End Trainable Network
Yipeng Sun
Chengquan Zhang
Zuming Huang
Jiaming Liu
Junyu Han
Errui Ding
67
62
0
24 Dec 2018
AVRA: Automatic Visual Ratings of Atrophy from MRI images using Recurrent Convolutional Neural Networks
G. Mårtensson
D. Ferreira
L. Cavallin
J.-Sebastian Muehlboeck
L. Wahlund
Chunliang Wang
E. Westman
82
21
0
23 Dec 2018
End-to-End Classification of Reverberant Rooms using DNNs
C. Papayiannis
C. Evers
Patrick A. Naylor
97
12
0
21 Dec 2018
LEAFAGE: Example-based and Feature importance-based Explanationsfor Black-box ML models
Ajaya Adhikari
David Tax
R. Satta
M. Faeth
FAtt
122
11
0
21 Dec 2018
A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization
Guoyun Tu
Yanwei Fu
Boyang Albert Li
Jiarui Gao
Yu-Gang Jiang
Xiangyang Xue
36
29
0
21 Dec 2018
nocaps: novel object captioning at scale
Harsh Agrawal
Karan Desai
Yufei Wang
Xinlei Chen
Rishabh Jain
Mark Johnson
Dhruv Batra
Devi Parikh
Stefan Lee
Peter Anderson
VLM
179
488
0
20 Dec 2018
A Comparison of LSTMs and Attention Mechanisms for Forecasting Financial Time Series
Thomas Hollis
Antoine Viscardi
S. Yi
AI4TS
50
19
0
18 Dec 2018
Toward Multimodal Model-Agnostic Meta-Learning
Risto Vuorio
Shao-Hua Sun
Hexiang Hu
Joseph J. Lim
97
32
0
18 Dec 2018
Attention-based Recurrent Neural Network for Urban Vehicle Trajectory Prediction
Seongjin Choi
Jiwon Kim
H. Yeo
HAI
50
60
0
18 Dec 2018
A Tutorial on Deep Latent Variable Models of Natural Language
Yoon Kim
Sam Wiseman
Alexander M. Rush
BDL
VLM
124
42
0
17 Dec 2018
Attending Category Disentangled Global Context for Image Classification
Keke Tang
Guodong Wei
Runnan Chen
Jie Zhu
Zhaoquan Gu
Wenping Wang
42
0
0
17 Dec 2018
Grounded Video Description
Luowei Zhou
Yannis Kalantidis
Xinlei Chen
Jason J. Corso
Marcus Rohrbach
102
193
0
17 Dec 2018
PiCANet: Pixel-wise Contextual Attention Learning for Accurate Saliency Detection
Nian Liu
Junwei Han
Ming-Hsuan Yang
SSeg
99
103
0
15 Dec 2018
Inverse Cooking: Recipe Generation from Food Images
Amaia Salvador
M. Drozdzal
Xavier Giró-i-Nieto
Adriana Romero
101
148
0
14 Dec 2018
On Attention Modules for Audio-Visual Synchronization
Naji Khosravan
Shervin Ardeshir
R. Puri
73
21
0
14 Dec 2018
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition
Hao Huang
Luowei Zhou
Wei Zhang
Jason J. Corso
Chenliang Xu
59
3
0
13 Dec 2018
Visual Social Relationship Recognition
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
59
27
0
13 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
114
368
0
13 Dec 2018
Coarse-to-fine: A RNN-based hierarchical attention model for vehicle re-identification
Xiu-Shen Wei
Chen-Da Liu-Zhang
Lingqiao Liu
Chunhua Shen
Jianxin Wu
122
43
0
11 Dec 2018
On the Dimensionality of Word Embedding
Zi Yin
Yuanyuan Shen
83
193
0
11 Dec 2018
Spatial Knowledge Distillation to aid Visual Reasoning
Somak Aditya
Rudra Saha
Yezhou Yang
Chitta Baral
79
15
0
10 Dec 2018
A Structured Model For Action Detection
Yubo Zhang
P. Tokmakov
M. Hebert
Cordelia Schmid
122
101
0
09 Dec 2018
Real-Time Referring Expression Comprehension by Single-Stage Grounding Network
Xinpeng Chen
Lin Ma
Jingyuan Chen
Zequn Jie
Wen Liu
Jiebo Luo
ObjD
90
113
0
09 Dec 2018
Attend More Times for Image Captioning
Jiajun Du
Yu Qin
Hongtao Lu
Yonghua Zhang
VLM
75
5
0
08 Dec 2018
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
91
20
0
07 Dec 2018
Verification of deep probabilistic models
Krishnamurthy Dvijotham
M. Garnelo
Alhussein Fawzi
Pushmeet Kohli
76
23
0
06 Dec 2018
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
215
710
0
06 Dec 2018
Recursive Visual Attention in Visual Dialog
Yulei Niu
Hanwang Zhang
Manli Zhang
Jianhong Zhang
Zhiwu Lu
Ji-Rong Wen
112
119
0
06 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
193
705
0
06 Dec 2018
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
Long Chen
Hanwang Zhang
Jun Xiao
Xiangnan He
Shiliang Pu
Shih-Fu Chang
102
159
0
06 Dec 2018
Summarizing Videos with Attention
Jiri Fajtl
Hajar Sadeghi Sokeh
Vasileios Argyriou
D. Monekosso
Paolo Remagnino
102
191
0
05 Dec 2018
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
140
234
0
05 Dec 2018
Complete the Look: Scene-based Complementary Product Recommendation
Wang-Cheng Kang
Eric Kim
J. Leskovec
Charles R. Rosenberg
Julian McAuley
105
77
0
04 Dec 2018
e-SNLI: Natural Language Inference with Natural Language Explanations
Oana-Maria Camburu
Tim Rocktaschel
Thomas Lukasiewicz
Phil Blunsom
LRM
447
643
0
04 Dec 2018
Pre-Defined Sparse Neural Networks with Hardware Acceleration
Sourya Dey
Kuan-Wen Huang
Peter A. Beerel
K. Chugg
114
25
0
04 Dec 2018
Improving Clinical Predictions through Unsupervised Time Series Representation Learning
Xinrui Lyu
Matthias Huser
Stephanie L. Hyland
George Zerveas
Gunnar Rätsch
SSL
OOD
AI4TS
76
43
0
02 Dec 2018
Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization
Siyuan Qiao
Zhe Lin
Jianming Zhang
Alan Yuille
65
23
0
02 Dec 2018
Learning to Caption Images through a Lifetime by Asking Questions
Tingke Shen
Amlan Kar
Sanja Fidler
105
31
0
01 Dec 2018
FineFool: Fine Object Contour Attack via Attention
Jinyin Chen
Haibin Zheng
Hui Xiong
Mengmeng Su
AAML
60
3
0
01 Dec 2018
From Known to the Unknown: Transferring Knowledge to Answer Questions about Novel Visual and Semantic Concepts
M. Farazi
Salman H Khan
Nick Barnes
58
13
0
30 Nov 2018
Deep Multimodal Learning: An Effective Method for Video Classification
Tianqi Zhao
26
4
0
30 Nov 2018
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRL
AI4CE
179
1,279
0
30 Nov 2018
Generating Easy-to-Understand Referring Expressions for Target Identifications
Mikihiro Tanaka
Takayuki Itamochi
Kenichi Narioka
Ikuro Sato
Yoshitaka Ushiku
Tatsuya Harada
74
1
0
29 Nov 2018
MAMNet: Multi-path Adaptive Modulation Network for Image Super-Resolution
Jun-Hyuk Kim
Jun-Ho Choi
Manri Cheon
Jong-Seok Lee
SupR
66
49
0
29 Nov 2018
Multi-level Multimodal Common Semantic Space for Image-Phrase Grounding
Hassan Akbari
Svebor Karaman
Surabhi Bhargava
Brian Chen
Carl Vondrick
Shih-Fu Chang
68
83
0
28 Nov 2018
Neural Sign Language Translation based on Human Keypoint Estimation
Sang-Ki Ko
Chang Jo Kim
Hyedong Jung
Choongsang Cho
SLR
118
213
0
28 Nov 2018
Previous
1
2
3
...
46
47
48
...
69
70
71
Next