Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
Deep Ancient Roman Republican Coin Classification via Feature Fusion and Attention
Hafeez Anwar
Saeed Anwar
S. Zambanini
Fatih Porikli
45
7
0
26 Aug 2019
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Iro Laina
Christian Rupprecht
Nassir Navab
SSL
80
103
0
25 Aug 2019
Learning Similarity Conditions Without Explicit Supervision
Reuben Tan
Mariya I. Vasileva
Kate Saenko
Bryan A. Plummer
SSL
52
76
0
22 Aug 2019
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
J. Aneja
Harsh Agrawal
Dhruv Batra
Alex Schwing
BDL
VLM
85
66
0
22 Aug 2019
Entropy-Enhanced Multimodal Attention Model for Scene-Aware Dialogue Generation
Kuan-Yen Lin
Chao-Chun Hsu
Yun-Nung Chen
Lun-Wei Ku
VGen
62
20
0
22 Aug 2019
Multiple instance dense connected convolution neural network for aerial image scene classification
Qi Bi
K. Qin
Zhili Li
Han Zhang
Kai Xu
99
114
0
22 Aug 2019
Improving Captioning for Low-Resource Languages by Cycle Consistency
Yike Wu
Shiwan Zhao
Jia Chen
Ying Zhang
Xiaojie Yuan
Zhong Su
49
8
0
21 Aug 2019
Saccader: Improving Accuracy of Hard Attention Models for Vision
Gamaleldin F. Elsayed
Simon Kornblith
Quoc V. Le
VLM
103
73
0
20 Aug 2019
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Hao Tan
Joey Tianyi Zhou
VLM
MLLM
254
2,499
0
20 Aug 2019
Towards High-Resolution Salient Object Detection
Yi Zeng
Pingping Zhang
Jianming Zhang
Zhe Lin
Huchuan Lu
88
202
0
20 Aug 2019
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Shuang Ma
Daniel J. McDuff
Yale Song
94
25
0
19 Aug 2019
Attention on Attention for Image Captioning
Lun Huang
Wenmin Wang
Jie Chen
Xiao-Yong Wei
89
837
0
19 Aug 2019
SPA-GAN: Spatial Attention GAN for Image-to-Image Translation
H. Emami
Majid Moradi Aliabadi
Ming Dong
R. Chinnam
GAN
82
173
0
19 Aug 2019
TDAM: a Topic-Dependent Attention Model for Sentiment Analysis
Gabriele Pergola
Lin Gui
Yulan He
68
58
0
18 Aug 2019
Language Features Matter: Effective Language Representations for Vision-Language Tasks
Andrea Burns
Reuben Tan
Kate Saenko
Stan Sclaroff
Bryan A. Plummer
VLM
58
27
0
17 Aug 2019
U-CAM: Visual Explanation using Uncertainty based Class Activation Maps
Badri N. Patro
Mayank Lunayach
Shivansh Patel
Vinay P. Namboodiri
FAtt
UQCV
124
76
0
17 Aug 2019
Learning Deep Representations by Mutual Information for Person Re-identification
Peng Chen
Tong Jia
Pengfei Wu
Jianjun Wu
Dongyue Chen
SSL
100
5
0
16 Aug 2019
Mixed High-Order Attention Network for Person Re-Identification
Binghui Chen
Weihong Deng
Jiani Hu
CVBM
102
357
0
16 Aug 2019
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards
Yuqing Song
Shizhe Chen
Yida Zhao
Qin Jin
SSL
57
41
0
15 Aug 2019
Towards Diverse and Accurate Image Captions via Reinforcing Determinantal Point Process
Qingzhong Wang
Antoni B. Chan
61
7
0
14 Aug 2019
Attention is not not Explanation
Sarah Wiegreffe
Yuval Pinter
XAI
AAML
FAtt
137
915
0
13 Aug 2019
Atlas: A Dataset and Benchmark for E-commerce Clothing Product Categorization
Venkatesh Umaashankar
Girish Shanmugam
Aditi Prakash
23
8
0
12 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
109
38
0
12 Aug 2019
Sentence Specified Dynamic Video Thumbnail Generation
Yiitan Yuan
Lin Ma
Wenwu Zhu
77
30
0
12 Aug 2019
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
Tan Wang
Xing Xu
Yang Yang
Alan Hanjalic
Heng Tao Shen
Jingkuan Song
62
150
0
12 Aug 2019
SCAR: Spatial-/Channel-wise Attention Regression Networks for Crowd Counting
Junyu Gao
Qi. Wang
Yuan. Yuan
68
197
0
10 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
69
82
0
10 Aug 2019
Transferable Representation Learning in Vision-and-Language Navigation
Haoshuo Huang
Vihan Jain
Harsh Mehta
Alexander Ku
Gabriel Ilharco
Jason Baldridge
Eugene Ie
LM&Ro
90
89
0
09 Aug 2019
Recognizing Part Attributes with Insufficient Data
Xiangyu Zhao
Yi Yang
Feng Zhou
Xiao Tan
Yuchen Yuan
Sid Ying-Ze Bao
Ying Nian Wu
51
20
0
09 Aug 2019
Towards Generating Stylized Image Captions via Adversarial Training
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
Len Hamey
GAN
70
18
0
08 Aug 2019
Image Captioning using Facial Expression and Attention
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
CVBM
72
10
0
08 Aug 2019
Scene-based Factored Attention for Image Captioning
Chen Shen
Rongrong Ji
Fuhai Chen
Xiaoshuai Sun
Xiangming Li
47
0
0
07 Aug 2019
Aligning Linguistic Words and Visual Semantic Units for Image Captioning
Longteng Guo
Jing Liu
Jinhui Tang
Jiangwei Li
W. Luo
Hanqing Lu
83
102
0
06 Aug 2019
REAPS: Towards Better Recognition of Fine-grained Images by Region Attending and Part Sequencing
Peng Zhang
Xinyu Zhu
Zhanzhan Cheng
Shuigeng Zhou
Yi Niu
111
1
0
06 Aug 2019
Cascaded Revision Network for Novel Object Captioning
Qianyu Feng
Yu Wu
Hehe Fan
C. Yan
Yezhou Yang
55
35
0
06 Aug 2019
ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal
Bin Ding
Chengjiang Long
Ling Zhang
Chunxia Xiao
GAN
3DH
93
152
0
04 Aug 2019
Permutation-invariant Feature Restructuring for Correlation-aware Image Set-based Recognition
Xiaofeng Liu
Zhenhua Guo
Site Li
Lingsheng Kong
P. Jia
J. You
B. V. Kumar
CVBM
99
32
0
03 Aug 2019
DAWN: Dual Augmented Memory Network for Unsupervised Video Object Tracking
Zhenmei Shi
Haoyang Fang
Yu-Wing Tai
Chi-Keung Tang
51
2
0
02 Aug 2019
Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation
Jing Wang
Yingwei Pan
Ting Yao
Jinhui Tang
Tao Mei
VLM
BDL
DiffM
67
36
0
01 Aug 2019
DEDUCE: Diverse scEne Detection methods in Unseen Challenging Environments
Anwesan Pal
Carlos Nieto-Granda
H. Christensen
51
23
0
01 Aug 2019
Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation
Yadan Luo
Zi Huang
Zheng Zhang
Ziwei Wang
Jingjing Li
Yang Yang
71
40
0
01 Aug 2019
Image Captioning with Unseen Objects
B. Demirel
R. G. Cinbis
Nazli Ikizler-Cinbis
VLM
120
16
0
31 Jul 2019
Local Interpretation Methods to Machine Learning Using the Domain of the Feature Space
T. Botari
Rafael Izbicki
A. Carvalho
FAtt
55
12
0
31 Jul 2019
Ablate, Variate, and Contemplate: Visual Analytics for Discovering Neural Architectures
Dylan Cashman
Adam Perer
Remco Chang
Hendrik Strobelt
KELM
72
29
0
30 Jul 2019
LEAF-QA: Locate, Encode & Attend for Figure Question Answering
Ritwick Chaudhry
Sumit Shekhar
Utkarsh Gupta
Pranav Maneriker
Prann Bansal
Ajay Joshi
LMTD
55
89
0
30 Jul 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
Cheng Zhang
Wei-Lun Chao
D. Xuan
77
51
0
28 Jul 2019
Hybrid-Attention based Decoupled Metric Learning for Zero-Shot Image Retrieval
Binghui Chen
Weihong Deng
VLM
FedML
57
56
0
27 Jul 2019
Supervised and Unsupervised Neural Approaches to Text Readability
Matej Martinc
Senja Pollak
Marko Robnik-Šikonja
99
145
0
26 Jul 2019
Cooperative image captioning
Gilad Vered
Gal Oren
Yuval Atzmon
Gal Chechik
56
2
0
26 Jul 2019
Visual Interaction with Deep Learning Models through Collaborative Semantic Inference
Sebastian Gehrmann
Hendrik Strobelt
Robert Krüger
Hanspeter Pfister
Alexander M. Rush
HAI
101
58
0
24 Jul 2019
Previous
1
2
3
...
39
40
41
...
69
70
71
Next