ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Deep Ancient Roman Republican Coin Classification via Feature Fusion and
  Attention
Deep Ancient Roman Republican Coin Classification via Feature Fusion and Attention
Hafeez Anwar
Saeed Anwar
S. Zambanini
Fatih Porikli
45
7
0
26 Aug 2019
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
Iro Laina
Christian Rupprecht
Nassir Navab
SSL
80
103
0
25 Aug 2019
Learning Similarity Conditions Without Explicit Supervision
Learning Similarity Conditions Without Explicit Supervision
Reuben Tan
Mariya I. Vasileva
Kate Saenko
Bryan A. Plummer
SSL
52
76
0
22 Aug 2019
Sequential Latent Spaces for Modeling the Intention During Diverse Image
  Captioning
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
J. Aneja
Harsh Agrawal
Dhruv Batra
Alex Schwing
BDLVLM
85
66
0
22 Aug 2019
Entropy-Enhanced Multimodal Attention Model for Scene-Aware Dialogue
  Generation
Entropy-Enhanced Multimodal Attention Model for Scene-Aware Dialogue Generation
Kuan-Yen Lin
Chao-Chun Hsu
Yun-Nung Chen
Lun-Wei Ku
VGen
62
20
0
22 Aug 2019
Multiple instance dense connected convolution neural network for aerial
  image scene classification
Multiple instance dense connected convolution neural network for aerial image scene classification
Qi Bi
K. Qin
Zhili Li
Han Zhang
Kai Xu
99
114
0
22 Aug 2019
Improving Captioning for Low-Resource Languages by Cycle Consistency
Improving Captioning for Low-Resource Languages by Cycle Consistency
Yike Wu
Shiwan Zhao
Jia Chen
Ying Zhang
Xiaojie Yuan
Zhong Su
49
8
0
21 Aug 2019
Saccader: Improving Accuracy of Hard Attention Models for Vision
Saccader: Improving Accuracy of Hard Attention Models for Vision
Gamaleldin F. Elsayed
Simon Kornblith
Quoc V. Le
VLM
103
73
0
20 Aug 2019
LXMERT: Learning Cross-Modality Encoder Representations from
  Transformers
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Hao Tan
Joey Tianyi Zhou
VLMMLLM
254
2,499
0
20 Aug 2019
Towards High-Resolution Salient Object Detection
Towards High-Resolution Salient Object Detection
Yi Zeng
Pingping Zhang
Jianming Zhang
Zhe Lin
Huchuan Lu
88
202
0
20 Aug 2019
Unpaired Image-to-Speech Synthesis with Multimodal Information
  Bottleneck
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Shuang Ma
Daniel J. McDuff
Yale Song
94
25
0
19 Aug 2019
Attention on Attention for Image Captioning
Attention on Attention for Image Captioning
Lun Huang
Wenmin Wang
Jie Chen
Xiao-Yong Wei
89
837
0
19 Aug 2019
SPA-GAN: Spatial Attention GAN for Image-to-Image Translation
SPA-GAN: Spatial Attention GAN for Image-to-Image Translation
H. Emami
Majid Moradi Aliabadi
Ming Dong
R. Chinnam
GAN
82
173
0
19 Aug 2019
TDAM: a Topic-Dependent Attention Model for Sentiment Analysis
TDAM: a Topic-Dependent Attention Model for Sentiment Analysis
Gabriele Pergola
Lin Gui
Yulan He
68
58
0
18 Aug 2019
Language Features Matter: Effective Language Representations for
  Vision-Language Tasks
Language Features Matter: Effective Language Representations for Vision-Language Tasks
Andrea Burns
Reuben Tan
Kate Saenko
Stan Sclaroff
Bryan A. Plummer
VLM
58
27
0
17 Aug 2019
U-CAM: Visual Explanation using Uncertainty based Class Activation Maps
U-CAM: Visual Explanation using Uncertainty based Class Activation Maps
Badri N. Patro
Mayank Lunayach
Shivansh Patel
Vinay P. Namboodiri
FAttUQCV
124
76
0
17 Aug 2019
Learning Deep Representations by Mutual Information for Person
  Re-identification
Learning Deep Representations by Mutual Information for Person Re-identification
Peng Chen
Tong Jia
Pengfei Wu
Jianjun Wu
Dongyue Chen
SSL
100
5
0
16 Aug 2019
Mixed High-Order Attention Network for Person Re-Identification
Mixed High-Order Attention Network for Person Re-Identification
Binghui Chen
Weihong Deng
Jiani Hu
CVBM
102
357
0
16 Aug 2019
Unpaired Cross-lingual Image Caption Generation with Self-Supervised
  Rewards
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards
Yuqing Song
Shizhe Chen
Yida Zhao
Qin Jin
SSL
57
41
0
15 Aug 2019
Towards Diverse and Accurate Image Captions via Reinforcing
  Determinantal Point Process
Towards Diverse and Accurate Image Captions via Reinforcing Determinantal Point Process
Qingzhong Wang
Antoni B. Chan
61
7
0
14 Aug 2019
Attention is not not Explanation
Attention is not not Explanation
Sarah Wiegreffe
Yuval Pinter
XAIAAMLFAtt
137
915
0
13 Aug 2019
Atlas: A Dataset and Benchmark for E-commerce Clothing Product
  Categorization
Atlas: A Dataset and Benchmark for E-commerce Clothing Product Categorization
Venkatesh Umaashankar
Girish Shanmugam
Aditi Prakash
23
8
0
12 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language
  Interactions
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
109
38
0
12 Aug 2019
Sentence Specified Dynamic Video Thumbnail Generation
Sentence Specified Dynamic Video Thumbnail Generation
Yiitan Yuan
Lin Ma
Wenwu Zhu
77
30
0
12 Aug 2019
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
Tan Wang
Xing Xu
Yang Yang
Alan Hanjalic
Heng Tao Shen
Jingkuan Song
62
150
0
12 Aug 2019
SCAR: Spatial-/Channel-wise Attention Regression Networks for Crowd
  Counting
SCAR: Spatial-/Channel-wise Attention Regression Networks for Crowd Counting
Junyu Gao
Qi. Wang
Yuan. Yuan
68
197
0
10 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
69
82
0
10 Aug 2019
Transferable Representation Learning in Vision-and-Language Navigation
Transferable Representation Learning in Vision-and-Language Navigation
Haoshuo Huang
Vihan Jain
Harsh Mehta
Alexander Ku
Gabriel Ilharco
Jason Baldridge
Eugene Ie
LM&Ro
90
89
0
09 Aug 2019
Recognizing Part Attributes with Insufficient Data
Recognizing Part Attributes with Insufficient Data
Xiangyu Zhao
Yi Yang
Feng Zhou
Xiao Tan
Yuchen Yuan
Sid Ying-Ze Bao
Ying Nian Wu
51
20
0
09 Aug 2019
Towards Generating Stylized Image Captions via Adversarial Training
Towards Generating Stylized Image Captions via Adversarial Training
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
Len Hamey
GAN
70
18
0
08 Aug 2019
Image Captioning using Facial Expression and Attention
Image Captioning using Facial Expression and Attention
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
CVBM
72
10
0
08 Aug 2019
Scene-based Factored Attention for Image Captioning
Scene-based Factored Attention for Image Captioning
Chen Shen
Rongrong Ji
Fuhai Chen
Xiaoshuai Sun
Xiangming Li
47
0
0
07 Aug 2019
Aligning Linguistic Words and Visual Semantic Units for Image Captioning
Aligning Linguistic Words and Visual Semantic Units for Image Captioning
Longteng Guo
Jing Liu
Jinhui Tang
Jiangwei Li
W. Luo
Hanqing Lu
83
102
0
06 Aug 2019
REAPS: Towards Better Recognition of Fine-grained Images by Region
  Attending and Part Sequencing
REAPS: Towards Better Recognition of Fine-grained Images by Region Attending and Part Sequencing
Peng Zhang
Xinyu Zhu
Zhanzhan Cheng
Shuigeng Zhou
Yi Niu
111
1
0
06 Aug 2019
Cascaded Revision Network for Novel Object Captioning
Cascaded Revision Network for Novel Object Captioning
Qianyu Feng
Yu Wu
Hehe Fan
C. Yan
Yezhou Yang
55
35
0
06 Aug 2019
ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow
  Detection and Removal
ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal
Bin Ding
Chengjiang Long
Ling Zhang
Chunxia Xiao
GAN3DH
93
152
0
04 Aug 2019
Permutation-invariant Feature Restructuring for Correlation-aware Image
  Set-based Recognition
Permutation-invariant Feature Restructuring for Correlation-aware Image Set-based Recognition
Xiaofeng Liu
Zhenhua Guo
Site Li
Lingsheng Kong
P. Jia
J. You
B. V. Kumar
CVBM
99
32
0
03 Aug 2019
DAWN: Dual Augmented Memory Network for Unsupervised Video Object
  Tracking
DAWN: Dual Augmented Memory Network for Unsupervised Video Object Tracking
Zhenmei Shi
Haoyang Fang
Yu-Wing Tai
Chi-Keung Tang
51
2
0
02 Aug 2019
Convolutional Auto-encoding of Sentence Topics for Image Paragraph
  Generation
Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation
Jing Wang
Yingwei Pan
Ting Yao
Jinhui Tang
Tao Mei
VLMBDLDiffM
67
36
0
01 Aug 2019
DEDUCE: Diverse scEne Detection methods in Unseen Challenging
  Environments
DEDUCE: Diverse scEne Detection methods in Unseen Challenging Environments
Anwesan Pal
Carlos Nieto-Granda
H. Christensen
51
23
0
01 Aug 2019
Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph
  Generation
Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation
Yadan Luo
Zi Huang
Zheng Zhang
Ziwei Wang
Jingjing Li
Yang Yang
71
40
0
01 Aug 2019
Image Captioning with Unseen Objects
Image Captioning with Unseen Objects
B. Demirel
R. G. Cinbis
Nazli Ikizler-Cinbis
VLM
120
16
0
31 Jul 2019
Local Interpretation Methods to Machine Learning Using the Domain of the
  Feature Space
Local Interpretation Methods to Machine Learning Using the Domain of the Feature Space
T. Botari
Rafael Izbicki
A. Carvalho
FAtt
55
12
0
31 Jul 2019
Ablate, Variate, and Contemplate: Visual Analytics for Discovering
  Neural Architectures
Ablate, Variate, and Contemplate: Visual Analytics for Discovering Neural Architectures
Dylan Cashman
Adam Perer
Remco Chang
Hendrik Strobelt
KELM
72
29
0
30 Jul 2019
LEAF-QA: Locate, Encode & Attend for Figure Question Answering
LEAF-QA: Locate, Encode & Attend for Figure Question Answering
Ritwick Chaudhry
Sumit Shekhar
Utkarsh Gupta
Pranav Maneriker
Prann Bansal
Ajay Joshi
LMTD
55
89
0
30 Jul 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question
  Answering
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
Cheng Zhang
Wei-Lun Chao
D. Xuan
77
51
0
28 Jul 2019
Hybrid-Attention based Decoupled Metric Learning for Zero-Shot Image
  Retrieval
Hybrid-Attention based Decoupled Metric Learning for Zero-Shot Image Retrieval
Binghui Chen
Weihong Deng
VLMFedML
57
56
0
27 Jul 2019
Supervised and Unsupervised Neural Approaches to Text Readability
Supervised and Unsupervised Neural Approaches to Text Readability
Matej Martinc
Senja Pollak
Marko Robnik-Šikonja
99
145
0
26 Jul 2019
Cooperative image captioning
Cooperative image captioning
Gilad Vered
Gal Oren
Yuval Atzmon
Gal Chechik
56
2
0
26 Jul 2019
Visual Interaction with Deep Learning Models through Collaborative
  Semantic Inference
Visual Interaction with Deep Learning Models through Collaborative Semantic Inference
Sebastian Gehrmann
Hendrik Strobelt
Robert Krüger
Hanspeter Pfister
Alexander M. Rush
HAI
101
58
0
24 Jul 2019
Previous
123...394041...697071
Next