ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
BSDAR: Beam Search Decoding with Attention Reward in Neural Keyphrase
  Generation
BSDAR: Beam Search Decoding with Attention Reward in Neural Keyphrase Generation
Iftitahu Ni'mah
Vlado Menkovski
Mykola Pechenizkiy
44
2
0
17 Sep 2019
Learning to Deceive with Attention-Based Explanations
Learning to Deceive with Attention-Based Explanations
Danish Pruthi
Mansi Gupta
Bhuwan Dhingra
Graham Neubig
Zachary Chase Lipton
118
194
0
17 Sep 2019
Inverse Visual Question Answering with Multi-Level Attentions
Inverse Visual Question Answering with Multi-Level Attentions
Yaser Alwatter
Yuhong Guo
BDL
39
1
0
17 Sep 2019
Controllable Text-to-Image Generation
Controllable Text-to-Image Generation
Bowen Li
Xiaojuan Qi
Thomas Lukasiewicz
Philip Torr
GAN
152
357
0
16 Sep 2019
Motion Guided Attention for Video Salient Object Detection
Motion Guided Attention for Video Salient Object Detection
Haofeng Li
Guanqi Chen
Guanbin Li
Yizhou Yu
128
167
0
16 Sep 2019
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable
  Makeup Transfer
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer
Wentao Jiang
Si Liu
Chen Gao
Jie Cao
Ran He
Jiashi Feng
Shuicheng Yan
CVBM
76
130
0
16 Sep 2019
Deep Collaborative Filtering with Multi-Aspect Information in
  Heterogeneous Networks
Deep Collaborative Filtering with Multi-Aspect Information in Heterogeneous Networks
C. Shi
Xiaotian Han
Li Song
Tianlin Li
Senzhang Wang
Junping Du
Philip S. Yu
144
101
0
14 Sep 2019
SANVis: Visual Analytics for Understanding Self-Attention Networks
SANVis: Visual Analytics for Understanding Self-Attention Networks
Cheonbok Park
Inyoup Na
Yongjang Jo
Sungbok Shin
J. Yoo
Bum Chul Kwon
Jian Zhao
Hyungjong Noh
Yeonsoo Lee
Jaegul Choo
HAI
80
40
0
13 Sep 2019
Understanding LSTM -- a tutorial into Long Short-Term Memory Recurrent
  Neural Networks
Understanding LSTM -- a tutorial into Long Short-Term Memory Recurrent Neural Networks
R. C. Staudemeyer
Eric Rothstein Morris
67
498
0
12 Sep 2019
Speculative Beam Search for Simultaneous Translation
Speculative Beam Search for Simultaneous Translation
Renjie Zheng
Mingbo Ma
Baigong Zheng
Liang Huang
98
24
0
12 Sep 2019
Human Visual Attention Prediction Boosts Learning & Performance of
  Autonomous Driving Agents
Human Visual Attention Prediction Boosts Learning & Performance of Autonomous Driving Agents
Alexander Makrigiorgos
A. Shafti
Alex Harston
Julien Gérard
A. Faisal
55
14
0
11 Sep 2019
PDANet: Polarity-consistent Deep Attention Network for Fine-grained
  Visual Emotion Regression
PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression
Sicheng Zhao
Zizhou Jia
Hui Chen
Leida Li
Guiguang Ding
Kurt Keutzer
94
62
0
11 Sep 2019
Dual-attention Focused Module for Weakly Supervised Object Localization
Dual-attention Focused Module for Weakly Supervised Object Localization
Yukun Zhou
Zailiang Chen
Hai-lan Shen
Qing Liu
Rongchang Zhao
Yixiong Liang
WSOL
57
4
0
11 Sep 2019
Select and Attend: Towards Controllable Content Selection in Text
  Generation
Select and Attend: Towards Controllable Content Selection in Text Generation
Xiaoyu Shen
Jun Suzuki
Kentaro Inui
Hui Su
Dietrich Klakow
Satoshi Sekine
76
29
0
10 Sep 2019
Compositional Generalization in Image Captioning
Compositional Generalization in Image Captioning
Mitja Nikolaus
Mostafa Abdou
Matthew Lamm
Rahul Aralikatte
Desmond Elliott
CoGe
98
49
0
10 Sep 2019
FDA: Feature Disruptive Attack
FDA: Feature Disruptive Attack
Aditya Ganeshan
S. VivekB.
R. Venkatesh Babu
AAML
124
105
0
10 Sep 2019
Multimodal Attention Branch Network for Perspective-Free Sentence
  Generation
Multimodal Attention Branch Network for Perspective-Free Sentence Generation
A. Magassouba
K. Sugiura
Hisashi Kawai
45
17
0
10 Sep 2019
Neural Naturalist: Generating Fine-Grained Image Comparisons
Neural Naturalist: Generating Fine-Grained Image Comparisons
Maxwell Forbes
Christine Kaeser-Chen
Piyush Sharma
Serge J. Belongie
VLM
141
58
0
09 Sep 2019
Hierarchy Parsing for Image Captioning
Hierarchy Parsing for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
VLM
96
166
0
09 Sep 2019
Picture What you Read
Picture What you Read
I. Gallo
Shah Nawaz
Alessandro Calefati
Riccardo La Grassa
Nicola Landro
DiffM
66
0
0
09 Sep 2019
Improving Neural Question Generation using World Knowledge
Improving Neural Question Generation using World Knowledge
D. Gupta
Kaheer Suleman
Mahmoud Adada
Andrew McNamara
Justin Harris
MedIm
82
7
0
09 Sep 2019
Transfer Reward Learning for Policy Gradient-Based Text Generation
Transfer Reward Learning for Policy Gradient-Based Text Generation
James OÑeill
Danushka Bollegala
25
1
0
09 Sep 2019
AtLoc: Attention Guided Camera Localization
AtLoc: Attention Guided Camera Localization
Bing Wang
Changhao Chen
Chris Xiaoxuan Lu
Peijun Zhao
A. Trigoni
Andrew Markham
102
158
0
08 Sep 2019
Aspect-based Sentiment Classification with Aspect-specific Graph
  Convolutional Networks
Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks
Chen Zhang
Qiuchi Li
D. Song
GNN
66
445
0
08 Sep 2019
Conditional Text Generation for Harmonious Human-Machine Interaction
Conditional Text Generation for Harmonious Human-Machine Interaction
Bin Guo
Hao Wang
Yasan Ding
Wei Wu
Shaoyang Hao
Yueqi Sun
Zhiwen Yu
103
4
0
08 Sep 2019
Look and Modify: Modification Networks for Image Captioning
Look and Modify: Modification Networks for Image Captioning
Fawaz Sammani
Mahmoud Elsayed
52
22
0
07 Sep 2019
What can computational models learn from human selective attention? A
  review from an audiovisual crossmodal perspective
What can computational models learn from human selective attention? A review from an audiovisual crossmodal perspective
Di Fu
C. Weber
Guochun Yang
Matthias Kerzel
Weizhi Nan
Pablo V. A. Barros
Haiyan Wu
Xun Liu
S. Wermter
35
0
0
05 Sep 2019
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
Wei Wei
Ling Cheng
Xian-Ling Mao
Guangyou Zhou
Feida Zhu
DiffM
79
19
0
05 Sep 2019
Semantic-Aware Scene Recognition
Semantic-Aware Scene Recognition
Alejandro López-Cifuentes
Marcos Escudero-Viñolo
Jesús Bescós
Álvaro García-Martín
86
106
0
05 Sep 2019
A Better Way to Attend: Attention with Trees for Video Question
  Answering
A Better Way to Attend: Attention with Trees for Video Question Answering
Hongyang Xue
Wenqing Chu
Zhou Zhao
Deng Cai
62
33
0
05 Sep 2019
Image Captioning with Very Scarce Supervised Data: Adversarial
  Semi-Supervised Learning Approach
Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach
Dong-Jin Kim
Jinsoo Choi
Tae-Hyun Oh
In So Kweon
SSLVLM
89
56
0
05 Sep 2019
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic
  Labels Improve Image Captioning and Visual Question Answering
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering
Soravit Changpinyo
Bo Pang
Piyush Sharma
Radu Soricut
ObjD
65
20
0
04 Sep 2019
Do Cross Modal Systems Leverage Semantic Relationships?
Do Cross Modal Systems Leverage Semantic Relationships?
Shah Nawaz
Muhammad Kamran Janjua
I. Gallo
Arif Mahmood
Alessandro Calefati
Faisal Shafait
56
8
0
03 Sep 2019
Encode, Tag, Realize: High-Precision Text Editing
Encode, Tag, Realize: High-Precision Text Editing
Eric Malmi
Sebastian Krause
S. Rothe
Daniil Mirylenka
Aliaksei Severyn
3DV
112
171
0
03 Sep 2019
A Geometry-Sensitive Approach for Photographic Style Classification
A Geometry-Sensitive Approach for Photographic Style Classification
Koustav Ghosal
Mukta Prasad
A. Smolic
GAN
61
6
0
03 Sep 2019
EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks
EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks
Pengfei Zhang
Jianru Xue
Cuiling Lan
Wenjun Zeng
Zhanning Gao
Nanning Zheng
72
85
0
03 Sep 2019
Story-oriented Image Selection and Placement
Story-oriented Image Selection and Placement
Sreyasi Nag Chowdhury
Simon Razniewski
Gerhard Weikum
27
1
0
02 Sep 2019
SumQE: a BERT-based Summary Quality Estimation Model
SumQE: a BERT-based Summary Quality Estimation Model
Stratos Xenouleas
Prodromos Malakasiotis
Marianna Apidianaki
Ion Androutsopoulos
71
37
0
02 Sep 2019
What You See is What You Get: Visual Pronoun Coreference Resolution in
  Dialogues
What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues
Xintong Yu
Hongming Zhang
Yangqiu Song
Yan Song
Changshui Zhang
42
28
0
01 Sep 2019
Phrase Grounding by Soft-Label Chain Conditional Random Field
Phrase Grounding by Soft-Label Chain Conditional Random Field
Jiacheng Liu
Julia Hockenmaier
50
10
0
01 Sep 2019
Humor Detection: A Transformer Gets the Last Laugh
Humor Detection: A Transformer Gets the Last Laugh
Orion Weller
Kevin Seppi
138
123
0
31 Aug 2019
A Semantics-Assisted Video Captioning Model Trained with Scheduled
  Sampling
A Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling
Haoran Chen
Ke Lin
A. Maye
Jianmin Li
Xiaoling Hu
64
48
0
31 Aug 2019
Rethinking Irregular Scene Text Recognition
Rethinking Irregular Scene Text Recognition
Shangbang Long
Yushuo Guan
Bingxuan Wang
Kaigui Bian
Cong Yao
67
8
0
30 Aug 2019
Reflective Decoding Network for Image Captioning
Reflective Decoding Network for Image Captioning
Lei Ke
Wenjie Pei
Ruiyu Li
Xiaoyong Shen
Yu-Wing Tai
ObjD
75
94
0
30 Aug 2019
Translating Math Formula Images to LaTeX Sequences Using Deep Neural
  Networks with Sequence-level Training
Translating Math Formula Images to LaTeX Sequences Using Deep Neural Networks with Sequence-level Training
Zelun Wang
Jyh-Charn S. Liu
29
7
0
29 Aug 2019
Aesthetic Image Captioning From Weakly-Labelled Photographs
Aesthetic Image Captioning From Weakly-Labelled Photographs
Koustav Ghosal
A. Rana
A. Smolic
67
25
0
29 Aug 2019
DFPENet-geology: A Deep Learning Framework for High Precision
  Recognition and Segmentation of Co-seismic Landslides
DFPENet-geology: A Deep Learning Framework for High Precision Recognition and Segmentation of Co-seismic Landslides
Qingsong Xu
Chaojun Ouyang
Tianhai Jiang
Xuanmei Fan
Duoxiang Cheng
AI4CE
46
13
0
28 Aug 2019
Image Captioning with Sparse Recurrent Neural Network
Image Captioning with Sparse Recurrent Neural Network
J. Tan
Chee Seng Chan
Joon Huang Chuah
VLM
56
6
0
28 Aug 2019
Fingerspelling recognition in the wild with iterative visual attention
Fingerspelling recognition in the wild with iterative visual attention
Bowen Shi
Aurora Martinez Del Rio
J. Keane
D. Brentari
G. Shakhnarovich
Karen Livescu
68
63
0
28 Aug 2019
Attention-based Dropout Layer for Weakly Supervised Object Localization
Attention-based Dropout Layer for Weakly Supervised Object Localization
Junsuk Choe
Hyunjung Shim
WSOL
155
368
0
27 Aug 2019
Previous
123...383940...697071
Next