ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Explainable Artificial Intelligence (XAI): Concepts, Taxonomies,
  Opportunities and Challenges toward Responsible AI
Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI
Alejandro Barredo Arrieta
Natalia Díaz Rodríguez
Javier Del Ser
Adrien Bennetot
Siham Tabik
...
S. Gil-Lopez
Daniel Molina
Richard Benjamins
Raja Chatila
Francisco Herrera
XAI
351
6,391
0
22 Oct 2019
Weakly-Supervised Completion Moment Detection using Temporal Attention
Weakly-Supervised Completion Moment Detection using Temporal Attention
Farnoosh Heidarivincheh
Majid Mirmehdi
Dima Damen
40
9
0
22 Oct 2019
Fixed Pattern Noise Reduction for Infrared Images Based on Cascade
  Residual Attention CNN
Fixed Pattern Noise Reduction for Infrared Images Based on Cascade Residual Attention CNN
Juntao Guan
R. Lai
Ai Xiong
Zesheng Liu
Lin Gu
53
75
0
22 Oct 2019
Drivers Drowsiness Detection using Condition-Adaptive Representation
  Learning Framework
Drivers Drowsiness Detection using Condition-Adaptive Representation Learning Framework
Jongmin Yu
Sangwook Park
Sangwook Lee
M. Jeon
MedIm
83
86
0
22 Oct 2019
Multi-Resolution Weak Supervision for Sequential Data
Multi-Resolution Weak Supervision for Sequential Data
Frederic Sala
P. Varma
Jason Alan Fries
Daniel Y. Fu
Shiori Sagawa
...
A. Ramamoorthy
K. Xiao
Kayvon Fatahalian
J. Priest
Christopher Ré
NoLa
166
30
0
21 Oct 2019
A Survey and Taxonomy of Adversarial Neural Networks for Text-to-Image
  Synthesis
A Survey and Taxonomy of Adversarial Neural Networks for Text-to-Image Synthesis
Jorge Agnese
Jonathan Herrera
Haicheng Tao
Xingquan Zhu
EGVM
95
103
0
21 Oct 2019
Attention Enriched Deep Learning Model for Breast Tumor Segmentation in
  Ultrasound Images
Attention Enriched Deep Learning Model for Breast Tumor Segmentation in Ultrasound Images
Aleksandar Vakanski
Min Xian
Phoebe E. Freer
90
144
0
20 Oct 2019
Endowing Deep 3D Models with Rotation Invariance Based on Principal
  Component Analysis
Endowing Deep 3D Models with Rotation Invariance Based on Principal Component Analysis
Zelin Xiao
Hongxin Lin
Renjie Li
Hongyang Chao
Shengyong Ding
46
31
0
20 Oct 2019
Learning to Answer Subjective, Specific Product-Related Queries using
  Customer Reviews by Adversarial Domain Adaptation
Learning to Answer Subjective, Specific Product-Related Queries using Customer Reviews by Adversarial Domain Adaptation
Manirupa Das
Zhen Wang
Evan Jaffe
Madhuja Chattopadhyay
Eric Fosler-Lussier
R. Ramnath
AAML
80
2
0
18 Oct 2019
Cross Attention Network for Few-shot Classification
Cross Attention Network for Few-shot Classification
Rui Hou
Hong Chang
Bingpeng Ma
Shiguang Shan
Xilin Chen
282
647
0
17 Oct 2019
Exploring Overall Contextual Information for Image Captioning in
  Human-Like Cognitive Style
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style
Hongwei Ge
Zehang Yan
Kai Zhang
Mingde Zhao
Liang Sun
59
25
0
15 Oct 2019
Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating
  Referee
Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating Referee
Shuangjie Xu
Feng Xu
Yu Cheng
Pan Zhou
35
2
0
14 Oct 2019
Dynamic Attention Networks for Task Oriented Grounding
Dynamic Attention Networks for Task Oriented Grounding
S. Dasgupta
Badri N. Patro
Vinay P. Namboodiri
86
1
0
14 Oct 2019
Snow avalanche segmentation in SAR images with Fully Convolutional
  Neural Networks
Snow avalanche segmentation in SAR images with Fully Convolutional Neural Networks
F. Bianchi
J. Grahn
M. Eckerstorfer
E. Malnes
H. Vickers
40
48
0
11 Oct 2019
Finding Interpretable Concept Spaces in Node Embeddings using Knowledge
  Bases
Finding Interpretable Concept Spaces in Node Embeddings using Knowledge Bases
Maximilian Idahl
Megha Khosla
Avishek Anand
33
10
0
11 Oct 2019
Multi-modal Deep Analysis for Multimedia
Multi-modal Deep Analysis for Multimedia
Wenwu Zhu
Xin Eric Wang
Hongzhi Li
76
43
0
11 Oct 2019
Referring Expression Object Segmentation with Caption-Aware Consistency
Referring Expression Object Segmentation with Caption-Aware Consistency
Yi-Wen Chen
Yi-Hsuan Tsai
Tiantian Wang
Yen-Yu Lin
Ming-Hsuan Yang
EgoV
71
87
0
10 Oct 2019
Semantic-aware Image Deblurring
Semantic-aware Image Deblurring
Fuhai Chen
Rongrong Ji
Chengpeng Dai
Xiaoshuai Sun
Chia-Wen Lin
Jiayi Ji
Baochang Zhang
Feiyue Huang
Liujuan Cao
BDLVLM
113
6
0
09 Oct 2019
Improved Res2Net model for Person re-identification
Improved Res2Net model for Person re-identification
Zongjing Cao
H. Lee
116
2
0
08 Oct 2019
Modulated Self-attention Convolutional Network for VQA
Modulated Self-attention Convolutional Network for VQA
Jean-Benoit Delbrouck
Antoine Maiorca
Nathan Hubens
Stéphane Dupont
29
1
0
08 Oct 2019
Graph Few-shot Learning via Knowledge Transfer
Graph Few-shot Learning via Knowledge Transfer
Huaxiu Yao
Chuxu Zhang
Ying Wei
Meng Jiang
Suhang Wang
Junzhou Huang
Nitesh Chawla
Z. Li
135
168
0
07 Oct 2019
SMArT: Training Shallow Memory-aware Transformers for Robotic
  Explainability
SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
164
29
0
07 Oct 2019
Adversarial reconstruction for Multi-modal Machine Translation
Adversarial reconstruction for Multi-modal Machine Translation
Jean-Benoit Delbrouck
Stéphane Dupont
GAN
158
2
0
07 Oct 2019
On Leveraging the Visual Modality for Neural Machine Translation
On Leveraging the Visual Modality for Neural Machine Translation
Vikas Raunak
Sang Keun Choe
Quanyang Lu
Yi Xu
Florian Metze
38
11
0
07 Oct 2019
Compositional Generalization for Primitive Substitutions
Compositional Generalization for Primitive Substitutions
Yuanpeng Li
Liang Zhao
Jianyu Wang
Joel Hestness
77
87
0
07 Oct 2019
MASTER: Multi-Aspect Non-local Network for Scene Text Recognition
MASTER: Multi-Aspect Non-local Network for Scene Text Recognition
Ning Lu
Wenwen Yu
Xianbiao Qi
Yihao Chen
Ping Gong
Rong Xiao
Xiang Bai
70
158
0
07 Oct 2019
Talk2Nav: Long-Range Vision-and-Language Navigation with Dual Attention
  and Spatial Memory
Talk2Nav: Long-Range Vision-and-Language Navigation with Dual Attention and Spatial Memory
A. Vasudevan
Ahmed K. Farahat
Chetan Gupta
LM&Ro
81
2
0
04 Oct 2019
Graph Analysis and Graph Pooling in the Spatial Domain
Graph Analysis and Graph Pooling in the Spatial Domain
M. Rahmani
Maria Liakata
GNN
61
3
0
03 Oct 2019
Residual Attention Graph Convolutional Network for Geometric 3D Scene
  Classification
Residual Attention Graph Convolutional Network for Geometric 3D Scene Classification
Albert Mosella-Montoro
Javier Ruiz-Hidalgo
3DPC
128
8
0
30 Sep 2019
DeepUSPS: Deep Robust Unsupervised Saliency Prediction With
  Self-Supervision
DeepUSPS: Deep Robust Unsupervised Saliency Prediction With Self-Supervision
D. Nguyen
Maximilian Dax
Chaithanya Kumar Mummadi
Thi Phuong Nhung Ngo
T. Nguyen
Zhongyu Lou
Thomas Brox
114
70
0
28 Sep 2019
The Detection of Distributional Discrepancy for Text Generation
The Detection of Distributional Discrepancy for Text Generation
Xingyuan Chen
Ping Cai
Peng Jin
Haokun Du
Hongjun Wang
Xingyu Dai
Jiajun Chen
43
0
0
28 Sep 2019
Imitation Learning Based on Bilateral Control for Human-Robot
  Cooperation
Imitation Learning Based on Bilateral Control for Human-Robot Cooperation
Ayumu Sasagawa
K. Fujimoto
S. Sakaino
T. Tsuji
53
2
0
28 Sep 2019
Learning Category Correlations for Multi-label Image Recognition with
  Graph Networks
Learning Category Correlations for Multi-label Image Recognition with Graph Networks
Qing Li
Xiaojiang Peng
Yu Qiao
Qiang Peng
51
22
0
28 Sep 2019
Interpreting Undesirable Pixels for Image Classification on Black-Box
  Models
Interpreting Undesirable Pixels for Image Classification on Black-Box Models
Sin-Han Kang
Hong G Jung
Seong-Whan Lee
FAtt
60
3
0
27 Sep 2019
Video-Based Convolutional Attention for Person Re-Identification
Video-Based Convolutional Attention for Person Re-Identification
Marco Zamprogno
Marco Passon
N. Martinel
G. Serra
G. Lancioni
C. Micheloni
C. Tasso
G. Foresti
130
1
0
26 Sep 2019
Multi-grained Attention Networks for Single Image Super-Resolution
Multi-grained Attention Networks for Single Image Super-Resolution
Huapeng Wu
Zhengxia Zou
Jie Gui
W. Zeng
Jieping Ye
Jun Zhang
Hongyi Liu
Zhihui Wei
SupR
56
60
0
26 Sep 2019
Gated Channel Transformation for Visual Recognition
Gated Channel Transformation for Visual Recognition
Zongxin Yang
Linchao Zhu
Yu Wu
Yezhou Yang
ViT
67
212
0
25 Sep 2019
Attention Interpretability Across NLP Tasks
Attention Interpretability Across NLP Tasks
Shikhar Vashishth
Shyam Upadhyay
Gaurav Singh Tomar
Manaal Faruqui
XAIMILM
97
176
0
24 Sep 2019
Improving Noise Robustness In Speaker Identification Using A Two-Stage
  Attention Model
Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model
Yanpei Shi
Qiang Huang
Thomas Hain
105
1
0
24 Sep 2019
Accept Synthetic Objects as Real: End-to-End Training of Attentive Deep
  Visuomotor Policies for Manipulation in Clutter
Accept Synthetic Objects as Real: End-to-End Training of Attentive Deep Visuomotor Policies for Manipulation in Clutter
P. Abolghasemi
Ladislau Bölöni
OffRL
85
10
0
24 Sep 2019
Paying Attention to Function Words
Paying Attention to Function Words
Shane Steinert-Threlkeld
31
3
0
24 Sep 2019
Where to Look Next: Unsupervised Active Visual Exploration on 360°
  Input
Where to Look Next: Unsupervised Active Visual Exploration on 360° Input
Soroush Seifi
Tinne Tuytelaars
70
10
0
23 Sep 2019
Learning Visual Relation Priors for Image-Text Matching and Image
  Captioning with Neural Scene Graph Generators
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators
Kuang-Huei Lee
Hamid Palangi
Xi Chen
Houdong Hu
Jianfeng Gao
VLM
67
37
0
22 Sep 2019
NeuroVectorizer: End-to-End Vectorization with Deep Reinforcement
  Learning
NeuroVectorizer: End-to-End Vectorization with Deep Reinforcement Learning
Ameer Haj-Ali
Nesreen Ahmed
Theodore L. Willke
Sophia Shao
Krste Asanović
Ion Stoica
92
101
0
20 Sep 2019
Goal-Embedded Dual Hierarchical Model for Task-Oriented Dialogue
  Generation
Goal-Embedded Dual Hierarchical Model for Task-Oriented Dialogue Generation
Yi-An Lai
Arshit Gupta
Yi Zhang
49
1
0
19 Sep 2019
Adaptively Aligned Image Captioning via Adaptive Attention Time
Adaptively Aligned Image Captioning via Adaptive Attention Time
Lun Huang
Wenmin Wang
Yaxian Xia
Jie Chen
83
63
0
19 Sep 2019
RUN through the Streets: A New Dataset and Baseline Models for Realistic
  Urban Navigation
RUN through the Streets: A New Dataset and Baseline Models for Realistic Urban Navigation
Tzuf Paz-Argaman
Reut Tsarfaty
70
20
0
19 Sep 2019
Large-scale representation learning from visually grounded untranscribed
  speech
Large-scale representation learning from visually grounded untranscribed speech
Gabriel Ilharco
Yuan Zhang
Jason Baldridge
SSL
87
61
0
19 Sep 2019
ContCap: A scalable framework for continual image captioning
ContCap: A scalable framework for continual image captioning
Giang Nguyen
Tae Joon Jun
T. Tran
Tolcha Yalew
Daeyoung Kim
VLMCLL
73
10
0
19 Sep 2019
Pose-aware Multi-level Feature Network for Human Object Interaction
  Detection
Pose-aware Multi-level Feature Network for Human Object Interaction Detection
Bo Wan
Desen Zhou
Yongfei Liu
Rongjie Li
Xuming He
76
200
0
18 Sep 2019
Previous
123...373839...697071
Next