ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Imputer: Sequence Modelling via Imputation and Dynamic Programming
Imputer: Sequence Modelling via Imputation and Dynamic Programming
William Chan
Chitwan Saharia
Geoffrey E. Hinton
Mohammad Norouzi
Navdeep Jaitly
BDLAI4TS
97
116
0
20 Feb 2020
Stroke Constrained Attention Network for Online Handwritten Mathematical
  Expression Recognition
Stroke Constrained Attention Network for Online Handwritten Mathematical Expression Recognition
Jiaming Wang
Jun Du
Jianshu Zhang
74
24
0
20 Feb 2020
A Convolutional Baseline for Person Re-Identification Using Vision and
  Language Descriptions
A Convolutional Baseline for Person Re-Identification Using Vision and Language Descriptions
Ammarah Farooq
Muhammad Awais
F. Yan
J. Kittler
A. Akbari
S. S. Khalid
119
8
0
20 Feb 2020
Deep Fusion of Local and Non-Local Features for Precision Landslide
  Recognition
Deep Fusion of Local and Non-Local Features for Precision Landslide Recognition
Qing Zhu
Lin Chen
Han Hu
Binzhi Xu
Yeting Zhang
Haifeng Li
18
10
0
20 Feb 2020
Expressing Objects just like Words: Recurrent Visual Embedding for
  Image-Text Matching
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching
Tianlang Chen
Jiebo Luo
72
69
0
20 Feb 2020
When Radiology Report Generation Meets Knowledge Graph
When Radiology Report Generation Meets Knowledge Graph
Yixiao Zhang
Xiaosong Wang
Ziyue Xu
Qihang Yu
Alan Yuille
Daguang Xu
MedIm
90
305
0
19 Feb 2020
CAUSE: Learning Granger Causality from Event Sequences using Attribution
  Methods
CAUSE: Learning Granger Causality from Event Sequences using Attribution Methods
W. Zhang
Thomas Kobber Panum
S. Jha
P. Chalasani
David Page
CMLAI4TS
83
49
0
18 Feb 2020
LocoGAN -- Locally Convolutional GAN
LocoGAN -- Locally Convolutional GAN
Lukasz Struski
Szymon Knop
Jacek Tabor
Wiktor Daniec
Przemysław Spurek
GAN
47
10
0
18 Feb 2020
MAST: A Memory-Augmented Self-supervised Tracker
MAST: A Memory-Augmented Self-supervised Tracker
Zihang Lai
Erika Lu
Weidi Xie
VOS
120
186
0
18 Feb 2020
Neural Attentive Multiview Machines
Neural Attentive Multiview Machines
Oren Barkan
Ori Katz
Noam Koenigstein
HAI
61
18
0
18 Feb 2020
Text Classification with Lexicon from PreAttention Mechanism
Text Classification with Lexicon from PreAttention Mechanism
Qingbiao Li
Chunhua Wu
K. Zheng
VLM
41
0
0
18 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLMAI4TSAI4CE
122
140
0
18 Feb 2020
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic
  Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO
  Framework
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework
C. Sur
124
7
0
16 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image
  Captioning With R-CNN Feature Distribution Composition (FDC)
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC)
C. Sur
56
17
0
15 Feb 2020
Sparse and Structured Visual Attention
Sparse and Structured Visual Attention
Pedro Henrique Martins
S. Becker
Zita Marinho
Michael Arens
81
8
0
13 Feb 2020
SpotNet: Self-Attention Multi-Task Network for Object Detection
SpotNet: Self-Attention Multi-Task Network for Object Detection
Hughes Perreault
Guillaume-Alexandre Bilodeau
Nicolas Saunier
Maguelonne Héritier
177
44
0
13 Feb 2020
Deep Learning for Source Code Modeling and Generation: Models,
  Applications and Challenges
Deep Learning for Source Code Modeling and Generation: Models, Applications and Challenges
T. H. Le
Hao Chen
Muhammad Ali Babar
VLM
147
155
0
13 Feb 2020
HAN-ECG: An Interpretable Atrial Fibrillation Detection Model Using
  Hierarchical Attention Networks
HAN-ECG: An Interpretable Atrial Fibrillation Detection Model Using Hierarchical Attention Networks
Sajad Mousavi
Fatemeh Afghah
U. Rajendra
55
97
0
12 Feb 2020
Vision-based Fight Detection from Surveillance Cameras
Vision-based Fight Detection from Surveillance Cameras
Seymanur Akti
G. A. Tataroglu
H. K. Ekenel
51
78
0
11 Feb 2020
What Changed Your Mind: The Roles of Dynamic Topics and Discourse in
  Argumentation Process
What Changed Your Mind: The Roles of Dynamic Topics and Discourse in Argumentation Process
Jichuan Zeng
Jing Li
Yulan He
Cuiyun Gao
Michael R. Lyu
Irwin King
68
16
0
10 Feb 2020
Blank Language Models
Blank Language Models
T. Shen
Victor Quach
Regina Barzilay
Tommi Jaakkola
291
73
0
08 Feb 2020
Attentive Group Equivariant Convolutional Networks
Attentive Group Equivariant Convolutional Networks
David W. Romero
Erik J. Bekkers
Jakub M. Tomczak
Mark Hoogendoorn
117
91
0
07 Feb 2020
Multimodal Matching Transformer for Live Commenting
Multimodal Matching Transformer for Live Commenting
Chaoqun Duan
Lei Cui
Shuming Ma
Furu Wei
Conghui Zhu
Tiejun Zhao
33
12
0
07 Feb 2020
Exploiting Temporal Coherence for Multi-modal Video Categorization
Exploiting Temporal Coherence for Multi-modal Video Categorization
Palash Goyal
Saurabh Sahu
Shalini Ghosh
Chul Lee
38
1
0
07 Feb 2020
The Costs and Benefits of Goal-Directed Attention in Deep Convolutional
  Neural Networks
The Costs and Benefits of Goal-Directed Attention in Deep Convolutional Neural Networks
Xiaoliang Luo
Brett D. Roads
Bradley C. Love
49
18
0
06 Feb 2020
GIM: Gaussian Isolation Machines
GIM: Gaussian Isolation Machines
Guy Amit
Ishai Rosenberg
Mosh Levy
Ron Bitton
A. Shabtai
Yuval Elovici
55
0
0
06 Feb 2020
Lossless Attention in Convolutional Networks for Facial Expression
  Recognition in the Wild
Lossless Attention in Convolutional Networks for Facial Expression Recognition in the Wild
Chuan Wang
R. Hu
Min Hu
Jiang-Dong Liu
Ting-fei Ren
Shan He
Ming Jiang
Jing Miao
CVBM
38
5
0
31 Jan 2020
Teaching Machines to Converse
Teaching Machines to Converse
Jiwei Li
98
4
0
31 Jan 2020
Convolutional Hierarchical Attention Network for Query-Focused Video
  Summarization
Convolutional Hierarchical Attention Network for Query-Focused Video Summarization
Shuwen Xiao
Zhou Zhao
Zijian Zhang
Ziyu Guan
Deng Cai
82
48
0
31 Jan 2020
Dual Convolutional LSTM Network for Referring Image Segmentation
Dual Convolutional LSTM Network for Referring Image Segmentation
Linwei Ye
Zhi Liu
Yang Wang
84
46
0
30 Jan 2020
Evaluating the Progress of Deep Learning for Visual Relational Concepts
Evaluating the Progress of Deep Learning for Visual Relational Concepts
Sebastian Stabinger
Peer David
J. Piater
A. Rodríguez-Sánchez
86
19
0
29 Jan 2020
aiTPR: Attribute Interaction-Tensor Product Representation for Image
  Caption
aiTPR: Attribute Interaction-Tensor Product Representation for Image Caption
C. Sur
47
8
0
27 Jan 2020
Uncertainty based Class Activation Maps for Visual Question Answering
Uncertainty based Class Activation Maps for Visual Question Answering
Badri N. Patro
Mayank Lunayach
Vinay P. Namboodiri
FAttUQCV
44
1
0
23 Jan 2020
Deep Bayesian Network for Visual Question Generation
Deep Bayesian Network for Visual Question Generation
Badri N. Patro
V. Kurmi
Sandeep Kumar
Vinay P. Namboodiri
BDL
52
18
0
23 Jan 2020
Robust Explanations for Visual Question Answering
Robust Explanations for Visual Question Answering
Badri N. Patro
Shivansh Pate
Vinay P. Namboodiri
OODAAML
73
19
0
23 Jan 2020
Visual Summary of Value-level Feature Attribution in Prediction Classes
  with Recurrent Neural Networks
Visual Summary of Value-level Feature Attribution in Prediction Classes with Recurrent Neural Networks
Chuan-Chi Wang
Xumeng Wang
K. Ma
FAttHAI
42
1
0
23 Jan 2020
Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models
Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models
M. Farazi
Salman H. Khan
Nick Barnes
81
18
0
20 Jan 2020
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Ruiyi Zhang
Changyou Chen
Zhe Gan
Zheng Wen
Wenlin Wang
Lawrence Carin
81
7
0
20 Jan 2020
Human-Aware Motion Deblurring
Human-Aware Motion Deblurring
Ziyi Shen
Wenguan Wang
Xiankai Lu
Jianbing Shen
Haibin Ling
Tingfa Xu
Ling Shao
3DH
116
290
0
19 Jan 2020
Text-to-Image Generation with Attention Based Recurrent Neural Networks
Text-to-Image Generation with Attention Based Recurrent Neural Networks
Tehseen Zia
Shahan Arif
Shakeeb Murtaza
M. A. Ullah
35
7
0
18 Jan 2020
Multi-step Joint-Modality Attention Network for Scene-Aware Dialogue
  System
Multi-step Joint-Modality Attention Network for Scene-Aware Dialogue System
Yun-Wei Chu
Kuan-Yen Lin
Chao-Chun Hsu
Lun-Wei Ku
140
22
0
17 Jan 2020
Multimodal Story Generation on Plural Images
Multimodal Story Generation on Plural Images
Jing Jiang
DiffM
18
0
0
16 Jan 2020
Delving Deeper into the Decoder for Video Captioning
Delving Deeper into the Decoder for Video Captioning
Haoran Chen
Jianmin Li
Xiaolin Hu
73
35
0
16 Jan 2020
A "Network Pruning Network" Approach to Deep Model Compression
A "Network Pruning Network" Approach to Deep Model Compression
Vinay Kumar Verma
Pravendra Singh
Vinay P. Namboodiri
Piyush Rai
3DPCVLM
58
8
0
15 Jan 2020
Show, Recall, and Tell: Image Captioning with Recall Mechanism
Show, Recall, and Tell: Image Captioning with Recall Mechanism
Li Wang
Zechen Bai
Yonghua Zhang
Hongtao Lu
79
67
0
15 Jan 2020
Visual Storytelling via Predicting Anchor Word Embeddings in the Stories
Visual Storytelling via Predicting Anchor Word Embeddings in the Stories
Bowen Zhang
Hexiang Hu
Fei Sha
46
6
0
13 Jan 2020
In Defense of Grid Features for Visual Question Answering
In Defense of Grid Features for Visual Question Answering
Huaizu Jiang
Ishan Misra
Marcus Rohrbach
Erik Learned-Miller
Xinlei Chen
OODObjD
105
320
0
10 Jan 2020
Visual Question Answering on 360° Images
Visual Question Answering on 360° Images
Shih-Han Chou
Wei-Lun Chao
Wei-Sheng Lai
Min Sun
Ming-Hsuan Yang
59
22
0
10 Jan 2020
On Interpretability of Artificial Neural Networks: A Survey
On Interpretability of Artificial Neural Networks: A Survey
Fenglei Fan
Jinjun Xiong
Mengzhou Li
Ge Wang
AAMLAI4CE
96
318
0
08 Jan 2020
Res3ATN -- Deep 3D Residual Attention Network for Hand Gesture
  Recognition in Videos
Res3ATN -- Deep 3D Residual Attention Network for Hand Gesture Recognition in Videos
Naina Dhingra
A. Kunz
3DPCSLR
82
36
0
04 Jan 2020
Previous
123...343536...697071
Next