ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Image-to-Image Retrieval by Learning Similarity between Scene Graphs
Image-to-Image Retrieval by Learning Similarity between Scene Graphs
Sangwoong Yoon
Woo-Young Kang
Sungwook Jeon
SeongEun Lee
C. Han
Jonghun Park
Eun-Sol Kim
3DH
96
45
0
29 Dec 2020
Coarse to Fine: Multi-label Image Classification with Global/Local
  Attention
Coarse to Fine: Multi-label Image Classification with Global/Local Attention
Fan Lyu
Fuyuan Hu
Victor S. Sheng
Zhengtian Wu
Qiming Fu
Baochuan Fu
36
6
0
26 Dec 2020
REM-Net: Recursive Erasure Memory Network for Commonsense Evidence
  Refinement
REM-Net: Recursive Erasure Memory Network for Commonsense Evidence Refinement
Yinya Huang
Meng Fang
Xunlin Zhan
Qingxing Cao
Xiaodan Liang
Liang Lin
64
9
0
24 Dec 2020
LCEval: Learned Composite Metric for Caption Evaluation
LCEval: Learned Composite Metric for Caption Evaluation
Naeha Sharif
Lyndon White
Bennamoun
Wei Liu
Syed Afaq Ali Shah
51
8
0
24 Dec 2020
SubICap: Towards Subword-informed Image Captioning
SubICap: Towards Subword-informed Image Captioning
Naeha Sharif
Bennamoun
Wei Liu
Syed Afaq Ali Shah
47
2
0
24 Dec 2020
ConvMath: A Convolutional Sequence Network for Mathematical Expression
  Recognition
ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition
Zuoyu Yan
Xiaode Zhang
Liangcai Gao
Ke Yuan
Zhi Tang
63
17
0
23 Dec 2020
A Survey on Visual Transformer
A Survey on Visual Transformer
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
237
2,294
0
23 Dec 2020
Image to Bengali Caption Generation Using Deep CNN and Bidirectional
  Gated Recurrent Unit
Image to Bengali Caption Generation Using Deep CNN and Bidirectional Gated Recurrent Unit
Albay Faruk
Hasan Al Faraby
M. M. Azad
Md. Riduyan Fedous
Md. Kishor Morol
49
18
0
22 Dec 2020
FcaNet: Frequency Channel Attention Networks
FcaNet: Frequency Channel Attention Networks
Zequn Qin
Pengyi Zhang
Leilei Gan
Xi Li
122
715
0
22 Dec 2020
Human Action Recognition from Various Data Modalities: A Review
Human Action Recognition from Various Data Modalities: A Review
Zehua Sun
Qiuhong Ke
Hossein Rahmani
Mohammed Bennamoun
Gang Wang
Jun Liu
MU
184
536
0
22 Dec 2020
Neural Methods for Effective, Efficient, and Exposure-Aware Information
  Retrieval
Neural Methods for Effective, Efficient, and Exposure-Aware Information Retrieval
Bhaskar Mitra
83
6
0
21 Dec 2020
Knowledge as Invariance -- History and Perspectives of
  Knowledge-augmented Machine Learning
Knowledge as Invariance -- History and Perspectives of Knowledge-augmented Machine Learning
A. Sagel
Amit Sahu
Stefan Matthes
H. Pfeifer
Tianming Qiu
Harald Ruess
Hao Shen
Julian Wormann
61
3
0
21 Dec 2020
Improving unsupervised anomaly localization by applying multi-scale
  memories to autoencoders
Improving unsupervised anomaly localization by applying multi-scale memories to autoencoders
Yifei Yang
Shibing Xiang
Ruixiang Zhang
57
9
0
21 Dec 2020
ShineOn: Illuminating Design Choices for Practical Video-based Virtual
  Clothing Try-on
ShineOn: Illuminating Design Choices for Practical Video-based Virtual Clothing Try-on
Gaurav Kuppa
Andrew Jong
Vera Liu
Ziwei Liu
Teng-Sheng Moh
CVBM
76
21
0
18 Dec 2020
Attention-based Image Upsampling
Attention-based Image Upsampling
Souvik Kundu
Hesham Mostafa
S. N. Sridhar
Sairam Sundaresan
SupR
37
11
0
17 Dec 2020
Transformer Interpretability Beyond Attention Visualization
Transformer Interpretability Beyond Attention Visualization
Hila Chefer
Shir Gur
Lior Wolf
152
681
0
17 Dec 2020
XAI-P-T: A Brief Review of Explainable Artificial Intelligence from
  Practice to Theory
XAI-P-T: A Brief Review of Explainable Artificial Intelligence from Practice to Theory
Nazanin Fouladgar
Kary Främling
XAI
39
4
0
17 Dec 2020
AutoCaption: Image Captioning with Neural Architecture Search
AutoCaption: Image Captioning with Neural Architecture Search
Xinxin Zhu
Weining Wang
Longteng Guo
Jing Liu
102
9
0
16 Dec 2020
AIST: An Interpretable Attention-based Deep Learning Model for Crime
  Prediction
AIST: An Interpretable Attention-based Deep Learning Model for Crime Prediction
Yeasir Rayhan
T. Hashem
43
24
0
16 Dec 2020
Two-Stage Copy-Move Forgery Detection with Self Deep Matching and
  Proposal SuperGlue
Two-Stage Copy-Move Forgery Detection with Self Deep Matching and Proposal SuperGlue
Yaqi Liu
Chao Xia
Xiaobin Zhu
Shengwei Xu
3DPC
52
57
0
16 Dec 2020
Intrinsic Image Captioning Evaluation
Intrinsic Image Captioning Evaluation
Chao Zeng
Sam Kwong
59
1
0
14 Dec 2020
TDAF: Top-Down Attention Framework for Vision Tasks
TDAF: Top-Down Attention Framework for Vision Tasks
Bo Pang
Yizhuo Li
Jiefeng Li
Muchen Li
Hanwen Cao
Cewu Lu
91
10
0
14 Dec 2020
Learning Contextual Causality from Time-consecutive Images
Learning Contextual Causality from Time-consecutive Images
Hongming Zhang
Yintong Huo
Xinran Zhao
Yangqiu Song
Dan Roth
CML
61
6
0
13 Dec 2020
Demystifying Deep Neural Networks Through Interpretation: A Survey
Demystifying Deep Neural Networks Through Interpretation: A Survey
Giang Dao
Minwoo Lee
FaMLFAtt
66
1
0
13 Dec 2020
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision
  and Language Research in Turkish
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish
Begum Citamak
Ozan Caglayan
Menekse Kuyu
Erkut Erdem
Aykut Erdem
Pranava Madhyastha
Lucia Specia
86
9
0
13 Dec 2020
Improving Image Captioning by Leveraging Intra- and Inter-layer Global
  Representation in Transformer Network
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Jiayi Ji
Yunpeng Luo
Xiaoshuai Sun
Fuhai Chen
Gen Luo
Yongjian Wu
Yue Gao
Rongrong Ji
ViT
113
178
0
13 Dec 2020
Dependency Decomposition and a Reject Option for Explainable Models
Dependency Decomposition and a Reject Option for Explainable Models
Jan Kronenberger
Anselm Haselhoff
FAttAAML
75
8
0
11 Dec 2020
Uncertainty-Aware Deep Calibrated Salient Object Detection
Uncertainty-Aware Deep Calibrated Salient Object Detection
Jing Zhang
Yuchao Dai
Xin Yu
Mehrtash Harandi
Nick Barnes
Leonid Sigal
UQCVEDL
79
6
0
10 Dec 2020
Look Before you Speak: Visually Contextualized Utterances
Look Before you Speak: Visually Contextualized Utterances
Paul Hongsuck Seo
Arsha Nagrani
Cordelia Schmid
101
67
0
10 Dec 2020
Image Captioning with Context-Aware Auxiliary Guidance
Image Captioning with Context-Aware Auxiliary Guidance
Zeliang Song
Xiaofei Zhou
Zhendong Mao
Jianlong Tan
88
31
0
10 Dec 2020
On the Binding Problem in Artificial Neural Networks
On the Binding Problem in Artificial Neural Networks
Klaus Greff
Sjoerd van Steenkiste
Jürgen Schmidhuber
OCL
316
267
0
09 Dec 2020
End-to-end Handwritten Paragraph Text Recognition Using a Vertical
  Attention Network
End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network
Denis Coquenet
Clément Chatelain
Thierry Paquet
AI4TS
81
82
0
07 Dec 2020
Robust Image Captioning
Robust Image Captioning
Daniel Yarnell
Xian Wang
56
0
0
06 Dec 2020
A Survey on Deep Learning for Human Mobility
A Survey on Deep Learning for Human Mobility
Massimiliano Luca
Gianni Barlacchi
Bruno Lepri
Luca Pappalardo
HAI
77
223
0
04 Dec 2020
Understanding Attention: In Minds and Machines
Understanding Attention: In Minds and Machines
S. P. Sawant
Shruti Singh
42
1
0
04 Dec 2020
Spatial-Temporal Alignment Network for Action Recognition and Detection
Spatial-Temporal Alignment Network for Action Recognition and Detection
Junwei Liang
Liangliang Cao
Xuehan Xiong
Ting Yu
Alexander G. Hauptmann
3DPC
70
9
0
04 Dec 2020
Understanding Guided Image Captioning Performance across Domains
Understanding Guided Image Captioning Performance across Domains
Edwin G. Ng
Bo Pang
P. Sharma
Radu Soricut
129
25
0
04 Dec 2020
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Dave Zhenyu Chen
A. Gholami
Matthias Nießner
Angel X. Chang
3DPC
192
176
0
03 Dec 2020
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling
Jing Su
Qingyun Dai
Frank Guerin
Mian Zhou
79
24
0
03 Dec 2020
Lookahead optimizer improves the performance of Convolutional
  Autoencoders for reconstruction of natural images
Lookahead optimizer improves the performance of Convolutional Autoencoders for reconstruction of natural images
Sayan Nag
DRL
31
2
0
03 Dec 2020
Generating Descriptions for Sequential Images with Local-Object
  Attention and Global Semantic Context Modelling
Generating Descriptions for Sequential Images with Local-Object Attention and Global Semantic Context Modelling
Jing Su
Chenghua Lin
Mian Zhou
Qingyun Dai
Haoyu Lv
41
2
0
02 Dec 2020
Learning Spatial Attention for Face Super-Resolution
Learning Spatial Attention for Face Super-Resolution
Chaofeng Chen
Dihong Gong
Hao Wang
Zhifeng Li
Kwan-Yee K. Wong
CVBMSupR3DH
97
164
0
02 Dec 2020
Inductive Biases for Deep Learning of Higher-Level Cognition
Inductive Biases for Deep Learning of Higher-Level Cognition
Anirudh Goyal
Yoshua Bengio
AI4CE
122
366
0
30 Nov 2020
Language-Driven Region Pointer Advancement for Controllable Image
  Captioning
Language-Driven Region Pointer Advancement for Controllable Image Captioning
Annika Lindh
R. Ross
John D. Kelleher
48
14
0
30 Nov 2020
Blind signal decomposition of various word embeddings based on join and
  individual variance explained
Blind signal decomposition of various word embeddings based on join and individual variance explained
Yikai Wang
Weijian Li
35
0
0
30 Nov 2020
An Investigation of Language Model Interpretability via Sentence Editing
An Investigation of Language Model Interpretability via Sentence Editing
Samuel Stevens
Yu-Chuan Su
LRM
48
9
0
28 Nov 2020
FFCI: A Framework for Interpretable Automatic Evaluation of
  Summarization
FFCI: A Framework for Interpretable Automatic Evaluation of Summarization
Fajri Koto
Timothy Baldwin
Jey Han Lau
HILM
113
37
0
27 Nov 2020
Joint Extraction of Entity and Relation with Information Redundancy
  Elimination
Joint Extraction of Entity and Relation with Information Redundancy Elimination
Yuan-Chung Shen
Jungang Han
46
1
0
27 Nov 2020
Reflective-Net: Learning from Explanations
Reflective-Net: Learning from Explanations
Johannes Schneider
Michalis Vlachos
FAttOffRLLRM
128
17
0
27 Nov 2020
Deep Metric Learning-based Image Retrieval System for Chest Radiograph
  and its Clinical Applications in COVID-19
Deep Metric Learning-based Image Retrieval System for Chest Radiograph and its Clinical Applications in COVID-19
Aoxiao Zhong
Xiang Li
Dufan Wu
Hui Ren
Kyungsang Kim
...
W. Chung
Ning Guo
I. Dayan
Mannudeep K. Kalra
Quanzheng Li
71
66
0
26 Nov 2020
Previous
123...252627...697071
Next