ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXivPDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,512 papers shown
Title
Isomer: Transfer enhanced Dual-Channel Heterogeneous Dependency
  Attention Network for Aspect-based Sentiment Classification
Isomer: Transfer enhanced Dual-Channel Heterogeneous Dependency Attention Network for Aspect-based Sentiment Classification
Yukun Cao
Yijia Tang
Ziyue Wei
Chengkun Jin
Zeyu Miao
Yixin Fang
Haizhou Du
Feifei Xu
27
0
0
21 Nov 2021
AGA-GAN: Attribute Guided Attention Generative Adversarial Network with
  U-Net for Face Hallucination
AGA-GAN: Attribute Guided Attention Generative Adversarial Network with U-Net for Face Hallucination
Abhishek Srivastava
S. Chanda
Umapada Pal
GAN
CVBM
35
10
0
20 Nov 2021
Combined Scaling for Zero-shot Transfer Learning
Combined Scaling for Zero-shot Transfer Learning
Hieu H. Pham
Zihang Dai
Golnaz Ghiasi
Kenji Kawaguchi
Hanxiao Liu
...
Yi-Ting Chen
Minh-Thang Luong
Yonghui Wu
Mingxing Tan
Quoc V. Le
VLM
17
194
0
19 Nov 2021
ClipCap: CLIP Prefix for Image Captioning
ClipCap: CLIP Prefix for Image Captioning
Ron Mokady
Amir Hertz
Amit H. Bermano
CLIP
VLM
28
660
0
18 Nov 2021
Image-specific Convolutional Kernel Modulation for Single Image
  Super-resolution
Image-specific Convolutional Kernel Modulation for Single Image Super-resolution
Yuanfei Huang
Jie Li
Yanting Hu
Xinbo Gao
Huan Huang
SupR
29
0
0
16 Nov 2021
Attention Mechanisms in Computer Vision: A Survey
Attention Mechanisms in Computer Vision: A Survey
Meng-Hao Guo
Tianhan Xu
Jiangjiang Liu
Zheng-Ning Liu
Peng-Tao Jiang
Tai-Jiang Mu
Song-Hai Zhang
Ralph Robert Martin
Ming-Ming Cheng
Shimin Hu
19
1,644
0
15 Nov 2021
Fingerprint Presentation Attack Detection by Channel-wise Feature
  Denoising
Fingerprint Presentation Attack Detection by Channel-wise Feature Denoising
Feng Liu
Zhe Kong
Haozhe Liu
Wentian Zhang
Linlin Shen
AAML
38
24
0
15 Nov 2021
A Probabilistic Hard Attention Model For Sequentially Observed Scenes
A Probabilistic Hard Attention Model For Sequentially Observed Scenes
Samrudhdhi B. Rangrej
James J. Clark
24
12
0
15 Nov 2021
Co-segmentation Inspired Attention Module for Video-based Computer
  Vision Tasks
Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks
Arulkumar Subramaniam
Jayesh Vaidya
Muhammed Ameen
Athira M. Nambiar
Anurag Mittal
30
7
0
14 Nov 2021
Where to Look: A Unified Attention Model for Visual Recognition with
  Reinforcement Learning
Where to Look: A Unified Attention Model for Visual Recognition with Reinforcement Learning
Gang Chen
22
3
0
13 Nov 2021
Yaw-Guided Imitation Learning for Autonomous Driving in Urban
  Environments
Yaw-Guided Imitation Learning for Autonomous Driving in Urban Environments
Yandong Liu
Chengzhong Xu
Hui Kong
21
0
0
11 Nov 2021
Learning to ignore: rethinking attention in CNNs
Learning to ignore: rethinking attention in CNNs
Firas Laakom
K. Chumachenko
Jenni Raitoharju
Alexandros Iosifidis
Moncef Gabbouj
68
7
0
10 Nov 2021
Explaining Face Presentation Attack Detection Using Natural Language
Explaining Face Presentation Attack Detection Using Natural Language
H. Mirzaalian
Mohamed E. Hussein
L. Spinoulas
Jonathan May
Wael AbdAlmageed
CVBM
FAtt
AAML
36
5
0
08 Nov 2021
Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation
Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation
Fenglin Liu
Chenyu You
Xian Wu
Shen Ge
Sheng Wang
Xu Sun
MedIm
81
92
0
08 Nov 2021
"How Does It Detect A Malicious App?" Explaining the Predictions of
  AI-based Android Malware Detector
"How Does It Detect A Malicious App?" Explaining the Predictions of AI-based Android Malware Detector
Zhi Lu
V. Thing
AAML
24
4
0
06 Nov 2021
The Curious Layperson: Fine-Grained Image Recognition without Expert
  Labels
The Curious Layperson: Fine-Grained Image Recognition without Expert Labels
Subhabrata Choudhury
Iro Laina
Christian Rupprecht
Andrea Vedaldi
VLM
38
9
0
05 Nov 2021
An Entropy-guided Reinforced Partial Convolutional Network for Zero-Shot
  Learning
An Entropy-guided Reinforced Partial Convolutional Network for Zero-Shot Learning
Yun Yvonna Li
Zhe Liu
L. Yao
Xianzhi Wang
Julian McAuley
Xiaojun Chang
37
21
0
03 Nov 2021
A Simple Approach to Image Tilt Correction with Self-Attention MobileNet
  for Smartphones
A Simple Approach to Image Tilt Correction with Self-Attention MobileNet for Smartphones
Siddhant Garg
D. Mohanty
S. Thota
Sukumar Moharana
ViT
24
2
0
31 Oct 2021
Attacking Video Recognition Models with Bullet-Screen Comments
Attacking Video Recognition Models with Bullet-Screen Comments
Kai-xiang Chen
Zhipeng Wei
Jingjing Chen
Zuxuan Wu
Yu-Gang Jiang
AAML
34
22
0
29 Oct 2021
ST-ABN: Visual Explanation Taking into Account Spatio-temporal
  Information for Video Recognition
ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition
Masahiro Mitsuhara
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
27
1
0
29 Oct 2021
Leveraging Recursive Gumbel-Max Trick for Approximate Inference in
  Combinatorial Spaces
Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces
Kirill Struminsky
Artyom Gadetsky
D. Rakitin
Danil Karpushkin
Dmitry Vetrov
BDL
32
9
0
28 Oct 2021
Discovering Non-monotonic Autoregressive Orderings with Variational
  Inference
Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Xuanlin Li
Brandon Trabucco
Dongmin Park
Michael Luo
S. Shen
Trevor Darrell
Yang Gao
27
12
0
27 Oct 2021
Understanding Interlocking Dynamics of Cooperative Rationalization
Understanding Interlocking Dynamics of Cooperative Rationalization
Mo Yu
Yang Zhang
Shiyu Chang
Tommi Jaakkola
32
41
0
26 Oct 2021
BioIE: Biomedical Information Extraction with Multi-head Attention
  Enhanced Graph Convolutional Network
BioIE: Biomedical Information Extraction with Multi-head Attention Enhanced Graph Convolutional Network
Jialun Wu
Yang Liu
Zeyu Gao
Tieliang Gong
Chunbao Wang
Chen Li
29
16
0
26 Oct 2021
Transferring Domain-Agnostic Knowledge in Video Question Answering
Transferring Domain-Agnostic Knowledge in Video Question Answering
Tianran Wu
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
Haruo Takemura
25
8
0
26 Oct 2021
Alignment Attention by Matching Key and Query Distributions
Alignment Attention by Matching Key and Query Distributions
Shujian Zhang
Xinjie Fan
Huangjie Zheng
Korawat Tanwisuth
Mingyuan Zhou
OOD
40
10
0
25 Oct 2021
Simple Dialogue System with AUDITED
Simple Dialogue System with AUDITED
Eugenio Clerico
Piotr Koniusz
21
2
0
22 Oct 2021
Recurrence along Depth: Deep Convolutional Neural Networks with
  Recurrent Layer Aggregation
Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer Aggregation
Jingyu Zhao
Yanwen Fang
Guodong Li
27
23
0
22 Oct 2021
Exploiting Cross-Modal Prediction and Relation Consistency for
  Semi-Supervised Image Captioning
Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Yang Yang
Haoran Wei
Hengshu Zhu
Dianhai Yu
Hui Xiong
Jian Yang
SSL
14
33
0
22 Oct 2021
SciCap: Generating Captions for Scientific Figures
SciCap: Generating Captions for Scientific Figures
Ting-Yao Hsu
C. Lee Giles
Ting-Hao 'Kenneth' Huang
27
85
0
22 Oct 2021
MHAttnSurv: Multi-Head Attention for Survival Prediction Using
  Whole-Slide Pathology Images
MHAttnSurv: Multi-Head Attention for Survival Prediction Using Whole-Slide Pathology Images
Shuai Jiang
A. Suriawinata
Saeed Hassanpour
24
26
0
22 Oct 2021
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal
  Action Proposals Generation
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation
Khoa T. Vo
Kevin Hyekang Joo
Kashu Yamazaki
Sang Truong
Kris Kitani
Minh-Triet Tran
Ngan Le
EgoV
64
17
0
21 Oct 2021
Video and Text Matching with Conditioned Embeddings
Video and Text Matching with Conditioned Embeddings
Ameen Ali
Idan Schwartz
Tamir Hazan
Lior Wolf
97
13
0
21 Oct 2021
Self-Supervision and Spatial-Sequential Attention Based Loss for
  Multi-Person Pose Estimation
Self-Supervision and Spatial-Sequential Attention Based Loss for Multi-Person Pose Estimation
Haiyang Liu
Dingli Luo
Songlin Du
T. Ikenaga
3DH
38
0
0
20 Oct 2021
A Self-Explainable Stylish Image Captioning Framework via
  Multi-References
A Self-Explainable Stylish Image Captioning Framework via Multi-References
Chengxi Li
Brent Harrison
26
0
0
20 Oct 2021
Inductive Biases and Variable Creation in Self-Attention Mechanisms
Inductive Biases and Variable Creation in Self-Attention Mechanisms
Benjamin L. Edelman
Surbhi Goel
Sham Kakade
Cyril Zhang
27
118
0
19 Oct 2021
Compositional Attention: Disentangling Search and Retrieval
Compositional Attention: Disentangling Search and Retrieval
Sarthak Mittal
Sharath Chandra Raparthy
Irina Rish
Yoshua Bengio
Guillaume Lajoie
22
20
0
18 Oct 2021
Deep Transfer Learning & Beyond: Transformer Language Models in
  Information Systems Research
Deep Transfer Learning & Beyond: Transformer Language Models in Information Systems Research
Ross Gruetzemacher
D. Paradice
35
30
0
18 Oct 2021
Visual-aware Attention Dual-stream Decoder for Video Captioning
Visual-aware Attention Dual-stream Decoder for Video Captioning
Zhixin Sun
Xian Zhong
Shuqin Chen
Lin Li
Luo Zhong
36
3
0
16 Oct 2021
Multimodal Dialogue Response Generation
Multimodal Dialogue Response Generation
Qingfeng Sun
Yujing Wang
Can Xu
Kai Zheng
Yaming Yang
Huang Hu
Fei Xu
Jessica Zhang
Xiubo Geng
Daxin Jiang
26
43
0
16 Oct 2021
Self-Annotated Training for Controllable Image Captioning
Self-Annotated Training for Controllable Image Captioning
Zhangzi Zhu
Tianlei Wang
Hong Qu
32
2
0
16 Oct 2021
Guiding Visual Question Generation
Guiding Visual Question Generation
Nihir Vedd
Zixu Wang
Marek Rei
Yishu Miao
Lucia Specia
89
23
0
15 Oct 2021
Improving the Performance of Automated Audio Captioning via Integrating
  the Acoustic and Semantic Information
Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information
Zhongjie Ye
Helin Wang
Dongchao Yang
Yuexian Zou
40
27
0
12 Oct 2021
Multi-Modal Interaction Graph Convolutional Network for Temporal
  Language Localization in Videos
Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos
Zongmeng Zhang
Xianjing Han
Xuemeng Song
Yan Yan
Liqiang Nie
41
36
0
12 Oct 2021
Topic Scene Graph Generation by Attention Distillation from Caption
Topic Scene Graph Generation by Attention Distillation from Caption
Wenbin Wang
R. Wang
X. Chen
DiffM
30
14
0
12 Oct 2021
Reason induced visual attention for explainable autonomous driving
Reason induced visual attention for explainable autonomous driving
Sikai Chen
Jiqian Dong
Runjia Du
Yujie Li
Samuel Labi
34
1
0
11 Oct 2021
Semi-Autoregressive Image Captioning
Semi-Autoregressive Image Captioning
Xu Yan
Zhengcong Fei
Zekang Li
Shuhui Wang
Qingming Huang
Qi Tian
35
23
0
11 Oct 2021
Supervision Exists Everywhere: A Data Efficient Contrastive
  Language-Image Pre-training Paradigm
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
Yangguang Li
Feng Liang
Lichen Zhao
Yufeng Cui
Wanli Ouyang
Jing Shao
F. Yu
Junjie Yan
VLM
CLIP
50
448
0
11 Oct 2021
Recurrent Attention Models with Object-centric Capsule Representation
  for Multi-object Recognition
Recurrent Attention Models with Object-centric Capsule Representation for Multi-object Recognition
Hossein Adeli
Seoyoung Ahn
G. Zelinsky
OCL
23
3
0
11 Oct 2021
Accessible Visualization via Natural Language Descriptions: A Four-Level
  Model of Semantic Content
Accessible Visualization via Natural Language Descriptions: A Four-Level Model of Semantic Content
Alan Lundgard
Arvind Satyanarayan
25
128
0
08 Oct 2021
Previous
123...171819...697071
Next