Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,512 papers shown
Title
Isomer: Transfer enhanced Dual-Channel Heterogeneous Dependency Attention Network for Aspect-based Sentiment Classification
Yukun Cao
Yijia Tang
Ziyue Wei
Chengkun Jin
Zeyu Miao
Yixin Fang
Haizhou Du
Feifei Xu
27
0
0
21 Nov 2021
AGA-GAN: Attribute Guided Attention Generative Adversarial Network with U-Net for Face Hallucination
Abhishek Srivastava
S. Chanda
Umapada Pal
GAN
CVBM
35
10
0
20 Nov 2021
Combined Scaling for Zero-shot Transfer Learning
Hieu H. Pham
Zihang Dai
Golnaz Ghiasi
Kenji Kawaguchi
Hanxiao Liu
...
Yi-Ting Chen
Minh-Thang Luong
Yonghui Wu
Mingxing Tan
Quoc V. Le
VLM
17
194
0
19 Nov 2021
ClipCap: CLIP Prefix for Image Captioning
Ron Mokady
Amir Hertz
Amit H. Bermano
CLIP
VLM
28
660
0
18 Nov 2021
Image-specific Convolutional Kernel Modulation for Single Image Super-resolution
Yuanfei Huang
Jie Li
Yanting Hu
Xinbo Gao
Huan Huang
SupR
29
0
0
16 Nov 2021
Attention Mechanisms in Computer Vision: A Survey
Meng-Hao Guo
Tianhan Xu
Jiangjiang Liu
Zheng-Ning Liu
Peng-Tao Jiang
Tai-Jiang Mu
Song-Hai Zhang
Ralph Robert Martin
Ming-Ming Cheng
Shimin Hu
19
1,644
0
15 Nov 2021
Fingerprint Presentation Attack Detection by Channel-wise Feature Denoising
Feng Liu
Zhe Kong
Haozhe Liu
Wentian Zhang
Linlin Shen
AAML
38
24
0
15 Nov 2021
A Probabilistic Hard Attention Model For Sequentially Observed Scenes
Samrudhdhi B. Rangrej
James J. Clark
24
12
0
15 Nov 2021
Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks
Arulkumar Subramaniam
Jayesh Vaidya
Muhammed Ameen
Athira M. Nambiar
Anurag Mittal
30
7
0
14 Nov 2021
Where to Look: A Unified Attention Model for Visual Recognition with Reinforcement Learning
Gang Chen
22
3
0
13 Nov 2021
Yaw-Guided Imitation Learning for Autonomous Driving in Urban Environments
Yandong Liu
Chengzhong Xu
Hui Kong
21
0
0
11 Nov 2021
Learning to ignore: rethinking attention in CNNs
Firas Laakom
K. Chumachenko
Jenni Raitoharju
Alexandros Iosifidis
Moncef Gabbouj
68
7
0
10 Nov 2021
Explaining Face Presentation Attack Detection Using Natural Language
H. Mirzaalian
Mohamed E. Hussein
L. Spinoulas
Jonathan May
Wael AbdAlmageed
CVBM
FAtt
AAML
36
5
0
08 Nov 2021
Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation
Fenglin Liu
Chenyu You
Xian Wu
Shen Ge
Sheng Wang
Xu Sun
MedIm
81
92
0
08 Nov 2021
"How Does It Detect A Malicious App?" Explaining the Predictions of AI-based Android Malware Detector
Zhi Lu
V. Thing
AAML
24
4
0
06 Nov 2021
The Curious Layperson: Fine-Grained Image Recognition without Expert Labels
Subhabrata Choudhury
Iro Laina
Christian Rupprecht
Andrea Vedaldi
VLM
38
9
0
05 Nov 2021
An Entropy-guided Reinforced Partial Convolutional Network for Zero-Shot Learning
Yun Yvonna Li
Zhe Liu
L. Yao
Xianzhi Wang
Julian McAuley
Xiaojun Chang
37
21
0
03 Nov 2021
A Simple Approach to Image Tilt Correction with Self-Attention MobileNet for Smartphones
Siddhant Garg
D. Mohanty
S. Thota
Sukumar Moharana
ViT
24
2
0
31 Oct 2021
Attacking Video Recognition Models with Bullet-Screen Comments
Kai-xiang Chen
Zhipeng Wei
Jingjing Chen
Zuxuan Wu
Yu-Gang Jiang
AAML
34
22
0
29 Oct 2021
ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition
Masahiro Mitsuhara
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
27
1
0
29 Oct 2021
Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces
Kirill Struminsky
Artyom Gadetsky
D. Rakitin
Danil Karpushkin
Dmitry Vetrov
BDL
32
9
0
28 Oct 2021
Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Xuanlin Li
Brandon Trabucco
Dongmin Park
Michael Luo
S. Shen
Trevor Darrell
Yang Gao
27
12
0
27 Oct 2021
Understanding Interlocking Dynamics of Cooperative Rationalization
Mo Yu
Yang Zhang
Shiyu Chang
Tommi Jaakkola
32
41
0
26 Oct 2021
BioIE: Biomedical Information Extraction with Multi-head Attention Enhanced Graph Convolutional Network
Jialun Wu
Yang Liu
Zeyu Gao
Tieliang Gong
Chunbao Wang
Chen Li
29
16
0
26 Oct 2021
Transferring Domain-Agnostic Knowledge in Video Question Answering
Tianran Wu
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
Haruo Takemura
25
8
0
26 Oct 2021
Alignment Attention by Matching Key and Query Distributions
Shujian Zhang
Xinjie Fan
Huangjie Zheng
Korawat Tanwisuth
Mingyuan Zhou
OOD
40
10
0
25 Oct 2021
Simple Dialogue System with AUDITED
Eugenio Clerico
Piotr Koniusz
21
2
0
22 Oct 2021
Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer Aggregation
Jingyu Zhao
Yanwen Fang
Guodong Li
27
23
0
22 Oct 2021
Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Yang Yang
Haoran Wei
Hengshu Zhu
Dianhai Yu
Hui Xiong
Jian Yang
SSL
14
33
0
22 Oct 2021
SciCap: Generating Captions for Scientific Figures
Ting-Yao Hsu
C. Lee Giles
Ting-Hao 'Kenneth' Huang
27
85
0
22 Oct 2021
MHAttnSurv: Multi-Head Attention for Survival Prediction Using Whole-Slide Pathology Images
Shuai Jiang
A. Suriawinata
Saeed Hassanpour
24
26
0
22 Oct 2021
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation
Khoa T. Vo
Kevin Hyekang Joo
Kashu Yamazaki
Sang Truong
Kris Kitani
Minh-Triet Tran
Ngan Le
EgoV
64
17
0
21 Oct 2021
Video and Text Matching with Conditioned Embeddings
Ameen Ali
Idan Schwartz
Tamir Hazan
Lior Wolf
97
13
0
21 Oct 2021
Self-Supervision and Spatial-Sequential Attention Based Loss for Multi-Person Pose Estimation
Haiyang Liu
Dingli Luo
Songlin Du
T. Ikenaga
3DH
38
0
0
20 Oct 2021
A Self-Explainable Stylish Image Captioning Framework via Multi-References
Chengxi Li
Brent Harrison
26
0
0
20 Oct 2021
Inductive Biases and Variable Creation in Self-Attention Mechanisms
Benjamin L. Edelman
Surbhi Goel
Sham Kakade
Cyril Zhang
27
118
0
19 Oct 2021
Compositional Attention: Disentangling Search and Retrieval
Sarthak Mittal
Sharath Chandra Raparthy
Irina Rish
Yoshua Bengio
Guillaume Lajoie
22
20
0
18 Oct 2021
Deep Transfer Learning & Beyond: Transformer Language Models in Information Systems Research
Ross Gruetzemacher
D. Paradice
35
30
0
18 Oct 2021
Visual-aware Attention Dual-stream Decoder for Video Captioning
Zhixin Sun
Xian Zhong
Shuqin Chen
Lin Li
Luo Zhong
36
3
0
16 Oct 2021
Multimodal Dialogue Response Generation
Qingfeng Sun
Yujing Wang
Can Xu
Kai Zheng
Yaming Yang
Huang Hu
Fei Xu
Jessica Zhang
Xiubo Geng
Daxin Jiang
26
43
0
16 Oct 2021
Self-Annotated Training for Controllable Image Captioning
Zhangzi Zhu
Tianlei Wang
Hong Qu
32
2
0
16 Oct 2021
Guiding Visual Question Generation
Nihir Vedd
Zixu Wang
Marek Rei
Yishu Miao
Lucia Specia
89
23
0
15 Oct 2021
Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information
Zhongjie Ye
Helin Wang
Dongchao Yang
Yuexian Zou
40
27
0
12 Oct 2021
Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos
Zongmeng Zhang
Xianjing Han
Xuemeng Song
Yan Yan
Liqiang Nie
41
36
0
12 Oct 2021
Topic Scene Graph Generation by Attention Distillation from Caption
Wenbin Wang
R. Wang
X. Chen
DiffM
30
14
0
12 Oct 2021
Reason induced visual attention for explainable autonomous driving
Sikai Chen
Jiqian Dong
Runjia Du
Yujie Li
Samuel Labi
34
1
0
11 Oct 2021
Semi-Autoregressive Image Captioning
Xu Yan
Zhengcong Fei
Zekang Li
Shuhui Wang
Qingming Huang
Qi Tian
35
23
0
11 Oct 2021
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
Yangguang Li
Feng Liang
Lichen Zhao
Yufeng Cui
Wanli Ouyang
Jing Shao
F. Yu
Junjie Yan
VLM
CLIP
50
448
0
11 Oct 2021
Recurrent Attention Models with Object-centric Capsule Representation for Multi-object Recognition
Hossein Adeli
Seoyoung Ahn
G. Zelinsky
OCL
23
3
0
11 Oct 2021
Accessible Visualization via Natural Language Descriptions: A Four-Level Model of Semantic Content
Alan Lundgard
Arvind Satyanarayan
25
128
0
08 Oct 2021
Previous
1
2
3
...
17
18
19
...
69
70
71
Next