Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,510 papers shown
Title
Toward Explainable AI for Regression Models
S. Letzgus
Patrick Wagner
Jonas Lederer
Wojciech Samek
Klaus-Robert Muller
G. Montavon
XAI
36
63
0
21 Dec 2021
Continual Learning with Knowledge Transfer for Sentiment Classification
Zixuan Ke
Bing-Quan Liu
Hao Wang
Lei Shu
CLL
21
31
0
18 Dec 2021
Inherently Explainable Reinforcement Learning in Natural Language
Xiangyu Peng
Mark O. Riedl
Prithviraj Ammanabrolu
LRM
19
20
0
16 Dec 2021
Positional Encoding Augmented GAN for the Assessment of Wind Flow for Pedestrian Comfort in Urban Areas
Henrik Hoiness
Kristoffer Gjerde
L. Oggiano
K. E. Giljarhus
M. Ruocco
DiffM
AI4CE
21
5
0
15 Dec 2021
Towards Controllable Agent in MOBA Games with Generative Modeling
Shubao Zhang
42
0
0
15 Dec 2021
Minimization of Stochastic First-order Oracle Complexity of Adaptive Methods for Nonconvex Optimization
Hideaki Iiduka
20
0
0
14 Dec 2021
Hybrid Graph Neural Networks for Few-Shot Learning
Tianyuan Yu
Sen He
Yi-Zhe Song
Tao Xiang
30
61
0
13 Dec 2021
PartGlot: Learning Shape Part Segmentation from Language Reference Games
Juil Koo
Ian Huang
Panos Achlioptas
Leonidas J. Guibas
Minhyuk Sung
3DPC
42
30
0
13 Dec 2021
Towards More Efficient Insertion Transformer with Fractional Positional Encoding
Zhisong Zhang
Yizhe Zhang
W. Dolan
49
0
0
12 Dec 2021
Neural Attention Models in Deep Learning: Survey and Taxonomy
Alana de Santana Correia
Esther Colombini
MLAU
21
17
0
11 Dec 2021
Quality-Aware Multimodal Biometric Recognition
Sobhan Soleymani
Ali Dabouei
Fariborz Taherkhani
Seyed Mehdi Iranmanesh
J. Dawson
Nasser M. Nasrabadi
CVBM
32
3
0
10 Dec 2021
VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling
Yang Li
Gang Li
Xin Zhou
Mostafa Dehghani
A. Gritsenko
MLLM
45
35
0
10 Dec 2021
Injecting Semantic Concepts into End-to-End Image Captioning
Zhiyuan Fang
Jianfeng Wang
Xiaowei Hu
Lin Liang
Zhe Gan
Lijuan Wang
Yezhou Yang
Zicheng Liu
ViT
VLM
32
86
0
09 Dec 2021
Self-Supervised Image-to-Text and Text-to-Image Synthesis
Anindya Sundar Das
S. Saha
SSL
21
5
0
09 Dec 2021
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
Jiaqi Tang
Zhaoyang Liu
Chao Qian
Wayne Wu
Limin Wang
17
17
0
09 Dec 2021
Trajectory-Constrained Deep Latent Visual Attention for Improved Local Planning in Presence of Heterogeneous Terrain
Stefan Wapnick
Travis Manderson
David Meger
Gregory Dudek
36
5
0
09 Dec 2021
Relating Blindsight and AI: A Review
Joshua Bensemann
Qiming Bao
Gaël Gendron
Tim Hartill
Michael Witbrock
16
2
0
09 Dec 2021
Forecasting Brain Activity Based on Models of Spatio-Temporal Brain Dynamics: A Comparison of Graph Neural Network Architectures
S. Wein
Alina Schüller
A. Tomé
W. Malloni
M. Greenlee
E. Lang
AI4CE
43
14
0
08 Dec 2021
BA-Net: Bridge Attention for Deep Convolutional Neural Networks
Yue Zhao
Junzhou Chen
Zirui Zhang
Ronghui Zhang
29
17
0
08 Dec 2021
Active Sensing for Communications by Learning
Foad Sohrabi
Tao Jiang
Wei Cui
Wei Yu
20
54
0
08 Dec 2021
CMA-CLIP: Cross-Modality Attention CLIP for Image-Text Classification
Huidong Liu
Shaoyuan Xu
Jinmiao Fu
Yang Liu
Ning Xie
Chien Wang
Bryan Wang
Yi Sun
CLIP
VLM
32
27
0
07 Dec 2021
Protecting Intellectual Property of Language Generation APIs with Lexical Watermark
Xuanli He
Qiongkai Xu
Lingjuan Lyu
Fangzhao Wu
Chenguang Wang
WaLM
180
96
0
05 Dec 2021
Explainable Deep Learning in Healthcare: A Methodological Survey from an Attribution View
Di Jin
Elena Sergeeva
W. Weng
Geeticka Chauhan
Peter Szolovits
OOD
56
56
0
05 Dec 2021
VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts
Longtian Qiu
Renrui Zhang
Ziyu Guo
Wei Zhang
Zilu Guo
Ziyao Zeng
Guangnan Zhang
VLM
CLIP
30
45
0
04 Dec 2021
BAANet: Learning Bi-directional Adaptive Attention Gates for Multispectral Pedestrian Detection
Xiaoxiao Yang
Yeqian Qiang
Huijie Zhu
Chunxiang Wang
Ming Yang
24
33
0
04 Dec 2021
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Qirui Wu
Matthias Nießner
Angel X. Chang
23
29
0
02 Dec 2021
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
Yongming Rao
Wenliang Zhao
Guangyi Chen
Yansong Tang
Zheng Zhu
Guan Huang
Jie Zhou
Jiwen Lu
VLM
CLIP
94
556
0
02 Dec 2021
SCNet: A Generalized Attention-based Model for Crack Fault Segmentation
Hrishikesh Sharma
Pandaba Pradhan
P. Balamuralidhar
33
6
0
02 Dec 2021
Attention based Occlusion Removal for Hybrid Telepresence Systems
Surabhi Gupta
Ashwath Shetty
Avinash Sharma
CVBM
3DH
30
2
0
02 Dec 2021
N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras
Junho Kim
Jaehyeok Bae
Gang-Ryeong Park
Dongsu Zhang
Y. Kim
ObjD
32
84
0
02 Dec 2021
Consensus Graph Representation Learning for Better Grounded Image Captioning
Wenqiao Zhang
Haochen Shi
Siliang Tang
Jun Xiao
Qiang Yu
Yueting Zhuang
17
54
0
02 Dec 2021
Object-Centric Unsupervised Image Captioning
Zihang Meng
David Yang
Xuefei Cao
Ashish Shah
Ser-Nam Lim
OCL
VLM
27
11
0
02 Dec 2021
Visual-Semantic Transformer for Scene Text Recognition
Xin Tang
Yongquan Lai
Ying Liu
Yuanyuan Fu
Rui Fang
ViT
33
8
0
02 Dec 2021
Transformer-based Network for RGB-D Saliency Detection
Yue Wang
Xu Jia
Lu Zhang
Yuke Li
J. Elder
Huchuan Lu
ViT
24
5
0
01 Dec 2021
Weakly-Supervised Video Object Grounding via Causal Intervention
Wei Wang
Junyu Gao
Changsheng Xu
CML
32
20
0
01 Dec 2021
Dyadic Human Motion Prediction
Isinsu Katircioglu
C. Georgantas
Mathieu Salzmann
Pascal Fua
29
11
0
01 Dec 2021
ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds
Georg Bökman
Fredrik Kahl
Axel Flinth
3DPC
31
19
0
30 Nov 2021
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
35
45
0
29 Nov 2021
LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering
Jingjing Jiang
Zi-yi Liu
N. Zheng
30
13
0
29 Nov 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
36
193
0
29 Nov 2021
TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNs
Shantanu Jaiswal
Basura Fernando
Cheston Tan
ViT
42
14
0
26 Nov 2021
ContIG: Self-supervised Multimodal Contrastive Learning for Medical Imaging with Genetics
Aiham Taleb
Matthias Kirchler
Remo Monti
C. Lippert
SSL
MedIm
36
54
0
26 Nov 2021
Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets
Marcella Cornia
Lorenzo Baraldi
G. Fiameni
Rita Cucchiara
22
12
0
24 Nov 2021
Efficient Anomaly Detection Using Self-Supervised Multi-Cue Tasks
Loic Jezequel
Ngoc-Son Vu
Jean Beaudet
A. Histace
34
19
0
24 Nov 2021
Reinforcement Learning based Path Exploration for Sequential Explainable Recommendation
Yicong Li
Hongxu Chen
Yile Li
Lin Li
Philip S. Yu
Guandong Xu
26
15
0
24 Nov 2021
A General Divergence Modeling Strategy for Salient Object Detection
Xinyu Tian
Jing Zhang
Yuchao Dai
35
0
0
23 Nov 2021
Hierarchical Text Classification As Sub-Hierarchy Sequence Generation
Sanghun Im
Gibaeg Kim
Heung-Seon Oh
Seong-Mok Jo
Donghwan Kim
BDL
65
4
0
22 Nov 2021
Local-Selective Feature Distillation for Single Image Super-Resolution
Seonguk Park
Nojun Kwak
24
9
0
22 Nov 2021
Isomer: Transfer enhanced Dual-Channel Heterogeneous Dependency Attention Network for Aspect-based Sentiment Classification
Yukun Cao
Yijia Tang
Ziyue Wei
Chengkun Jin
Zeyu Miao
Yixin Fang
Haizhou Du
Feifei Xu
27
0
0
21 Nov 2021
AGA-GAN: Attribute Guided Attention Generative Adversarial Network with U-Net for Face Hallucination
Abhishek Srivastava
S. Chanda
Umapada Pal
GAN
CVBM
35
10
0
20 Nov 2021
Previous
1
2
3
...
16
17
18
...
69
70
71
Next