Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning Models
Jiamei Sun
Sebastian Lapuschkin
Wojciech Samek
Alexander Binder
FAtt
108
30
0
04 Jan 2020
Zero-Shot Reinforcement Learning with Deep Attention Convolutional Neural Networks
Sahika Genc
S. Mallya
S. Bodapati
Tao Sun
Yunzhe Tao
57
6
0
02 Jan 2020
A Deep Learning Approach to Diagnosing Multiple Sclerosis from Smartphone Data
Patrick Schwab
W. Karlen
74
26
0
02 Jan 2020
Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation
Xinjie Fan
Yizhe Zhang
Zhendong Wang
Mingyuan Zhou
BDL
72
4
0
31 Dec 2019
Learning Selective Sensor Fusion for States Estimation
Changhao Chen
Stefano Rosa
Chris Xiaoxuan Lu
Bing Wang
Niki Trigoni
Andrew Markham
67
20
0
30 Dec 2019
Generative Memorize-Then-Recall framework for low bit-rate Surveillance Video Compression
Yaojun Wu
Tianyu He
Zhibo Chen
45
0
0
30 Dec 2019
Explainable Deep Relational Networks for Predicting Compound-Protein Affinities and Contacts
Mostafa Karimi
Di Wu
Zhangyang Wang
Yang Shen
89
47
0
29 Dec 2019
Deep neural network models for computational histopathology: A survey
C. Srinidhi
Ozan Ciga
Anne L. Martel
AI4CE
187
584
0
28 Dec 2019
Visual Agreement Regularized Training for Multi-Modal Machine Translation
Pengcheng Yang
Boxing Chen
Pei Zhang
Xu Sun
154
31
0
27 Dec 2019
Vision and Language: from Visual Perception to Content Creation
Tao Mei
Wei Zhang
Ting Yao
VLM
79
8
0
26 Dec 2019
Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection
Guangxiang Zhao
Junyang Lin
Zhiyuan Zhang
Xuancheng Ren
Qi Su
Xu Sun
83
113
0
25 Dec 2019
A Multimodal Target-Source Classifier with Attention Branches to Understand Ambiguous Instructions for Fetching Daily Objects
A. Magassouba
K. Sugiura
Hisashi Kawai
83
9
0
23 Dec 2019
Recurrent Hierarchical Topic-Guided RNN for Language Generation
D. Guo
Bo Chen
Ruiying Lu
Mingyuan Zhou
BDL
LRM
80
8
0
21 Dec 2019
Candidate Fusion: Integrating Language Modelling into a Sequence-to-Sequence Handwritten Word Recognition Architecture
Lei Kang
Pau Riba
M. Villegas
Alicia Fornés
Marçal Rusiñol
68
31
0
21 Dec 2019
Questions to Guide the Future of Artificial Intelligence Research
J. Ott
24
3
0
21 Dec 2019
Triple Generative Adversarial Networks
Chongxuan Li
Kun Xu
Jiashuo Liu
Jun Zhu
Bo Zhang
GAN
88
43
0
20 Dec 2019
Deep Exemplar Networks for VQA and VQG
Badri N. Patro
Vinay P. Namboodiri
41
4
0
19 Dec 2019
Optimization for deep learning: theory and algorithms
Ruoyu Sun
ODL
137
169
0
19 Dec 2019
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Dave Zhenyu Chen
Angel X. Chang
Matthias Nießner
3DPC
115
379
0
18 Dec 2019
DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog
Feilong Chen
Fandong Meng
Jiaming Xu
Peng Li
Bo Xu
Jie Zhou
97
34
0
18 Dec 2019
Meshed-Memory Transformer for Image Captioning
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
126
890
0
17 Dec 2019
Water Supply Prediction Based on Initialized Attention Residual Network
Yu Long
Jingcheng Wang
Jingyi Wang
31
1
0
17 Dec 2019
Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
86
432
0
15 Dec 2019
Fast Image Caption Generation with Position Alignment
Z. Fei
77
38
0
13 Dec 2019
Small Object Detection using Context and Attention
Jeong-Seon Lim
Marcella Astrid
Hyungjin Yoon
Seung-Ik Lee
ObjD
59
216
0
13 Dec 2019
L3DOC: Lifelong 3D Object Classification
Yuyang Liu
Yang Cong
Gan Sun
3DPC
45
2
0
12 Dec 2019
Multimodal Self-Supervised Learning for Medical Image Analysis
Aiham Taleb
C. Lippert
T. Klein
Moin Nabi
SSL
99
98
0
11 Dec 2019
Fine-grained Classification of Rowing teams
M.J.A. van Wezel
L. J. Hamburger
Y. Napolean
119
1
0
11 Dec 2019
Multimodal Generative Models for Compositional Representation Learning
Mike Wu
Noah D. Goodman
GAN
DRL
97
17
0
11 Dec 2019
A Feasible Framework for Arbitrary-Shaped Scene Text Recognition
Jinjin Zhang
Wei Wang
Di Huang
Qingjie Liu
Yunhong Wang
58
4
0
10 Dec 2019
A Real-time Global Inference Network for One-stage Referring Expression Comprehension
Yiyi Zhou
Rongrong Ji
Gen Luo
Xiaoshuai Sun
Jinsong Su
Xinghao Ding
Chia-Wen Lin
Q. Tian
ObjD
94
64
0
07 Dec 2019
Connecting Vision and Language with Localized Narratives
Jordi Pont-Tuset
J. Uijlings
Soravit Changpinyo
Radu Soricut
V. Ferrari
ObjD
155
252
0
06 Dec 2019
Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation
Hongwei Yi
Zizhuang Wei
Mingyu Ding
Runze Zhang
Yisong Chen
Guoping Wang
Yu-Wing Tai
3DPC
3DV
115
112
0
06 Dec 2019
HABNet: Machine Learning, Remote Sensing Based Detection and Prediction of Harmful Algal Blooms
P. Hill
A. Kumar
M. Temimi
D. R. Bull
30
4
0
04 Dec 2019
Towards Robust Image Classification Using Sequential Attention Models
Daniel Zoran
Mike Chrzanowski
Po-Sen Huang
Sven Gowal
Alex Mott
Pushmeet Kohli
AAML
79
61
0
04 Dec 2019
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
150
332
0
04 Dec 2019
Better Understanding Hierarchical Visual Relationship for Image Caption
Z. Fei
45
0
0
04 Dec 2019
IENet: Interacting Embranchment One Stage Anchor Free Detector for Orientation Aerial Object Detection
Youtian Lin
Pengming Feng
Jian Guan
Wenwu Wang
Jonathon Chambers
ObjD
83
86
0
02 Dec 2019
Not All Attention Is Needed: Gated Attention Network for Sequence Data
Lanqing Xue
Xiaopeng Li
N. Zhang
69
32
0
01 Dec 2019
Assessing the Robustness of Visual Question Answering Models
Jia-Hong Huang
Modar Alfadly
Guohao Li
Marcel Worring
AAML
OOD
105
24
0
30 Nov 2019
ST-GRAT: A Novel Spatio-temporal Graph Attention Network for Accurately Forecasting Dynamically Changing Road Speed
Cheonbok Park
Chunggi Lee
Hyojin Bahng
Taeyun Won
Kihwan Kim
Seungmin Jin
Sungahn Ko
Jaegul Choo
GNN
AI4TS
83
33
0
29 Nov 2019
Self-attention with Functional Time Representation Learning
Da Xu
Chuanwei Ruan
Sushant Kumar
Evren Körpeoglu
Kannan Achan
AI4TS
87
118
0
28 Nov 2019
Multimodal Machine Translation through Visuals and Speech
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
106
77
0
28 Nov 2019
Multimodal Attention Networks for Low-Level Vision-and-Language Navigation
Federico Landi
Lorenzo Baraldi
Marcella Cornia
M. Corsini
Rita Cucchiara
LM&Ro
96
29
0
27 Nov 2019
Towards Precise End-to-end Weakly Supervised Object Detection Network
Ke Yang
Dongsheng Li
Y. Dou
WSOD
78
130
0
27 Nov 2019
Analysis of Explainers of Black Box Deep Neural Networks for Computer Vision: A Survey
Vanessa Buhrmester
David Münch
Michael Arens
MLAU
FaML
XAI
AAML
142
369
0
27 Nov 2019
Distraction-Aware Feature Learning for Human Attribute Recognition via Coarse-to-Fine Attention Mechanism
Mingda Wu
Di Huang
Yuanfang Guo
Yunhong Wang
CVBM
81
31
0
26 Nov 2019
RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
Qingyong Hu
Bo Yang
Linhai Xie
Stefano Rosa
Yulan Guo
Zhihua Wang
A. Trigoni
Andrew Markham
3DPC
146
1,522
0
25 Nov 2019
Event Recognition with Automatic Album Detection based on Sequential Processing, Neural Attention and Image Captioning
Andrey V. Savchenko
39
1
0
25 Nov 2019
Multi-Agent Game Abstraction via Graph Attention Neural Network
Y. Liu
Weixun Wang
Yujing Hu
Jianye Hao
Xingguo Chen
Yang Gao
80
245
0
25 Nov 2019
Previous
1
2
3
...
35
36
37
...
69
70
71
Next