Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
Learning to Guide Decoding for Image Captioning
Wenhao Jiang
Lin Ma
Xinpeng Chen
Hanwang Zhang
Wen Liu
88
69
0
03 Apr 2018
Generating Diverse and Accurate Visual Captions by Comparative Adversarial Learning
Dianqi Li
Qiuyuan Huang
Xiaodong He
Lei Zhang
Ming-Ting Sun
96
50
0
03 Apr 2018
End-to-End Dense Video Captioning with Masked Transformer
Luowei Zhou
Yingbo Zhou
Jason J. Corso
R. Socher
Caiming Xiong
103
531
0
03 Apr 2018
Attentional Multilabel Learning over Graphs: A Message Passing Approach
Kien Do
T. Tran
Thin Nguyen
Svetha Venkatesh
61
17
0
01 Apr 2018
Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning
Jingwen Wang
Wenhao Jiang
Lin Ma
Wen Liu
Yong-mei Xu
94
209
0
31 Mar 2018
Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning
Sandeep Subramanian
Adam Trischler
Yoshua Bengio
C. Pal
SSL
105
330
0
30 Mar 2018
Guide Me: Interacting with Deep Networks
Christian Rupprecht
Iro Laina
Nassir Navab
Gregory Hager
Federico Tombari
HAI
73
38
0
30 Mar 2018
Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Xinpeng Chen
Lin Ma
Wenhao Jiang
Jian Yao
Wen Liu
88
92
0
30 Mar 2018
Fine-Grained Attention Mechanism for Neural Machine Translation
Heeyoul Choi
Kyunghyun Cho
Yoshua Bengio
87
175
0
30 Mar 2018
Unsupervised Textual Grounding: Linking Words to Image Concepts
Raymond A. Yeh
Minh Do
Alex Schwing
54
40
0
29 Mar 2018
Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering
Unnat Jain
Svetlana Lazebnik
Alex Schwing
MLLM
81
82
0
29 Mar 2018
Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective
Jing Zhang
Tong Zhang
Yuchao Dai
Mehrtash Harandi
Leonid Sigal
77
184
0
29 Mar 2018
Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks
Agrim Gupta
Justin Johnson
Li Fei-Fei
Silvio Savarese
Alexandre Alahi
GAN
192
1,935
0
29 Mar 2018
Referring Relationships
Ranjay Krishna
Ines Chami
Michael S. Bernstein
Li Fei-Fei
102
95
0
28 Mar 2018
Dual Attention Matching Network for Context-Aware Feature Sequence based Person Re-Identification
Jianlou Si
Honggang Zhang
Chun-Guang Li
Jason Kuen
Xiangfei Kong
Alex C. Kot
G. Wang
72
466
0
27 Mar 2018
Diversity Regularized Spatiotemporal Attention for Video-based Person Re-identification
Shuang Li
Sławomir Bąk
Peter Carr
Xiaogang Wang
154
342
0
27 Mar 2018
Neural Baby Talk
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
VLM
245
436
0
27 Mar 2018
MOrdReD: Memory-based Ordinal Regression Deep Neural Networks for Time Series Forecasting
Bernardo Pérez Orozco
G. Abbati
Stephen J. Roberts
OOD
AI4TS
38
14
0
26 Mar 2018
Connectionist Recommendation in the Wild: On the utility and scrutability of neural networks for personalized course guidance
Z. Pardos
Zihao Fan
Weijie Jiang
HAI
58
75
0
26 Mar 2018
code2vec: Learning Distributed Representations of Code
Uri Alon
Meital Zilberstein
Omer Levy
Eran Yahav
92
1,190
0
26 Mar 2018
StarMap for Category-Agnostic Keypoint and Viewpoint Estimation
Xingyi Zhou
Arjun Karpur
Linjie Luo
Qi-Xing Huang
3DPC
3DV
54
88
0
25 Mar 2018
Audio-Visual Event Localization in Unconstrained Videos
Yapeng Tian
Jing Shi
Bochen Li
Zhiyao Duan
Chenliang Xu
127
442
0
23 Mar 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu
Hongsheng Li
Jing Shao
Dapeng Chen
Xiaogang Wang
95
133
0
22 Mar 2018
Stacked Cross Attention for Image-Text Matching
Kuang-Huei Lee
Xi Chen
G. Hua
Houdong Hu
Xiaodong He
122
1,163
0
21 Mar 2018
End-to-End Video Captioning with Multitask Reinforcement Learning
Lijun Li
Boqing Gong
71
56
0
21 Mar 2018
Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation
Xin Eric Wang
Wenhan Xiong
Hongmin Wang
William Yang Wang
85
202
0
21 Mar 2018
Attention on Attention: Architectures for Visual Question Answering (VQA)
Jasdeep Singh
Vincent Ying
Alex Nutkiewicz
60
26
0
21 Mar 2018
Learning Robotic Assembly from CAD
G. Thomas
Melissa Chien
Aviv Tamar
J. A. Ojea
Pieter Abbeel
73
151
0
20 Mar 2018
Dynamic Filtering with Large Sampling Field for ConvNets
Jialin Wu
Daiyang Li
Yu Yang
Minh Nguyen
Xiangyang Ji
51
9
0
20 Mar 2018
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Qing Li
Qingyi Tao
Shafiq Joty
Jianfei Cai
Jiebo Luo
107
109
0
20 Mar 2018
GaAN: Gated Attention Networks for Learning on Large and Spatiotemporal Graphs
Jiani Zhang
Xingjian Shi
Junyuan Xie
Hao Ma
Irwin King
Dit-Yan Yeung
GNN
122
573
0
20 Mar 2018
Attention-based Temporal Weighted Convolutional Neural Network for Action Recognition
J. Zang
Le Wang
Zi-yi Liu
Qilin Zhang
Zhenxing Niu
G. Hua
N. Zheng
54
72
0
19 Mar 2018
Attention-GAN for Object Transfiguration in Wild Images
Xinyuan Chen
Chang Xu
Xiaokang Yang
Dacheng Tao
82
177
0
19 Mar 2018
Learning Unsupervised Visual Grounding Through Semantic Self-Supervision
Syed Ashar Javed
Shreyas Saxena
Vineet Gandhi
SSL
76
25
0
17 Mar 2018
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
71
29
0
16 Mar 2018
A Dataset and Architecture for Visual Reasoning with a Working Memory
G. R. Yang
Igor Ganichev
Xiao-Jing Wang
Jonathon Shlens
David Sussillo
71
55
0
16 Mar 2018
Aggregated Sparse Attention for Steering Angle Prediction
Sen He
D. Kangin
Yang Mi
N. Pugeault
LLMSV
59
5
0
15 Mar 2018
Unpaired Image Captioning by Language Pivoting
Jiuxiang Gu
Shafiq Joty
Jianfei Cai
G. Wang
96
83
0
14 Mar 2018
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
David Mascharka
Philip Tran
Ryan Soklaski
Arjun Majumdar
131
207
0
14 Mar 2018
Feature Selective Small Object Detection via Knowledge-based Recurrent Attentive Neural Network
Kai Yi
Zhiqiang Jian
Shi-tao Chen
N. Zheng
ObjD
55
6
0
13 Mar 2018
Recurrent Neural Network Attention Mechanisms for Interpretable System Log Anomaly Detection
Andy Brown
Aaron Tuor
Brian Hutchinson
Nicole Nichols
44
174
0
13 Mar 2018
Discriminability objective for training descriptive captions
Ruotian Luo
Brian L. Price
Scott D. Cohen
Gregory Shakhnarovich
140
203
0
12 Mar 2018
Learning to Localize Sound Source in Visual Scenes
Arda Senocak
Tae-Hyun Oh
Junsik Kim
Ming-Hsuan Yang
In So Kweon
SSL
96
346
0
10 Mar 2018
Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions
Albert Gatt
Marc Tanti
A. Muscat
Patrizia Paggio
R. Farrugia
Claudia Borg
K. Camilleri
M. Rosner
Lonneke van der Plas
CVBM
64
25
0
10 Mar 2018
Attention-based Graph Neural Network for Semi-supervised Learning
K. K. Thekumparampil
Chong-Jun Wang
Sewoong Oh
Li Li
GNN
100
335
0
10 Mar 2018
Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey
Muzammal Naseer
Salman H Khan
Fatih Porikli
3DPC
3DV
78
101
0
09 Mar 2018
Decoupled Spatial Neural Attention for Weakly Supervised Semantic Segmentation
Tianyi Zhang
Guosheng Lin
Jianfei Cai
T. Shen
Chunhua Shen
Alex C. Kot
64
77
0
07 Mar 2018
Multi-level Attention Model for Weakly Supervised Audio Classification
Changsong Yu
Karim Barsim
Qiuqiang Kong
Binh Yang
56
83
0
06 Mar 2018
Totally Looks Like - How Humans Compare, Compared to Machines
Amir Rosenfeld
M. Solbach
John K. Tsotsos
3DH
94
29
0
05 Mar 2018
Less Is More: Picking Informative Frames for Video Captioning
Yangyu Chen
Shuhui Wang
Wentao Zhang
Qingming Huang
85
201
0
05 Mar 2018
Previous
1
2
3
...
54
55
56
...
69
70
71
Next