Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
Detection and Description of Change in Visual Streams
Davis Gilton
Ruotian Luo
Rebecca Willett
Gregory Shakhnarovich
AI4TS
52
4
0
27 Mar 2020
Neural encoding and interpretation for high-level visual cortices based on fMRI using image caption features
Kai Qiao
Chi Zhang
Jian Chen
Linyuan Wang
Li Tong
Bin Yan
26
3
0
26 Mar 2020
Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning models
Pranav Agarwal
Alejandro Betancourt
V. Panagiotou
Natalia Díaz Rodríguez
EGVM
82
10
0
26 Mar 2020
Learning Compact Reward for Image Captioning
Nannan Li
Zhenzhong Chen
66
3
0
24 Mar 2020
TRACER: A Framework for Facilitating Accurate and Interpretable Analytics for High Stakes Applications
Kaiping Zheng
Shaofeng Cai
H. Chua
Wei Wang
K. Ngiam
Beng Chin Ooi
AI4TS
65
26
0
24 Mar 2020
Attention-Based Self-Supervised Feature Learning for Security Data
I-Ta Lee
Manish Marwah
M. Arlitt
SSL
64
2
0
24 Mar 2020
Toward Tag-free Aspect Based Sentiment Analysis: A Multiple Attention Network Approach
Yao Qiang
X. Li
D. Zhu
66
16
0
22 Mar 2020
Ensembles of Deep Neural Networks for Action Recognition in Still Images
S. Mohammadi
Sina Ghofrani Majelan
S. B. Shokouhi
33
17
0
22 Mar 2020
SAC: Accelerating and Structuring Self-Attention via Sparse Adaptive Connection
Xiaoya Li
Yuxian Meng
Mingxin Zhou
Qinghong Han
Leilei Gan
Jiwei Li
88
20
0
22 Mar 2020
Video-based Person Re-Identification using Gated Convolutional Recurrent Neural Networks
Yang Feng
Yu Wang
Jiebo Luo
48
0
0
21 Mar 2020
Explainable Object-induced Action Decision for Autonomous Vehicles
Yiran Xu
Xiaoyin Yang
Lihang Gong
Hsuan-Chu Lin
Tz-Ying Wu
Yunsheng Li
Nuno Vasconcelos
80
113
0
20 Mar 2020
Fine-grained Species Recognition with Privileged Pooling: Better Sample Efficiency Through Supervised Attention
Andrés C. Rodríguez
Stefano Dáronco
Konrad Schindler
Jan Dirk Wegner
48
4
0
20 Mar 2020
Exchangeable Input Representations for Reinforcement Learning
John Mern
Dorsa Sadigh
Mykel J. Kochenderfer
109
4
0
19 Mar 2020
Generating new concepts with hybrid neuro-symbolic models
Reuben Feinman
Brenden M. Lake
BDL
128
13
0
19 Mar 2020
Normalized and Geometry-Aware Self-Attention Network for Image Captioning
Longteng Guo
Jing Liu
Xinxin Zhu
Peng Yao
Shichen Lu
Hanqing Lu
ViT
208
192
0
19 Mar 2020
PIC: Permutation Invariant Convolution for Recognizing Long-range Activities
Noureldien Hussein
E. Gavves
A. Smeulders
VLM
79
13
0
18 Mar 2020
Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation
Huiyu Wang
Yukun Zhu
Bradley Green
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
3DPC
155
676
0
17 Mar 2020
Active Perception and Representation for Robotic Manipulation
Youssef Y. Zaky
Gaurav Paruthi
B. Tripp
James Bergstra
86
16
0
15 Mar 2020
A Neural Architecture for Detecting Confusion in Eye-tracking Data
Shane D. V. Sims
Cristina Conati
23
2
0
13 Mar 2020
Efficient Content-Based Sparse Attention with Routing Transformers
Aurko Roy
M. Saffar
Ashish Vaswani
David Grangier
MoE
479
607
0
12 Mar 2020
SOS: Selective Objective Switch for Rapid Immunofluorescence Whole Slide Image Classification
Sam Maksoud
Kun-li Zhao
Peter Hobson
A. Jennings
Brian C. Lovell
66
30
0
11 Mar 2020
Visual Grounding in Video for Unsupervised Word Translation
Gunnar Sigurdsson
Jean-Baptiste Alayrac
Aida Nematzadeh
Lucas Smaira
Mateusz Malinowski
João Carreira
Phil Blunsom
Andrew Zisserman
VGen
117
50
0
11 Mar 2020
PBRnet: Pyramidal Bounding Box Refinement to Improve Object Localization Accuracy
Li Xiao
Yufan Luo
Chunlong Luo
Lianhe Zhao
Quanshui Fu
Guoqing Yang
Anpeng Huang
Yi Zhao
ObjD
79
4
0
10 Mar 2020
Causal Interpretability for Machine Learning -- Problems, Methods and Evaluation
Raha Moraffah
Mansooreh Karami
Ruocheng Guo
A. Raglin
Huan Liu
CML
ELM
XAI
98
221
0
09 Mar 2020
Deconfounded Image Captioning: A Causal Retrospect
Xu Yang
Hanwang Zhang
Jianfei Cai
CML
79
127
0
09 Mar 2020
Better Captioning with Sequence-Level Exploration
Jia Chen
Qin Jin
79
12
0
08 Mar 2020
OVC-Net: Object-Oriented Video Captioning with Temporal Graph and Detail Enhancement
Fangyi Zhu
Lei Li
Zhanyu Ma
Guang Chen
Jun Guo
49
1
0
08 Mar 2020
Adaptive Offline Quintuplet Loss for Image-Text Matching
Tianlang Chen
Jiajun Deng
Jiebo Luo
237
70
0
07 Mar 2020
Trends and Advancements in Deep Neural Network Communication
Felix Sattler
Thomas Wiegand
Wojciech Samek
GNN
72
9
0
06 Mar 2020
Captioning Images with Novel Objects via Online Vocabulary Expansion
Mikihiro Tanaka
Tatsuya Harada
3DV
83
2
0
06 Mar 2020
Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding
Seonguk Park
Gyubok Lee
Manoj Bhat
Jimin Seo
Minseok Kang
Jonathan M Francis
Ashwin R. Jadhav
Paul Pu Liang
Louis-Philippe Morency
224
120
0
06 Mar 2020
Show, Edit and Tell: A Framework for Editing Image Captions
Fawaz Sammani
Luke Melas-Kyriazi
KELM
DiffM
108
59
0
06 Mar 2020
Understanding Contexts Inside Robot and Human Manipulation Tasks through a Vision-Language Model and Ontology System in a Video Stream
Chen Jiang
Masood Dehghan
Martin Jägersand
LM&Ro
71
9
0
02 Mar 2020
A Question-Centric Model for Visual Question Answering in Medical Imaging
Minh H. Vu
Tommy Löfstedt
T. Nyholm
Raphael Sznitman
MedIm
81
61
0
02 Mar 2020
Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs
Shizhe Chen
Qin Jin
Peng Wang
Qi Wu
DiffM
152
219
0
01 Mar 2020
Grounded and Controllable Image Completion by Incorporating Lexical Semantics
Shengyu Zhang
Tan Jiang
Qinghao Huang
Ziqi Tan
Zhou Zhao
Siliang Tang
Jin Yu
Hongxia Yang
Yi Yang
Leilei Gan
35
1
0
29 Feb 2020
Exploring and Distilling Cross-Modal Information for Image Captioning
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Kai Lei
Xu Sun
ViT
90
52
0
28 Feb 2020
MANet: Multimodal Attention Network based Point- View fusion for 3D Shape Recognition
Yaxin Zhao
Jichao Jiao
Tangkun Zhang
3DPC
80
7
0
28 Feb 2020
Visual Commonsense R-CNN
Tan Wang
Jianqiang Huang
Hanwang Zhang
Qianru Sun
SSL
ObjD
CML
86
252
0
27 Feb 2020
CLARA: Clinical Report Auto-completion
Siddharth Biswal
Cao Xiao
Lucas Glass
M. P. M. Brandon Westover
Jimeng Sun
79
28
0
26 Feb 2020
Recognizing Handwritten Mathematical Expressions as LaTex Sequences Using a Multiscale Robust Neural Network
Hongyu Wang
Guangcun Shan
91
7
0
26 Feb 2020
Refined Gate: A Simple and Effective Gating Mechanism for Recurrent Units
Zhanzhan Cheng
Yunlu Xu
Mingjian Cheng
Yu Qiao
Shiliang Pu
Yi Niu
Leilei Gan
44
8
0
26 Feb 2020
Sparse Sinkhorn Attention
Yi Tay
Dara Bahri
Liu Yang
Donald Metzler
Da-Cheng Juan
107
341
0
26 Feb 2020
Dual Graph Representation Learning
Huiling Zhu
Xin Luo
Hankui Zhuo
GNN
34
0
0
25 Feb 2020
Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval
A. Bhunia
Yongxin Yang
Timothy M. Hospedales
Tao Xiang
Yi-Zhe Song
169
104
0
24 Feb 2020
See, Attend and Brake: An Attention-based Saliency Map Prediction Model for End-to-End Driving
Ekrem Aksoy
A. Yazıcı
Mahmut Kasap
86
13
0
24 Feb 2020
A
3
^3
3
: Accelerating Attention Mechanisms in Neural Networks with Approximation
Tae Jun Ham
Sungjun Jung
Seonghak Kim
Young H. Oh
Yeonhong Park
...
Jung-Hun Park
Sanghee Lee
Kyoung Park
Jae W. Lee
D. Jeong
99
221
0
22 Feb 2020
Image to Language Understanding: Captioning approach
M. Seshadri
Malavika Srikanth
Mikhail Belov
33
1
0
21 Feb 2020
Memory-Based Graph Networks
Amir Hosein Khas Ahmadi
Kaveh Hassani
Parsa Moradi
Leo Lee
Q. Morris
GNN
163
91
0
21 Feb 2020
AutoFoley: Artificial Synthesis of Synchronized Sound Tracks for Silent Videos with Deep Learning
Sanchita Ghose
John J. Prevost
VGen
76
47
0
21 Feb 2020
Previous
1
2
3
...
33
34
35
...
69
70
71
Next