Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,520 papers shown
Title
Position Focused Attention Network for Image-Text Matching
Yaxiong Wang
Hao-Hsiang Yang
Xueming Qian
Lin Ma
Jing Lu
Biao Li
Xin Fan
58
172
0
23 Jul 2019
MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models
Boyuan Pan
Yazheng Yang
Hao Li
Zhou Zhao
Yueting Zhuang
Deng Cai
Xiaofei He
64
18
0
23 Jul 2019
Compact Global Descriptor for Neural Networks
Xiangyu He
Ke Cheng
Qiang Chen
Qinghao Hu
Peisong Wang
Jian Cheng
96
8
0
23 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
143
136
0
22 Jul 2019
Deep Learning for Time Series Forecasting: The Electric Load Case
Alberto Gasparin
S. Lukovic
Cesare Alippi
AI4TS
80
231
0
22 Jul 2019
Automatic Radiology Report Generation based on Multi-view Image Fusion and Medical Concept Enrichment
Jianbo Yuan
Haofu Liao
R. Luo
Jiebo Luo
MedIm
88
196
0
22 Jul 2019
A study on the Interpretability of Neural Retrieval Models using DeepSHAP
Zeon Trevor Fernando
Jaspreet Singh
Avishek Anand
FAtt
AAML
65
68
0
15 Jul 2019
Extracting Interpretable Physical Parameters from Spatiotemporal Systems using Unsupervised Learning
Peter Y. Lu
Samuel Kim
Marin Soljacic
AI4CE
67
60
0
13 Jul 2019
A Survey of Deep Learning-based Object Detection
L. Jiao
Fan Zhang
Fang Liu
Shuyuan Yang
Lingling Li
Zhixi Feng
Rong Qu
ObjD
131
974
0
11 Jul 2019
Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions
Yulei Niu
Hanwang Zhang
Zhiwu Lu
Shih-Fu Chang
ObjD
BDL
101
26
0
08 Jul 2019
Informative Visual Storytelling with Cross-modal Rules
Jiacheng Li
Haizhou Shi
Siliang Tang
Leilei Gan
Yueting Zhuang
52
24
0
07 Jul 2019
EPNAS: Efficient Progressive Neural Architecture Search
Yanqi Zhou
Peng Wang
Sercan O. Arik
Haonan Yu
Syed Zawad
Feng Yan
G. Diamos
47
5
0
07 Jul 2019
Graph Representation Learning via Hard and Channel-Wise Attention Networks
Hongyang Gao
Shuiwang Ji
GNN
70
57
0
05 Jul 2019
Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters
Federico Landi
Lorenzo Baraldi
M. Corsini
Rita Cucchiara
LM&Ro
98
27
0
05 Jul 2019
Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks
V. Kosaraju
Amir Sadeghian
Roberto Martín-Martín
Ian Reid
S. Hamid Rezatofighi
Silvio Savarese
95
613
0
04 Jul 2019
ACNe: Attentive Context Normalization for Robust Permutation-Equivariant Learning
Weiwei Sun
Wei Jiang
Eduard Trulls
Andrea Tagliasacchi
K. M. Yi
3DPC
90
20
0
04 Jul 2019
Learning Blended, Precise Semantic Program Embeddings
Ke Wang
Z. Su
NAI
62
27
0
03 Jul 2019
Neural Image Captioning
E. Tan
Lakshay Sharma
VLM
55
3
0
02 Jul 2019
Generative Models for Automatic Chemical Design
Daniel Schwalbe-Koda
Rafael Gómez-Bombarelli
MedIm
AI4CE
89
81
0
02 Jul 2019
Augmenting Self-attention with Persistent Memory
Sainbayar Sukhbaatar
Edouard Grave
Guillaume Lample
Hervé Jégou
Armand Joulin
RALM
KELM
77
139
0
02 Jul 2019
Kite: Automatic speech recognition for unmanned aerial vehicles
Dan Oneaţă
H. Cucu
40
13
0
02 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
70
112
0
02 Jul 2019
Inter and Intra Document Attention for Depression Risk Assessment
Diego Maupomé
Marc Queudot
Marie-Jean Meurs
14
7
0
30 Jun 2019
Machine Reading Comprehension: a Literature Review
Xin Zhang
An Yang
Sujian Li
Yizhong Wang
91
33
0
30 Jun 2019
A Neural Attention Model for Adaptive Learning of Social Friends' Preferences
Dimitrios Rafailidis
Gerhard Weiss
GNN
FedML
50
2
0
29 Jun 2019
A Deep Decoder Structure Based on WordEmbedding Regression for An Encoder-Decoder Based Model for Image Captioning
A. Asadi
Reza Safabakhsh
26
3
0
26 Jun 2019
Creating A Neural Pedagogical Agent by Jointly Learning to Review and Assess
Youngnam Lee
Youngduck Choi
Junghyun Cho
Alexander R. Fabbri
Hyunbin Loh
Chanyou Hwang
Yongku Lee
Sang-Wook Kim
Dragomir R. Radev
35
18
0
26 Jun 2019
Deep Modular Co-Attention Networks for Visual Question Answering
Zhou Yu
Jun Yu
Yuhao Cui
Dacheng Tao
Q. Tian
150
811
0
25 Jun 2019
Is It Worth the Attention? A Comparative Evaluation of Attention Layers for Argument Unit Segmentation
Maximilian Spliethover
Jonas Klaff
Hendrik Heuer
54
10
0
24 Jun 2019
CORAL8: Concurrent Object Regression for Area Localization in Medical Image Panels
Sam Maksoud
Arnold Wiliem
Kun-li Zhao
Teng Zhang
Lin Wu
Brian C. Lovell
MedIm
51
11
0
24 Jun 2019
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments
K. Niu
Y. Huang
Wanli Ouyang
Liang Wang
58
144
0
23 Jun 2019
Sequence Generation: From Both Sides to the Middle
Long Zhou
Jiajun Zhang
Chengqing Zong
Heng Yu
76
22
0
23 Jun 2019
Informative Image Captioning with External Sources of Information
Sanqiang Zhao
Piyush Sharma
Tomer Levinboim
Radu Soricut
65
46
0
20 Jun 2019
Understanding More about Human and Machine Attention in Deep Neural Networks
Qiuxia Lai
Salman Khan
Wenguan Wang
Jianbing Shen
Hanqiu Sun
Ling Shao
HAI
XAI
52
7
0
20 Jun 2019
SMILES-X: autonomous molecular compounds characterization for small datasets without descriptors
G. Lambard
Ekaterina Gracheva
71
21
0
20 Jun 2019
A simple and effective postprocessing method for image classification
Yan Liu
Yun Li
Yunhao Yuan
Jipeng Qiang
21
1
0
19 Jun 2019
VizADS-B: Analyzing Sequences of ADS-B Images Using Explainable Convolutional LSTM Encoder-Decoder to Detect Cyber Attacks
Sefi Akerman
Edan Habler
A. Shabtai
82
18
0
19 Jun 2019
Distilling Translations with Visual Awareness
Julia Ive
Pranava Madhyastha
Lucia Specia
VLM
154
76
0
18 Jun 2019
Expressing Visual Relationships via Language
Hao Tan
Franck Dernoncourt
Zhe Lin
Trung Bui
Joey Tianyi Zhou
93
68
0
18 Jun 2019
Attention Guided Graph Convolutional Networks for Relation Extraction
Zhijiang Guo
Yan Zhang
Wei Lu
GNN
97
413
0
18 Jun 2019
ASAC: Active Sensing using Actor-Critic models
Chang Jo Kim
James Jordon
M. Schaar
CML
63
16
0
16 Jun 2019
Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding
Jian Zheng
S. Krishnamurthy
Ruxin Chen
Min-Hung Chen
Zhenhao Ge
Xiaohua Li
85
4
0
16 Jun 2019
Generating Diverse and Informative Natural Language Fashion Feedback
Gil Sadeh
L. Fritz
Gabi Shalev
Eduard Oks
58
5
0
15 Jun 2019
Connecting Touch and Vision via Cross-Modal Prediction
Yunzhu Li
Jun-Yan Zhu
Russ Tedrake
Antonio Torralba
80
139
0
14 Jun 2019
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
175
476
0
14 Jun 2019
Multigrid Neural Memory
T. Huynh
Michael Maire
Matthew R. Walter
64
10
0
13 Jun 2019
Stand-Alone Self-Attention in Vision Models
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
VLM
SLR
ViT
193
1,218
0
13 Jun 2019
Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training
William Harvey
Michael Teng
Frank Wood
50
4
0
13 Jun 2019
Vispi: Automatic Visual Perception and Interpretation of Chest X-rays
X. Li
Rui Cao
D. Zhu
79
20
0
12 Jun 2019
Pay Attention to Convolution Filters: Towards Fast and Accurate Fine-Grained Transfer Learning
Xiangxi Mo
Ruizhe Cheng
Tianyi Fang
35
3
0
12 Jun 2019
Previous
1
2
3
...
40
41
42
...
69
70
71
Next