Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03044
Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,515 papers shown
Title
3M: Multi-style image caption generation using Multi-modality features under Multi-UPDOWN model
Chengxi Li
Brent Harrison
66
6
0
20 Mar 2021
Local Interpretations for Explainable Natural Language Processing: A Survey
Siwen Luo
Hamish Ivison
S. Han
Josiah Poon
MILM
85
49
0
20 Mar 2021
Let Your Heart Speak in its Mother Tongue: Multilingual Captioning of Cardiac Signals
Dani Kiyasseh
T. Zhu
David Clifton
50
0
0
19 Mar 2021
Decoupled Spatial Temporal Graphs for Generic Visual Grounding
Qi Feng
Yunchao Wei
Mingming Cheng
Yi Yang
32
5
0
18 Mar 2021
Which to Match? Selecting Consistent GT-Proposal Assignment for Pedestrian Detection
Yan Luo
Chongyang Zhang
Muming Zhao
Hao Zhou
Jun Sun
27
0
0
18 Mar 2021
Set-to-Sequence Methods in Machine Learning: a Review
Mateusz Jurewicz
Leon Derczynski
BDL
32
9
0
17 Mar 2021
CACTUS: Detecting and Resolving Conflicts in Objective Functions
Subhajit Das
Alex Endert
27
0
0
13 Mar 2021
Dual Attention-in-Attention Model for Joint Rain Streak and Raindrop Removal
Kaihao Zhang
Dongxu Li
Wenhan Luo
Wenqi Ren
41
74
0
12 Mar 2021
Full Page Handwriting Recognition via Image to Sequence Extraction
Sumeet S. Singh
Sergey Karayev
27
53
0
11 Mar 2021
Iterative Shrinking for Referring Expression Grounding Using Deep Reinforcement Learning
Mingjie Sun
Jimin Xiao
Eng Gee Lim
ObjD
37
34
0
09 Mar 2021
Analysis of Convolutional Decoder for Image Caption Generation
Sulabh Katiyar
S. Borgohain
26
0
0
08 Mar 2021
Lipschitz Normalization for Self-Attention Layers with Application to Graph Neural Networks
George Dasoulas
Kevin Scaman
Aladin Virmaux
GNN
27
40
0
08 Mar 2021
Relationship-based Neural Baby Talk
Fan Fu
Tingting Xie
Ioannis Patras
Sepehr Jalali
25
0
0
08 Mar 2021
Contextual Dropout: An Efficient Sample-Dependent Dropout Module
Xinjie Fan
Shujian Zhang
Korawat Tanwisuth
Xiaoning Qian
Mingyuan Zhou
OOD
BDL
UQCV
50
27
0
06 Mar 2021
Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision
Andrew Shin
Masato Ishii
T. Narihira
62
38
0
06 Mar 2021
Causal Attention for Vision-Language Tasks
Xu Yang
Hanwang Zhang
Guojun Qi
Jianfei Cai
CML
36
152
0
05 Mar 2021
Enhanced 3D Human Pose Estimation from Videos by using Attention-Based Neural Network with Dilated Convolutions
Ruixu Liu
Ju Shen
He Wang
Chong Chen
S. Cheung
V. Asari
3DH
30
30
0
04 Mar 2021
Coordinate Attention for Efficient Mobile Network Design
Qibin Hou
Daquan Zhou
Jiashi Feng
39
2,977
0
04 Mar 2021
End-to-end acoustic modelling for phone recognition of young readers
Lucile Gelin
Morgane Daniel
J. Pinquier
Thomas Pellegrini
23
13
0
04 Mar 2021
Video Sentiment Analysis with Bimodal Information-augmented Multi-Head Attention
Ting-Wei Wu
Jun-jie Peng
Wenqiang Zhang
Huiran Zhang
Chuan Ma
Yansong Huang
42
85
0
03 Mar 2021
Dual Reinforcement-Based Specification Generation for Image De-Rendering
Ramakanth Pasunuru
David B. Rosenberg
Gideon Mann
Joey Tianyi Zhou
30
0
0
02 Mar 2021
Deep Learning Based Decision Support for Medicine -- A Case Study on Skin Cancer Diagnosis
Adriano Lucieri
Andreas Dengel
Sheraz Ahmed
54
7
0
02 Mar 2021
Listening to the city, attentively: A Spatio-Temporal Attention Boosted Autoencoder for the Short-Term Flow Prediction Problem
Stefano Fiorini
Michele Ciavotta
A. Maurino
16
9
0
01 Mar 2021
Generalization Through Hand-Eye Coordination: An Action Space for Learning Spatially-Invariant Visuomotor Control
Chen Wang
Rui Wang
Ajay Mandlekar
Li Fei-Fei
Silvio Savarese
Danfei Xu
29
29
0
28 Feb 2021
A Universal Model for Cross Modality Mapping by Relational Reasoning
Zun Li
Congyan Lang
Liqian Liang
Tao Wang
Songhe Feng
Jun Wu
Yidong Li
35
2
0
26 Feb 2021
Benchmarking and Survey of Explanation Methods for Black Box Models
F. Bodria
F. Giannotti
Riccardo Guidotti
Francesca Naretto
D. Pedreschi
S. Rinzivillo
XAI
55
224
0
25 Feb 2021
Retrieval Augmentation for Deep Neural Networks
R. Ramos
Patrícia Pereira
Helena Moniz
Joao Paulo Carvalho
Bruno Martins
VLM
24
0
0
25 Feb 2021
Multichannel LSTM-CNN for Telugu Technical Domain Identification
Sunil Gundapu
R. Mamidi
23
7
0
24 Feb 2021
Characterization and recognition of handwritten digits using Julia
Md Asifuzzaman Jishan
M. Alam
A. Islam
I. R. Mazumder
K. Mahmud
A. K. Azad
24
0
0
24 Feb 2021
Enhanced Modality Transition for Image Captioning
Ziwei Wang
Yadan Luo
Zi Huang
19
0
0
23 Feb 2021
Comparative evaluation of CNN architectures for Image Caption Generation
Sulabh Katiyar
S. Borgohain
23
24
0
23 Feb 2021
Model-Attentive Ensemble Learning for Sequence Modeling
Victor D. Bourgin
Ioana Bica
M. Schaar
AI4TS
25
0
0
23 Feb 2021
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation
Sulabh Katiyar
S. Borgohain
VLM
36
14
0
22 Feb 2021
A Hierarchical Conditional Random Field-based Attention Mechanism Approach for Gastric Histopathology Image Classification
Yixin Li
Xinran Wu
Chen Li
Changhao Sun
M. Rahaman
Hao Chen
Yudong Yao
Xiaoyan Li
Yong Zhang
Tao Jiang
42
25
0
21 Feb 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
34
220
0
20 Feb 2021
Hard-Attention for Scalable Image Classification
Athanasios Papadopoulos
Pawel Korus
N. Memon
73
25
0
20 Feb 2021
Progressive Transformer-Based Generation of Radiology Reports
Farhad Nooralahzadeh
Nicolas Andres Perez Gonzalez
T. Frauenfelder
Koji Fujimoto
Michael Krauthammer
ViT
MedIm
34
86
0
19 Feb 2021
Trends in Vehicle Re-identification Past, Present, and Future: A Comprehensive Review
Zakria
Jianhua Deng
Muhammad Saddam Khokhar
Muhammad Umar Aftab
Jingye Cai
Rajesh Kumar
Jay Kumar
43
33
0
19 Feb 2021
I Want This Product but Different : Multimodal Retrieval with Synthetic Query Expansion
Ivona Tautkute
Tomasz Trzciñski
39
4
0
17 Feb 2021
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
284
180
0
17 Feb 2021
A Context-Enhanced De-identification System
Kahyun Lee
M. Kayaalp
Sam Henry
Özlem Uzuner
52
3
0
17 Feb 2021
Learning Intra-Batch Connections for Deep Metric Learning
Jenny Seidenschwarz
Ismail Elezi
Laura Leal-Taixé
FedML
24
52
0
15 Feb 2021
A Gated Fusion Network for Dynamic Saliency Prediction
Aysun Kocak
Erkut Erdem
Aykut Erdem
28
7
0
15 Feb 2021
Improved Bengali Image Captioning via deep convolutional neural network based encoder-decoder model
Mohammad Faiyaz Khan
S. M. S. Shifath
Md. Saiful Islam
VLM
35
18
0
14 Feb 2021
Image Captioning using Multiple Transformers for Self-Attention Mechanism
Farrukh Olimov
Shikha Dubey
Labina Shrestha
Tran Trung Tin
M. Jeon
ViT
34
2
0
14 Feb 2021
DeepRA: Predicting Joint Damage From Radiographs Using CNN with Attention
Neelam Chaturvedi
29
10
0
13 Feb 2021
InsNet: An Efficient, Flexible, and Performant Insertion-based Text Generation Model
Sidi Lu
Tao Meng
Nanyun Peng
41
12
0
12 Feb 2021
The Role of the Input in Natural Language Video Description
S. Cascianelli
G. Costante
Alessandro Devo
Thomas Alessandro Ciarfuglia
P. Valigi
M. L. Fravolini
29
5
0
09 Feb 2021
In Defense of Scene Graphs for Image Captioning
Kien Nguyen
Subarna Tripathi
Bang Du
T. Guha
Truong Thao Nguyen
39
43
0
09 Feb 2021
Dynamic Neural Networks: A Survey
Yizeng Han
Gao Huang
Shiji Song
Le Yang
Honghui Wang
Yulin Wang
3DH
AI4TS
AI4CE
31
634
0
09 Feb 2021
Previous
1
2
3
...
23
24
25
...
69
70
71
Next