ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Modality Shifting Attention Network for Multi-modal Video Question
  Answering
Modality Shifting Attention Network for Multi-modal Video Question Answering
Junyeong Kim
Minuk Ma
T. Pham
Kyungsu Kim
Chang D. Yoo
91
72
0
04 Jul 2020
A Few-Shot Sequential Approach for Object Counting
A Few-Shot Sequential Approach for Object Counting
Negin Sokhandan
Pegah Kamousi
Alejandro Posada
Eniola Alese
Negar Rostamzadeh
63
3
0
03 Jul 2020
Learning to Discover Multi-Class Attentional Regions for Multi-Label
  Image Recognition
Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition
Bin-Bin Gao
Hong-Yu Zhou
71
115
0
03 Jul 2020
Synergistic saliency and depth prediction for RGB-D saliency detection
Synergistic saliency and depth prediction for RGB-D saliency detection
Yue Wang
Yuke Li
J. Elder
Huchuan Lu
Runmin Wu
Lu Zhang
MDE
106
8
0
03 Jul 2020
Balanced Symmetric Cross Entropy for Large Scale Imbalanced and Noisy
  Data
Balanced Symmetric Cross Entropy for Large Scale Imbalanced and Noisy Data
Feifei Huang
Jie Li
Xuelin Zhu
25
10
0
03 Jul 2020
Modality-Agnostic Attention Fusion for visual search with text feedback
Modality-Agnostic Attention Fusion for visual search with text feedback
Eric Dodds
Jack Culpepper
Simão Herdade
Yang Zhang
K. Boakye
EgoV
109
74
0
30 Jun 2020
AdaSGD: Bridging the gap between SGD and Adam
AdaSGD: Bridging the gap between SGD and Adam
Jiaxuan Wang
Jenna Wiens
77
10
0
30 Jun 2020
Vehicle Attribute Recognition by Appearance: Computer Vision Methods for
  Vehicle Type, Make and Model Classification
Vehicle Attribute Recognition by Appearance: Computer Vision Methods for Vehicle Type, Make and Model Classification
Xingyang Ni
H. Huttunen
CVBM
43
20
0
29 Jun 2020
Self-Attention Networks for Intent Detection
Self-Attention Networks for Intent Detection
Sevinj Yolchuyeva
Géza Németh
Bálint Gyires-Tóth
27
13
0
28 Jun 2020
Listen carefully and tell: an audio captioning system based on residual
  learning and gammatone audio representation
Listen carefully and tell: an audio captioning system based on residual learning and gammatone audio representation
Sergi Perez-Castanos
Javier Naranjo-Alcazar
P. Zuccarello
M. Cobos
70
11
0
27 Jun 2020
Modeling Long-Term and Short-Term Interests with Parallel Attentions for
  Session-based Recommendation
Modeling Long-Term and Short-Term Interests with Parallel Attentions for Session-based Recommendation
Jing Zhu
Yanan Xu
Yanmin Zhu
HAI
20
11
0
27 Jun 2020
ULSAM: Ultra-Lightweight Subspace Attention Module for Compact
  Convolutional Neural Networks
ULSAM: Ultra-Lightweight Subspace Attention Module for Compact Convolutional Neural Networks
Rajat Saini
N. Jha
B. K. Das
Sparsh Mittal
C.Krishna Mohan
71
83
0
26 Jun 2020
Graph Optimal Transport for Cross-Domain Alignment
Graph Optimal Transport for Cross-Domain Alignment
Liqun Chen
Zhe Gan
Yu Cheng
Linjie Li
Lawrence Carin
Jingjing Liu
OT
129
152
0
26 Jun 2020
Self-Segregating and Coordinated-Segregating Transformer for Focused
  Deep Multi-Modular Network for Visual Question Answering
Self-Segregating and Coordinated-Segregating Transformer for Focused Deep Multi-Modular Network for Visual Question Answering
C. Sur
30
9
0
25 Jun 2020
AReLU: Attention-based Rectified Linear Unit
AReLU: Attention-based Rectified Linear Unit
Dengsheng Chen
Jun Li
Kai Xu
87
20
0
24 Jun 2020
Differentiable Window for Dynamic Local Attention
Differentiable Window for Dynamic Local Attention
Thanh-Tung Nguyen
Xuan-Phi Nguyen
Shafiq Joty
Xiaoli Li
56
13
0
24 Jun 2020
Robot Object Retrieval with Contextual Natural Language Queries
Robot Object Retrieval with Contextual Natural Language Queries
Thao Nguyen
N. Gopalan
Roma Patel
Matt Corsaro
Ellie Pavlick
Stefanie Tellex
LM&Ro
81
53
0
23 Jun 2020
Neural Cellular Automata Manifold
Neural Cellular Automata Manifold
Alejandro Hernandez Ruiz
Armand Vilalta
Francesc Moreno-Noguer
57
9
0
22 Jun 2020
Improving Image Captioning with Better Use of Captions
Improving Image Captioning with Better Use of Captions
Zhan Shi
Xu Zhou
Xipeng Qiu
Xiao-Dan Zhu
68
128
0
21 Jun 2020
Off-Policy Self-Critical Training for Transformer in Visual Paragraph
  Generation
Off-Policy Self-Critical Training for Transformer in Visual Paragraph Generation
Shiyang Yan
Yang Hua
N. Robertson
OffRL
52
0
0
21 Jun 2020
A3T-GCN: Attention Temporal Graph Convolutional Network for Traffic
  Forecasting
A3T-GCN: Attention Temporal Graph Convolutional Network for Traffic Forecasting
Jiawei Zhu
Yujiao Song
Ling Zhao
Haifeng Li
AI4TS
72
278
0
20 Jun 2020
Predicting Temporal Sets with Deep Neural Networks
Predicting Temporal Sets with Deep Neural Networks
Le Yu
Leilei Sun
Bowen Du
Chuanren Liu
Hui Xiong
Weifeng Lv
89
45
0
20 Jun 2020
Concatenated Attention Neural Network for Image Restoration
Concatenated Attention Neural Network for Image Restoration
Ying-jie Tian
Yiqi Wang
LinRui Yang
Zhiquan Qi
57
11
0
19 Jun 2020
Adversarial Attacks for Multi-view Deep Models
Adversarial Attacks for Multi-view Deep Models
Xuli Sun
Shiliang Sun
AAML
39
0
0
19 Jun 2020
Hyperparameter Analysis for Image Captioning
Hyperparameter Analysis for Image Captioning
Amish Patel
Aravind Varier
73
2
0
19 Jun 2020
Automated Radiological Report Generation For Chest X-Rays With
  Weakly-Supervised End-to-End Deep Learning
Automated Radiological Report Generation For Chest X-Rays With Weakly-Supervised End-to-End Deep Learning
Shuai Zhang
Xiaoyan Xin
Yang Wang
Yachong Guo
Q. Hao
Xianfeng Yang
Jun Wang
Jian Zhang
Bing Zhang
Wei Wang
MedIm
43
1
0
18 Jun 2020
Category-Specific CNN for Visual-aware CTR Prediction at JD.com
Category-Specific CNN for Visual-aware CTR Prediction at JD.com
Hu Liu
Jing Lu
Hao Yang
Xiwei Zhao
Sulong Xu
...
Zehua Zhang
Wenjie Niu
Xiaokun Zhu
Yongjun Bao
Weipeng P. Yan
71
32
0
18 Jun 2020
XRayGAN: Consistency-preserving Generation of X-ray Images from
  Radiology Reports
XRayGAN: Consistency-preserving Generation of X-ray Images from Radiology Reports
Xingyi Yang
Nandiraju Gireesh
Eric Xing
P. Xie
MedIm
51
3
0
17 Jun 2020
Visual Attention for Musical Instrument Recognition
Visual Attention for Musical Instrument Recognition
Karn N. Watcharasupat
Siddharth Gururani
Alexander Lerch
49
3
0
17 Jun 2020
Cross-Correlated Attention Networks for Person Re-Identification
Cross-Correlated Attention Networks for Person Re-Identification
Jieming Zhou
S. Roy
Pengfei Fang
Mehrtash Harandi
L. Petersson
56
16
0
17 Jun 2020
A generalizable saliency map-based interpretation of model outcome
A generalizable saliency map-based interpretation of model outcome
Shailja Thakur
S. Fischmeister
AAMLFAttMILM
41
2
0
16 Jun 2020
Visualization for Histopathology Images using Graph Convolutional Neural
  Networks
Visualization for Histopathology Images using Graph Convolutional Neural Networks
M. Sureka
Abhijeet Patil
Deepak Anand
A. Sethi
FAttGNNMedIm
68
36
0
16 Jun 2020
Unsupervised Pansharpening Based on Self-Attention Mechanism
Unsupervised Pansharpening Based on Self-Attention Mechanism
Ying Qu
Razieh Kaviani Baghbaderani
Hairong Qi
C. Kwan
81
69
0
16 Jun 2020
Global Feature Aggregation for Accident Anticipation
Global Feature Aggregation for Accident Anticipation
Mishal Fatima
Muhammad Umar Karim Khan
C. Kyung
83
19
0
16 Jun 2020
SD-RSIC: Summarization Driven Deep Remote Sensing Image Captioning
SD-RSIC: Summarization Driven Deep Remote Sensing Image Captioning
Gencer Sumbul
Sonali Nayak
Begüm Demir
63
77
0
15 Jun 2020
ORD: Object Relationship Discovery for Visual Dialogue Generation
ORD: Object Relationship Discovery for Visual Dialogue Generation
Ziwei Wang
Zi Huang
Yadan Luo
Huimin Lu
57
4
0
15 Jun 2020
Mitigating Gender Bias in Captioning Systems
Mitigating Gender Bias in Captioning Systems
Ruixiang Tang
Mengnan Du
Yuening Li
Zirui Liu
Na Zou
Helen Zhou
FaML
142
66
0
15 Jun 2020
AMENet: Attentive Maps Encoder Network for Trajectory Prediction
AMENet: Attentive Maps Encoder Network for Trajectory Prediction
Hao Cheng
Wentong Liao
M. Yang
Bodo Rosenhahn
Monika Sester
90
46
0
15 Jun 2020
Towards Robust Pattern Recognition: A Review
Towards Robust Pattern Recognition: A Review
Xu-Yao Zhang
Cheng-Lin Liu
C. Suen
OODHAI
73
110
0
12 Jun 2020
Incorporating User Micro-behaviors and Item Knowledge into Multi-task
  Learning for Session-based Recommendation
Incorporating User Micro-behaviors and Item Knowledge into Multi-task Learning for Session-based Recommendation
Wenjing Meng
Deqing Yang
Yanghua Xiao
70
110
0
12 Jun 2020
RTEX: A novel methodology for Ranking, Tagging, and Explanatory
  diagnostic captioning of radiography exams
RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams
Vasiliki Kougia
John Pavlopoulos
P. Papapetrou
Max Gordon
50
0
0
11 Jun 2020
Dance Revolution: Long-Term Dance Generation with Music via Curriculum Learning
Ruozi Huang
Huang Hu
Wei Wu
Kei Sawada
Mi Zhang
Daxin Jiang
125
122
0
11 Jun 2020
Report from the NSF Future Directions Workshop, Toward User-Oriented
  Agents: Research Directions and Challenges
Report from the NSF Future Directions Workshop, Toward User-Oriented Agents: Research Directions and Challenges
M. Eskénazi
Tiancheng Zhao
LLMAGAI4TSAI4CE
93
9
0
10 Jun 2020
MultiResolution Attention Extractor for Small Object Detection
MultiResolution Attention Extractor for Small Object Detection
Fan Zhang
L. Jiao
Lingling Li
Fang Liu
Xu Liu
ObjD
49
11
0
10 Jun 2020
Toward Building Safer Smart Homes for the People with Disabilities
Toward Building Safer Smart Homes for the People with Disabilities
Shahinur Alam
M. Mahmud
M. Yeasin
30
4
0
10 Jun 2020
Why Attentions May Not Be Interpretable?
Why Attentions May Not Be Interpretable?
Bing Bai
Jian Liang
Guanhua Zhang
Hao Li
Kun Bai
Fei Wang
FAtt
100
61
0
10 Jun 2020
Cost-effective Interactive Attention Learning with Neural Attention
  Processes
Cost-effective Interactive Attention Learning with Neural Attention Processes
Jay Heo
Junhyeong Park
Hyewon Jeong
Kwang Joon Kim
Juho Lee
Eunho Yang
Sung Ju Hwang
50
8
0
09 Jun 2020
Physically constrained short-term vehicle trajectory forecasting with
  naive semantic maps
Physically constrained short-term vehicle trajectory forecasting with naive semantic maps
Albert Dulian
J. Murray
33
0
0
09 Jun 2020
Text Detection and Recognition in the Wild: A Review
Text Detection and Recognition in the Wild: A Review
Z. Raisi
Mohamed A. Naiel
Paul Fieguth
Steven Wardell
John S. Zelek
90
35
0
08 Jun 2020
FMA-ETA: Estimating Travel Time Entirely Based on FFN With Attention
FMA-ETA: Estimating Travel Time Entirely Based on FFN With Attention
Yiwen Sun
Yulu Wang
Kun Fu
Zheng Wang
Ziang Yan
Changshui Zhang
Jieping Ye
AI4TS
46
16
0
07 Jun 2020
Previous
123...303132...697071
Next