ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Memory-augmented Dense Predictive Coding for Video Representation
  Learning
Memory-augmented Dense Predictive Coding for Video Representation Learning
Tengda Han
Weidi Xie
Andrew Zisserman
SSL
126
242
0
03 Aug 2020
AUTSL: A Large Scale Multi-modal Turkish Sign Language Dataset and
  Baseline Methods
AUTSL: A Large Scale Multi-modal Turkish Sign Language Dataset and Baseline Methods
Ozge Mercanoglu Sincan
H. Keles
SLR
77
173
0
03 Aug 2020
Efficient Urdu Caption Generation using Attention based LSTM
Efficient Urdu Caption Generation using Attention based LSTM
Inaam Ilahi
Hafiz Muhammad Abdullah Zia
Ahtazaz Ehsan
Rauf Tabassam
Armaghan Ahmed
VLM
67
3
0
02 Aug 2020
A review of deep learning in medical imaging: Imaging traits, technology
  trends, case studies with progress highlights, and future promises
A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises
S. Kevin Zhou
H. Greenspan
Christos Davatzikos
James S. Duncan
Bram van Ginneken
A. Madabhushi
Jerry L. Prince
Daniel Rueckert
Ronald M. Summers
220
650
0
02 Aug 2020
SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation Space
Liu Yang
VLM
60
5
0
02 Aug 2020
Improving Skeleton-based Action Recognitionwith Robust Spatial and
  Temporal Features
Improving Skeleton-based Action Recognitionwith Robust Spatial and Temporal Features
Zeshi Yang
KangKang Yin
3DPC
65
3
0
01 Aug 2020
Actor-Action Video Classification CSC 249/449 Spring 2020 Challenge
  Report
Actor-Action Video Classification CSC 249/449 Spring 2020 Challenge Report
Jing Shi
Zhiheng Li
Haitian Zheng
Yihang Xu
Tianyou Xiao
...
R. Magnotti
A. Sexton
Jeet Thaker
Oscar Su
Chenliang Xu
47
1
0
01 Aug 2020
Learning to Rank for Active Learning: A Listwise Approach
Learning to Rank for Active Learning: A Listwise Approach
Minghan Li
Xialei Liu
Joost van de Weijer
Bogdan Raducanu
91
22
0
31 Jul 2020
Neural Language Generation: Formulation, Methods, and Evaluation
Neural Language Generation: Formulation, Methods, and Evaluation
Cristina Garbacea
Qiaozhu Mei
165
30
0
31 Jul 2020
Foveation for Segmentation of Ultra-High Resolution Images
Foveation for Segmentation of Ultra-High Resolution Images
Chen Jin
Ryutaro Tanno
Moucheng Xu
T. Mertzanidou
Daniel C. Alexander
AI4TS
53
4
0
29 Jul 2020
Enriching Video Captions With Contextual Text
Enriching Video Captions With Contextual Text
Philipp Rimle
Pelin Dogan
Markus Gross
59
3
0
29 Jul 2020
Improving Recurrent Neural Network Responsiveness to Acute Clinical
  Events
Improving Recurrent Neural Network Responsiveness to Acute Clinical Events
D. Ledbetter
Eugene Laksana
M. Aczon
R. Wetzel
OOD
32
3
0
28 Jul 2020
AiR: Attention with Reasoning Capability
AiR: Attention with Reasoning Capability
Shi Chen
Ming Jiang
Jinhui Yang
Qi Zhao
LRM
56
36
0
28 Jul 2020
Chest X-ray Report Generation through Fine-Grained Label Learning
Chest X-ray Report Generation through Fine-Grained Label Learning
Tanveer Syeda-Mahmood
Ken C. L. Wong
Yaniv Gur
Joy T. Wu
A. Jadhav
...
A. Pillai
Arjun Sharma
A. Syed
Orest Boyko
Mehdi Moradi
95
47
0
27 Jul 2020
RANDOM MASK: Towards Robust Convolutional Neural Networks
RANDOM MASK: Towards Robust Convolutional Neural Networks
Tiange Luo
Tianle Cai
Mengxiao Zhang
Siyu Chen
Liwei Wang
AAMLOOD
100
17
0
27 Jul 2020
Contrastive Visual-Linguistic Pretraining
Contrastive Visual-Linguistic Pretraining
Lei Shi
Kai Shuang
Shijie Geng
Peng Su
Zhengkai Jiang
Peng Gao
Zuohui Fu
Gerard de Melo
Sen Su
VLMSSLCLIP
105
29
0
26 Jul 2020
Dynamically Extracting Outcome-Specific Problem Lists from Clinical
  Notes with Guided Multi-Headed Attention
Dynamically Extracting Outcome-Specific Problem Lists from Clinical Notes with Guided Multi-Headed Attention
Justin Lovelace
N. Hurley
A. Haimovich
B. Mortazavi
69
4
0
25 Jul 2020
Deep Inverse Reinforcement Learning for Structural Evolution of Small
  Molecules
Deep Inverse Reinforcement Learning for Structural Evolution of Small Molecules
Brighter Agyemang
Wei-Ping Wu
Daniel Addo
Michael Y. Kpiebaareh
Ebenezer Nanor
C. R. Haruna
38
7
0
24 Jul 2020
Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object
  Detection
Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object Detection
Xianyu Chen
Ming Jiang
Qi Zhao
ObjD
42
14
0
23 Jul 2020
HCMS at SemEval-2020 Task 9: A Neural Approach to Sentiment Analysis for
  Code-Mixed Texts
HCMS at SemEval-2020 Task 9: A Neural Approach to Sentiment Analysis for Code-Mixed Texts
Aditya Srivastava
V. H. Vardhan
77
5
0
23 Jul 2020
Comprehensive Image Captioning via Scene Graph Decomposition
Comprehensive Image Captioning via Scene Graph Decomposition
Yiwu Zhong
Liwei Wang
Jianshu Chen
Dong Yu
Yin Li
137
128
0
23 Jul 2020
Integrating Image Captioning with Rule-based Entity Masking
Integrating Image Captioning with Rule-based Entity Masking
Aditya Mogadala
Xiaoyu Shen
Dietrich Klakow
34
7
0
22 Jul 2020
Attend and Segment: Attention Guided Active Semantic Segmentation
Attend and Segment: Attention Guided Active Semantic Segmentation
Soroush Seifi
Tinne Tuytelaars
71
13
0
22 Jul 2020
BAKSA at SemEval-2020 Task 9: Bolstering CNN with Self-Attention for
  Sentiment Analysis of Code Mixed Text
BAKSA at SemEval-2020 Task 9: Bolstering CNN with Self-Attention for Sentiment Analysis of Code Mixed Text
Ayush Kumar
Harsh Agarwal
Keshav Bansal
Ashutosh Modi
35
12
0
21 Jul 2020
Fine-Grained Image Captioning with Global-Local Discriminative Objective
Fine-Grained Image Captioning with Global-Local Discriminative Objective
Jie Wu
Tianshui Chen
Hefeng Wu
Zhi Yang
Guangchun Luo
Liang Lin
70
59
0
21 Jul 2020
A Generic Visualization Approach for Convolutional Neural Networks
A Generic Visualization Approach for Convolutional Neural Networks
Ahmed Taha
Xitong Yang
Abhinav Shrivastava
L. Davis
49
8
0
19 Jul 2020
Length-Controllable Image Captioning
Length-Controllable Image Captioning
Chaorui Deng
Ning Ding
Mingkui Tan
Qi Wu
VLM
81
57
0
19 Jul 2020
Understanding Spatial Relations through Multiple Modalities
Understanding Spatial Relations through Multiple Modalities
Soham Dan
Hangfeng He
Dan Roth
36
6
0
19 Jul 2020
Deep Learning Based Brain Tumor Segmentation: A Survey
Deep Learning Based Brain Tumor Segmentation: A Survey
Zhihua Liu
Lei Tong
Zheheng Jiang
Long Chen
Feixiang Zhou
Qianni Zhang
Xiangrong Zhang
Ling Li
Huiyu Zhou
3DV
110
238
0
18 Jul 2020
Volumetric Transformer Networks
Volumetric Transformer Networks
Seungryong Kim
Sabine Süsstrunk
Mathieu Salzmann
ViT
107
5
0
18 Jul 2020
Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person
  Re-Identification
Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification
Mang Ye
Jianbing Shen
David J. Crandall
Ling Shao
Jiebo Luo
93
324
0
18 Jul 2020
Kronecker Attention Networks
Kronecker Attention Networks
Hongyang Gao
Zhengyang Wang
Shuiwang Ji
55
33
0
16 Jul 2020
Active Visual Information Gathering for Vision-Language Navigation
Active Visual Information Gathering for Vision-Language Navigation
Hanqing Wang
Wenguan Wang
Tianmin Shu
Wei Liang
Jianbing Shen
145
73
0
15 Jul 2020
RobustScanner: Dynamically Enhancing Positional Clues for Robust Text
  Recognition
RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition
Xiaoyu Yue
Zhanghui Kuang
Chenhao Lin
Hongbin Sun
Wayne Zhang
94
162
0
15 Jul 2020
Explore and Explain: Self-supervised Navigation and Recounting
Explore and Explain: Self-supervised Navigation and Recounting
Roberto Bigazzi
Federico Landi
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
EgoVLM&Ro
78
17
0
14 Jul 2020
Compare and Reweight: Distinctive Image Captioning Using Similar Images
  Sets
Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
70
45
0
14 Jul 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image
  Captioning
RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning
Riccardo Del Chiaro
Bartlomiej Twardowski
Andrew D. Bagdanov
Joost van de Weijer
CLLVLM
79
41
0
13 Jul 2020
Sparse Graph to Sequence Learning for Vision Conditioned Long Textual
  Sequence Generation
Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation
Aditya Mogadala
Marius Mosbach
Dietrich Klakow
VLM
370
0
0
12 Jul 2020
Applying recent advances in Visual Question Answering to Record Linkage
Applying recent advances in Visual Question Answering to Record Linkage
Marko Smilevski
22
0
0
12 Jul 2020
Image Captioning with Compositional Neural Module Networks
Image Captioning with Compositional Neural Module Networks
Junjiao Tian
Jean Oh
44
11
0
10 Jul 2020
Attention or memory? Neurointerpretable agents in space and time
Attention or memory? Neurointerpretable agents in space and time
Lennart Bramlage
A. Cortese
53
1
0
09 Jul 2020
Fast Transformers with Clustered Attention
Fast Transformers with Clustered Attention
Apoorv Vyas
Angelos Katharopoulos
Franccois Fleuret
96
156
0
09 Jul 2020
Graph-Based Continual Learning
Graph-Based Continual Learning
Binh Tang
David S. Matteson
BDLCLL
76
37
0
09 Jul 2020
Learning to Reweight with Deep Interactions
Learning to Reweight with Deep Interactions
Yang Fan
Yingce Xia
Lijun Wu
Shufang Xie
Weiqing Liu
Jiang Bian
Tao Qin
Xiang-Yang Li
81
9
0
09 Jul 2020
PathGAN: Local Path Planning with Attentive Generative Adversarial
  Networks
PathGAN: Local Path Planning with Attentive Generative Adversarial Networks
Dooseop Choi
Seung-Jun Han
Kyoung‐Wook Min
Jeongdan Choi
GAN
59
5
0
08 Jul 2020
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal
  Shuffled Transformers
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers
Shijie Geng
Peng Gao
Moitreya Chatterjee
Chiori Hori
Jonathan Le Roux
Yongfeng Zhang
Hongsheng Li
A. Cherian
101
11
0
08 Jul 2020
Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent
  Experts
Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent Experts
Marzi Heidari
M. Ghatee
A. Nickabadi
Arash Pourhasan Nezhad
DiffMMoE
84
1
0
07 Jul 2020
RGBT Salient Object Detection: A Large-scale Dataset and Benchmark
RGBT Salient Object Detection: A Large-scale Dataset and Benchmark
Zhengzheng Tu
Yan Ma
Zhun Li
Chenglong Li
Jieming Xu
Yongtao Liu
3DV
84
165
0
07 Jul 2020
EDSL: An Encoder-Decoder Architecture with Symbol-Level Features for
  Printed Mathematical Expression Recognition
EDSL: An Encoder-Decoder Architecture with Symbol-Level Features for Printed Mathematical Expression Recognition
Yingnan Fu
Tingting Liu
Ming Gao
Aoying Zhou
100
7
0
06 Jul 2020
Automatically Generating Codes from Graphical Screenshots Based on Deep
  Autocoder
Automatically Generating Codes from Graphical Screenshots Based on Deep Autocoder
Xiaoling Huang
Feng Liao
144
0
0
05 Jul 2020
Previous
123...293031...697071
Next