ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
When Point Process Meets RNNs: Predicting Fine-Grained User Interests with Mutual Behavioral Infectivity
Tongbing Chen
Lin Wu
Yang Wang
Jun Zhang
Hongxu Chen
Xue Li
AI4TS
43
0
0
14 Oct 2017
Convolutional Attention-based Seq2Seq Neural Network for End-to-End ASR
Convolutional Attention-based Seq2Seq Neural Network for End-to-End ASR
D. Lim
37
2
0
12 Oct 2017
Lung Cancer Screening Using Adaptive Memory-Augmented Recurrent Networks
Lung Cancer Screening Using Adaptive Memory-Augmented Recurrent Networks
Aryan Mobiny
S. Moulik
H. Nguyen
53
13
0
11 Oct 2017
Deep learning in remote sensing: a review
Deep learning in remote sensing: a review
Xiaoxiang Zhu
D. Tuia
Lichao Mou
Gui-Song Xia
Liangpei Zhang
Feng Xu
Friedrich Fraundorfer
102
1,627
0
11 Oct 2017
iVQA: Inverse Visual Question Answering
iVQA: Inverse Visual Question Answering
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
66
47
0
10 Oct 2017
Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on
  Rough Terrain Challenge
Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on Rough Terrain Challenge
Doo Re Song
Chuanyu Yang
C. McGreavy
Zhibin Li
167
30
0
08 Oct 2017
OSU Multimodal Machine Translation System Report
OSU Multimodal Machine Translation System Report
Mingbo Ma
Dapeng Li
Kai Zhao
Liang Huang
83
15
0
07 Oct 2017
Contrastive Learning for Image Captioning
Contrastive Learning for Image Captioning
Bo Dai
Dahua Lin
SSLVLM
105
194
0
06 Oct 2017
Semantic speech retrieval with a visually grounded model of
  untranscribed speech
Semantic speech retrieval with a visually grounded model of untranscribed speech
Herman Kamper
Gregory Shakhnarovich
Karen Livescu
73
53
0
05 Oct 2017
Person Re-Identification with Vision and Language
Person Re-Identification with Vision and Language
F. Yan
K. Mikolajczyk
J. Kittler
VLM
34
11
0
03 Oct 2017
Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken
  Language Understanding
Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken Language Understanding
Po-Chun Chen
Ta-Chung Chi
Shang-Yu Su
Yun-Nung Chen
63
28
0
30 Sep 2017
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis
Xihui Liu
Haiyu Zhao
Maoqing Tian
Lu Sheng
Jing Shao
Shuai Yi
Junjie Yan
Xiaogang Wang
119
517
0
28 Sep 2017
Sentiment Classification with Word Attention based on Weakly Supervised
  Learning with a Convolutional Neural Network
Sentiment Classification with Word Attention based on Weakly Supervised Learning with a Convolutional Neural Network
Gichang Lee
Jaeyun Jeong
Seungwan Seo
Czangyeob Kim
Pilsung Kang
33
49
0
28 Sep 2017
Fooling Vision and Language Models Despite Localization and Attention
  Mechanism
Fooling Vision and Language Models Despite Localization and Attention Mechanism
Xiaojun Xu
Xinyun Chen
Chang-rui Liu
Anna Rohrbach
Trevor Darrell
Basel Alomair
AAML
106
41
0
25 Sep 2017
Muon Trigger for Mobile Phones
Muon Trigger for Mobile Phones
M. Borisyak
Mikhail (Misha) Usvyatsov
M. Mulhearn
C. Shimmin
Andrey Ustyuzhanin
21
10
0
25 Sep 2017
The Consciousness Prior
The Consciousness Prior
Yoshua Bengio
DRLAI4CE
91
231
0
25 Sep 2017
Visual Reference Resolution using Attention Memory for Visual Dialog
Visual Reference Resolution using Attention Memory for Visual Dialog
Paul Hongsuck Seo
Andreas M. Lehrmann
Bohyung Han
Leonid Sigal
107
123
0
23 Sep 2017
Learning to Detect Violent Videos using Convolutional Long Short-Term
  Memory
Learning to Detect Violent Videos using Convolutional Long Short-Term Memory
Swathikiran Sudhakaran
Oswald Lanz
67
217
0
19 Sep 2017
Learning to update Auto-associative Memory in Recurrent Neural Networks
  for Improving Sequence Memorization
Learning to update Auto-associative Memory in Recurrent Neural Networks for Improving Sequence Memorization
Wei Zhang
Bowen Zhou
65
13
0
19 Sep 2017
Where to Focus: Deep Attention-based Spatially Recurrent Bilinear
  Networks for Fine-Grained Visual Recognition
Where to Focus: Deep Attention-based Spatially Recurrent Bilinear Networks for Fine-Grained Visual Recognition
Lin Wu
Yang Wang
53
9
0
18 Sep 2017
Deep Graph Attention Model
Deep Graph Attention Model
J. B. Lee
Ryan Rossi
Xiangnan Kong
GNN
44
13
0
15 Sep 2017
Learning Functional Causal Models with Generative Neural Networks
Learning Functional Causal Models with Generative Neural Networks
Hugo Jair Escalante
Sergio Escalera
Xavier Baro
Isabelle M Guyon
Umut Güçlü
Marcel van Gerven
CMLBDL
109
108
0
15 Sep 2017
Self-Guiding Multimodal LSTM - when we do not have a perfect training
  dataset for image captioning
Self-Guiding Multimodal LSTM - when we do not have a perfect training dataset for image captioning
Yang Xian
Yingli Tian
VLM
59
23
0
15 Sep 2017
Robustness Analysis of Visual QA Models by Basic Questions
Robustness Analysis of Visual QA Models by Basic Questions
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
C. Huck Yang
Guohao Li
OOD
65
24
0
14 Sep 2017
Natural Language Inference over Interaction Space
Natural Language Inference over Interaction Space
Yichen Gong
Heng Luo
Jian Zhang
115
265
0
13 Sep 2017
RRA: Recurrent Residual Attention for Sequence Learning
RRA: Recurrent Residual Attention for Sequence Learning
Cheng Wang
35
13
0
12 Sep 2017
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
Jiuxiang Gu
Jianfei Cai
G. Wang
Tsuhan Chen
126
181
0
11 Sep 2017
Steering Output Style and Topic in Neural Response Generation
Steering Output Style and Topic in Neural Response Generation
Di Wang
Nebojsa Jojic
Chris Brockett
Eric Nyberg
LLMSV
86
68
0
09 Sep 2017
Cross-Domain Image Retrieval with Attention Modeling
Cross-Domain Image Retrieval with Attention Modeling
Xin Ji
Wei Wang
Mei-juan Zhang
Yang Yang
109
82
0
06 Sep 2017
Interacting Attention-gated Recurrent Networks for Recommendation
Interacting Attention-gated Recurrent Networks for Recommendation
Wenjie Pei
Jie Yang
Zhu Sun
Jie Zhang
A. Bozzon
David Tax
GNNHAI
56
68
0
05 Sep 2017
Squeeze-and-Excitation Networks
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
506
26,777
0
05 Sep 2017
Unsupervised feature learning with discriminative encoder
Unsupervised feature learning with discriminative encoder
Gaurav Pandey
Ambedkar Dukkipati
SSL
60
6
0
03 Sep 2017
Learning to Look Around: Intelligently Exploring Unseen Environments for
  Unknown Tasks
Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown Tasks
Dinesh Jayaraman
Kristen Grauman
SSL
87
106
0
01 Sep 2017
Query-by-example Spoken Term Detection using Attention-based Multi-hop
  Networks
Query-by-example Spoken Term Detection using Attention-based Multi-hop Networks
Chia-Wei Ao
Hung-yi Lee
60
22
0
01 Sep 2017
Predicting Cardiovascular Risk Factors from Retinal Fundus Photographs
  using Deep Learning
Predicting Cardiovascular Risk Factors from Retinal Fundus Photographs using Deep Learning
Ryan Poplin
A. Varadarajan
Katy Blumer
Yun-Hui Liu
M. McConnell
G. Corrado
L. Peng
D. Webster
MedIm
87
1,344
0
31 Aug 2017
Video Captioning with Guidance of Multimodal Latent Topics
Video Captioning with Guidance of Multimodal Latent Topics
Shizhe Chen
Jia Chen
Qin Jin
Alexander G. Hauptmann
125
67
0
31 Aug 2017
Generating Video Descriptions with Topic Guidance
Generating Video Descriptions with Topic Guidance
Shizhe Chen
Jia Chen
Qin Jin
88
21
0
31 Aug 2017
Video Summarization with Attention-Based Encoder-Decoder Networks
Video Summarization with Attention-Based Encoder-Decoder Networks
Zhong Ji
Kailin Xiong
Yanwei Pang
Xuelong Li
94
308
0
31 Aug 2017
Action Classification and Highlighting in Videos
Action Classification and Highlighting in Videos
Atousa Torabi
Leonid Sigal
79
5
0
31 Aug 2017
End-to-end Learning for Short Text Expansion
End-to-end Learning for Short Text Expansion
Jian Tang
Yue Wang
Kai Zheng
Qiaozhu Mei
LLMAG
81
19
0
30 Aug 2017
Hierarchical Multi-scale Attention Networks for Action Recognition
Hierarchical Multi-scale Attention Networks for Action Recognition
Shiyang Yan
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
91
37
0
25 Aug 2017
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks
Victor Campos
Brendan Jou
Xavier Giró-i-Nieto
Jordi Torres
Shih-Fu Chang
95
220
0
22 Aug 2017
Twin Networks: Matching the Future for Sequence Generation
Twin Networks: Matching the Future for Sequence Generation
Dmitriy Serdyuk
Nan Rosemary Ke
Alessandro Sordoni
Adam Trischler
C. Pal
Yoshua Bengio
69
12
0
22 Aug 2017
PiCANet: Learning Pixel-wise Contextual Attention for Saliency Detection
PiCANet: Learning Pixel-wise Contextual Attention for Saliency Detection
Nian Liu
Junwei Han
Ming-Hsuan Yang
SSeg
76
6
0
21 Aug 2017
Attentive Semantic Video Generation using Captions
Attentive Semantic Video Generation using Captions
Tanya Marwah
Gaurav Mittal
V. Balasubramanian
94
72
0
20 Aug 2017
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
185
2,830
0
19 Aug 2017
Image2song: Song Retrieval via Bridging Image Content and Lyric Words
Image2song: Song Retrieval via Bridging Image Content and Lyric Words
Xuelong Li
Di Hu
Xiaoqiang Lu
56
10
0
19 Aug 2017
Incorporating Copying Mechanism in Image Captioning for Learning Novel
  Objects
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
VLM
82
148
0
17 Aug 2017
Modality-specific Cross-modal Similarity Measurement with Recurrent
  Attention Network
Modality-specific Cross-modal Similarity Measurement with Recurrent Attention Network
Yuxin Peng
Jinwei Qi
Yuxin Yuan
48
124
0
16 Aug 2017
VQS: Linking Segmentations to Questions and Answers for Supervised
  Attention in VQA and Question-Focused Semantic Segmentation
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
Chuang Gan
Yandong Li
Haoxiang Li
Chen Sun
Boqing Gong
109
127
0
15 Aug 2017
Previous
123...585960...697071
Next