ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Mimic and Fool: A Task Agnostic Adversarial Attack
Mimic and Fool: A Task Agnostic Adversarial Attack
Akshay Chaturvedi
Utpal Garain
AAML
57
27
0
11 Jun 2019
Relationship-Embedded Representation Learning for Grounding Referring
  Expressions
Relationship-Embedded Representation Learning for Grounding Referring Expressions
Sibei Yang
Guanbin Li
Yizhou Yu
ObjD
97
55
0
11 Jun 2019
Bag of Color Features For Color Constancy
Bag of Color Features For Color Constancy
Firas Laakom
Nikolaos Passalis
Jenni Raitoharju
Jarno Nikkanen
Anastasios Tefas
Alexandros Iosifidis
Moncef Gabbouj
46
33
0
11 Jun 2019
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Yale Song
M. Soleymani
84
247
0
11 Jun 2019
Improving Neural Language Modeling via Adversarial Training
Improving Neural Language Modeling via Adversarial Training
Dilin Wang
Chengyue Gong
Qiang Liu
AAML
124
119
0
10 Jun 2019
An Attention-based Recurrent Convolutional Network for Vehicle Taillight
  Recognition
An Attention-based Recurrent Convolutional Network for Vehicle Taillight Recognition
Kuan-Hui Lee
Takaaki Tagawa
Jia Pan
Adrien Gaidon
B. Douillard
ViT
37
15
0
09 Jun 2019
Attention-based Conditioning Methods for External Knowledge Integration
Attention-based Conditioning Methods for External Knowledge Integration
Katerina Margatina
Christos Baziotis
Alexandros Potamianos
51
30
0
09 Jun 2019
Attending to Discriminative Certainty for Domain Adaptation
Attending to Discriminative Certainty for Domain Adaptation
V. Kurmi
Shanu Kumar
Vinay P. Namboodiri
OOD
98
108
0
08 Jun 2019
Figure Captioning with Reasoning and Sequence-Level Training
Figure Captioning with Reasoning and Sequence-Level Training
Charles C. Chen
Ruiyi Zhang
Eunyee Koh
Sungchul Kim
Scott D. Cohen
Tong Yu
Ryan Rossi
Razvan Bunescu
AIMat
69
39
0
07 Jun 2019
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video
Zhenfang Chen
Lin Ma
Wenhan Luo
Kwan-Yee K. Wong
105
103
0
06 Jun 2019
Towards Interpretable Reinforcement Learning Using Attention Augmented
  Agents
Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
Alex Mott
Daniel Zoran
Mike Chrzanowski
Daan Wierstra
Danilo Jimenez Rezende
74
192
0
06 Jun 2019
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via
  Question Answering
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering
Zhou Yu
D. Xu
Jun-chen Yu
Ting Yu
Zhou Zhao
Yueting Zhuang
Dacheng Tao
157
478
0
06 Jun 2019
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Zhengjun Zha
Daqing Liu
Hanwang Zhang
Yongdong Zhang
Feng Wu
66
122
0
06 Jun 2019
Neural Legal Judgment Prediction in English
Neural Legal Judgment Prediction in English
Ilias Chalkidis
Ion Androutsopoulos
Nikolaos Aletras
AILawELM
190
342
0
05 Jun 2019
Large-Scale Multi-Label Text Classification on EU Legislation
Large-Scale Multi-Label Text Classification on EU Legislation
Ilias Chalkidis
Manos Fergadiotis
Prodromos Malakasiotis
Ion Androutsopoulos
AILaw
66
217
0
05 Jun 2019
Machine Learning and System Identification for Estimation in Physical
  Systems
Machine Learning and System Identification for Estimation in Physical Systems
Fredrik Bagge Carlson
OOD
56
5
0
05 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences
KERMIT: Generative Insertion-Based Modeling for Sequences
William Chan
Nikita Kitaev
Kelvin Guu
Mitchell Stern
Jakob Uszkoreit
VLM
96
65
0
04 Jun 2019
Natural Vocabulary Emerges from Free-Form Annotations
Natural Vocabulary Emerges from Free-Form Annotations
Jordi Pont-Tuset
Michael Gygli
V. Ferrari
VLM
90
3
0
04 Jun 2019
Masked Non-Autoregressive Image Captioning
Masked Non-Autoregressive Image Captioning
Junlong Gao
Xi Meng
Shiqi Wang
Xia Li
Shanshe Wang
Siwei Ma
Wen Gao
80
39
0
03 Jun 2019
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic
  Attention for Neural TTS
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic Attention for Neural TTS
Mutian He
Yan Deng
Lei He
97
81
0
03 Jun 2019
Listening while Speaking and Visualizing: Improving ASR through
  Multimodal Chain
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain
Johanes Effendi
Andros Tjandra
S. Sakti
Satoshi Nakamura
68
3
0
03 Jun 2019
A Survey of Natural Language Generation Techniques with a Focus on
  Dialogue Systems - Past, Present and Future Directions
A Survey of Natural Language Generation Techniques with a Focus on Dialogue Systems - Past, Present and Future Directions
Sashank Santhanam
Samira Shaikh
3DV
84
52
0
02 Jun 2019
Unsupervised Bilingual Lexicon Induction from Mono-lingual Multimodal
  Data
Unsupervised Bilingual Lexicon Induction from Mono-lingual Multimodal Data
Shizhe Chen
Qin Jin
Alexander G. Hauptmann
SSL
46
9
0
02 Jun 2019
Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular
  Video
Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video
Jian Liu
Naveed Akhtar
Ajmal Mian
3DH
66
10
0
01 Jun 2019
Do Human Rationales Improve Machine Explanations?
Do Human Rationales Improve Machine Explanations?
Julia Strout
Ye Zhang
Raymond J. Mooney
89
58
0
31 May 2019
Audio Caption in a Car Setting with a Sentence-Level Loss
Audio Caption in a Car Setting with a Sentence-Level Loss
Xuenan Xu
Heinrich Dinkel
Mengyue Wu
Kai Yu
31
2
0
31 May 2019
Interactive-predictive neural multimodal systems
Interactive-predictive neural multimodal systems
Álvaro Peris
F. Casacuberta
KELMHAI
47
2
0
30 May 2019
Meta Dropout: Learning to Perturb Features for Generalization
Meta Dropout: Learning to Perturb Features for Generalization
Haebeom Lee
Taewook Nam
Eunho Yang
Sung Ju Hwang
OOD
68
3
0
30 May 2019
Adversarial Sub-sequence for Text Generation
Adversarial Sub-sequence for Text Generation
Xingyuan Chen
Yanzhe Li
Peng Jin
Jiuhua Zhang
Xinyu Dai
Jiajun Chen
Gang Song
GAN
55
5
0
30 May 2019
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language
  Feedback
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback
Hui Wu
Yupeng Gao
Xiaoxiao Guo
Ziad Al-Halah
Steven J. Rennie
Kristen Grauman
Rogerio Feris
EgoV
167
68
0
30 May 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
79
37
0
29 May 2019
Recurrent Existence Determination Through Policy Optimization
Recurrent Existence Determination Through Policy Optimization
Baoxiang Wang
47
1
0
29 May 2019
Semantic Fisher Scores for Task Transfer: Using Objects to Classify
  Scenes
Semantic Fisher Scores for Task Transfer: Using Objects to Classify Scenes
Mandar Dixit
Yunsheng Li
Nuno Vasconcelos
86
14
0
27 May 2019
Audio2Face: Generating Speech/Face Animation from Single Audio with
  Attention-Based Bidirectional LSTM Networks
Audio2Face: Generating Speech/Face Animation from Single Audio with Attention-Based Bidirectional LSTM Networks
Guanzhong Tian
Yi Yuan
Yang Liu
CVBM
86
45
0
27 May 2019
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient
  Models
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient Models
Linfeng Zhang
Zhanhong Tan
Jiebo Song
Jingwei Chen
Chenglong Bao
Kaisheng Ma
55
71
0
27 May 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing
  general artificial intelligence
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Jeff Clune
148
122
0
27 May 2019
Transcribing Content from Structural Images with Spotlight Mechanism
Transcribing Content from Structural Images with Spotlight Mechanism
Yu Yin
Zhenya Huang
Enhong Chen
Qi Liu
Fuzheng Zhang
Xing Xie
Guoping Hu
48
22
0
27 May 2019
Extreme Multi-Label Legal Text Classification: A case study in EU
  Legislation
Extreme Multi-Label Legal Text Classification: A case study in EU Legislation
Ilias Chalkidis
Manos Fergadiotis
Prodromos Malakasiotis
Nikolaos Aletras
Ion Androutsopoulos
AILaw
86
75
0
26 May 2019
Simple and Effective Curriculum Pointer-Generator Networks for Reading
  Comprehension over Long Narratives
Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives
Yi Tay
Shuohang Wang
Anh Tuan Luu
Jie Fu
Minh C. Phan
Xingdi Yuan
J. Rao
S. Hui
Aston Zhang
118
110
0
26 May 2019
A Survey on Biomedical Image Captioning
A Survey on Biomedical Image Captioning
Vasiliki Kougia
John Pavlopoulos
Ion Androutsopoulos
MedIm
94
83
0
26 May 2019
Path Ranking with Attention to Type Hierarchies
Path Ranking with Attention to Type Hierarchies
Weiyu Liu
A. Daruna
Z. Kira
Sonia Chernova
AIMat
70
13
0
26 May 2019
DIANet: Dense-and-Implicit Attention Network
DIANet: Dense-and-Implicit Attention Network
Zhongzhan Huang
Senwei Liang
Mingfu Liang
Haizhao Yang
CVBM
82
57
0
25 May 2019
Bivariate Beta-LSTM
Bivariate Beta-LSTM
Kyungwoo Song
Joonho Jang
Seung-Jae Shin
Il-Chul Moon
51
6
0
25 May 2019
Pose-adaptive Hierarchical Attention Network for Facial Expression
  Recognition
Pose-adaptive Hierarchical Attention Network for Facial Expression Recognition
Yuanyuan Liu
Jiyao Peng
Jiabei Zeng
Shiguang Shan
CVBM
69
16
0
24 May 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire
  Evacuation Environment
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh Sharma
Per-Arne Andersen
Ole-Christoffer Granmo
M. G. Olsen
AI4CE
78
70
0
23 May 2019
AttentionRNN: A Structured Spatial Attention Mechanism
AttentionRNN: A Structured Spatial Attention Mechanism
Siddhesh Khandelwal
Leonid Sigal
71
3
0
22 May 2019
What Would You Expect? Anticipating Egocentric Actions with
  Rolling-Unrolling LSTMs and Modality Attention
What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention
Antonino Furnari
G. Farinella
EgoV
141
175
0
22 May 2019
A Neural, Interactive-predictive System for Multimodal Sequence to
  Sequence Tasks
A Neural, Interactive-predictive System for Multimodal Sequence to Sequence Tasks
Álvaro Peris
F. Casacuberta
48
4
0
20 May 2019
Image Captioning based on Deep Learning Methods: A Survey
Image Captioning based on Deep Learning Methods: A Survey
Yiyu Wang
Jungang Xu
Yingfei Sun
Xianpei Han
VLM
44
7
0
20 May 2019
Less Memory, Faster Speed: Refining Self-Attention Module for Image
  Reconstruction
Less Memory, Faster Speed: Refining Self-Attention Module for Image Reconstruction
Zheng Wang
Jianwu Li
Ge Song
Tieling Li
28
2
0
20 May 2019
Previous
123...414243...697071
Next