ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Implications of Human Irrationality for Reinforcement Learning
Implications of Human Irrationality for Reinforcement Learning
Haiyang Chen
H. Chang
Andrew Howes
57
1
0
07 Jun 2020
Attention-Based Deep Learning Framework for Human Activity Recognition
  with User Adaptation
Attention-Based Deep Learning Framework for Human Activity Recognition with User Adaptation
Davide Buffelli
Fabio Vandin
HAI
49
44
0
06 Jun 2020
MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language
  Queries at Phrase Level
MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language Queries at Phrase Level
Amar Shrestha
Krittaphat Pugdeethosapol
Haowen Fang
Qinru Qiu
ObjD
33
2
0
06 Jun 2020
Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report
  Generation
Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report Generation
Mingjie Li
Fuyu Wang
Xiaojun Chang
Xiaodan Liang
MedIm
93
107
0
06 Jun 2020
Audio Captioning using Gated Recurrent Units
Audio Captioning using Gated Recurrent Units
Aysegül Özkaya Eren
M. Sert
74
10
0
05 Jun 2020
Pick-Object-Attack: Type-Specific Adversarial Attack for Object
  Detection
Pick-Object-Attack: Type-Specific Adversarial Attack for Object Detection
Omid Mohamad Nezami
Akshay Chaturvedi
Mark Dras
Utpal Garain
AAMLObjD
61
19
0
05 Jun 2020
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language
  Learning
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning
Alessandro Suglia
Ioannis Konstas
Andrea Vanzo
E. Bastianelli
Desmond Elliott
Stella Frank
Oliver Lemon
62
16
0
03 Jun 2020
Transforming Multi-Concept Attention into Video Summarization
Transforming Multi-Concept Attention into Video Summarization
Yen-Ting Liu
Yu-Jhe Li
Y. Wang
52
19
0
02 Jun 2020
Cascaded Text Generation with Markov Transformers
Cascaded Text Generation with Markov Transformers
Yuntian Deng
Alexander M. Rush
53
13
0
01 Jun 2020
Artificial neural networks for neuroscientists: A primer
Artificial neural networks for neuroscientists: A primer
G. R. Yang
Xiao-Jing Wang
114
255
0
01 Jun 2020
Pedestrian Tracking with Gated Recurrent Units and Attention Mechanisms
Pedestrian Tracking with Gated Recurrent Units and Attention Mechanisms
Mahdi Elhousni
Xinming Huang
23
0
0
31 May 2020
A Survey on Transfer Learning in Natural Language Processing
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
91
75
0
31 May 2020
Explainable Artificial Intelligence: a Systematic Review
Explainable Artificial Intelligence: a Systematic Review
Giulia Vilone
Luca Longo
XAI
123
271
0
29 May 2020
Stereo Vision Based Single-Shot 6D Object Pose Estimation for
  Bin-Picking by a Robot Manipulator
Stereo Vision Based Single-Shot 6D Object Pose Estimation for Bin-Picking by a Robot Manipulator
Y. Nakano
44
5
0
28 May 2020
Network-to-Network Translation with Conditional Invertible Neural
  Networks
Network-to-Network Translation with Conditional Invertible Neural Networks
Robin Rombach
Patrick Esser
Bjorn Ommer
40
3
0
27 May 2020
TIME: Text and Image Mutual-Translation Adversarial Networks
TIME: Text and Image Mutual-Translation Adversarial Networks
Bingchen Liu
Kunpeng Song
Yizhe Zhu
Gerard de Melo
Ahmed Elgammal
65
32
0
27 May 2020
Rationalizing Text Matching: Learning Sparse Alignments via Optimal
  Transport
Rationalizing Text Matching: Learning Sparse Alignments via Optimal Transport
Kyle Swanson
L. Yu
Tao Lei
OT
67
37
0
27 May 2020
Visual Interest Prediction with Attentive Multi-Task Transfer Learning
Visual Interest Prediction with Attentive Multi-Task Transfer Learning
Deepanway Ghosal
M. Kolekar
88
1
0
26 May 2020
Hyperspectral Image Classification with Attention Aided CNNs
Hyperspectral Image Classification with Attention Aided CNNs
Renlong Hang
Zhu Li
Qingshan Liu
Pedram Ghamisi
Shuvra S. Bhattacharyya
56
229
0
25 May 2020
Joint learning of interpretation and distillation
Joint learning of interpretation and distillation
Jinchao Huang
Guofu Li
Zhicong Yan
Fucai Luo
Shenghong Li
FedMLFAtt
19
1
0
24 May 2020
Attention-guided Context Feature Pyramid Network for Object Detection
Attention-guided Context Feature Pyramid Network for Object Detection
Junxu Cao
Qi Chen
Jun Guo
Ruichao Shi
ObjD
98
89
0
23 May 2020
Focus Longer to See Better:Recursively Refined Attention for
  Fine-Grained Image Classification
Focus Longer to See Better:Recursively Refined Attention for Fine-Grained Image Classification
Prateek Shroff
Tianlong Chen
Yunchao Wei
Zhangyang Wang
48
12
0
22 May 2020
Deep learning approaches for neural decoding: from CNNs to LSTMs and
  spikes to fMRI
Deep learning approaches for neural decoding: from CNNs to LSTMs and spikes to fMRI
J. Livezey
Joshua I. Glaser
AI4CE
111
9
0
19 May 2020
Toward Automated Classroom Observation: Multimodal Machine Learning to Estimate CLASS Positive Climate and Negative Climate
Anand Ramakrishnan
Brian Zylich
Erin Ottmar
Jennifer LoCasale-Crouch
Jacob Whitehill
48
27
0
19 May 2020
Human Sentence Processing: Recurrence or Attention?
Human Sentence Processing: Recurrence or Attention?
Danny Merkx
S. Frank
53
96
0
19 May 2020
IMoJIE: Iterative Memory-Based Joint Open Information Extraction
IMoJIE: Iterative Memory-Based Joint Open Information Extraction
Keshav Kolluru
Samarth Aggarwal
Vipul Rathore
Mausam
Soumen Chakrabarti
VLM
80
72
0
17 May 2020
Rethinking and Improving Natural Language Generation with Layer-Wise
  Multi-View Decoding
Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding
Fenglin Liu
Xuancheng Ren
Guangxiang Zhao
Chenyu You
Xuewei Ma
Xian Wu
Xu Sun
107
2
0
16 May 2020
Visual Relationship Detection using Scene Graphs: A Survey
Visual Relationship Detection using Scene Graphs: A Survey
Aniket Agarwal
Ayush Mangal
Vipul
GNN
72
21
0
16 May 2020
AccentDB: A Database of Non-Native English Accents to Assist Neural
  Speech Recognition
AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition
Afroz Ahamad
Ankit Anand
Pranesh Bhargava
38
23
0
16 May 2020
Ventral-Dorsal Neural Networks: Object Detection via Selective Attention
Ventral-Dorsal Neural Networks: Object Detection via Selective Attention
M. K. Ebrahimpour
Jiayun Li
Yen-Yun Yu
Jackson Reesee
Azadeh Moghtaderi
Ming-Hsuan Yang
D. Noelle
ObjD
67
20
0
15 May 2020
WW-Nets: Dual Neural Networks for Object Detection
WW-Nets: Dual Neural Networks for Object Detection
M. K. Ebrahimpour
J. Falandays
S. Spevack
Ming-Hsuan Yang
D. Noelle
48
4
0
15 May 2020
A Novel Fusion of Attention and Sequence to Sequence Autoencoders to
  Predict Sleepiness From Speech
A Novel Fusion of Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech
Shahin Amiriparian
Pawel Winokurow
Vincent Karas
Sandra Ottl
Maurice Gerczuk
Björn W. Schuller
53
6
0
15 May 2020
ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural
  Language
ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language
Zhe Wang
Zhiyuan Fang
Jun Wang
Yezhou Yang
110
159
0
15 May 2020
Explaining Black Box Predictions and Unveiling Data Artifacts through
  Influence Functions
Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions
Xiaochuang Han
Byron C. Wallace
Yulia Tsvetkov
MILMFAttAAMLTDI
107
175
0
14 May 2020
Memory Controlled Sequential Self Attention for Sound Recognition
Memory Controlled Sequential Self Attention for Sound Recognition
Arjun Pankajakshan
Helen L. Bear
Vinod Subramanian
Emmanouil Benetos
13
2
0
13 May 2020
A Deep Learning Approach for Automatic Detection of Fake News
A Deep Learning Approach for Automatic Detection of Fake News
Tanik Saikh
Arkadipta De
Asif Ekbal
P. Bhattacharyya
59
34
0
11 May 2020
Attentional Bottleneck: Towards an Interpretable Deep Driving Network
Attentional Bottleneck: Towards an Interpretable Deep Driving Network
Jinkyu Kim
Mayank Bansal
99
13
0
08 May 2020
Synergistic Learning of Lung Lobe Segmentation and Hierarchical
  Multi-Instance Classification for Automated Severity Assessment of COVID-19
  in CT Images
Synergistic Learning of Lung Lobe Segmentation and Hierarchical Multi-Instance Classification for Automated Severity Assessment of COVID-19 in CT Images
Kelei He
Wei Zhao
Xingzhi Xie
Wen Ji
Mingxia Liu
...
F. Shi
Yang Gao
Jun Liu
Junfeng Zhang
Dinggang Shen
83
98
0
08 May 2020
Text Synopsis Generation for Egocentric Videos
Text Synopsis Generation for Egocentric Videos
Aidean Sharghi
N. Lobo
M. Shah
DiffMEgoV
18
1
0
08 May 2020
Spatio-Temporal Event Segmentation and Localization for Wildlife
  Extended Videos
Spatio-Temporal Event Segmentation and Localization for Wildlife Extended Videos
R. Mounir
R. Gula
J. Theuerkauf
Sudeep Sarkar
45
0
0
05 May 2020
Mind the Gap: On Bridging the Semantic Gap between Machine Learning and
  Information Security
Mind the Gap: On Bridging the Semantic Gap between Machine Learning and Information Security
Michael R. Smith
Nicholas T. Johnson
J. Ingram
A. Carbajal
Ramyaa
Evelyn Domschot
Christopher C. Lamb
Stephen J Verzi
W. Kegelmeyer
AAML
47
4
0
04 May 2020
Depth-2 Neural Networks Under a Data-Poisoning Attack
Depth-2 Neural Networks Under a Data-Poisoning Attack
Sayar Karmakar
Anirbit Mukherjee
Ramchandran Muthukumar
72
7
0
04 May 2020
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video
Antonino Furnari
G. Farinella
EgoV
73
141
0
04 May 2020
Distributional Discrepancy: A Metric for Unconditional Text Generation
Distributional Discrepancy: A Metric for Unconditional Text Generation
Ping Cai
Xingyuan Chen
Peng Jin
Hongjun Wang
Tianrui Li
36
6
0
04 May 2020
A New Data Normalization Method to Improve Dialogue Generation by
  Minimizing Long Tail Effect
A New Data Normalization Method to Improve Dialogue Generation by Minimizing Long Tail Effect
Zhiqiang Zhan
Zifeng Hou
Yang Zhang
29
0
0
04 May 2020
Quantifying Attention Flow in Transformers
Quantifying Attention Flow in Transformers
Samira Abnar
Willem H. Zuidema
229
808
0
02 May 2020
Multi-Dimensional Gender Bias Classification
Multi-Dimensional Gender Bias Classification
Emily Dinan
Angela Fan
Ledell Yu Wu
Jason Weston
Douwe Kiela
Adina Williams
FaML
91
124
0
01 May 2020
Multi-View Self-Attention for Interpretable Drug-Target Interaction
  Prediction
Multi-View Self-Attention for Interpretable Drug-Target Interaction Prediction
Brighter Agyemang
Wei-Ping Wu
Michael Y. Kpiebaareh
Zhihua Lei
Ebenezer Nanor
Lei Chen
51
29
0
01 May 2020
Cross-modal Language Generation using Pivot Stabilization for Web-scale
  Language Coverage
Cross-modal Language Generation using Pivot Stabilization for Web-scale Language Coverage
Ashish V. Thapliyal
Radu Soricut
49
12
0
01 May 2020
Hide-and-Seek: A Template for Explainable AI
Hide-and-Seek: A Template for Explainable AI
Thanos Tagaris
A. Stafylopatis
33
6
0
30 Apr 2020
Previous
123...313233...697071
Next