ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Black-Box Attacks against RNN based Malware Detection Algorithms
Black-Box Attacks against RNN based Malware Detection Algorithms
Weiwei Hu
Ying Tan
66
150
0
23 May 2017
Local Monotonic Attention Mechanism for End-to-End Speech and Language
  Processing
Local Monotonic Attention Mechanism for End-to-End Speech and Language Processing
Andros Tjandra
S. Sakti
Satoshi Nakamura
49
6
0
23 May 2017
pix2code: Generating Code from a Graphical User Interface Screenshot
pix2code: Generating Code from a Graphical User Interface Screenshot
Tony Beltramelli
78
277
0
22 May 2017
A Regularized Framework for Sparse and Structured Neural Attention
A Regularized Framework for Sparse and Structured Neural Attention
Vlad Niculae
Mathieu Blondel
94
100
0
22 May 2017
Learning Convolutional Text Representations for Visual Question
  Answering
Learning Convolutional Text Representations for Visual Question Answering
Zhengyang Wang
Shuiwang Ji
FAtt
71
15
0
18 May 2017
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement
  Learning
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
Nat Dilokthanakul
Christos Kaplanis
Nick Pawlowski
Murray Shanahan
87
92
0
18 May 2017
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
H. Ben-younes
Rémi Cadène
Matthieu Cord
Nicolas Thome
176
584
0
18 May 2017
Learning Hard Alignments with Variational Inference
Learning Hard Alignments with Variational Inference
Dieterich Lawson
Chung-Cheng Chiu
George Tucker
Colin Raffel
Kevin Swersky
Navdeep Jaitly
DRL
62
29
0
16 May 2017
Detecting Statistical Interactions from Neural Network Weights
Detecting Statistical Interactions from Neural Network Weights
Michael Tsang
Dehua Cheng
Yan Liu
99
193
0
14 May 2017
Survey of Visual Question Answering: Datasets and Techniques
Survey of Visual Question Answering: Datasets and Techniques
A. Gupta
57
38
0
10 May 2017
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for
  Reading Comprehension
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
304
2,695
0
09 May 2017
CHAM: action recognition using convolutional hierarchical attention
  model
CHAM: action recognition using convolutional hierarchical attention model
Shiyang Yan
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
63
8
0
09 May 2017
You said that?
You said that?
Joon Son Chung
A. Jamaludin
Andrew Zisserman
CVBM
77
260
0
08 May 2017
Image Annotation using Multi-Layer Sparse Coding
Image Annotation using Multi-Layer Sparse Coding
Amara Tariq
H. Foroosh
31
2
0
06 May 2017
Motion Prediction Under Multimodality with Conditional Stochastic
  Networks
Motion Prediction Under Multimodality with Conditional Stochastic Networks
Katerina Fragkiadaki
Jonathan Huang
Alexander A. Alemi
Sudheendra Vijayanarasimhan
Susanna Ricco
Rahul Sukthankar
3DH
98
25
0
05 May 2017
Recurrent Soft Attention Model for Common Object Recognition
Recurrent Soft Attention Model for Common Object Recognition
Liliang Ren
23
1
0
04 May 2017
Weakly-supervised Visual Grounding of Phrases with Linguistic Structures
Weakly-supervised Visual Grounding of Phrases with Linguistic Structures
Fanyi Xiao
Leonid Sigal
Yong Jae Lee
87
139
0
03 May 2017
Show, Adapt and Tell: Adversarial Training of Cross-domain Image
  Captioner
Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner
Tseng-Hung Chen
Yuan-Hong Liao
Ching-Yao Chuang
W. Hsu
Jianlong Fu
Min Sun
105
142
0
02 May 2017
Dense-Captioning Events in Videos
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
208
1,259
0
02 May 2017
Speech-Based Visual Question Answering
Speech-Based Visual Question Answering
Ted Zhang
Dengxin Dai
Tinne Tuytelaars
Marie-Francine Moens
Luc Van Gool
85
25
0
01 May 2017
Tree-Structured Neural Machine for Linguistics-Aware Sentence Generation
Tree-Structured Neural Machine for Linguistics-Aware Sentence Generation
Ganbin Zhou
Ping Luo
Rongyu Cao
Yijun Xiao
Fen Lin
Bo Chen
Qing He
63
4
0
30 Apr 2017
Learning to Ask: Neural Question Generation for Reading Comprehension
Learning to Ask: Neural Question Generation for Reading Comprehension
Xinya Du
Junru Shao
Claire Cardie
3DV
163
664
0
29 Apr 2017
Mapping Instructions and Visual Observations to Actions with
  Reinforcement Learning
Mapping Instructions and Visual Observations to Actions with Reinforcement Learning
Dipendra Kumar Misra
John Langford
Yoav Artzi
86
247
0
28 Apr 2017
Learning Structured Natural Language Representations for Semantic
  Parsing
Learning Structured Natural Language Representations for Semantic Parsing
Jianpeng Cheng
Siva Reddy
V. Saraswat
Mirella Lapata
NAI
157
76
0
27 Apr 2017
Paying Attention to Descriptions Generated by Image Captioning Models
Paying Attention to Descriptions Generated by Image Captioning Models
Hamed R. Tavakoli
Rakshith Shetty
Ali Borji
Jorma T. Laaksonen
80
79
0
24 Apr 2017
Lexically Constrained Decoding for Sequence Generation Using Grid Beam
  Search
Lexically Constrained Decoding for Sequence Generation Using Grid Beam Search
Chris Hokamp
Qun Liu
86
379
0
24 Apr 2017
Being Negative but Constructively: Lessons Learnt from Creating Better
  Visual Question Answering Datasets
Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets
Wei-Lun Chao
Hexiang Hu
Fei Sha
95
37
0
24 Apr 2017
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition
Yufei Wang
Zhe Lin
Xiaohui Shen
Scott D. Cohen
G. Cottrell
89
106
0
23 Apr 2017
Differentiable Scheduled Sampling for Credit Assignment
Differentiable Scheduled Sampling for Credit Assignment
Kartik Goyal
Chris Dyer
Taylor Berg-Kirkpatrick
92
40
0
23 Apr 2017
Residual Attention Network for Image Classification
Residual Attention Network for Image Classification
Fei Wang
Mengqing Jiang
Chao Qian
Shuo Yang
Cheng Li
Honggang Zhang
Xiaogang Wang
Xiaoou Tang
128
3,320
0
23 Apr 2017
Attention Strategies for Multi-Source Sequence-to-Sequence Learning
Attention Strategies for Multi-Source Sequence-to-Sequence Learning
Jindrich Libovický
Jindřich Helcl
AIMat
87
183
0
21 Apr 2017
Attend to You: Personalized Image Captioning with Context Sequence
  Memory Networks
Attend to You: Personalized Image Captioning with Context Sequence Memory Networks
C. C. Park
Byeongchang Kim
Gunhee Kim
76
173
0
21 Apr 2017
Solar Power Plant Detection on Multi-Spectral Satellite Imagery using
  Weakly-Supervised CNN with Feedback Features and m-PCNN Fusion
Solar Power Plant Detection on Multi-Spectral Satellite Imagery using Weakly-Supervised CNN with Feedback Features and m-PCNN Fusion
Nevrez Imamoglu
Motoki Kimura
H. Miyamoto
A. Fujita
Ryosuke Nakamura
45
12
0
21 Apr 2017
Call Attention to Rumors: Deep Attention Based Recurrent Neural Networks
  for Early Rumor Detection
Call Attention to Rumors: Deep Attention Based Recurrent Neural Networks for Early Rumor Detection
Tong Chen
Lin Wu
Xue Li
Jun Zhang
Hongzhi Yin
Yang Wang
86
277
0
20 Apr 2017
Accurate Single Stage Detector Using Recurrent Rolling Convolution
Accurate Single Stage Detector Using Recurrent Rolling Convolution
Jimmy S. J. Ren
Xiaohao Chen
Jianbo Liu
Wenxiu Sun
Jiahao Pang
Qiong Yan
Yu-Wing Tai
Li Xu
ObjD
77
281
0
19 Apr 2017
Beating Atari with Natural Language Guided Reinforcement Learning
Beating Atari with Natural Language Guided Reinforcement Learning
Russell Kaplan
Chris Sauer
A. Sosa
LM&Ro
86
69
0
18 Apr 2017
Learning Character-level Compositionality with Visual Features
Learning Character-level Compositionality with Visual Features
Frederick Liu
Han Lu
Chieh Lo
Graham Neubig
CoGe
88
64
0
17 Apr 2017
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal
  Attentions
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions
Amir Mazaheri
Dong Zhang
M. Shah
56
12
0
15 Apr 2017
Integrating Scene Text and Visual Appearance for Fine-Grained Image
  Classification
Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification
X. Bai
Mingkun Yang
Pengyuan Lyu
Yongchao Xu
Jiebo Luo
98
75
0
15 Apr 2017
Neural Extractive Summarization with Side Information
Neural Extractive Summarization with Side Information
Shashi Narayan
Nikos Papasarantopoulos
Shay B. Cohen
Mirella Lapata
157
74
0
14 Apr 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
Y. Jang
Yale Song
Youngjae Yu
Youngjin Kim
Gunhee Kim
95
562
0
14 Apr 2017
DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting
  Agents
DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents
Namhoon Lee
Wongun Choi
Paul Vernaza
Chris Choy
Philip Torr
Manmohan Chandraker
AI4TS
110
996
0
14 Apr 2017
Get To The Point: Summarization with Pointer-Generator Networks
Get To The Point: Summarization with Pointer-Generator Networks
A. See
Peter J. Liu
Christopher D. Manning
3DPC
511
4,033
0
14 Apr 2017
Spatial Memory for Context Reasoning in Object Detection
Spatial Memory for Context Reasoning in Object Detection
Xinlei Chen
Abhinav Gupta
ObjD
106
166
0
13 Apr 2017
Room for improvement in automatic image description: an error analysis
Room for improvement in automatic image description: an error analysis
Emiel van Miltenburg
Desmond Elliott
3DV
70
12
0
13 Apr 2017
Discriminative Bimodal Networks for Visual Localization and Detection
  with Natural Language Queries
Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries
Y. Zhang
Luyao Yuan
Yijie Guo
Zhiyuan He
I-An Huang
Honglak Lee
ObjD
92
57
0
12 Apr 2017
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Zhou Ren
Xiaoyu Wang
Ning Zhang
Xutao Lv
Li Li
65
324
0
12 Apr 2017
Creativity: Generating Diverse Questions using Variational Autoencoders
Creativity: Generating Diverse Questions using Variational Autoencoders
Unnat Jain
Ziyu Zhang
Alex Schwing
79
152
0
11 Apr 2017
Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question
  Answering
Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering
V. Kazemi
Ali Elqursh
OOD
89
185
0
11 Apr 2017
Pay Attention to Those Sets! Learning Quantification from Images
Pay Attention to Those Sets! Learning Quantification from Images
Ionut-Teodor Sorodoc
Sandro Pezzelle
Aurélie Herbelot
Mariella Dimiccoli
Raffaella Bernardi
42
0
0
10 Apr 2017
Previous
123...616263...697071
Next