ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXivPDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,510 papers shown
Title
Domain Adaptation for Neural Networks by Parameter Augmentation
Domain Adaptation for Neural Networks by Parameter Augmentation
Yusuke Watanabe
Kazuma Hashimoto
Yoshimasa Tsuruoka
OOD
24
6
0
01 Jul 2016
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes
Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes
Çağlar Gülçehre
A. Chandar
Kyunghyun Cho
Yoshua Bengio
20
64
0
30 Jun 2016
"Show me the cup": Reference with Continuous Representations
"Show me the cup": Reference with Continuous Representations
Gemma Boleda
Sebastian Padó
Marco Baroni
26
3
0
28 Jun 2016
Diversified Visual Attention Networks for Fine-Grained Object
  Classification
Diversified Visual Attention Networks for Fine-Grained Object Classification
Bo Zhao
Xiao-Jun Wu
Jiashi Feng
Qiang Peng
Shuicheng Yan
19
365
0
28 Jun 2016
Sequence-Level Knowledge Distillation
Sequence-Level Knowledge Distillation
Yoon Kim
Alexander M. Rush
47
1,101
0
25 Jun 2016
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation
  Tasks
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks
Jindrich Libovický
Jindřich Helcl
Marek Tlustý
Pavel Pecina
Ondrej Bojar
11
67
0
23 Jun 2016
LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in
  Recurrent Neural Networks
LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks
Hendrik Strobelt
Sebastian Gehrmann
Hanspeter Pfister
Alexander M. Rush
HAI
34
83
0
23 Jun 2016
Tagger: Deep Unsupervised Perceptual Grouping
Tagger: Deep Unsupervised Perceptual Grouping
Klaus Greff
Antti Rasmus
Mathias Berglund
T. Hao
Jürgen Schmidhuber
Harri Valpola
OCL
32
161
0
21 Jun 2016
Question Relevance in VQA: Identifying Non-Visual And False-Premise
  Questions
Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
Arijit Ray
Gordon A. Christie
Joey Tianyi Zhou
Dhruv Batra
Devi Parikh
27
56
0
21 Jun 2016
Drawing and Recognizing Chinese Characters with Recurrent Neural Network
Drawing and Recognizing Chinese Characters with Recurrent Neural Network
Xu-Yao Zhang
Fei Yin
Yanming Zhang
Cheng-Lin Liu
Yoshua Bengio
50
320
0
21 Jun 2016
Using Visual Analytics to Interpret Predictive Machine Learning Models
Using Visual Analytics to Interpret Predictive Machine Learning Models
Josua Krause
Adam Perer
E. Bertini
HAI
36
65
0
17 Jun 2016
FVQA: Fact-based Visual Question Answering
FVQA: Fact-based Visual Question Answering
Peng Wang
Qi Wu
Chunhua Shen
Anton van den Hengel
A. Dick
CoGe
39
455
0
17 Jun 2016
Model-Agnostic Interpretability of Machine Learning
Model-Agnostic Interpretability of Machine Learning
Marco Tulio Ribeiro
Sameer Singh
Carlos Guestrin
FAtt
FaML
31
833
0
16 Jun 2016
A Correlational Encoder Decoder Architecture for Pivot Based Sequence
  Generation
A Correlational Encoder Decoder Architecture for Pivot Based Sequence Generation
Amrita Saha
Mitesh M. Khapra
A. Chandar
Janarthanan Rajendran
Kyunghyun Cho
22
18
0
15 Jun 2016
Unsupervised Learning of Predictors from Unpaired Input-Output Samples
Unsupervised Learning of Predictors from Unpaired Input-Output Samples
Jianshu Chen
Po-Sen Huang
Xiaodong He
Jianfeng Gao
Li Deng
OOD
SSL
29
8
0
15 Jun 2016
Bidirectional Long-Short Term Memory for Video Description
Bidirectional Long-Short Term Memory for Video Description
Yi Bin
Yang Yang
Zi Huang
Fumin Shen
Xing Xu
Heng Tao Shen
39
60
0
15 Jun 2016
Watch What You Just Said: Image Captioning with Text-Conditional
  Attention
Watch What You Just Said: Image Captioning with Text-Conditional Attention
Luowei Zhou
Chenliang Xu
Parker A. Koch
Jason J. Corso
VLM
22
44
0
15 Jun 2016
End-to-End Comparative Attention Networks for Person Re-identification
End-to-End Comparative Attention Networks for Person Re-identification
Hao Liu
Jiashi Feng
Meibin Qi
Jianguo Jiang
Shuicheng Yan
25
575
0
14 Jun 2016
Rationalizing Neural Predictions
Rationalizing Neural Predictions
Tao Lei
Regina Barzilay
Tommi Jaakkola
59
805
0
13 Jun 2016
Training Recurrent Answering Units with Joint Loss Minimization for VQA
Training Recurrent Answering Units with Joint Loss Minimization for VQA
Hyeonwoo Noh
Bohyung Han
32
71
0
12 Jun 2016
Natural Language Generation in Dialogue using Lexicalized and
  Delexicalized Data
Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data
Shikhar Sharma
Jing He
Kaheer Suleman
Hannes Schulz
Philip Bachman
24
29
0
11 Jun 2016
Human Attention in Visual Question Answering: Do Humans and Deep
  Networks Look at the Same Regions?
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
Abhishek Das
Harsh Agrawal
C. L. Zitnick
Devi Parikh
Dhruv Batra
32
465
0
11 Jun 2016
Conditional Generation and Snapshot Learning in Neural Dialogue Systems
Conditional Generation and Snapshot Learning in Neural Dialogue Systems
Tsung-Hsien Wen
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Pei-hao Su
Stefan Ultes
David Vandyke
S. Young
25
78
0
10 Jun 2016
Sequence-to-Sequence Learning as Beam-Search Optimization
Sequence-to-Sequence Learning as Beam-Search Optimization
Sam Wiseman
Alexander M. Rush
44
590
0
09 Jun 2016
Progressive Attention Networks for Visual Attribute Prediction
Progressive Attention Networks for Visual Attribute Prediction
Paul Hongsuck Seo
Zhe-nan Lin
Scott D. Cohen
Xiaohui Shen
Bohyung Han
24
41
0
08 Jun 2016
SE3-Nets: Learning Rigid Body Motion using Deep Neural Networks
SE3-Nets: Learning Rigid Body Motion using Deep Neural Networks
Arunkumar Byravan
Dieter Fox
3DPC
16
268
0
08 Jun 2016
Iterative Alternating Neural Attention for Machine Reading
Iterative Alternating Neural Attention for Machine Reading
Alessandro Sordoni
Philip Bachman
Adam Trischler
Yoshua Bengio
CLL
AIMat
32
118
0
07 Jun 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
167
1,465
0
06 Jun 2016
Attention Correctness in Neural Image Captioning
Attention Correctness in Neural Image Captioning
Chenxi Liu
Junhua Mao
Fei Sha
Alan Yuille
3DV
38
220
0
31 May 2016
End-to-End Instance Segmentation with Recurrent Attention
End-to-End Instance Segmentation with Recurrent Attention
Mengye Ren
R. Zemel
SSeg
30
61
0
30 May 2016
Does Multimodality Help Human and Machine for Translation and Image
  Captioning?
Does Multimodality Help Human and Machine for Translation and Image Captioning?
Ozan Caglayan
Walid Aransa
Yaxing Wang
Marc Masana
Mercedes García-Martínez
Fethi Bougares
Loïc Barrault
Joost van de Weijer
30
85
0
30 May 2016
Video Summarization with Long Short-term Memory
Video Summarization with Long Short-term Memory
Ke Zhang
Wei-Lun Chao
Fei Sha
Kristen Grauman
38
682
0
26 May 2016
Review Networks for Caption Generation
Review Networks for Caption Generation
Zhilin Yang
Ye Yuan
Yuexin Wu
Ruslan Salakhutdinov
William W. Cohen
3DV
40
85
0
25 May 2016
BattRAE: Bidimensional Attention-Based Recursive Autoencoders for
  Learning Bilingual Phrase Embeddings
BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings
Biao Zhang
Deyi Xiong
Jinsong Su
11
20
0
25 May 2016
Localizing by Describing: Attribute-Guided Attention Localization for
  Fine-Grained Recognition
Localizing by Describing: Attribute-Guided Attention Localization for Fine-Grained Recognition
Xiao-Chang Liu
Jiang Wang
Shilei Wen
Errui Ding
Yuanqing Lin
13
76
0
20 May 2016
Generative Adversarial Text to Image Synthesis
Generative Adversarial Text to Image Synthesis
Scott E. Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
GAN
58
3,127
0
17 May 2016
Learning Deep Representations of Fine-grained Visual Descriptions
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCL
VLM
176
840
0
17 May 2016
Movie Description
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DV
VGen
32
354
0
12 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
24
101
0
09 May 2016
Chained Predictions Using Convolutional Neural Networks
Chained Predictions Using Convolutional Neural Networks
Georgia Gkioxari
Alexander Toshev
Navdeep Jaitly
BDL
32
190
0
08 May 2016
DeepPicker: a Deep Learning Approach for Fully Automated Particle
  Picking in Cryo-EM
DeepPicker: a Deep Learning Approach for Fully Automated Particle Picking in Cryo-EM
Feng Wang
Huichao Gong
Gaochao liu
Meijing Li
Chuangye Yan
Tian Xia
Xueming Li
Jianyang Zeng
36
168
0
06 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking
Leveraging Visual Question Answering for Image-Caption Ranking
Xiaoyu Lin
Devi Parikh
CoGe
22
83
0
04 May 2016
Multi30K: Multilingual English-German Image Descriptions
Multi30K: Multilingual English-German Image Descriptions
Desmond Elliott
Stella Frank
K. Simaán
Lucia Specia
VLM
27
581
0
02 May 2016
Look-ahead before you leap: end-to-end active recognition by forecasting
  the effect of motion
Look-ahead before you leap: end-to-end active recognition by forecasting the effect of motion
Dinesh Jayaraman
Kristen Grauman
17
91
0
30 Apr 2016
Joint Line Segmentation and Transcription for End-to-End Handwritten
  Paragraph Recognition
Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition
Théodore Bluche
AI4TS
20
189
0
28 Apr 2016
Dialog-based Language Learning
Dialog-based Language Learning
Jason Weston
LLMAG
24
108
0
20 Apr 2016
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length
  Image Tagging
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length Image Tagging
Jiren Jin
Hideki Nakayama
3DV
VLM
30
69
0
18 Apr 2016
Parallelizing Word2Vec in Shared and Distributed Memory
Parallelizing Word2Vec in Shared and Distributed Memory
Shihao Ji
N. Satish
Sheng Li
Pradeep Dubey
VLM
MoE
19
72
0
15 Apr 2016
Learning Visual Storylines with Skipping Recurrent Neural Networks
Learning Visual Storylines with Skipping Recurrent Neural Networks
Gunnar A. Sigurdsson
Xinlei Chen
Abhinav Gupta
29
38
0
14 Apr 2016
Filling in the details: Perceiving from low fidelity images
Filling in the details: Perceiving from low fidelity images
F. Wick
Michael L. Wick
M. Pomplun
3DH
11
1
0
14 Apr 2016
Previous
123...666768697071
Next