ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,520 papers shown
Title
Saliency-Guided Attention Network for Image-Sentence Matching
Saliency-Guided Attention Network for Image-Sentence Matching
Zhong Ji
Haoran Wang
Jiawei Han
Yanwei Pang
71
89
0
20 Apr 2019
Salient Object Detection in the Deep Learning Era: An In-Depth Survey
Salient Object Detection in the Deep Learning Era: An In-Depth Survey
Wenguan Wang
Qiuxia Lai
Huazhu Fu
Jianbing Shen
Haibin Ling
Ruigang Yang
108
617
0
19 Apr 2019
Emergence of Compositional Language with Deep Generational Transmission
Emergence of Compositional Language with Deep Generational Transmission
Michael Cogswell
Jiasen Lu
Stefan Lee
Devi Parikh
Dhruv Batra
117
49
0
19 Apr 2019
Attentive Single-Tasking of Multiple Tasks
Attentive Single-Tasking of Multiple Tasks
Kevis-Kokitsi Maninis
Ilija Radosavovic
Iasonas Kokkinos
204
251
0
18 Apr 2019
Learning to Collocate Neural Modules for Image Captioning
Learning to Collocate Neural Modules for Image Captioning
Xu Yang
Hanwang Zhang
Jianfei Cai
71
78
0
18 Apr 2019
DeepNovoV2: Better de novo peptide sequencing with deep learning
DeepNovoV2: Better de novo peptide sequencing with deep learning
Rui Qiao
Ngoc Hieu Tran
L. Xin
B. Shan
Ming Li
A. Ghodsi
37
17
0
17 Apr 2019
Aggregation Cross-Entropy for Sequence Recognition
Aggregation Cross-Entropy for Sequence Recognition
Zecheng Xie
Yaoxiong Huang
Yuanzhi Zhu
Lianwen Jin
Yuliang Liu
Lele Xie
95
92
0
17 Apr 2019
BS-Nets: An End-to-End Framework For Band Selection of Hyperspectral
  Image
BS-Nets: An End-to-End Framework For Band Selection of Hyperspectral Image
Yaoming Cai
Xiaobo Liu
Z. Cai
49
192
0
17 Apr 2019
CaseNet: Content-Adaptive Scale Interaction Networks for Scene Parsing
CaseNet: Content-Adaptive Scale Interaction Networks for Scene Parsing
Xin Jin
Cuiling Lan
Wenjun Zeng
Zhizheng Zhang
Zhibo Chen
74
7
0
17 Apr 2019
Explainability in Human-Agent Systems
Explainability in Human-Agent Systems
A. Rosenfeld
A. Richardson
XAI
86
207
0
17 Apr 2019
Neural Message Passing for Multi-Label Classification
Neural Message Passing for Multi-Label Classification
Jack Lanchantin
Arshdeep Sekhon
Yanjun Qi
66
38
0
17 Apr 2019
Real Image Denoising with Feature Attention
Real Image Denoising with Feature Attention
Saeed Anwar
Nick Barnes
110
513
0
16 Apr 2019
Latent Code and Text-based Generative Adversarial Networks for Soft-text
  Generation
Latent Code and Text-based Generative Adversarial Networks for Soft-text Generation
Md. Akmal Haidar
Mehdi Rezagholizadeh
Alan Do-Omri
Ahmad Rashid
GAN
68
15
0
15 Apr 2019
Self-critical n-step Training for Image Captioning
Self-critical n-step Training for Image Captioning
Junlong Gao
Shiqi Wang
Shanshe Wang
Siwei Ma
Wen Gao
97
55
0
15 Apr 2019
An Empirical Investigation of Global and Local Normalization for
  Recurrent Neural Sequence Models Using a Continuous Relaxation to Beam Search
An Empirical Investigation of Global and Local Normalization for Recurrent Neural Sequence Models Using a Continuous Relaxation to Beam Search
Kartik Goyal
Chris Dyer
Taylor Berg-Kirkpatrick
65
16
0
15 Apr 2019
IIT (BHU) Varanasi at MSR-SRST 2018: A Language Model Based Approach for
  Natural Language Generation
IIT (BHU) Varanasi at MSR-SRST 2018: A Language Model Based Approach for Natural Language Generation
Shreyansh Singh
Avi Chawla
Ayush Sharma
Anil Kumar Singh
21
3
0
12 Apr 2019
Factor Graph Attention
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
Alex Schwing
132
110
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
Alex Schwing
Tamir Hazan
89
71
0
11 Apr 2019
An Empirical Study of Spatial Attention Mechanisms in Deep Networks
An Empirical Study of Spatial Attention Mechanisms in Deep Networks
Xizhou Zhu
Dazhi Cheng
Zheng Zhang
Stephen Lin
Jifeng Dai
94
420
0
11 Apr 2019
FTGAN: A Fully-trained Generative Adversarial Networks for Text to Face
  Generation
FTGAN: A Fully-trained Generative Adversarial Networks for Text to Face Generation
Xiang Chen
Lingbo Qing
Xiaohai He
Xiaodong Luo
Yining Xu
GANCVBM
65
34
0
11 Apr 2019
Reasoning Visual Dialogs with Structural and Partial Observations
Reasoning Visual Dialogs with Structural and Partial Observations
Zilong Zheng
Wenguan Wang
Siyuan Qi
Song-Chun Zhu
128
117
0
11 Apr 2019
UniVSE: Robust Visual Semantic Embeddings via Structured Semantic
  Representations
UniVSE: Robust Visual Semantic Embeddings via Structured Semantic Representations
Hao Wu
Jiayuan Mao
Yufeng Zhang
Yuning Jiang
Lei Li
Weiwei Sun
Wei-Ying Ma
33
8
0
11 Apr 2019
Knowledge Squeezed Adversarial Network Compression
Knowledge Squeezed Adversarial Network Compression
Changyong Shu
Li Peng
Xie Yuan
Yanyun Qu
Longquan Dai
Lizhuang Ma
GAN
75
11
0
10 Apr 2019
Identifying Sub-Phenotypes of Acute Kidney Injury using Structured and
  Unstructured Electronic Health Record Data with Memory Networks
Identifying Sub-Phenotypes of Acute Kidney Injury using Structured and Unstructured Electronic Health Record Data with Memory Networks
Zhenxing Xu
Jingyuan Chou
Xi Sheryl Zhang
Yuan Luo
T. Isakova
...
Richard C. Kiefer
J. Pacheco
Luke Rasmussen
Jyotishman Pathak
Fei Wang
74
54
0
10 Apr 2019
Context-Aware Embeddings for Automatic Art Analysis
Context-Aware Embeddings for Automatic Art Analysis
Noa Garcia
B. Renoust
Yuta Nakashima
47
52
0
10 Apr 2019
Cross-Modal Self-Attention Network for Referring Image Segmentation
Cross-Modal Self-Attention Network for Referring Image Segmentation
Linwei Ye
Mrigank Rochan
Zhi Liu
Yang Wang
EgoV
87
478
0
09 Apr 2019
Attention-based Multi-instance Neural Network for Medical Diagnosis from
  Incomplete and Low Quality Data
Attention-based Multi-instance Neural Network for Medical Diagnosis from Incomplete and Low Quality Data
Zeyuan Wang
Josiah Poon
Shiding Sun
S. Poon
93
26
0
09 Apr 2019
Giving Attention to the Unexpected: Using Prosody Innovations in
  Disfluency Detection
Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency Detection
Vicky Zayats
Mari Ostendorf
57
30
0
08 Apr 2019
L2AE-D: Learning to Aggregate Embeddings for Few-shot Learning with
  Meta-level Dropout
L2AE-D: Learning to Aggregate Embeddings for Few-shot Learning with Meta-level Dropout
Heda Song
M. Torres
Ender Ozcan
I. Triguero
57
8
0
08 Apr 2019
Streamlined Dense Video Captioning
Streamlined Dense Video Captioning
Jonghwan Mun
L. Yang
Zhou Ren
N. Xu
Bohyung Han
94
144
0
08 Apr 2019
SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for
  Unsupervised Abstractive Sentence Compression
SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression
Christos Baziotis
Ion Androutsopoulos
Ioannis Konstas
Alexandros Potamianos
76
83
0
07 Apr 2019
Learning to Learn Relation for Important People Detection in Still
  Images
Learning to Learn Relation for Important People Detection in Still Images
Wei-Hong Li
Fa-Ting Hong
Weishi Zheng
3DPC3DH
57
27
0
07 Apr 2019
Doodle to Search: Practical Zero-Shot Sketch-based Image Retrieval
Doodle to Search: Practical Zero-Shot Sketch-based Image Retrieval
S. Dey
Pau Riba
Anjan Dutta
Josep Llados
Yi-Zhe Song
94
181
0
06 Apr 2019
Modeling Point Clouds with Self-Attention and Gumbel Subset Sampling
Modeling Point Clouds with Self-Attention and Gumbel Subset Sampling
Jiancheng Yang
Qiang Zhang
Bingbing Ni
Linguo Li
Jinxian Liu
Mengdie Zhou
Qi Tian
3DPC
95
382
0
06 Apr 2019
Attention Distillation for Learning Video Representations
Attention Distillation for Learning Video Representations
Miao Liu
Xin Chen
Yun C. Zhang
Yin Li
James M. Rehg
66
2
0
05 Apr 2019
Information Aggregation for Multi-Head Attention with
  Routing-by-Agreement
Information Aggregation for Multi-Head Attention with Routing-by-Agreement
Jian Li
Baosong Yang
Zi-Yi Dou
Xing Wang
Michael R. Lyu
Zhaopeng Tu
82
46
0
05 Apr 2019
Relation-Aware Global Attention for Person Re-identification
Relation-Aware Global Attention for Person Re-identification
Zhizheng Zhang
Cuiling Lan
Wenjun Zeng
Xin Jin
Zhibo Chen
3DPC
118
485
0
05 Apr 2019
Snap and Find: Deep Discrete Cross-domain Garment Image Retrieval
Snap and Find: Deep Discrete Cross-domain Garment Image Retrieval
Yadan Luo
Ziwei Wang
Zi Huang
Yang Yang
Huimin Lu
44
7
0
05 Apr 2019
An Attentive Survey of Attention Models
An Attentive Survey of Attention Models
S. Chaudhari
Varun Mithal
Gungor Polatkan
R. Ramanath
200
666
0
05 Apr 2019
Clinically Accurate Chest X-Ray Report Generation
Clinically Accurate Chest X-Ray Report Generation
Guanxiong Liu
T. Hsu
Matthew B. A. McDermott
Willie Boag
W. Weng
Peter Szolovits
Marzyeh Ghassemi
MedIm
134
279
0
04 Apr 2019
End-to-End Video Captioning
End-to-End Video Captioning
Silvio Olivastri
Gurkirt Singh
Fabio Cuzzolin
70
18
0
04 Apr 2019
A Simple Joint Model for Improved Contextual Neural Lemmatization
A Simple Joint Model for Improved Contextual Neural Lemmatization
Chaitanya Malaviya
Shijie Wu
Ryan Cotterell
99
28
0
04 Apr 2019
Revisiting Visual Grounding
Revisiting Visual Grounding
E. Conser
Kennedy Hahn
Chandler M. Watson
Melanie Mitchell
49
5
0
03 Apr 2019
Medical device surveillance with electronic health records
Medical device surveillance with electronic health records
A. Callahan
Jason Alan Fries
Christopher Ré
J. Huddleston
N. Giori
Scott L. Delp
N. Shah
79
54
0
03 Apr 2019
Good News, Everyone! Context driven entity-aware captioning for news
  images
Good News, Everyone! Context driven entity-aware captioning for news images
Ali Furkan Biten
Lluís Gómez
Marçal Rusiñol
Dimosthenis Karatzas
89
141
0
02 Apr 2019
Aiding Intra-Text Representations with Visual Context for Multimodal
  Named Entity Recognition
Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition
Omer Arshad
I. Gallo
Shah Nawaz
Alessandro Calefati
44
43
0
02 Apr 2019
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents
Christian Rupprecht
Cyril Ibrahim
C. Pal
96
32
0
02 Apr 2019
Learning Good Representation via Continuous Attention
Learning Good Representation via Continuous Attention
Liang Zhao
Wenyuan Xu
27
0
0
29 Mar 2019
Counting with Focus for Free
Counting with Focus for Free
Zenglin Shi
Pascal Mettes
Cees G. M. Snoek
3DV3DPC
84
109
0
28 Mar 2019
Describing like humans: on diversity in image captioning
Describing like humans: on diversity in image captioning
Qingzhong Wang
Antoni B. Chan
104
99
0
28 Mar 2019
Previous
123...434445...697071
Next