ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4555
  4. Cited By
Show and Tell: A Neural Image Caption Generator

Show and Tell: A Neural Image Caption Generator

17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
    3DV
ArXivPDFHTML

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,023 papers shown
Title
Meaning guided video captioning
Meaning guided video captioning
Rushi J. Babariya
Toru Tamaki
30
3
0
12 Dec 2019
Connecting Vision and Language with Localized Narratives
Connecting Vision and Language with Localized Narratives
Jordi Pont-Tuset
J. Uijlings
Soravit Changpinyo
Radu Soricut
V. Ferrari
ObjD
36
244
0
06 Dec 2019
Synchronous Transformers for End-to-End Speech Recognition
Synchronous Transformers for End-to-End Speech Recognition
Zhengkun Tian
Jiangyan Yi
Ye Bai
J. Tao
Shuai Zhang
Zhengqi Wen
27
72
0
06 Dec 2019
Siamese Natural Language Tracker: Tracking by Natural Language
  Descriptions with Siamese Trackers
Siamese Natural Language Tracker: Tracking by Natural Language Descriptions with Siamese Trackers
Qi Feng
Vitaly Ablavsky
Qinxun Bai
Stan Sclaroff
39
17
0
04 Dec 2019
Better Understanding Hierarchical Visual Relationship for Image Caption
Better Understanding Hierarchical Visual Relationship for Image Caption
Z. Fei
31
0
0
04 Dec 2019
Assessing the Robustness of Visual Question Answering Models
Assessing the Robustness of Visual Question Answering Models
Jia-Hong Huang
Modar Alfadly
Guohao Li
Marcel Worring
AAML
OOD
28
23
0
30 Nov 2019
Multimodal Machine Translation through Visuals and Speech
Multimodal Machine Translation through Visuals and Speech
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
63
74
0
28 Nov 2019
Multimodal Attention Networks for Low-Level Vision-and-Language
  Navigation
Multimodal Attention Networks for Low-Level Vision-and-Language Navigation
Federico Landi
Lorenzo Baraldi
Marcella Cornia
M. Corsini
Rita Cucchiara
LM&Ro
16
27
0
27 Nov 2019
CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and
  Context Capture for Language Representation -- A Generalization of Bi
  Directional LSTM
CRUR: Coupled-Recurrent Unit for Unification, Conceptualization and Context Capture for Language Representation -- A Generalization of Bi Directional LSTM
C. Sur
BDL
14
6
0
22 Nov 2019
Injecting Prior Knowledge into Image Caption Generation
Injecting Prior Knowledge into Image Caption Generation
A. Goel
Basura Fernando
Thanh-Son Nguyen
Hakan Bilen
23
0
0
22 Nov 2019
Orderless Recurrent Models for Multi-label Classification
Orderless Recurrent Models for Multi-label Classification
V. O. Yazici
Abel Gonzalez-Garcia
Arnau Ramisa
Bartlomiej Twardowski
Joost van de Weijer
SSL
19
92
0
22 Nov 2019
Characterizing the impact of using features extracted from pre-trained
  models on the quality of video captioning sequence-to-sequence models
Characterizing the impact of using features extracted from pre-trained models on the quality of video captioning sequence-to-sequence models
Menatallh Hammad
May Hammad
Mohamed Elshenawy
24
2
0
22 Nov 2019
Continual adaptation for efficient machine communication
Continual adaptation for efficient machine communication
Robert D. Hawkins
Minae Kwon
Dorsa Sadigh
Noah D. Goodman
CLL
32
33
0
22 Nov 2019
Reinforcing an Image Caption Generator Using Off-Line Human Feedback
Reinforcing an Image Caption Generator Using Off-Line Human Feedback
Paul Hongsuck Seo
Piyush Sharma
Tomer Levinboim
Bohyung Han
Radu Soricut
OffRL
29
22
0
21 Nov 2019
Inspect Transfer Learning Architecture with Dilated Convolution
Inspect Transfer Learning Architecture with Dilated Convolution
Syeda Noor Jaha Azim
Md. Aminur Rab Ratul
24
0
0
20 Nov 2019
Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA
Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA
Badri N. Patro
Anupriy
Vinay P. Namboodiri
AAML
FAtt
48
26
0
19 Nov 2019
Conditionally Learn to Pay Attention for Sequential Visual Task
Conditionally Learn to Pay Attention for Sequential Visual Task
Jun He
Quan-Jie Cao
Lei Zhang
32
0
0
11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion,
  and Applications
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
40
326
0
10 Nov 2019
Distilling Knowledge Learned in BERT for Text Generation
Distilling Knowledge Learned in BERT for Text Generation
Yen-Chun Chen
Zhe Gan
Yu Cheng
Jingzhou Liu
Jingjing Liu
28
28
0
10 Nov 2019
On Architectures for Including Visual Information in Neural Language
  Models for Image Description
On Architectures for Including Visual Information in Neural Language Models for Image Description
Marc Tanti
Albert Gatt
K. Camilleri
VLM
30
2
0
09 Nov 2019
Early Predictions for Medical Crowdfunding: A Deep Learning Approach
  Using Diverse Inputs
Early Predictions for Medical Crowdfunding: A Deep Learning Approach Using Diverse Inputs
Tong Wang
Fujie Jin
Y. Hu
Yuan Cheng
OOD
30
5
0
09 Nov 2019
Are we asking the right questions in MovieQA?
Are we asking the right questions in MovieQA?
Bhavan A. Jasani
Rohit Girdhar
Deva Ramanan
19
15
0
08 Nov 2019
Boosting LSTM Performance Through Dynamic Precision Selection
Boosting LSTM Performance Through Dynamic Precision Selection
Franyell Silfa
J. Arnau
Antonio González
MQ
21
5
0
07 Nov 2019
Dancing to Music
Dancing to Music
Hsin-Ying Lee
Xiaodong Yang
Xuan Li
Ting-Chun Wang
Yu-Ding Lu
Ming-Hsuan Yang
Jan Kautz
27
15
0
05 Nov 2019
SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural
  Network
SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural Network
R. Yazdani
Olatunji Ruwase
Minjia Zhang
Yuxiong He
J. Arnau
Antonio González
38
4
0
04 Nov 2019
Predicting the Politics of an Image Using Webly Supervised Data
Predicting the Politics of an Image Using Webly Supervised Data
Christopher Thomas
Adriana Kovashka
SSL
26
21
0
31 Oct 2019
Hidden State Guidance: Improving Image Captioning using An Image
  Conditioned Autoencoder
Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder
Jialin Wu
Raymond J. Mooney
26
0
0
31 Oct 2019
XL-Editor: Post-editing Sentences with XLNet
XL-Editor: Post-editing Sentences with XLNet
Yong-Siang Shih
Wei-Cheng Chang
Yiming Yang
KELM
28
11
0
19 Oct 2019
Exploring Overall Contextual Information for Image Captioning in
  Human-Like Cognitive Style
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style
Hongwei Ge
Zehang Yan
Kai Zhang
Mingde Zhao
Liang Sun
30
24
0
15 Oct 2019
Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating
  Referee
Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating Referee
Shuangjie Xu
Feng Xu
Yu Cheng
Pan Zhou
21
2
0
14 Oct 2019
Dynamic Attention Networks for Task Oriented Grounding
Dynamic Attention Networks for Task Oriented Grounding
S. Dasgupta
Badri N. Patro
Vinay P. Namboodiri
36
1
0
14 Oct 2019
Granular Multimodal Attention Networks for Visual Dialog
Granular Multimodal Attention Networks for Visual Dialog
Badri N. Patro
Shivansh Patel
Vinay P. Namboodiri
33
1
0
13 Oct 2019
Referring Expression Object Segmentation with Caption-Aware Consistency
Referring Expression Object Segmentation with Caption-Aware Consistency
Yi-Wen Chen
Yi-Hsuan Tsai
Tiantian Wang
Yen-Yu Lin
Ming-Hsuan Yang
EgoV
17
87
0
10 Oct 2019
Text-to-Image Synthesis Based on Machine Generated Captions
Text-to-Image Synthesis Based on Machine Generated Captions
Marco Menardi
Alex Falcon
Saida S. Mohamed
Lorenzo Seidenari
G. Serra
A. Bimbo
C. Tasso
33
0
0
09 Oct 2019
Semantic-aware Image Deblurring
Semantic-aware Image Deblurring
Fuhai Chen
Rongrong Ji
Chengpeng Dai
Xiaoshuai Sun
Chia-Wen Lin
Jiayi Ji
Baochang Zhang
Feiyue Huang
Liujuan Cao
BDL
VLM
25
6
0
09 Oct 2019
Prose for a Painting
Prose for a Painting
Prerna Kashyap
Samrat Phatale
Iddo Drori
9
3
0
08 Oct 2019
Score-CAM: Score-Weighted Visual Explanations for Convolutional Neural
  Networks
Score-CAM: Score-Weighted Visual Explanations for Convolutional Neural Networks
Mehdi Neshat
Zifan Wang
Bradley Alexander
Fan Yang
Zijian Zhang
Sirui Ding
Markus Wagner
Xia Hu
FAtt
45
1,053
0
03 Oct 2019
A Hierarchical Approach for Visual Storytelling Using Image Description
A Hierarchical Approach for Visual Storytelling Using Image Description
Md Sultan al Nahian
Tasmia Tasrin
Sagar Gandhi
Ryan Gaines
Brent Harrison
19
11
0
26 Sep 2019
Learning Visual Relation Priors for Image-Text Matching and Image
  Captioning with Neural Scene Graph Generators
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators
Kuang-Huei Lee
Hamid Palangi
Xi Chen
Houdong Hu
Jianfeng Gao
VLM
35
37
0
22 Sep 2019
Towards Explainable Neural-Symbolic Visual Reasoning
Towards Explainable Neural-Symbolic Visual Reasoning
Adrien Bennetot
J. Laurent
Raja Chatila
Natalia Díaz Rodríguez
XAI
30
1
0
19 Sep 2019
Adaptively Aligned Image Captioning via Adaptive Attention Time
Adaptively Aligned Image Captioning via Adaptive Attention Time
Lun Huang
Wenmin Wang
Yaxian Xia
Jie Chen
8
61
0
19 Sep 2019
Large-scale representation learning from visually grounded untranscribed
  speech
Large-scale representation learning from visually grounded untranscribed speech
Gabriel Ilharco
Yuan Zhang
Jason Baldridge
SSL
27
60
0
19 Sep 2019
ContCap: A scalable framework for continual image captioning
ContCap: A scalable framework for continual image captioning
Giang Nguyen
Tae Joon Jun
T. Tran
Tolcha Yalew
Daeyoung Kim
VLM
CLL
24
10
0
19 Sep 2019
BSDAR: Beam Search Decoding with Attention Reward in Neural Keyphrase
  Generation
BSDAR: Beam Search Decoding with Attention Reward in Neural Keyphrase Generation
Iftitahu Ni'mah
Vlado Menkovski
Mykola Pechenizkiy
30
2
0
17 Sep 2019
Semantic Relatedness Based Re-ranker for Text Spotting
Semantic Relatedness Based Re-ranker for Text Spotting
Ahmed Sabir
Francesc Moreno-Noguer
Lluís Padró
VLM
30
5
0
17 Sep 2019
Controllable Length Control Neural Encoder-Decoder via Reinforcement
  Learning
Controllable Length Control Neural Encoder-Decoder via Reinforcement Learning
Junyi Bian
Baojun Lin
Kecheng Zhang
Zhaohui Yan
H. Tang
Yonghe Zhang
22
5
0
17 Sep 2019
Inverse Visual Question Answering with Multi-Level Attentions
Inverse Visual Question Answering with Multi-Level Attentions
Yaser Alwatter
Yuhong Guo
BDL
24
1
0
17 Sep 2019
ChOracle: A Unified Statistical Framework for Churn Prediction
ChOracle: A Unified Statistical Framework for Churn Prediction
Ali Khodadadi
Seyed Abbas Hosseini
Ehsan Pajouheshgar
Farnam Mansouri
Hamid R. Rabiee
AI4TS
16
12
0
15 Sep 2019
Understanding LSTM -- a tutorial into Long Short-Term Memory Recurrent
  Neural Networks
Understanding LSTM -- a tutorial into Long Short-Term Memory Recurrent Neural Networks
R. C. Staudemeyer
Eric Rothstein Morris
17
479
0
12 Sep 2019
Speculative Beam Search for Simultaneous Translation
Speculative Beam Search for Simultaneous Translation
Renjie Zheng
Mingbo Ma
Baigong Zheng
Liang Huang
45
24
0
12 Sep 2019
Previous
123...192021...394041
Next