ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4555
  4. Cited By
Show and Tell: A Neural Image Caption Generator

Show and Tell: A Neural Image Caption Generator

17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
    3DV
ArXivPDFHTML

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,022 papers shown
Title
Learning Deep Structure-Preserving Image-Text Embeddings
Learning Deep Structure-Preserving Image-Text Embeddings
Liwei Wang
Yin Li
Svetlana Lazebnik
35
780
0
19 Nov 2015
Reducing Overfitting in Deep Networks by Decorrelating Representations
Reducing Overfitting in Deep Networks by Decorrelating Representations
Michael Cogswell
Faruk Ahmed
Ross B. Girshick
C. L. Zitnick
Dhruv Batra
23
411
0
19 Nov 2015
ABC-CNN: An Attention Based Convolutional Neural Network for Visual
  Question Answering
ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering
Kan Chen
Jiang Wang
Liang-Chieh Chen
Haoyuan Gao
Wenyuan Xu
Ram Nevatia
22
287
0
18 Nov 2015
Learning Articulated Motion Models from Visual and Lingual Signals
Learning Articulated Motion Models from Visual and Lingual Signals
Zhengyang Wu
Joey Tianyi Zhou
Matthew R. Walter
22
0
0
17 Nov 2015
Deep Compositional Captioning: Describing Novel Object Categories
  without Paired Training Data
Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data
Lisa Anne Hendricks
Subhashini Venugopalan
Marcus Rohrbach
Raymond J. Mooney
Kate Saenko
Trevor Darrell
CoGe
16
284
0
17 Nov 2015
Recurrent Neural Networks Hardware Implementation on FPGA
Recurrent Neural Networks Hardware Implementation on FPGA
Andre Xian Ming Chang
B. Martini
Eugenio Culurciello
11
126
0
17 Nov 2015
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for
  Visual Question Answering
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
Huijuan Xu
Kate Saenko
24
760
0
17 Nov 2015
How (not) to Train your Generative Model: Scheduled Sampling,
  Likelihood, Adversary?
How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary?
Ferenc Huszár
OOD
DiffM
GAN
17
296
0
16 Nov 2015
Yin and Yang: Balancing and Answering Binary Visual Questions
Yin and Yang: Balancing and Answering Binary Visual Questions
Peng Zhang
Yash Goyal
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
19
349
0
16 Nov 2015
Sherlock: Scalable Fact Learning in Images
Sherlock: Scalable Fact Learning in Images
Mohamed Elhoseiny
Scott D. Cohen
W. Chang
Brian L. Price
Ahmed Elgammal
19
26
0
16 Nov 2015
A Neural Transducer
A Neural Transducer
Navdeep Jaitly
David Sussillo
Quoc V. Le
Oriol Vinyals
Ilya Sutskever
Samy Bengio
AI4TS
14
48
0
16 Nov 2015
Neural Programmer: Inducing Latent Programs with Gradient Descent
Neural Programmer: Inducing Latent Programs with Gradient Descent
Arvind Neelakantan
Quoc V. Le
Ilya Sutskever
ODL
27
260
0
16 Nov 2015
Uncovering Temporal Context for Video Question and Answering
Uncovering Temporal Context for Video Question and Answering
Linchao Zhu
Zhongwen Xu
Yi Yang
Alexander G. Hauptmann
BDL
19
44
0
15 Nov 2015
Oracle performance for visual captioning
Oracle performance for visual captioning
L. Yao
Nicolas Ballas
Kyunghyun Cho
John R. Smith
Yoshua Bengio
VLM
31
8
0
14 Nov 2015
Symbol Grounding Association in Multimodal Sequences with Missing
  Elements
Symbol Grounding Association in Multimodal Sequences with Missing Elements
Federico Raue
Andreas Dengel
Thomas Breuel
Marcus Liwicki
11
1
0
13 Nov 2015
Sequence to Sequence Learning for Optical Character Recognition
Sequence to Sequence Learning for Optical Character Recognition
D. Sahu
Mohak Sukhwani
14
13
0
13 Nov 2015
Natural Language Object Retrieval
Natural Language Object Retrieval
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
ObjD
32
551
0
13 Nov 2015
Action Recognition using Visual Attention
Action Recognition using Visual Attention
Shikhar Sharma
Ryan Kiros
Ruslan Salakhutdinov
24
666
0
12 Nov 2015
Improving performance of recurrent neural network with relu nonlinearity
Improving performance of recurrent neural network with relu nonlinearity
S. Talathi
Aniket A. Vartak
ODL
16
87
0
12 Nov 2015
Grounding of Textual Phrases in Images by Reconstruction
Grounding of Textual Phrases in Images by Reconstruction
Anna Rohrbach
Marcus Rohrbach
Ronghang Hu
Trevor Darrell
Bernt Schiele
18
494
0
12 Nov 2015
Deep Multimodal Semantic Embeddings for Speech and Images
Deep Multimodal Semantic Embeddings for Speech and Images
David Harwath
James R. Glass
10
155
0
11 Nov 2015
Generative Concatenative Nets Jointly Learn to Write and Classify
  Reviews
Generative Concatenative Nets Jointly Learn to Write and Classify Reviews
Zachary Chase Lipton
Sharad Vikram
Julian McAuley
BDL
25
32
0
11 Nov 2015
Learning to Diagnose with LSTM Recurrent Neural Networks
Learning to Diagnose with LSTM Recurrent Neural Networks
Zachary Chase Lipton
David C. Kale
Charles Elkan
R. Wetzel
14
1,095
0
11 Nov 2015
Visual7W: Grounded Question Answering in Images
Visual7W: Grounded Question Answering in Images
Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
44
870
0
11 Nov 2015
From Images to Sentences through Scene Description Graphs using
  Commonsense Reasoning and Knowledge
From Images to Sentences through Scene Description Graphs using Commonsense Reasoning and Knowledge
Somak Aditya
Yezhou Yang
Chitta Baral
Cornelia Fermuller
Yiannis Aloimonos
3DV
11
69
0
10 Nov 2015
Generating Images from Captions with Attention
Generating Images from Captions with Attention
Elman Mansimov
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
VLM
40
449
0
09 Nov 2015
Generation and Comprehension of Unambiguous Object Descriptions
Generation and Comprehension of Unambiguous Object Descriptions
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana-Maria Camburu
Alan Yuille
Kevin Patrick Murphy
ObjD
31
1,310
0
07 Nov 2015
Stacked Attention Networks for Image Question Answering
Stacked Attention Networks for Image Question Answering
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
BDL
36
1,867
0
07 Nov 2015
Semi-supervised Sequence Learning
Semi-supervised Sequence Learning
Andrew M. Dai
Quoc V. Le
SSL
13
1,230
0
04 Nov 2015
Top-down Tree Long Short-Term Memory Networks
Top-down Tree Long Short-Term Memory Networks
Xingxing Zhang
Liang Lu
Mirella Lapata
AIMat
17
101
0
31 Oct 2015
Generating Text with Deep Reinforcement Learning
Generating Text with Deep Reinforcement Learning
Hongyu Guo
AIMat
17
50
0
30 Oct 2015
Deep Recurrent Regression for Facial Landmark Detection
Deep Recurrent Regression for Facial Landmark Detection
Hanjiang Lai
Shengtao Xiao
Yan Pan
Zhen Cui
Jiashi Feng
Chunyan Xu
Jian Yin
Shuicheng Yan
3DV
CVBM
31
57
0
30 Oct 2015
Learning Multi-Domain Convolutional Neural Networks for Visual Tracking
Learning Multi-Domain Convolutional Neural Networks for Visual Tracking
Hyeonseob Nam
Bohyung Han
49
2,464
0
27 Oct 2015
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
Haonan Yu
Jiang Wang
Zhiheng Huang
Yi Yang
Wenyuan Xu
42
560
0
26 Oct 2015
Phenotyping of Clinical Time Series with LSTM Recurrent Neural Networks
Phenotyping of Clinical Time Series with LSTM Recurrent Neural Networks
Zachary Chase Lipton
David C. Kale
R. Wetzel
22
38
0
26 Oct 2015
Multilingual Image Description with Neural Sequence Models
Multilingual Image Description with Neural Sequence Models
Desmond Elliott
Stella Frank
Eva Hasler
VLM
22
75
0
15 Oct 2015
Resolving References to Objects in Photographs using the
  Words-As-Classifiers Model
Resolving References to Objects in Photographs using the Words-As-Classifiers Model
David Schlangen
Sina Zarrieß
C. Kennington
23
48
0
07 Oct 2015
SentiCap: Generating Image Descriptions with Sentiments
SentiCap: Generating Image Descriptions with Sentiments
A. Mathews
Lexing Xie
Xuming He
23
221
0
06 Oct 2015
Segment-Phrase Table for Semantic Segmentation, Visual Entailment and
  Paraphrasing
Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing
Hamid Izadinia
Fereshteh Sadeghi
S. Divvala
Yejin Choi
Ali Farhadi
VLM
12
20
0
27 Sep 2015
Guiding Long-Short Term Memory for Image Caption Generation
Guiding Long-Short Term Memory for Image Caption Generation
Xu Jia
E. Gavves
Basura Fernando
Tinne Tuytelaars
VLM
22
101
0
16 Sep 2015
Deep Learning Applied to Image and Text Matching
Deep Learning Applied to Image and Text Matching
Afroze Ibrahim Baqapuri
VLM
16
0
0
14 Sep 2015
Structured Prediction with Output Embeddings for Semantic Image
  Annotation
Structured Prediction with Output Embeddings for Semantic Image Annotation
A. Quattoni
Arnau Ramisa
Pranava Swaroop Madhyastha
E. Simo-Serra
Francesc Moreno-Noguer
14
2
0
07 Sep 2015
Object Recognition from Short Videos for Robotic Perception
Object Recognition from Short Videos for Robotic Perception
Ivan Bogun
A. Angelova
Navdeep Jaitly
12
8
0
04 Sep 2015
What to talk about and how? Selective Generation using LSTMs with
  Coarse-to-Fine Alignment
What to talk about and how? Selective Generation using LSTMs with Coarse-to-Fine Alignment
Hongyuan Mei
Joey Tianyi Zhou
Matthew R. Walter
22
288
0
02 Sep 2015
SentenceRacer: A Game with a Purpose for Image Sentence Annotation
SentenceRacer: A Game with a Purpose for Image Sentence Annotation
Kenji Hata
Sherman Leung
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
26
0
0
27 Aug 2015
The SP theory of intelligence: distinctive features and advantages
The SP theory of intelligence: distinctive features and advantages
J. G. Wolff
8
31
0
17 Aug 2015
Image Representations and New Domains in Neural Image Captioning
Image Representations and New Domains in Neural Image Captioning
Jack Hessel
Nicolas Savva
Michael J. Wilber
VLM
22
16
0
09 Aug 2015
Listen, Attend and Spell
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
50
2,250
0
05 Aug 2015
Recurrent Network Models for Human Dynamics
Recurrent Network Models for Human Dynamics
Katerina Fragkiadaki
Sergey Levine
Panna Felsen
Jitendra Malik
28
30
0
02 Aug 2015
Building End-To-End Dialogue Systems Using Generative Hierarchical
  Neural Network Models
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
Iulian Serban
Alessandro Sordoni
Yoshua Bengio
Aaron Courville
Joelle Pineau
AILaw
16
1,747
0
17 Jul 2015
Previous
123...38394041
Next