Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4555
Cited By
Show and Tell: A Neural Image Caption Generator
17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: A Neural Image Caption Generator"
50 / 2,022 papers shown
Title
Learning Deep Structure-Preserving Image-Text Embeddings
Liwei Wang
Yin Li
Svetlana Lazebnik
35
780
0
19 Nov 2015
Reducing Overfitting in Deep Networks by Decorrelating Representations
Michael Cogswell
Faruk Ahmed
Ross B. Girshick
C. L. Zitnick
Dhruv Batra
23
411
0
19 Nov 2015
ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering
Kan Chen
Jiang Wang
Liang-Chieh Chen
Haoyuan Gao
Wenyuan Xu
Ram Nevatia
22
287
0
18 Nov 2015
Learning Articulated Motion Models from Visual and Lingual Signals
Zhengyang Wu
Joey Tianyi Zhou
Matthew R. Walter
22
0
0
17 Nov 2015
Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data
Lisa Anne Hendricks
Subhashini Venugopalan
Marcus Rohrbach
Raymond J. Mooney
Kate Saenko
Trevor Darrell
CoGe
16
284
0
17 Nov 2015
Recurrent Neural Networks Hardware Implementation on FPGA
Andre Xian Ming Chang
B. Martini
Eugenio Culurciello
11
126
0
17 Nov 2015
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
Huijuan Xu
Kate Saenko
24
760
0
17 Nov 2015
How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary?
Ferenc Huszár
OOD
DiffM
GAN
17
296
0
16 Nov 2015
Yin and Yang: Balancing and Answering Binary Visual Questions
Peng Zhang
Yash Goyal
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
19
349
0
16 Nov 2015
Sherlock: Scalable Fact Learning in Images
Mohamed Elhoseiny
Scott D. Cohen
W. Chang
Brian L. Price
Ahmed Elgammal
19
26
0
16 Nov 2015
A Neural Transducer
Navdeep Jaitly
David Sussillo
Quoc V. Le
Oriol Vinyals
Ilya Sutskever
Samy Bengio
AI4TS
14
48
0
16 Nov 2015
Neural Programmer: Inducing Latent Programs with Gradient Descent
Arvind Neelakantan
Quoc V. Le
Ilya Sutskever
ODL
27
260
0
16 Nov 2015
Uncovering Temporal Context for Video Question and Answering
Linchao Zhu
Zhongwen Xu
Yi Yang
Alexander G. Hauptmann
BDL
19
44
0
15 Nov 2015
Oracle performance for visual captioning
L. Yao
Nicolas Ballas
Kyunghyun Cho
John R. Smith
Yoshua Bengio
VLM
31
8
0
14 Nov 2015
Symbol Grounding Association in Multimodal Sequences with Missing Elements
Federico Raue
Andreas Dengel
Thomas Breuel
Marcus Liwicki
11
1
0
13 Nov 2015
Sequence to Sequence Learning for Optical Character Recognition
D. Sahu
Mohak Sukhwani
14
13
0
13 Nov 2015
Natural Language Object Retrieval
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
ObjD
32
551
0
13 Nov 2015
Action Recognition using Visual Attention
Shikhar Sharma
Ryan Kiros
Ruslan Salakhutdinov
24
666
0
12 Nov 2015
Improving performance of recurrent neural network with relu nonlinearity
S. Talathi
Aniket A. Vartak
ODL
16
87
0
12 Nov 2015
Grounding of Textual Phrases in Images by Reconstruction
Anna Rohrbach
Marcus Rohrbach
Ronghang Hu
Trevor Darrell
Bernt Schiele
18
494
0
12 Nov 2015
Deep Multimodal Semantic Embeddings for Speech and Images
David Harwath
James R. Glass
10
155
0
11 Nov 2015
Generative Concatenative Nets Jointly Learn to Write and Classify Reviews
Zachary Chase Lipton
Sharad Vikram
Julian McAuley
BDL
25
32
0
11 Nov 2015
Learning to Diagnose with LSTM Recurrent Neural Networks
Zachary Chase Lipton
David C. Kale
Charles Elkan
R. Wetzel
14
1,095
0
11 Nov 2015
Visual7W: Grounded Question Answering in Images
Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
44
870
0
11 Nov 2015
From Images to Sentences through Scene Description Graphs using Commonsense Reasoning and Knowledge
Somak Aditya
Yezhou Yang
Chitta Baral
Cornelia Fermuller
Yiannis Aloimonos
3DV
11
69
0
10 Nov 2015
Generating Images from Captions with Attention
Elman Mansimov
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
VLM
40
449
0
09 Nov 2015
Generation and Comprehension of Unambiguous Object Descriptions
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana-Maria Camburu
Alan Yuille
Kevin Patrick Murphy
ObjD
31
1,310
0
07 Nov 2015
Stacked Attention Networks for Image Question Answering
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
BDL
36
1,867
0
07 Nov 2015
Semi-supervised Sequence Learning
Andrew M. Dai
Quoc V. Le
SSL
13
1,230
0
04 Nov 2015
Top-down Tree Long Short-Term Memory Networks
Xingxing Zhang
Liang Lu
Mirella Lapata
AIMat
17
101
0
31 Oct 2015
Generating Text with Deep Reinforcement Learning
Hongyu Guo
AIMat
17
50
0
30 Oct 2015
Deep Recurrent Regression for Facial Landmark Detection
Hanjiang Lai
Shengtao Xiao
Yan Pan
Zhen Cui
Jiashi Feng
Chunyan Xu
Jian Yin
Shuicheng Yan
3DV
CVBM
31
57
0
30 Oct 2015
Learning Multi-Domain Convolutional Neural Networks for Visual Tracking
Hyeonseob Nam
Bohyung Han
49
2,464
0
27 Oct 2015
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
Haonan Yu
Jiang Wang
Zhiheng Huang
Yi Yang
Wenyuan Xu
42
560
0
26 Oct 2015
Phenotyping of Clinical Time Series with LSTM Recurrent Neural Networks
Zachary Chase Lipton
David C. Kale
R. Wetzel
22
38
0
26 Oct 2015
Multilingual Image Description with Neural Sequence Models
Desmond Elliott
Stella Frank
Eva Hasler
VLM
22
75
0
15 Oct 2015
Resolving References to Objects in Photographs using the Words-As-Classifiers Model
David Schlangen
Sina Zarrieß
C. Kennington
23
48
0
07 Oct 2015
SentiCap: Generating Image Descriptions with Sentiments
A. Mathews
Lexing Xie
Xuming He
23
221
0
06 Oct 2015
Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing
Hamid Izadinia
Fereshteh Sadeghi
S. Divvala
Yejin Choi
Ali Farhadi
VLM
12
20
0
27 Sep 2015
Guiding Long-Short Term Memory for Image Caption Generation
Xu Jia
E. Gavves
Basura Fernando
Tinne Tuytelaars
VLM
22
101
0
16 Sep 2015
Deep Learning Applied to Image and Text Matching
Afroze Ibrahim Baqapuri
VLM
16
0
0
14 Sep 2015
Structured Prediction with Output Embeddings for Semantic Image Annotation
A. Quattoni
Arnau Ramisa
Pranava Swaroop Madhyastha
E. Simo-Serra
Francesc Moreno-Noguer
14
2
0
07 Sep 2015
Object Recognition from Short Videos for Robotic Perception
Ivan Bogun
A. Angelova
Navdeep Jaitly
12
8
0
04 Sep 2015
What to talk about and how? Selective Generation using LSTMs with Coarse-to-Fine Alignment
Hongyuan Mei
Joey Tianyi Zhou
Matthew R. Walter
22
288
0
02 Sep 2015
SentenceRacer: A Game with a Purpose for Image Sentence Annotation
Kenji Hata
Sherman Leung
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
26
0
0
27 Aug 2015
The SP theory of intelligence: distinctive features and advantages
J. G. Wolff
8
31
0
17 Aug 2015
Image Representations and New Domains in Neural Image Captioning
Jack Hessel
Nicolas Savva
Michael J. Wilber
VLM
22
16
0
09 Aug 2015
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
50
2,250
0
05 Aug 2015
Recurrent Network Models for Human Dynamics
Katerina Fragkiadaki
Sergey Levine
Panna Felsen
Jitendra Malik
28
30
0
02 Aug 2015
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
Iulian Serban
Alessandro Sordoni
Yoshua Bengio
Aaron Courville
Joelle Pineau
AILaw
16
1,747
0
17 Jul 2015
Previous
1
2
3
...
38
39
40
41
Next