Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4555
Cited By
Show and Tell: A Neural Image Caption Generator
17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: A Neural Image Caption Generator"
50 / 2,022 papers shown
Title
Watch What You Just Said: Image Captioning with Text-Conditional Attention
Luowei Zhou
Chenliang Xu
Parker A. Koch
Jason J. Corso
VLM
22
44
0
15 Jun 2016
Conditional Generative Moment-Matching Networks
Yong Ren
J. Li
Yucen Luo
Jun Zhu
GAN
11
61
0
14 Jun 2016
Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data
Shikhar Sharma
Jing He
Kaheer Suleman
Hannes Schulz
Philip Bachman
16
29
0
11 Jun 2016
Length bias in Encoder Decoder Models and a Case for Global Conditioning
Pavel Sountsov
Sunita Sarawagi
AI4CE
19
42
0
10 Jun 2016
Automated Image Captioning for Rapid Prototyping and Resource Constrained Environments
Karan Sharma
Arun C. S. Kumar
S. Bhandarkar
12
0
0
04 Jun 2016
Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network
Yu Liu
Jianlong Fu
Tao Mei
C. Chen
13
4
0
02 Jun 2016
Attention Correctness in Neural Image Captioning
Chenxi Liu
Junhua Mao
Fei Sha
Alan Yuille
3DV
30
220
0
31 May 2016
Parametric Exponential Linear Unit for Deep Convolutional Neural Networks
Ludovic Trottier
Philippe Giguère
B. Chaib-draa
41
199
0
30 May 2016
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey
Hirokatsu Kataoka
Yudai Miyashita
Tomoaki K. Yamabe
Soma Shirakabe
Shin-ichi Sato
...
Kaori Abe
Takaaki Imanari
Naomichi Kobayashi
Shinichiro Morita
Akio Nakamura
24
2
0
26 May 2016
Review Networks for Caption Generation
Zhilin Yang
Ye Yuan
Yuexin Wu
Ruslan Salakhutdinov
William W. Cohen
3DV
32
85
0
25 May 2016
Structured Prediction Theory Based on Factor Graph Complexity
Corinna Cortes
M. Mohri
Vitaly Kuznetsov
Scott Yang
OOD
14
55
0
20 May 2016
Stereotyping and Bias in the Flickr30K Dataset
Emiel van Miltenburg
21
90
0
19 May 2016
Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Andrew Shin
Katsunori Ohnishi
Tatsuya Harada
12
31
0
18 May 2016
Generative Adversarial Text to Image Synthesis
Scott E. Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
GAN
22
3,125
0
17 May 2016
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCL
VLM
170
840
0
17 May 2016
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DV
VGen
32
353
0
12 May 2016
Sensorimotor Input as a Language Generalisation Tool: A Neurorobotics Model for Generation and Generalisation of Noun-Verb Combinations with Sensorimotor Inputs
Junpei Zhong
M. Peniak
Jun Tani
T. Ogata
Angelo Cangelosi
11
23
0
11 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
16
101
0
09 May 2016
Chained Predictions Using Convolutional Neural Networks
Georgia Gkioxari
Alexander Toshev
Navdeep Jaitly
BDL
24
190
0
08 May 2016
Improving Image Captioning by Concept-based Sentence Reranking
Xirong Li
Qin Jin
17
5
0
03 May 2016
Multi30K: Multilingual English-German Image Descriptions
Desmond Elliott
Stella Frank
K. Simaán
Lucia Specia
VLM
27
580
0
02 May 2016
Compression Artifacts Removal Using Convolutional Neural Networks
P. Svoboda
Michal Hradiš
David Barina
P. Zemčík
24
144
0
02 May 2016
Attributes for Improved Attributes: A Multi-Task Network for Attribute Classification
Emily M. Hand
Rama Chellappa
CVBM
22
36
0
25 Apr 2016
Chinese Song Iambics Generation with Neural Attention-based Model
Qixin Wang
Tianyi Luo
Dong Wang
Chao Xing
6
78
0
21 Apr 2016
Sketching and Neural Networks
Amit Daniely
N. Lazić
Y. Singer
Kunal Talwar
4
10
0
19 Apr 2016
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length Image Tagging
Jiren Jin
Hideki Nakayama
3DV
VLM
24
69
0
18 Apr 2016
CNN-RNN: A Unified Framework for Multi-label Image Classification
Jiang Wang
Yi Yang
Junhua Mao
Zhiheng Huang
Chang Huang
Wenyuan Xu
SSL
34
1,162
0
15 Apr 2016
Learning Visual Storylines with Skipping Recurrent Neural Networks
Gunnar A. Sigurdsson
Xinlei Chen
Abhinav Gupta
26
38
0
14 Apr 2016
Visual Storytelling
Ting-Hao 'Kenneth' Huang
Huang
Francis Ferraro
N. Mostafazadeh
Ishan Misra
...
C. L. Zitnick
Devi Parikh
Lucy Vanderwende
Michel Galley
Margaret Mitchell
VGen
22
464
0
13 Apr 2016
Online Multi-Target Tracking Using Recurrent Neural Networks
Anton Milan
S. Hamid Rezatofighi
A. Dick
Ian Reid
Konrad Schindler
VOT
9
515
0
13 Apr 2016
Video Description using Bidirectional Recurrent Neural Networks
Álvaro Peris
Marc Bolaños
Petia Radeva
F. Casacuberta
20
33
0
12 Apr 2016
Attributes as Semantic Units between Natural Language and Visual Recognition
Marcus Rohrbach
VLM
16
3
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
25
270
0
10 Apr 2016
Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes
Gordon A. Christie
A. Laddha
Aishwarya Agrawal
Stanislaw Antol
Yash Goyal
K. Kochersberger
Dhruv Batra
20
30
0
07 Apr 2016
Sentence Level Recurrent Topic Model: Letting Topics Speak for Themselves
Fei Tian
Bin Gao
Di He
Tie-Yan Liu
LRM
BDL
12
24
0
07 Apr 2016
Optimizing Performance of Recurrent Neural Networks on GPUs
J. Appleyard
Tomás Kociský
Phil Blunsom
25
91
0
07 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text
Subhashini Venugopalan
Lisa Anne Hendricks
Raymond J. Mooney
Kate Saenko
VLM
28
117
0
06 Apr 2016
Image Captioning with Deep Bidirectional LSTMs
Cheng Wang
Haojin Yang
Christian Bartz
Christoph Meinel
VLM
10
278
0
04 Apr 2016
Automatic Annotation of Structured Facts in Images
Mohamed Elhoseiny
Scott D. Cohen
W. Chang
Brian L. Price
Ahmed Elgammal
13
9
0
02 Apr 2016
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings
Spandana Gella
Mirella Lapata
Frank Keller
CoGe
27
52
0
30 Mar 2016
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning
Andrew Shin
Masataka Yamaguchi
Katsunori Ohnishi
Tatsuya Harada
47
8
0
30 Mar 2016
Rich Image Captioning in the Wild
Kenneth Tran
Xiaodong He
Lei Zhang
Jian Sun
Cornelia Carapcea
Chris Thrasher
Chris Buehler
Chris Sienkiewicz
VLM
19
123
0
30 Mar 2016
Multi-Cue Zero-Shot Learning with Strong Supervision
Zeynep Akata
Mateusz Malinowski
Mario Fritz
Bernt Schiele
37
148
0
29 Mar 2016
Generating Visual Explanations
Lisa Anne Hendricks
Zeynep Akata
Marcus Rohrbach
Jeff Donahue
Bernt Schiele
Trevor Darrell
VLM
FAtt
41
618
0
28 Mar 2016
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation
Hoo-Chang Shin
Kirk Roberts
Le Lu
Dina Demner-Fushman
Jianhua Yao
Ronald M. Summers
18
347
0
28 Mar 2016
Do You See What I Mean? Visual Resolution of Linguistic Ambiguities
Yevgeni Berzak
Andrei Barbu
Daniel Harari
Boris Katz
S. Ullman
13
34
0
26 Mar 2016
Unsupervised Category Discovery via Looped Deep Pseudo-Task Optimization Using a Large Scale Radiology Image Database
Xiaosong Wang
Le Lu
Hoo-Chang Shin
Lauren Kim
Isabella Nogues
Jianhua Yao
Ronald M. Summers
22
16
0
25 Mar 2016
Neural Text Generation from Structured Data with Application to the Biography Domain
R. Lebret
David Grangier
Michael Auli
19
45
0
24 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing
Arnau Ramisa
F. Yan
Francesc Moreno-Noguer
K. Mikolajczyk
29
105
0
23 Mar 2016
"What happens if..." Learning to Predict the Effect of Forces in Images
Roozbeh Mottaghi
Mohammad Rastegari
Abhinav Gupta
Ali Farhadi
OOD
31
123
0
17 Mar 2016
Previous
1
2
3
...
36
37
38
39
40
41
Next