Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4555
Cited By
Show and Tell: A Neural Image Caption Generator
17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: A Neural Image Caption Generator"
50 / 2,023 papers shown
Title
Situation Recognition with Graph Neural Networks
Ruiyu Li
Makarand Tapaswi
Renjie Liao
Jiaya Jia
R. Urtasun
Sanja Fidler
GNN
24
131
0
14 Aug 2017
Recurrent Filter Learning for Visual Tracking
Tianyu Yang
Antoni B. Chan
VOT
27
84
0
13 Aug 2017
Early Stage Malware Prediction Using Recurrent Neural Networks
Matilda Rhode
Pete Burnap
K. Jones
AAML
22
253
0
11 Aug 2017
TandemNet: Distilling Knowledge from Medical Images Using Diagnostic Reports as Optional Semantic References
Zizhao Zhang
Pingjun Chen
Manish Sapkota
Ling Yang
MedIm
18
67
0
10 Aug 2017
Hierarchically-Attentive RNN for Album Summarization and Storytelling
Licheng Yu
Joey Tianyi Zhou
Tamara L. Berg
41
66
0
09 Aug 2017
Learning to Disambiguate by Asking Discriminative Questions
Yining Li
Chen Huang
Xiaoou Tang
Chen Change Loy
18
22
0
09 Aug 2017
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge
Damien Teney
Peter Anderson
Xiaodong He
Anton Van Den Hengel
50
381
0
09 Aug 2017
Recent Trends in Deep Learning Based Natural Language Processing
Tom Young
Devamanyu Hazarika
Soujanya Poria
Min Zhang
35
2,824
0
09 Aug 2017
Weakly Supervised Image Annotation and Segmentation with Objects and Attributes
Zhiyuan Shi
Yongxin Yang
Timothy M. Hospedales
Tao Xiang
21
46
0
08 Aug 2017
What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption Generator?
Marc Tanti
Albert Gatt
K. Camilleri
24
56
0
07 Aug 2017
Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection
Pingping Zhang
D. Wang
Huchuan Lu
Hongyu Wang
Xiang Ruan
16
734
0
07 Aug 2017
Identity-Aware Textual-Visual Matching with Latent Co-attention
Shuang Li
Tong Xiao
Hongsheng Li
Wei Yang
Xiaogang Wang
22
227
0
07 Aug 2017
PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN
Hanwang Zhang
Zawlin Kyaw
Jinyang Yu
Shih-Fu Chang
22
141
0
07 Aug 2017
Learning to Infer Graphics Programs from Hand-Drawn Images
Kevin Ellis
Daniel E. Ritchie
Armando Solar-Lezama
J. Tenenbaum
NAI
13
226
0
30 Jul 2017
Graph Classification with 2D Convolutional Neural Networks
A. Tixier
Giannis Nikolentzos
Polykarpos Meladianos
Michalis Vazirgiannis
GNN
15
23
0
29 Jul 2017
Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints
Jieyu Zhao
Tianlu Wang
Mark Yatskar
Vicente Ordonez
Kai-Wei Chang
FaML
32
964
0
29 Jul 2017
Deep Co-Space: Sample Mining Across Feature Transformation for Semi-Supervised Learning
Ziliang Chen
Keze Wang
Tianlin Li
Pai Peng
E. Izquierdo
Liang Lin
34
9
0
28 Jul 2017
TensorLayer: A Versatile Library for Efficient Deep Learning Development
Hao Dong
A. Supratak
Luo Mai
Fangde Liu
A. Oehmichen
Simiao Yu
Yike Guo
59
114
0
26 Jul 2017
Deep Interactive Region Segmentation and Captioning
Ali Sharifi Boroujerdi
M. Khanian
M. Breuß
24
7
0
26 Jul 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
61
4,184
0
25 Jul 2017
Image Pivoting for Learning Multilingual Multimodal Representations
Spandana Gella
Rico Sennrich
Frank Keller
Mirella Lapata
SSL
38
78
0
24 Jul 2017
OBJ2TEXT: Generating Visually Descriptive Language from Object Layouts
Xuwang Yin
Vicente Ordonez
VLM
40
55
0
22 Jul 2017
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey
Hirokatsu Kataoka
Soma Shirakabe
Yun He
S. Ueta
Teppei Suzuki
...
Ryousuke Takasawa
Masataka Fuchida
Yudai Miyashita
Kazushige Okayasu
Yuta Matsuzaki
30
1
0
20 Jul 2017
Learning Visually Grounded Sentence Representations
Douwe Kiela
Alexis Conneau
Allan Jabri
Maximilian Nickel
SSL
29
69
0
19 Jul 2017
Grounding Spatio-Semantic Referring Expressions for Human-Robot Interaction
Mohit Shridhar
David Hsu
ObjD
27
20
0
18 Jul 2017
Auto-Conditioned Recurrent Networks for Extended Complex Human Motion Synthesis
Zimo Li
Yi Zhou
Shuangjiu Xiao
C. He
Zeng Huang
Hao Li
3DH
27
47
0
17 Jul 2017
Knowledge-Guided Recurrent Neural Network Learning for Task-Oriented Action Prediction
Liang Lin
Lili Huang
Tianshui Chen
Yukang Gan
Hui Cheng
20
16
0
15 Jul 2017
Large-scale Video Classification guided by Batch Normalized LSTM Translator
Jae Hyeon Yoo
VLM
20
11
0
13 Jul 2017
Deep Fisher Discriminant Learning for Mobile Hand Gesture Recognition
Chunyu Xie
Ce Li
Baochang Zhang
Chong Chen
Jungong Han
HAI
27
64
0
12 Jul 2017
Automatic Understanding of Image and Video Advertisements
Zaeem Hussain
Ruotong Wang
Xiaozhong Zhang
Keren Ye
Christopher Thomas
Zuha Agha
Nathan Ong
Adriana Kovashka
DiffM
22
161
0
10 Jul 2017
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network
Zizhao Zhang
Yuanpu Xie
Fuyong Xing
M. McGough
Ling Yang
MedIm
21
301
0
08 Jul 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
AI4TS
81
6,792
0
04 Jul 2017
Where to Play: Retrieval of Video Segments using Natural-Language Queries
Sangkuk Lee
Daesik Kim
Myunggi Lee
Jihye Hwang
Nojun Kwak
38
3
0
02 Jul 2017
Automated Audio Captioning with Recurrent Neural Networks
K. Drossos
Sharath Adavanne
Tuomas Virtanen
25
128
0
30 Jun 2017
Actor-Critic Sequence Training for Image Captioning
Li Zhang
Flood Sung
Feng Liu
Tao Xiang
S. Gong
Yongxin Yang
Timothy M. Hospedales
24
111
0
29 Jun 2017
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention
Marcella Cornia
Lorenzo Baraldi
Giuseppe Serra
Rita Cucchiara
33
79
0
26 Jun 2017
Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates
Jun Liu
Amir Shahroudy
Dong Xu
Alex C. Kot
G. Wang
25
452
0
26 Jun 2017
Neural-based Natural Language Generation in Dialogue using RNN Encoder-Decoder with Semantic Aggregation
Van-Khanh Tran
Le-Minh Nguyen
31
33
0
21 Jun 2017
Using Artificial Tokens to Control Languages for Multilingual Image Caption Generation
Satoshi Tsutsui
David J. Crandall
19
19
0
20 Jun 2017
An online sequence-to-sequence model for noisy speech recognition
Chung-Cheng Chiu
Dieterich Lawson
Yuping Luo
George Tucker
Kevin Swersky
Ilya Sutskever
Navdeep Jaitly
19
7
0
16 Jun 2017
Deep Learning Methods for Efficient Large Scale Video Labeling
Miha Škalič
M. Pekalski
Xin Pan
VLM
18
17
0
14 Jun 2017
Evaluating Personal Assistants on Mobile devices
Julia Kiseleva
Maarten de Rijke
11
21
0
14 Jun 2017
SEARNN: Training RNNs with Global-Local Losses
Rémi Leblond
Jean-Baptiste Alayrac
A. Osokin
Simon Lacoste-Julien
27
52
0
14 Jun 2017
Teaching Compositionality to CNNs
Austin Stone
Hua-Yan Wang
Michael Stark
Yi Liu
D. Phoenix
Dileep George
CoGe
16
54
0
14 Jun 2017
Image Captioning with Object Detection and Localization
Zhongliang Yang
Yujin Zhang
S. Rehman
Yongfeng Huang
ObjD
VLM
30
47
0
08 Jun 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model
Jiasen Lu
A. Kannan
Jianwei Yang
Devi Parikh
Dhruv Batra
BDL
38
136
0
05 Jun 2017
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Jingkuan Song
Zhao Guo
Lianli Gao
Wu Liu
Dongxiang Zhang
Heng Tao Shen
48
166
0
05 Jun 2017
Order embeddings and character-level convolutions for multimodal alignment
Jonatas Wehrmann
Anderson Mattjie
Rodrigo C. Barros
28
27
0
03 Jun 2017
See, Hear, and Read: Deep Aligned Representations
Y. Aytar
Carl Vondrick
Antonio Torralba
VLM
AI4TS
12
136
0
03 Jun 2017
Natural Language Generation for Spoken Dialogue System using RNN Encoder-Decoder Networks
Van-Khanh Tran
Le-Minh Nguyen
41
41
0
01 Jun 2017
Previous
1
2
3
...
31
32
33
...
39
40
41
Next