Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4555
Cited By
Show and Tell: A Neural Image Caption Generator
17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: A Neural Image Caption Generator"
50 / 2,023 papers shown
Title
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
36
20
0
07 Dec 2018
Recursive Visual Attention in Visual Dialog
Yulei Niu
Hanwang Zhang
Manli Zhang
Jianhong Zhang
Zhiwu Lu
Ji-Rong Wen
28
118
0
06 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
35
693
0
06 Dec 2018
CompILE: Compositional Imitation Learning and Execution
Thomas Kipf
Yujia Li
H. Dai
V. Zambaldi
Alvaro Sanchez-Gonzalez
Edward Grefenstette
Pushmeet Kohli
Peter W. Battaglia
VLM
33
13
0
04 Dec 2018
A Survey on Semantic Parsing
Aishwarya Kamath
Rajarshi Das
22
117
0
03 Dec 2018
Lightweight and Efficient Image Super-Resolution with Block State-based Recursive Network
Sanghyun Son
Jun-Hyuk Kim
Manri Cheon
Jong-Seok Lee
SupR
16
10
0
30 Nov 2018
Generating Easy-to-Understand Referring Expressions for Target Identifications
Mikihiro Tanaka
Takayuki Itamochi
Kenichi Narioka
Ikuro Sato
Yoshitaka Ushiku
Tatsuya Harada
24
1
0
29 Nov 2018
Towards Task Understanding in Visual Settings
Sebastin Santy
W. Zulfikar
Rishabh Mehrotra
Emine Yilmaz
37
1
0
28 Nov 2018
Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network
Peng Lu
Hangyu Lin
Yanwei Fu
S. Gong
Yu-Gang Jiang
Xiangyang Xue
3DPC
27
5
0
28 Nov 2018
Unsupervised Multi-modal Neural Machine Translation
Yuanhang Su
Kai Fan
Nguyen Bach
C.-C. Jay Kuo
Fei Huang
33
59
0
28 Nov 2018
Eliminating Exposure Bias and Loss-Evaluation Mismatch in Multiple Object Tracking
Andrii Maksai
Pascal Fua
VOT
22
17
0
27 Nov 2018
Unsupervised Image Captioning
Yang Feng
Lin Ma
Wei Liu
Jiebo Luo
VLM
SSL
24
201
0
27 Nov 2018
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
DiffM
34
175
0
26 Nov 2018
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
Xin Eric Wang
Qiuyuan Huang
Asli Celikyilmaz
Jianfeng Gao
Dinghan Shen
Yuan-fang Wang
William Yang Wang
Lei Zhang
LM&Ro
SSL
40
530
0
25 Nov 2018
Senti-Attend: Image Captioning using Sentiment and Attention
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
VLM
24
15
0
24 Nov 2018
Connecting the Dots Between MLE and RL for Sequence Prediction
Bowen Tan
Zhiting Hu
Zichao Yang
Ruslan Salakhutdinov
Eric Xing
28
24
0
24 Nov 2018
Towards Robust Neural Networks with Lipschitz Continuity
Muhammad Usama
D. Chang
OOD
11
9
0
22 Nov 2018
Scene Graph Generation via Conditional Random Fields
Weilin Cong
Wenjie Wang
Wang-Chien Lee
GNN
27
22
0
20 Nov 2018
Explicit Bias Discovery in Visual Question Answering Models
Varun Manjunatha
Nirat Saini
L. Davis
CML
FAtt
19
92
0
19 Nov 2018
Intention Oriented Image Captions with Guiding Objects
Yue Zheng
Yali Li
Shengjin Wang
27
55
0
19 Nov 2018
Performance Estimation of Synthesis Flows cross Technologies using LSTMs and Transfer Learning
Cunxi Yu
Wang Zhou
AI4TS
14
0
0
14 Nov 2018
Predicting the time-evolution of multi-physics systems with sequence-to-sequence models
K. Humbird
J. Peterson
R. McClarren
AI4CE
17
3
0
14 Nov 2018
Image Captioning Based on a Hierarchical Attention Mechanism and Policy Gradient Optimization
Shiyang Yan
Yuan Xie
F. Wu
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
14
5
0
13 Nov 2018
Learning to Skip Ineffectual Recurrent Computations in LSTMs
A. Ardakani
Zhengyun Ji
W. Gross
13
16
0
09 Nov 2018
TrafficPredict: Trajectory Prediction for Heterogeneous Traffic-Agents
Sungwon Kim
Sang-gil Lee
Sibo Zhang
Ruigang Yang
Jaehyeon Kim
Tianyi Zhou
38
416
0
06 Nov 2018
Image Chat: Engaging Grounded Conversations
Kurt Shuster
Samuel Humeau
Antoine Bordes
Jason Weston
23
115
0
02 Nov 2018
Sequence Generation with Guider Network
Ruiyi Zhang
Changyou Chen
Zhe Gan
Wenlin Wang
Liqun Chen
Dinghan Shen
Guoyin Wang
Lawrence Carin
3DV
16
4
0
02 Nov 2018
Learning Beam Search Policies via Imitation Learning
Renato M. P. Negrinho
Matthew R. Gormley
Geoffrey J. Gordon
41
27
0
01 Nov 2018
Dial2Desc: End-to-end Dialogue Description Generation
Haojie Pan
Junpei Zhou
Zhou Zhao
Yan Liu
Deng Cai
Min Yang
VLM
18
14
0
01 Nov 2018
Gated Hierarchical Attention for Image Captioning
Qingzhong Wang
Antoni B. Chan
24
18
0
30 Oct 2018
Discrimination-aware Channel Pruning for Deep Neural Networks
Zhuangwei Zhuang
Mingkui Tan
Bohan Zhuang
Jing Liu
Yong Guo
Qingyao Wu
Junzhou Huang
Jin-Hui Zhu
25
595
0
28 Oct 2018
Middle-Out Decoding
Shikib Mehri
Leonid Sigal
24
22
0
28 Oct 2018
Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features
S. Palazzo
C. Spampinato
I. Kavasidis
D. Giordano
Joseph Schmidt
M. Shah
132
111
0
25 Oct 2018
Tackling Sequence to Sequence Mapping Problems with Neural Networks
Lei Yu
AIMat
28
2
0
25 Oct 2018
Learning with Interpretable Structure from Gated RNN
Bo-Jian Hou
Zhi-Hua Zhou
AI4CE
21
69
0
25 Oct 2018
Engaging Image Captioning Via Personality
Kurt Shuster
Samuel Humeau
Hexiang Hu
Antoine Bordes
Jason Weston
40
149
0
25 Oct 2018
DropFilter: Dropout for Convolutions
Zhengsu Chen
9
4
0
23 Oct 2018
A Neural Compositional Paradigm for Image Captioning
Bo Dai
Sanja Fidler
Dahua Lin
CoGe
29
41
0
23 Oct 2018
A Knowledge-Grounded Multimodal Search-Based Conversational Agent
Shubham Agarwal
Ondrej Dusek
Ioannis Konstas
Verena Rieser
31
22
0
20 Oct 2018
Improved Techniques for GAN based Facial Inpainting
A. Lahiri
A. Jain
Divyasri Nadendla
P. Biswas
CVBM
27
8
0
20 Oct 2018
Cross-Modal and Hierarchical Modeling of Video and Text
Bowen Zhang
Hexiang Hu
Fei Sha
BDL
AI4TS
23
188
0
16 Oct 2018
Trellis Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
25
145
0
15 Oct 2018
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images
Javier Marín
Aritro Biswas
Ferda Ofli
Nick Hynes
Amaia Salvador
Y. Aytar
Ingmar Weber
Antonio Torralba
16
320
0
14 Oct 2018
Learning to Globally Edit Images with Textual Description
Hai Wang
Jason D. Williams
Sin-Han Kang
DiffM
27
18
0
13 Oct 2018
Mode Normalization
Lucas Deecke
Iain Murray
Hakan Bilen
OOD
29
33
0
12 Oct 2018
Image Captioning as Neural Machine Translation Task in SOCKEYE
Loris Bazzani
Tobias Domhan
Felix Hieber
VLM
19
2
0
09 Oct 2018
Light-Weight RefineNet for Real-Time Semantic Segmentation
Vladimir Nekrasov
Chunhua Shen
Ian Reid
SSeg
VLM
24
147
0
08 Oct 2018
h-detach: Modifying the LSTM Gradient Towards Better Optimization
Devansh Arpit
Bhargav Kanuparthi
Giancarlo Kerg
Nan Rosemary Ke
Ioannis Mitliagkas
Yoshua Bengio
33
32
0
06 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
45
761
0
06 Oct 2018
Image-to-Video Person Re-Identification by Reusing Cross-modal Embeddings
Zhongwei Xie
Lin Li
Xian Zhong
Luo Zhong
19
2
0
04 Oct 2018
Previous
1
2
3
...
24
25
26
...
39
40
41
Next