Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4555
Cited By
Show and Tell: A Neural Image Caption Generator
17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: A Neural Image Caption Generator"
50 / 2,023 papers shown
Title
Utilizing Every Image Object for Semi-supervised Phrase Grounding
Haidong Zhu
Arka Sadhu
Zhao-Heng Zheng
Ram Nevatia
ObjD
25
7
0
05 Nov 2020
Multi-layer Feature Aggregation for Deep Scene Parsing Models
Litao Yu
Yongsheng Gao
Jun Zhou
Jian Zhang
Qiang Wu
SSeg
52
1
0
04 Nov 2020
Attention Beam: An Image Captioning Approach
Anubhav Shrimal
Tanmoy Chakraborty
3DV
13
2
0
03 Nov 2020
Parameter Efficient Deep Neural Networks with Bilinear Projections
Litao Yu
Yongsheng Gao
Jun Zhou
Jian Zhang
21
1
0
03 Nov 2020
Dual Attention on Pyramid Feature Maps for Image Captioning
Litao Yu
Jian Zhang
Qiang Wu
24
47
0
02 Nov 2020
Diverse Image Captioning with Context-Object Split Latent Spaces
Shweta Mahajan
Stefan Roth
19
41
0
02 Nov 2020
Boost Image Captioning with Knowledge Reasoning
Feicheng Huang
Zhixin Li
Haiyang Wei
Canlong Zhang
Huifang Ma
17
25
0
02 Nov 2020
Multimodal Continuous Emotion Recognition using Deep Multi-Task Learning with Correlation Loss
Berkay Köprü
E. Erzin
CVBM
19
5
0
02 Nov 2020
DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual Explanation
Jia-Hong Huang
Chao-Han Huck Yang
Fangyu Liu
Meng Tian
Yi-Chieh Liu
...
Kang Wang
Hiromasa Morikawa
Hernghua Chang
Jesper N. Tegnér
M. Worring
MedIm
14
47
0
01 Nov 2020
Personalized Multimodal Feedback Generation in Education
Haochen Liu
Zitao Liu
Zhongqin Wu
Jiliang Tang
29
9
0
31 Oct 2020
Generating Radiology Reports via Memory-driven Transformer
Zhihong Chen
Yan Song
Tsung-Hui Chang
Xiang Wan
MedIm
30
461
0
30 Oct 2020
Fusion Models for Improved Visual Captioning
M. Kalimuthu
Aditya Mogadala
Marius Mosbach
Dietrich Klakow
VLM
26
0
0
28 Oct 2020
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions
Radhika Dua
Sai Srinivas Kancheti
V. Balasubramanian
LRM
43
22
0
24 Oct 2020
Show and Speak: Directly Synthesize Spoken Description of Images
Xinsheng Wang
Siyuan Feng
Jihua Zhu
M. Hasegawa-Johnson
O. Scharenborg
26
4
0
23 Oct 2020
Learning Dual Semantic Relations with Graph Attention for Image-Text Matching
Keyu Wen
Xiaodong Gu
Qingrong Cheng
27
95
0
22 Oct 2020
A Survey on Deep Learning and Explainability for Automatic Report Generation from Medical Images
Pablo Messina
Pablo Pino
Denis Parra
Alvaro Soto
Cecilia Besa
S. Uribe
Marcelo andía
C. Tejos
Claudia Prieto
Daniel Capurro
MedIm
36
62
0
20 Oct 2020
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation
Yasuhide Miura
Yuhao Zhang
Emily Bao Tsai
C. Langlotz
Dan Jurafsky
MedIm
162
157
0
20 Oct 2020
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
30
6
0
19 Oct 2020
Collaborative Training of GANs in Continuous and Discrete Spaces for Text Generation
Yanghoon Kim
Seungpil Won
Seunghyun Yoon
Kyomin Jung
12
5
0
16 Oct 2020
TextMage: The Automated Bangla Caption Generator Based On Deep Learning
Abrar Hasin Kamal
Md Asifuzzaman Jishan
N. Mansoor
VLM
8
17
0
15 Oct 2020
MAF: Multimodal Alignment Framework for Weakly-Supervised Phrase Grounding
Qinxin Wang
Hao Tan
Sheng Shen
Michael W. Mahoney
Z. Yao
ObjD
52
11
0
12 Oct 2020
Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification
Yulin Wang
Kangchen Lv
Rui Huang
Shiji Song
Le Yang
Gao Huang
3DH
16
148
0
11 Oct 2020
Boosted EfficientNet: Detection of Lymph Node Metastases in Breast Cancer Using Convolutional Neural Network
Jun Wang
Qianying Liu
Haotian Xie
Zhaogang Yang
Hefeng Zhou
MedIm
19
77
0
10 Oct 2020
Block-term Tensor Neural Networks
Jinmian Ye
Guangxi Li
Di Chen
Haiqin Yang
Shandian Zhe
Zenglin Xu
29
30
0
10 Oct 2020
HydroDeep -- A Knowledge Guided Deep Neural Network for Geo-Spatiotemporal Data Analysis
Aishwarya Sarkar
Jien Zhang
Chaoqun Lu
Ali Jannesari
AI4CE
8
4
0
09 Oct 2020
Dense Relational Image Captioning via Multi-task Triple-Stream Networks
Dong-Jin Kim
Tae-Hyun Oh
Jinsoo Choi
In So Kweon
34
27
0
08 Oct 2020
Visual News: Benchmark and Challenges in News Image Captioning
Fuxiao Liu
Yinghan Wang
Tianlu Wang
Vicente Ordonez
VLM
24
111
0
08 Oct 2020
Toward Stance-based Personas for Opinionated Dialogues
Thomas Scialom
Serra Sinem Tekiroğlu
Jacopo Staiano
Marco Guerini
20
9
0
07 Oct 2020
BAAAN: Backdoor Attacks Against Autoencoder and GAN-Based Machine Learning Models
A. Salem
Yannick Sautter
Michael Backes
Mathias Humbert
Yang Zhang
AAML
SILM
AI4CE
25
39
0
06 Oct 2020
Fine-Grained Grounding for Multimodal Speech Recognition
Tejas Srinivasan
Ramon Sanabria
Florian Metze
Desmond Elliott
25
11
0
05 Oct 2020
Viable Threat on News Reading: Generating Biased News Using Natural Language Models
Saurabh Gupta
H. Nguyen
Junichi Yamagishi
Isao Echizen
23
3
0
05 Oct 2020
A Novel Actor Dual-Critic Model for Remote Sensing Image Captioning
Ruchika Chavhan
Biplab Banerjee
Xiaoxiang Zhu
S. Chaudhuri
16
8
0
05 Oct 2020
Attention Guided Semantic Relationship Parsing for Visual Question Answering
M. Farazi
Salman Khan
Nick Barnes
19
2
0
05 Oct 2020
UNISON: Unpaired Cross-lingual Image Captioning
Jiahui Gao
Yi Zhou
Philip L. H. Yu
Chenyu You
Jiuxiang Gu
18
16
0
03 Oct 2020
Multi-Modal Open-Domain Dialogue
Kurt Shuster
Eric Michael Smith
Da Ju
Jason Weston
AI4CE
41
42
0
02 Oct 2020
Improving Auto-Augment via Augmentation-Wise Weight Sharing
Keyu Tian
Chen Lin
Ming Sun
Luping Zhou
Junjie Yan
Wanli Ouyang
26
48
0
30 Sep 2020
Teacher-Critical Training Strategies for Image Captioning
Yiqing Huang
Jiansheng Chen
VLM
29
8
0
30 Sep 2020
Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning
Xiangxi Shi
Xu Yang
Jiuxiang Gu
Chenyu You
Jianfei Cai
21
52
0
30 Sep 2020
Spatial Attention as an Interface for Image Captioning Models
P. Sadler
28
0
0
29 Sep 2020
Neural Twins Talk
Zanyar Zohourianshahzadi
Jugal Kalita
17
1
0
26 Sep 2020
Generative Imagination Elevates Machine Translation
Quanyu Long
Mingxuan Wang
Lei Li
35
35
0
21 Sep 2020
Commands 4 Autonomous Vehicles (C4AV) Workshop Summary
Thierry Deruyttere
Simon Vandenhende
Dusan Grujicic
Yu Liu
Luc Van Gool
Matthew Blaschko
Tinne Tuytelaars
Marie-Francine Moens
30
6
0
18 Sep 2020
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
44
79
0
17 Sep 2020
Global-aware Beam Search for Neural Abstractive Summarization
Ye Ma
Zixun Lan
Lu Zong
Kaizhu Huang
28
12
0
15 Sep 2020
Learning semantic Image attributes using Image recognition and knowledge graph embeddings
Ashutosh Tiwari
Sandeep Varma
14
3
0
12 Sep 2020
Understanding the Role of Individual Units in a Deep Neural Network
David Bau
Jun-Yan Zhu
Hendrik Strobelt
Àgata Lapedriza
Bolei Zhou
Antonio Torralba
GAN
25
437
0
10 Sep 2020
Online trajectory recovery from offline handwritten Japanese kanji characters
Hung Tuan Nguyen
Tsubasa Nakamura
C. Nguyen
M. Nakagawa
25
14
0
09 Sep 2020
Towards Unique and Informative Captioning of Images
Zeyu Wang
Berthy Feng
Karthik Narasimhan
Olga Russakovsky
25
37
0
08 Sep 2020
KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Soohwan Kim
Seyoung Bae
Cheolhwang Won
VLM
22
5
0
07 Sep 2020
An Efficient Technique for Image Captioning using Deep Neural Network
Borneel Bikash Phukan
Amiya Ranjan Panda
VLM
19
8
0
05 Sep 2020
Previous
1
2
3
...
15
16
17
...
39
40
41
Next