Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4555
Cited By
Show and Tell: A Neural Image Caption Generator
17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: A Neural Image Caption Generator"
50 / 2,023 papers shown
Title
VirTex: Learning Visual Representations from Textual Annotations
Karan Desai
Justin Johnson
SSL
VLM
30
433
0
11 Jun 2020
RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams
Vasiliki Kougia
John Pavlopoulos
P. Papapetrou
Max Gordon
32
0
0
11 Jun 2020
Toward Building Safer Smart Homes for the People with Disabilities
Shahinur Alam
M. Mahmud
M. Yeasin
16
4
0
10 Jun 2020
Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report Generation
Mingjie Li
Fuyu Wang
Xiaojun Chang
Xiaodan Liang
MedIm
34
101
0
06 Jun 2020
Pick-Object-Attack: Type-Specific Adversarial Attack for Object Detection
Omid Mohamad Nezami
Akshay Chaturvedi
Mark Dras
Utpal Garain
AAML
ObjD
26
19
0
05 Jun 2020
An embedded system for the automated generation of labeled plant images to enable machine learning applications in agriculture
Michael A. Beck
Chen-Yi Liu
C. Bidinosti
C. Henry
Cara M. Godee
Manisha Ajmani
VLM
19
21
0
01 Jun 2020
JPD-SE: High-Level Semantics for Joint Perception-Distortion Enhancement in Image Compression
Shiyu Duan
Huaijin Chen
Liang Feng
32
5
0
24 May 2020
PruneNet: Channel Pruning via Global Importance
A. Khetan
Zohar Karnin
26
11
0
22 May 2020
Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding
Fenglin Liu
Xuancheng Ren
Guangxiang Zhao
Chenyu You
Xuewei Ma
Xian Wu
Xu Sun
45
2
0
16 May 2020
Flight Time Prediction for Fuel Loading Decisions with a Deep Learning Approach
Xinting Zhu
Lishuai Li
11
32
0
12 May 2020
Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters
Wei Zhang
Quan Chen
Kaihua Fu
Ningxin Zheng
Zhiyi Huang
Jingwen Leng
Chao Li
Wenli Zheng
Minyi Guo
27
3
0
05 May 2020
Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context
Xinyi Zheng
Doug Burdick
Lucian Popa
Xu Zhong
N. Wang
LMTD
35
142
0
01 May 2020
Computing the Testing Error without a Testing Set
C. Corneanu
Meysam Madadi
Sergio Escalera
Aleix M. Martinez
AAML
10
69
0
01 May 2020
Towards Embodied Scene Description
Sinan Tan
Huaping Liu
Di Guo
Xinyu Zhang
F. Sun
LM&Ro
10
9
0
30 Apr 2020
memeBot: Towards Automatic Image Meme Generation
Aadhavan Sadasivam
K. Gunasekar
H. Davulcu
Yezhou Yang
14
9
0
30 Apr 2020
Explainable Deep Learning: A Field Guide for the Uninitiated
Gabrielle Ras
Ning Xie
Marcel van Gerven
Derek Doran
AAML
XAI
55
371
0
30 Apr 2020
Pragmatic Issue-Sensitive Image Captioning
Allen Nie
Reuben Cohn-Gordon
Christopher Potts
20
24
0
29 Apr 2020
Image Captioning through Image Transformer
Sen He
Wentong Liao
Hamed R. Tavakoli
M. Yang
Bodo Rosenhahn
N. Pugeault
ViT
41
91
0
29 Apr 2020
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
28
26
0
28 Apr 2020
Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports
Baoyu Jing
Zeya Wang
Eric Xing
22
139
0
26 Apr 2020
Detective: An Attentive Recurrent Model for Sparse Object Detection
A. Kechaou
Manuel Martínez
Monica Haurilet
Rainer Stiefelhagen
ObjD
12
3
0
25 Apr 2020
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
J. S. Park
Chandra Bhagavatula
Roozbeh Mottaghi
Ali Farhadi
Yejin Choi
ReLM
LRM
27
6
0
22 Apr 2020
Textual Visual Semantic Dataset for Text Spotting
Ahmed Sabir
Francesc Moreno-Noguer
Lluís Padró
24
3
0
21 Apr 2020
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs
Shiyang Yan
Yang Hua
N. Robertson
19
7
0
21 Apr 2020
Transform and Tell: Entity-Aware News Image Captioning
Alasdair Tran
A. Mathews
Lexing Xie
VLM
28
95
0
17 Apr 2020
Context-Aware Group Captioning via Self-Attention and Contrastive Features
Zhuowan Li
Quan Hung Tran
Long Mai
Zhe Lin
Alan Yuille
VLM
14
44
0
07 Apr 2020
Character-level Japanese Text Generation with Attention Mechanism for Chest Radiography Diagnosis
Kenya Sakka
Kotaro Nakayama
Nisei Kimura
Taiki Inoue
Yusuke Iwasawa
Ryohei Yamaguchi
Yosimasa Kawazoe
K. Ohe
Y. Matsuo
14
2
0
06 Apr 2020
B-SCST: Bayesian Self-Critical Sequence Training for Image Captioning
Shashank Bujimalla
Mahesh Subedar
Omesh Tickoo
BDL
UQCV
25
10
0
06 Apr 2020
Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation
Xingyuan Chen
Ping Cai
Peng Jin
Hongjun Wang
Xingyu Dai
Jiajun Chen
26
2
0
05 Apr 2020
Open Domain Dialogue Generation with Latent Images
Ze Yang
Wei Wu
Huang Hu
Can Xu
Wei Wang
Zhoujun Li
30
29
0
04 Apr 2020
PaStaNet: Toward Human Activity Knowledge Engine
Yong-Lu Li
Liang Xu
Xinpeng Liu
Xijie Huang
Yue Xu
Shiyi Wang
Haoshu Fang
Ze Ma
Mingyang Chen
Cewu Lu
28
151
0
02 Apr 2020
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers
Zhicheng Huang
Zhaoyang Zeng
Bei Liu
Dongmei Fu
Jianlong Fu
ViT
50
436
0
02 Apr 2020
Consistent Multiple Sequence Decoding
Bicheng Xu
Leonid Sigal
34
0
0
02 Apr 2020
More Grounded Image Captioning by Distilling Image-Text Matching Model
Yuanen Zhou
Meng Wang
Daqing Liu
Zhenzhen Hu
Hanwang Zhang
25
125
0
01 Apr 2020
X-Linear Attention Networks for Image Captioning
Yingwei Pan
Ting Yao
Yehao Li
Tao Mei
39
510
0
31 Mar 2020
Detection and Description of Change in Visual Streams
Davis Gilton
Ruotian Luo
Rebecca Willett
Gregory Shakhnarovich
AI4TS
18
4
0
27 Mar 2020
Grounded Situation Recognition
Sarah M Pratt
Mark Yatskar
Luca Weihs
Ali Farhadi
Aniruddha Kembhavi
30
112
0
26 Mar 2020
Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning models
Pranav Agarwal
Alejandro Betancourt
V. Panagiotou
Natalia Díaz Rodríguez
EGVM
14
10
0
26 Mar 2020
Learning Compact Reward for Image Captioning
Nannan Li
Zhenzhong Chen
23
3
0
24 Mar 2020
Normalized and Geometry-Aware Self-Attention Network for Image Captioning
Longteng Guo
Jing Liu
Xinxin Zhu
Peng Yao
Shichen Lu
Hanqing Lu
ViT
135
189
0
19 Mar 2020
Fast Distance-based Anomaly Detection in Images Using an Inception-like Autoencoder
Natasa Sarafijanovic-Djukic
Jesse Davis
30
24
0
12 Mar 2020
"An Image is Worth a Thousand Features": Scalable Product Representations for In-Session Type-Ahead Personalization
Bingqing Yu
Jacopo Tagliabue
C. Greco
Federico Bianchi
66
10
0
11 Mar 2020
Visual Grounding in Video for Unsupervised Word Translation
Gunnar Sigurdsson
Jean-Baptiste Alayrac
Aida Nematzadeh
Lucas Smaira
Mateusz Malinowski
João Carreira
Phil Blunsom
Andrew Zisserman
VGen
29
49
0
11 Mar 2020
Deconfounded Image Captioning: A Causal Retrospect
Xu Yang
Hanwang Zhang
Jianfei Cai
CML
18
119
0
09 Mar 2020
Better Captioning with Sequence-Level Exploration
Jia Chen
Qin Jin
37
12
0
08 Mar 2020
Investigating the Decoders of Maximum Likelihood Sequence Models: A Look-ahead Approach
Yu-Siang Wang
Yen-Ling Kuo
Boris Katz
31
3
0
08 Mar 2020
Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning
Elad Amrani
Rami Ben-Ari
Daniel Rotman
A. Bronstein
27
121
0
06 Mar 2020
Show, Edit and Tell: A Framework for Editing Image Captions
Fawaz Sammani
Luke Melas-Kyriazi
KELM
DiffM
48
59
0
06 Mar 2020
Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs
Shizhe Chen
Qin Jin
Peng Wang
Qi Wu
DiffM
39
215
0
01 Mar 2020
Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI Components by Deep Learning
Jieshan Chen
Chunyang Chen
Zhenchang Xing
Xiwei Xu
Liming Zhu
Guoqiang Li
Jinshui Wang
19
139
0
01 Mar 2020
Previous
1
2
3
...
17
18
19
...
39
40
41
Next