Show and Tell: A Neural Image Caption Generator

17 November 2014

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,023 papers shown

Title
VirTex: Learning Visual Representations from Textual Annotations Karan Desai Justin Johnson SSL VLM 30 433 0 11 Jun 2020
RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams Vasiliki Kougia John Pavlopoulos P. Papapetrou Max Gordon 32 0 0 11 Jun 2020
Toward Building Safer Smart Homes for the People with Disabilities Shahinur Alam M. Mahmud M. Yeasin 16 4 0 10 Jun 2020
Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report Generation Mingjie Li Fuyu Wang Xiaojun Chang Xiaodan Liang MedIm 34 101 0 06 Jun 2020
Pick-Object-Attack: Type-Specific Adversarial Attack for Object Detection Omid Mohamad Nezami Akshay Chaturvedi Mark Dras Utpal Garain AAML ObjD 26 19 0 05 Jun 2020
An embedded system for the automated generation of labeled plant images to enable machine learning applications in agriculture Michael A. Beck Chen-Yi Liu C. Bidinosti C. Henry Cara M. Godee Manisha Ajmani VLM 19 21 0 01 Jun 2020
JPD-SE: High-Level Semantics for Joint Perception-Distortion Enhancement in Image Compression Shiyu Duan Huaijin Chen Liang Feng 32 5 0 24 May 2020
PruneNet: Channel Pruning via Global Importance A. Khetan Zohar Karnin 26 11 0 22 May 2020
Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding Fenglin Liu Xuancheng Ren Guangxiang Zhao Chenyu You Xuewei Ma Xian Wu Xu Sun 45 2 0 16 May 2020
Flight Time Prediction for Fuel Loading Decisions with a Deep Learning Approach Xinting Zhu Lishuai Li 11 32 0 12 May 2020
Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters Wei Zhang Quan Chen Kaihua Fu Ningxin Zheng Zhiyi Huang Jingwen Leng Chao Li Wenli Zheng Minyi Guo 27 3 0 05 May 2020
Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context Xinyi Zheng Doug Burdick Lucian Popa Xu Zhong N. Wang LMTD 35 142 0 01 May 2020
Computing the Testing Error without a Testing Set C. Corneanu Meysam Madadi Sergio Escalera Aleix M. Martinez AAML 10 69 0 01 May 2020
Towards Embodied Scene Description Sinan Tan Huaping Liu Di Guo Xinyu Zhang F. Sun LM&Ro 10 9 0 30 Apr 2020
memeBot: Towards Automatic Image Meme Generation Aadhavan Sadasivam K. Gunasekar H. Davulcu Yezhou Yang 14 9 0 30 Apr 2020
Explainable Deep Learning: A Field Guide for the Uninitiated Gabrielle Ras Ning Xie Marcel van Gerven Derek Doran AAML XAI 55 371 0 30 Apr 2020
Pragmatic Issue-Sensitive Image Captioning Allen Nie Reuben Cohn-Gordon Christopher Potts 20 24 0 29 Apr 2020
Image Captioning through Image Transformer Sen He Wentong Liao Hamed R. Tavakoli M. Yang Bodo Rosenhahn N. Pugeault ViT 41 91 0 29 Apr 2020
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective M. S. Saeed Shah Nawaz Pietro Morerio Arif Mahmood I. Gallo Muhammad Haroon Yousaf Alessio Del Bue CVBM 28 26 0 28 Apr 2020
Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports Baoyu Jing Zeya Wang Eric Xing 22 139 0 26 Apr 2020
Detective: An Attentive Recurrent Model for Sparse Object Detection A. Kechaou Manuel Martínez Monica Haurilet Rainer Stiefelhagen ObjD 12 3 0 25 Apr 2020
VisualCOMET: Reasoning about the Dynamic Context of a Still Image J. S. Park Chandra Bhagavatula Roozbeh Mottaghi Ali Farhadi Yejin Choi ReLM LRM 27 6 0 22 Apr 2020
Textual Visual Semantic Dataset for Text Spotting Ahmed Sabir Francesc Moreno-Noguer Lluís Padró 24 3 0 21 Apr 2020
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs Shiyang Yan Yang Hua N. Robertson 19 7 0 21 Apr 2020
Transform and Tell: Entity-Aware News Image Captioning Alasdair Tran A. Mathews Lexing Xie VLM 28 95 0 17 Apr 2020
Context-Aware Group Captioning via Self-Attention and Contrastive Features Zhuowan Li Quan Hung Tran Long Mai Zhe Lin Alan Yuille VLM 14 44 0 07 Apr 2020
Character-level Japanese Text Generation with Attention Mechanism for Chest Radiography Diagnosis Kenya Sakka Kotaro Nakayama Nisei Kimura Taiki Inoue Yusuke Iwasawa Ryohei Yamaguchi Yosimasa Kawazoe K. Ohe Y. Matsuo 14 2 0 06 Apr 2020
B-SCST: Bayesian Self-Critical Sequence Training for Image Captioning Shashank Bujimalla Mahesh Subedar Omesh Tickoo BDL UQCV 25 10 0 06 Apr 2020
Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation Xingyuan Chen Ping Cai Peng Jin Hongjun Wang Xingyu Dai Jiajun Chen 26 2 0 05 Apr 2020
Open Domain Dialogue Generation with Latent Images Ze Yang Wei Wu Huang Hu Can Xu Wei Wang Zhoujun Li 30 29 0 04 Apr 2020
PaStaNet: Toward Human Activity Knowledge Engine Yong-Lu Li Liang Xu Xinpeng Liu Xijie Huang Yue Xu Shiyi Wang Haoshu Fang Ze Ma Mingyang Chen Cewu Lu 28 151 0 02 Apr 2020
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers Zhicheng Huang Zhaoyang Zeng Bei Liu Dongmei Fu Jianlong Fu ViT 50 436 0 02 Apr 2020
Consistent Multiple Sequence Decoding Bicheng Xu Leonid Sigal 34 0 0 02 Apr 2020
More Grounded Image Captioning by Distilling Image-Text Matching Model Yuanen Zhou Meng Wang Daqing Liu Zhenzhen Hu Hanwang Zhang 25 125 0 01 Apr 2020
X-Linear Attention Networks for Image Captioning Yingwei Pan Ting Yao Yehao Li Tao Mei 39 510 0 31 Mar 2020
Detection and Description of Change in Visual Streams Davis Gilton Ruotian Luo Rebecca Willett Gregory Shakhnarovich AI4TS 18 4 0 27 Mar 2020
Grounded Situation Recognition Sarah M Pratt Mark Yatskar Luca Weihs Ali Farhadi Aniruddha Kembhavi 30 112 0 26 Mar 2020
Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning models Pranav Agarwal Alejandro Betancourt V. Panagiotou Natalia Díaz Rodríguez EGVM 14 10 0 26 Mar 2020
Learning Compact Reward for Image Captioning Nannan Li Zhenzhong Chen 23 3 0 24 Mar 2020
Normalized and Geometry-Aware Self-Attention Network for Image Captioning Longteng Guo Jing Liu Xinxin Zhu Peng Yao Shichen Lu Hanqing Lu ViT 135 189 0 19 Mar 2020
Fast Distance-based Anomaly Detection in Images Using an Inception-like Autoencoder Natasa Sarafijanovic-Djukic Jesse Davis 30 24 0 12 Mar 2020
"An Image is Worth a Thousand Features": Scalable Product Representations for In-Session Type-Ahead Personalization Bingqing Yu Jacopo Tagliabue C. Greco Federico Bianchi 66 10 0 11 Mar 2020
Visual Grounding in Video for Unsupervised Word Translation Gunnar Sigurdsson Jean-Baptiste Alayrac Aida Nematzadeh Lucas Smaira Mateusz Malinowski João Carreira Phil Blunsom Andrew Zisserman VGen 29 49 0 11 Mar 2020
Deconfounded Image Captioning: A Causal Retrospect Xu Yang Hanwang Zhang Jianfei Cai CML 18 119 0 09 Mar 2020
Better Captioning with Sequence-Level Exploration Jia Chen Qin Jin 37 12 0 08 Mar 2020
Investigating the Decoders of Maximum Likelihood Sequence Models: A Look-ahead Approach Yu-Siang Wang Yen-Ling Kuo Boris Katz 31 3 0 08 Mar 2020
Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning Elad Amrani Rami Ben-Ari Daniel Rotman A. Bronstein 27 121 0 06 Mar 2020
Show, Edit and Tell: A Framework for Editing Image Captions Fawaz Sammani Luke Melas-Kyriazi KELM DiffM 48 59 0 06 Mar 2020
Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs Shizhe Chen Qin Jin Peng Wang Qi Wu DiffM 39 215 0 01 Mar 2020
Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI Components by Deep Learning Jieshan Chen Chunyang Chen Zhenchang Xing Xiwei Xu Liming Zhu Guoqiang Li Jinshui Wang 19 139 0 01 Mar 2020