Show and Tell: A Neural Image Caption Generator

17 November 2014

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,023 papers shown

Title
Utilizing Every Image Object for Semi-supervised Phrase Grounding Haidong Zhu Arka Sadhu Zhao-Heng Zheng Ram Nevatia ObjD 25 7 0 05 Nov 2020
Multi-layer Feature Aggregation for Deep Scene Parsing Models Litao Yu Yongsheng Gao Jun Zhou Jian Zhang Qiang Wu SSeg 52 1 0 04 Nov 2020
Attention Beam: An Image Captioning Approach Anubhav Shrimal Tanmoy Chakraborty 3DV 13 2 0 03 Nov 2020
Parameter Efficient Deep Neural Networks with Bilinear Projections Litao Yu Yongsheng Gao Jun Zhou Jian Zhang 21 1 0 03 Nov 2020
Dual Attention on Pyramid Feature Maps for Image Captioning Litao Yu Jian Zhang Qiang Wu 24 47 0 02 Nov 2020
Diverse Image Captioning with Context-Object Split Latent Spaces Shweta Mahajan Stefan Roth 19 41 0 02 Nov 2020
Boost Image Captioning with Knowledge Reasoning Feicheng Huang Zhixin Li Haiyang Wei Canlong Zhang Huifang Ma 17 25 0 02 Nov 2020
Multimodal Continuous Emotion Recognition using Deep Multi-Task Learning with Correlation Loss Berkay Köprü E. Erzin CVBM 19 5 0 02 Nov 2020
DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual Explanation Jia-Hong Huang Chao-Han Huck Yang Fangyu Liu Meng Tian Yi-Chieh Liu ... Kang Wang Hiromasa Morikawa Hernghua Chang Jesper N. Tegnér M. Worring MedIm 14 47 0 01 Nov 2020
Personalized Multimodal Feedback Generation in Education Haochen Liu Zitao Liu Zhongqin Wu Jiliang Tang 29 9 0 31 Oct 2020
Generating Radiology Reports via Memory-driven Transformer Zhihong Chen Yan Song Tsung-Hui Chang Xiang Wan MedIm 30 461 0 30 Oct 2020
Fusion Models for Improved Visual Captioning M. Kalimuthu Aditya Mogadala Marius Mosbach Dietrich Klakow VLM 26 0 0 28 Oct 2020
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions Radhika Dua Sai Srinivas Kancheti V. Balasubramanian LRM 43 22 0 24 Oct 2020
Show and Speak: Directly Synthesize Spoken Description of Images Xinsheng Wang Siyuan Feng Jihua Zhu M. Hasegawa-Johnson O. Scharenborg 26 4 0 23 Oct 2020
Learning Dual Semantic Relations with Graph Attention for Image-Text Matching Keyu Wen Xiaodong Gu Qingrong Cheng 27 95 0 22 Oct 2020
A Survey on Deep Learning and Explainability for Automatic Report Generation from Medical Images Pablo Messina Pablo Pino Denis Parra Alvaro Soto Cecilia Besa S. Uribe Marcelo andía C. Tejos Claudia Prieto Daniel Capurro MedIm 36 62 0 20 Oct 2020
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation Yasuhide Miura Yuhao Zhang Emily Bao Tsai C. Langlotz Dan Jurafsky MedIm 162 157 0 20 Oct 2020
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends Shagun Uppal Sarthak Bhagat Devamanyu Hazarika Navonil Majumdar Soujanya Poria Roger Zimmermann Amir Zadeh 30 6 0 19 Oct 2020
Collaborative Training of GANs in Continuous and Discrete Spaces for Text Generation Yanghoon Kim Seungpil Won Seunghyun Yoon Kyomin Jung 12 5 0 16 Oct 2020
TextMage: The Automated Bangla Caption Generator Based On Deep Learning Abrar Hasin Kamal Md Asifuzzaman Jishan N. Mansoor VLM 8 17 0 15 Oct 2020
MAF: Multimodal Alignment Framework for Weakly-Supervised Phrase Grounding Qinxin Wang Hao Tan Sheng Shen Michael W. Mahoney Z. Yao ObjD 52 11 0 12 Oct 2020
Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification Yulin Wang Kangchen Lv Rui Huang Shiji Song Le Yang Gao Huang 3DH 16 148 0 11 Oct 2020
Boosted EfficientNet: Detection of Lymph Node Metastases in Breast Cancer Using Convolutional Neural Network Jun Wang Qianying Liu Haotian Xie Zhaogang Yang Hefeng Zhou MedIm 19 77 0 10 Oct 2020
Block-term Tensor Neural Networks Jinmian Ye Guangxi Li Di Chen Haiqin Yang Shandian Zhe Zenglin Xu 29 30 0 10 Oct 2020
HydroDeep -- A Knowledge Guided Deep Neural Network for Geo-Spatiotemporal Data Analysis Aishwarya Sarkar Jien Zhang Chaoqun Lu Ali Jannesari AI4CE 8 4 0 09 Oct 2020
Dense Relational Image Captioning via Multi-task Triple-Stream Networks Dong-Jin Kim Tae-Hyun Oh Jinsoo Choi In So Kweon 34 27 0 08 Oct 2020
Visual News: Benchmark and Challenges in News Image Captioning Fuxiao Liu Yinghan Wang Tianlu Wang Vicente Ordonez VLM 24 111 0 08 Oct 2020
Toward Stance-based Personas for Opinionated Dialogues Thomas Scialom Serra Sinem Tekiroğlu Jacopo Staiano Marco Guerini 20 9 0 07 Oct 2020
BAAAN: Backdoor Attacks Against Autoencoder and GAN-Based Machine Learning Models A. Salem Yannick Sautter Michael Backes Mathias Humbert Yang Zhang AAML SILM AI4CE 25 39 0 06 Oct 2020
Fine-Grained Grounding for Multimodal Speech Recognition Tejas Srinivasan Ramon Sanabria Florian Metze Desmond Elliott 25 11 0 05 Oct 2020
Viable Threat on News Reading: Generating Biased News Using Natural Language Models Saurabh Gupta H. Nguyen Junichi Yamagishi Isao Echizen 23 3 0 05 Oct 2020
A Novel Actor Dual-Critic Model for Remote Sensing Image Captioning Ruchika Chavhan Biplab Banerjee Xiaoxiang Zhu S. Chaudhuri 16 8 0 05 Oct 2020
Attention Guided Semantic Relationship Parsing for Visual Question Answering M. Farazi Salman Khan Nick Barnes 19 2 0 05 Oct 2020
UNISON: Unpaired Cross-lingual Image Captioning Jiahui Gao Yi Zhou Philip L. H. Yu Chenyu You Jiuxiang Gu 18 16 0 03 Oct 2020
Multi-Modal Open-Domain Dialogue Kurt Shuster Eric Michael Smith Da Ju Jason Weston AI4CE 41 42 0 02 Oct 2020
Improving Auto-Augment via Augmentation-Wise Weight Sharing Keyu Tian Chen Lin Ming Sun Luping Zhou Junjie Yan Wanli Ouyang 26 48 0 30 Sep 2020
Teacher-Critical Training Strategies for Image Captioning Yiqing Huang Jiansheng Chen VLM 29 8 0 30 Sep 2020
Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning Xiangxi Shi Xu Yang Jiuxiang Gu Chenyu You Jianfei Cai 21 52 0 30 Sep 2020
Spatial Attention as an Interface for Image Captioning Models P. Sadler 28 0 0 29 Sep 2020
Neural Twins Talk Zanyar Zohourianshahzadi Jugal Kalita 17 1 0 26 Sep 2020
Generative Imagination Elevates Machine Translation Quanyu Long Mingxuan Wang Lei Li 35 35 0 21 Sep 2020
Commands 4 Autonomous Vehicles (C4AV) Workshop Summary Thierry Deruyttere Simon Vandenhende Dusan Grujicic Yu Liu Luc Van Gool Matthew Blaschko Tinne Tuytelaars Marie-Francine Moens 30 6 0 18 Sep 2020
Review: Deep Learning in Electron Microscopy Jeffrey M. Ede 44 79 0 17 Sep 2020
Global-aware Beam Search for Neural Abstractive Summarization Ye Ma Zixun Lan Lu Zong Kaizhu Huang 28 12 0 15 Sep 2020
Learning semantic Image attributes using Image recognition and knowledge graph embeddings Ashutosh Tiwari Sandeep Varma 14 3 0 12 Sep 2020
Understanding the Role of Individual Units in a Deep Neural Network David Bau Jun-Yan Zhu Hendrik Strobelt Àgata Lapedriza Bolei Zhou Antonio Torralba GAN 25 437 0 10 Sep 2020
Online trajectory recovery from offline handwritten Japanese kanji characters Hung Tuan Nguyen Tsubasa Nakamura C. Nguyen M. Nakagawa 25 14 0 09 Sep 2020
Towards Unique and Informative Captioning of Images Zeyu Wang Berthy Feng Karthik Narasimhan Olga Russakovsky 25 37 0 08 Sep 2020
KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition Soohwan Kim Seyoung Bae Cheolhwang Won VLM 22 5 0 07 Sep 2020
An Efficient Technique for Image Captioning using Deep Neural Network Borneel Bikash Phukan Amiya Ranjan Panda VLM 19 8 0 05 Sep 2020