Show and Tell: A Neural Image Caption Generator

17 November 2014

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,023 papers shown

Title
Hierarchical Adaptable and Transferable Networks (HATN) for Driving Behavior Prediction Letian Wang Yeping Hu Liting Sun Wei Zhan Masayoshi Tomizuka Changliu Liu 21 16 0 01 Nov 2021
Latent Cognizance: What Machine Really Learns Pisit Nakjai J. Ponsawat Tatpong Katanyukul BDL 18 3 0 29 Oct 2021
Discovering Non-monotonic Autoregressive Orderings with Variational Inference Xuanlin Li Brandon Trabucco Dongmin Park Michael Luo S. Shen Trevor Darrell Yang Gao 27 12 0 27 Oct 2021
Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer Aggregation Jingyu Zhao Yanwen Fang Guodong Li 27 23 0 22 Oct 2021
Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning Yang Yang Haoran Wei Hengshu Zhu Dianhai Yu Hui Xiong Jian Yang SSL 14 33 0 22 Oct 2021
Adaptive Bridge between Training and Inference for Dialogue Haoran Xu Hainan Zhang Yanyan Zou Hongshen Chen Zhuoye Ding Yanyan Lan CVBM 6 8 0 22 Oct 2021
ASFormer: Transformer for Action Segmentation Fangqiu Yi Hongyu Wen Tingting Jiang ViT 79 174 0 16 Oct 2021
Self-Annotated Training for Controllable Image Captioning Zhangzi Zhu Tianlei Wang Hong Qu 29 2 0 16 Oct 2021
Guiding Visual Question Generation Nihir Vedd Zixu Wang Marek Rei Yishu Miao Lucia Specia 89 23 0 15 Oct 2021
Identification of Attack-Specific Signatures in Adversarial Examples Hossein Souri Pirazh Khorramshahi Chun Pong Lau Micah Goldblum Rama Chellappa AAML MLAU 48 4 0 13 Oct 2021
Topic Scene Graph Generation by Attention Distillation from Caption Wenbin Wang R. Wang X. Chen DiffM 30 14 0 12 Oct 2021
Semi-Autoregressive Image Captioning Xu Yan Zhengcong Fei Zekang Li Shuhui Wang Qingming Huang Qi Tian 35 23 0 11 Oct 2021
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm Yangguang Li Feng Liang Lichen Zhao Yufeng Cui Wanli Ouyang Jing Shao F. Yu Junjie Yan VLM CLIP 50 448 0 11 Oct 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization S. Gu Manfred Diaz Daniel Freeman Hiroki Furuta Seyed Kamyar Seyed Ghasemipour Anton Raichuk Byron David Erik Frey Erwin Coumans Olivier Bachem 44 14 0 10 Oct 2021
Accessible Visualization via Natural Language Descriptions: A Four-Level Model of Semantic Content Alan Lundgard Arvind Satyanarayan 25 128 0 08 Oct 2021
Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning Ali Furkan Biten L. G. I. Bigorda Dimosthenis Karatzas 102 57 0 04 Oct 2021
Learning Structural Representations for Recipe Generation and Food Retrieval Hao Wang Guosheng Lin Guosheng Lin Chunyan Miao 29 28 0 04 Oct 2021
Transfer Learning Approaches for Knowledge Discovery in Grid-based Geo-Spatiotemporal Data Aishwarya Sarkar Jien Zhang Chaoqun Lu Ali Jannesari AI4CE 33 2 0 02 Oct 2021
Geometry Attention Transformer with Position-aware LSTMs for Image Captioning Chi-Yin Wang Yulin Shen Luping Ji ViT 52 49 0 01 Oct 2021
A Review of Text Style Transfer using Deep Learning Martina Toshevska Sonja Gievska CLIP 48 43 0 30 Sep 2021
Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks Amirali Boroumand Saugata Ghose Berkin Akin Ravi Narayanaswami Geraldo F. Oliveira Xiaoyu Ma Eric Shiu O. Mutlu 25 82 0 29 Sep 2021
Geometry-Entangled Visual Semantic Transformer for Image Captioning Ling Cheng Wei Wei Feida Zhu Yong Liu Chunyan Miao ViT 21 3 0 29 Sep 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation G. O. D. Santos Esther Luna Colombini Sandra Avila 47 30 0 28 Sep 2021
Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation An Yan Zexue He Xing Lu Jingfeng Du E. Chang Amilcare Gentili Julian McAuley Chun-Nan Hsu MedIm 88 64 0 25 Sep 2021
Scene Graph Generation for Better Image Captioning? Maximilian Mozes Martin Schmitt Vladimir Golkov Hinrich Schütze Daniel Cremers GNN 34 3 0 23 Sep 2021
Pix2seq: A Language Modeling Framework for Object Detection Ting-Li Chen Saurabh Saxena Lala Li David J. Fleet Geoffrey E. Hinton MLLM ViT VLM 244 344 0 22 Sep 2021
Caption Enriched Samples for Improving Hateful Memes Detection Efrat Blaier Itzik Malkiel Lior Wolf VLM 61 21 0 22 Sep 2021
Survey: Transformer based Video-Language Pre-training Ludan Ruan Qin Jin VLM ViT 72 44 0 21 Sep 2021
Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning Shikha Dubey Farrukh Olimov M. Rafique Joonmo Kim M. Jeon ViT 36 37 0 16 Sep 2021
SafeAccess+: An Intelligent System to make Smart Home Safer and Americans with Disability Act Compliant Shahinur Alam 24 2 0 14 Sep 2021
Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation Zechen Bai Yuta Nakashima Noa Garcia 68 43 0 13 Sep 2021
DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval A. Zhu Zijie Wang Yifeng Li Xili Wan Jing Jin Tian Wang Fangqiang Hu G. Hua 95 162 0 12 Sep 2021
We went to look for meaning and all we got were these lousy representations: aspects of meaning representation for computational semantics Simon Dobnik R. Cooper Adam Ek Bill Noble Staffan Larsson N. Ilinykh Vladislav Maraev Vidya Somashekarappa 30 0 0 10 Sep 2021
Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention Katsuyuki Nakamura Hiroki Ohashi Mitsuhiro Okada EgoV 36 13 0 07 Sep 2021
LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation Mohammad Abuzar Shaikh Zhanghexuan Ji Dana Moukheiber Yan Shen S. Srihari Mingchen Gao VLM 22 1 0 04 Sep 2021
Working Memory Connections for LSTM Federico Landi Lorenzo Baraldi Marcella Cornia Rita Cucchiara KELM 29 158 0 31 Aug 2021
QACE: Asking Questions to Evaluate an Image Caption Hwanhee Lee Thomas Scialom Seunghyun Yoon Franck Dernoncourt Kyomin Jung CoGe 27 18 0 28 Aug 2021
$Automated Generation of Accurate \& Fluent Medical X-ray Reports$ Automated Generation of Accurate \& Fluent Medical X-ray Reports Hoang T.N. Nguyen Dong Nie Taivanbat Badamdorj Yujie Liu Yingying Zhu J. Truong Li Cheng MedIm LM&MA 27 40 0 27 Aug 2021
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments Muhammad Zubair Irshad Niluthpol Chowdhury Mithun Zachary Seymour Han-Pang Chiu S. Samarasekera Rakesh Kumar LM&Ro 26 49 0 26 Aug 2021
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning Guodun Li Yuchen Zhai Zehao Lin Yin Zhang 59 21 0 26 Aug 2021
Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training Yuqing Song Shizhe Chen Qin Jin Wei Luo Jun Xie Fei Huang 31 18 0 25 Aug 2021
Auto-Parsing Network for Image Captioning and Visual Question Answering Xu Yang Chongyang Gao Hanwang Zhang Jianfei Cai 24 35 0 24 Aug 2021
Group-based Distinctive Image Captioning with Memory Attention Jiuniu Wang Wenjia Xu Qingzhong Wang Antoni B. Chan 21 18 0 20 Aug 2021
Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report Generation With Alternate Learning Guangyi Liu Yinghong Liao Fuyu Wang Bin Zhang Lu Zhang ... Xiang Wan Shaolin Li Zhen Li Shuixing Zhang Shuguang Cui 28 56 0 11 Aug 2021
Communicating Visualizations without Visuals: Investigation of Visualization Alternative Text for People with Visual Impairments C. Jung Shubham Mehta Atharva Kulkarni Yuhang Zhao Yea-Seul Kim 144 55 0 08 Aug 2021
Screen2Words: Automatic Mobile UI Summarization with Multimodal Learning Bryan Wang Gang Li Xin Zhou Zhourong Chen Tovi Grossman Yang Li 170 154 0 07 Aug 2021
Tiny Neural Models for Seq2Seq A. Kandoor 34 0 0 07 Aug 2021
Interpretable Visual Understanding with Cognitive Attention Network Xuejiao Tang Wenbin Zhang Yi Yu Kea Turner Tyler Derr Mengyu Wang Eirini Ntoutsi 52 12 0 06 Aug 2021
Neural Twins Talk & Alternative Calculations Zanyar Zohourianshahzadi Jugal Kalita 25 0 0 05 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning Xinzhi Dong Chengjiang Long Wenju Xu Chunxia Xiao ViT 83 66 0 05 Aug 2021