Exploring Nearest Neighbor Approaches for Image Captioning

17 May 2015

Papers citing "Exploring Nearest Neighbor Approaches for Image Captioning"

39 / 39 papers shown

Title
Explaining the Success of Nearest Neighbor Methods in Prediction George H. Chen Devavrat Shah OOD 67 145 0 21 Feb 2025
Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification Chunpu Xu Jing Li VLM 26 5 0 27 Mar 2023
Diverse Image Captioning with Grounded Style Franz Klein Shweta Mahajan S. Roth 22 7 0 03 May 2022
FlowEval: A Consensus-Based Dialogue Evaluation Framework Using Segment Act Flows Jianqiao Zhao Yanyang Li Wanyu Du Yangfeng Ji Dong Yu M. Lyu Liwei Wang 25 4 0 14 Feb 2022
Cross-Modal Retrieval Augmentation for Multi-Modal Classification Shir Gur Natalia Neverova C. Stauffer Ser-Nam Lim Douwe Kiela A. Reiter 19 26 0 16 Apr 2021
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes Douwe Kiela Hamed Firooz Aravind Mohan Vedanuj Goswami Amanpreet Singh Pratik Ringshia Davide Testuggine 37 580 0 10 May 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC) C. Sur 25 16 0 15 Feb 2020
In Defense of Grid Features for Visual Question Answering Huaizu Jiang Ishan Misra Marcus Rohrbach Erik Learned-Miller Xinlei Chen OOD ObjD 23 318 0 10 Jan 2020
Going Beneath the Surface: Evaluating Image Captioning for Grammaticality, Truthfulness and Diversity Huiyuan Xie Tom Sherborne A. Kuhnle Ann A. Copestake DiffM 25 9 0 19 Dec 2019
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning J. Aneja Harsh Agrawal Dhruv Batra A. Schwing BDL VLM 23 66 0 22 Aug 2019
3G structure for image caption generation Aihong Yuan Xuelong Li Xiaoqiang Lu 15 34 0 21 Apr 2019
Clinically Accurate Chest X-Ray Report Generation Guanxiong Liu T. Hsu Matthew B. A. McDermott Willie Boag W. Weng Peter Szolovits Marzyeh Ghassemi MedIm 25 271 0 04 Apr 2019
Good News, Everyone! Context driven entity-aware captioning for news images Ali Furkan Biten Lluís Gómez Marçal Rusiñol Dimosthenis Karatzas 19 139 0 02 Apr 2019
What the Constant Velocity Model Can Teach Us About Pedestrian Motion Prediction Christoph Schöller Vincent Aravantinos F. Lay Alois C. Knoll 24 220 0 19 Mar 2019
From Recognition to Cognition: Visual Commonsense Reasoning Rowan Zellers Yonatan Bisk Ali Farhadi Yejin Choi LRM BDL OCL ReLM 44 866 0 27 Nov 2018
A Neural Compositional Paradigm for Image Captioning Bo Dai Sanja Fidler Dahua Lin CoGe 29 41 0 23 Oct 2018
Fast, Diverse and Accurate Image Captioning Guided By Part-of-Speech Aditya Deshpande J. Aneja Liwei Wang A. Schwing David A. Forsyth 25 146 0 31 May 2018
Visual Referring Expression Recognition: What Do Systems Actually Learn? Volkan Cirik Louis-Philippe Morency Taylor Berg-Kirkpatrick 31 63 0 30 May 2018
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool Feng Liu Tao Xiang Timothy M. Hospedales Wankou Yang Changyin Sun 22 29 0 16 Mar 2018
Neural Aesthetic Image Reviewer Wenshan Wang Su Yang Weishan Zhang Jiulong Zhang 22 38 0 28 Feb 2018
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space Liwei Wang A. Schwing Svetlana Lazebnik CoGe 37 175 0 19 Nov 2017
Reasoning about Fine-grained Attribute Phrases using Reference Games Jong-Chyi Su Chenyun Wu Huaizu Jiang Subhransu Maji 34 16 0 29 Aug 2017
Negative Results in Computer Vision: A Perspective Ali Borji 17 36 0 11 May 2017
Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation Albert Gatt E. Krahmer LM&MA ELM 27 810 0 29 Mar 2017
Context-aware Captions from Context-agnostic Supervision Ramakrishna Vedantam Samy Bengio Kevin Patrick Murphy Devi Parikh Gal Chechik 20 152 0 11 Jan 2017
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering Yash Goyal Tejas Khot D. Summers-Stay Dhruv Batra Devi Parikh CoGe 110 3,126 0 02 Dec 2016
Guided Open Vocabulary Image Captioning with Constrained Beam Search Peter Anderson Basura Fernando Mark Johnson Stephen Gould 21 232 0 02 Dec 2016
Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge Oriol Vinyals Alexander Toshev Samy Bengio D. Erhan 19 850 0 21 Sep 2016
Measuring Machine Intelligence Through Visual Question Answering C. L. Zitnick Aishwarya Agrawal Stanislaw Antol Margaret Mitchell Dhruv Batra Devi Parikh 19 37 0 31 Aug 2016
SPICE: Semantic Propositional Image Caption Evaluation Peter Anderson Basura Fernando Mark Johnson Stephen Gould EGVM 36 1,884 0 29 Jul 2016
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions? Abhishek Das Harsh Agrawal C. L. Zitnick Devi Parikh Dhruv Batra 32 465 0 11 Jun 2016
Subjects and Their Objects: Localizing Interactees for a Person-Centric View of Importance Chao-Yeh Chen Kristen Grauman 32 9 0 17 Apr 2016
Image Captioning with Semantic Attention Quanzeng You Hailin Jin Zhaowen Wang Chen Fang Jiebo Luo VLM 52 1,652 0 12 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge Qi Wu Chunhua Shen Anton Van Den Hengel Peng Wang A. Dick 27 360 0 09 Mar 2016
Yin and Yang: Balancing and Answering Binary Visual Questions Peng Zhang Yash Goyal D. Summers-Stay Dhruv Batra Devi Parikh CoGe 22 349 0 16 Nov 2015
Sherlock: Scalable Fact Learning in Images Mohamed Elhoseiny Scott D. Cohen W. Chang Brian L. Price Ahmed Elgammal 19 26 0 16 Nov 2015
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images Junhua Mao Xu Wei Yi Yang Jiang Wang Zhiheng Huang Alan Yuille 25 154 0 25 Apr 2015
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN) Junhua Mao Wenyuan Xu Yi Yang Jiang Wang Zhiheng Huang Alan Yuille VLM 65 1,235 0 20 Dec 2014
Long-term Recurrent Convolutional Networks for Visual Recognition and Description Jeff Donahue Lisa Anne Hendricks Marcus Rohrbach Subhashini Venugopalan S. Guadarrama Kate Saenko Trevor Darrell VLM 85 6,032 0 17 Nov 2014