Don't Just Listen, Use Your Imagination: Leveraging Visual Common Sense
for Non-Visual Tasks

Don't Just Listen, Use Your Imagination: Leveraging Visual Common Sense for Non-Visual Tasks

21 February 2015

Devi Parikh

Papers citing "Don't Just Listen, Use Your Imagination: Leveraging Visual Common Sense for Non-Visual Tasks"

17 / 17 papers shown

Title
Kiki or Bouba? Sound Symbolism in Vision-and-Language Models Morris Alper Hadar Averbuch-Elor 48 10 0 25 Oct 2023
Visual Question Answering on Image Sets Ankan Bansal Yuting Zhang Rama Chellappa CoGe 16 40 0 27 Aug 2020
A Review on Intelligent Object Perception Methods Combining Knowledge-based Reasoning and Machine Learning Filippos Gouidis Alexandros Vassiliades T. Patkos Antonis Argyros Nick Bassiliades Dimitris Plexousakis OCL 29 12 0 26 Dec 2019
Machine Common Sense Concept Paper David Gunning VLM LRM 21 39 0 17 Oct 2018
Seeing Voices and Hearing Faces: Cross-modal biometric matching Arsha Nagrani Samuel Albanie Andrew Zisserman CVBM 24 220 0 01 Apr 2018
Generative Models of Visually Grounded Imagination Ramakrishna Vedantam Ian S. Fischer Jonathan Huang Kevin Patrick Murphy 25 138 0 30 May 2017
Imagination improves Multimodal Translation Desmond Elliott Ákos Kádár 29 136 0 11 May 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering Y. Jang Yale Song Youngjae Yu Youngjin Kim Gunhee Kim 34 547 0 14 Apr 2017
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey Hirokatsu Kataoka Yudai Miyashita Tomoaki K. Yamabe Soma Shirakabe Shin-ichi Sato ... Kaori Abe Takaaki Imanari Naomichi Kobayashi Shinichiro Morita Akio Nakamura 24 2 0 26 May 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge Qi Wu Chunhua Shen Anton Van Den Hengel Peng Wang A. Dick 27 360 0 09 Mar 2016
We Are Humor Beings: Understanding and Predicting Visual Humor Arjun Chandrasekaran Ashwin K. Vijayakumar Stanislaw Antol Joey Tianyi Zhou Dhruv Batra C. L. Zitnick Devi Parikh 28 56 0 14 Dec 2015
Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes Satwik Kottur Ramakrishna Vedantam José M. F. Moura Devi Parikh VLM 38 85 0 22 Nov 2015
Yin and Yang: Balancing and Answering Binary Visual Questions Peng Zhang Yash Goyal D. Summers-Stay Dhruv Batra Devi Parikh CoGe 37 349 0 16 Nov 2015
Uncovering Temporal Context for Video Question and Answering Linchao Zhu Zhongwen Xu Yi Yang Alexander G. Hauptmann BDL 27 44 0 15 Nov 2015
Building a Large-scale Multimodal Knowledge Base System for Answering Visual Queries Yuke Zhu Ce Zhang Christopher Ré Li Fei-Fei 27 35 0 20 Jul 2015
Visual Madlibs: Fill in the blank Image Generation and Question Answering Licheng Yu Eunbyung Park Alexander C. Berg Tamara L. Berg VLM MLLM 32 97 0 31 May 2015
VQA: Visual Question Answering Aishwarya Agrawal Jiasen Lu Stanislaw Antol Margaret Mitchell C. L. Zitnick Dhruv Batra Devi Parikh CoGe 96 5,383 0 03 May 2015