Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.06108
Cited By
Don't Just Listen, Use Your Imagination: Leveraging Visual Common Sense for Non-Visual Tasks
21 February 2015
Xiaoyu Lin
Devi Parikh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Don't Just Listen, Use Your Imagination: Leveraging Visual Common Sense for Non-Visual Tasks"
17 / 17 papers shown
Title
Kiki or Bouba? Sound Symbolism in Vision-and-Language Models
Morris Alper
Hadar Averbuch-Elor
48
10
0
25 Oct 2023
Visual Question Answering on Image Sets
Ankan Bansal
Yuting Zhang
Rama Chellappa
CoGe
16
40
0
27 Aug 2020
A Review on Intelligent Object Perception Methods Combining Knowledge-based Reasoning and Machine Learning
Filippos Gouidis
Alexandros Vassiliades
T. Patkos
Antonis Argyros
Nick Bassiliades
Dimitris Plexousakis
OCL
29
12
0
26 Dec 2019
Machine Common Sense Concept Paper
David Gunning
VLM
LRM
21
39
0
17 Oct 2018
Seeing Voices and Hearing Faces: Cross-modal biometric matching
Arsha Nagrani
Samuel Albanie
Andrew Zisserman
CVBM
24
220
0
01 Apr 2018
Generative Models of Visually Grounded Imagination
Ramakrishna Vedantam
Ian S. Fischer
Jonathan Huang
Kevin Patrick Murphy
25
138
0
30 May 2017
Imagination improves Multimodal Translation
Desmond Elliott
Ákos Kádár
29
136
0
11 May 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
Y. Jang
Yale Song
Youngjae Yu
Youngjin Kim
Gunhee Kim
34
547
0
14 Apr 2017
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey
Hirokatsu Kataoka
Yudai Miyashita
Tomoaki K. Yamabe
Soma Shirakabe
Shin-ichi Sato
...
Kaori Abe
Takaaki Imanari
Naomichi Kobayashi
Shinichiro Morita
Akio Nakamura
24
2
0
26 May 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge
Qi Wu
Chunhua Shen
Anton Van Den Hengel
Peng Wang
A. Dick
27
360
0
09 Mar 2016
We Are Humor Beings: Understanding and Predicting Visual Humor
Arjun Chandrasekaran
Ashwin K. Vijayakumar
Stanislaw Antol
Joey Tianyi Zhou
Dhruv Batra
C. L. Zitnick
Devi Parikh
28
56
0
14 Dec 2015
Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes
Satwik Kottur
Ramakrishna Vedantam
José M. F. Moura
Devi Parikh
VLM
38
85
0
22 Nov 2015
Yin and Yang: Balancing and Answering Binary Visual Questions
Peng Zhang
Yash Goyal
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
37
349
0
16 Nov 2015
Uncovering Temporal Context for Video Question and Answering
Linchao Zhu
Zhongwen Xu
Yi Yang
Alexander G. Hauptmann
BDL
27
44
0
15 Nov 2015
Building a Large-scale Multimodal Knowledge Base System for Answering Visual Queries
Yuke Zhu
Ce Zhang
Christopher Ré
Li Fei-Fei
27
35
0
20 Jul 2015
Visual Madlibs: Fill in the blank Image Generation and Question Answering
Licheng Yu
Eunbyung Park
Alexander C. Berg
Tamara L. Berg
VLM
MLLM
32
97
0
31 May 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
96
5,383
0
03 May 2015
1