ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1410.0210
  4. Cited By
A Multi-World Approach to Question Answering about Real-World Scenes
  based on Uncertain Input

A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input

1 October 2014
Mateusz Malinowski
Mario Fritz
ArXivPDFHTML

Papers citing "A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input"

50 / 330 papers shown
Title
Learning Answer Embeddings for Visual Question Answering
Learning Answer Embeddings for Visual Question Answering
Hexiang Hu
Wei-Lun Chao
Fei Sha
21
33
0
10 Jun 2018
Focal Visual-Text Attention for Visual Question Answering
Focal Visual-Text Attention for Visual Question Answering
Junwei Liang
Lu Jiang
Liangliang Cao
Li Li
Alexander G. Hauptmann
33
110
0
05 Jun 2018
On the Flip Side: Identifying Counterexamples in Visual Question
  Answering
On the Flip Side: Identifying Counterexamples in Visual Question Answering
Gabriel Grand
Aron Szanto
Yoon Kim
Alexander Rush
21
0
0
03 Jun 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Mian
Wen Liu
Syed Zulqarnain Gilani
Mubarak Shah
14
91
0
01 Jun 2018
Hyperbolic Attention Networks
Hyperbolic Attention Networks
Çağlar Gülçehre
Misha Denil
Mateusz Malinowski
Ali Razavi
Razvan Pascanu
...
Peter W. Battaglia
V. Bapst
David Raposo
Adam Santoro
Nando de Freitas
31
220
0
24 May 2018
Stories for Images-in-Sequence by using Visual and Narrative Components
Stories for Images-in-Sequence by using Visual and Narrative Components
Marko Smilevski
Ilija Lalkovski
Gjorgji Madjarov
19
19
0
15 May 2018
Solving Bongard Problems with a Visual Language and Pragmatic Reasoning
Solving Bongard Problems with a Visual Language and Pragmatic Reasoning
Stefan Depeweg
Constantin Rothkopf
Frank Jakel
LRM
19
42
0
12 Apr 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory
  Input
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
32
200
0
04 Apr 2018
Seeing Voices and Hearing Faces: Cross-modal biometric matching
Seeing Voices and Hearing Faces: Cross-modal biometric matching
Arsha Nagrani
Samuel Albanie
Andrew Zisserman
CVBM
22
220
0
01 Apr 2018
Differential Attention for Visual Question Answering
Differential Attention for Visual Question Answering
Badri N. Patro
Vinay P. Namboodiri
AIMat
21
74
0
01 Apr 2018
Two can play this Game: Visual Dialog with Discriminative Question
  Generation and Answering
Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering
Unnat Jain
Svetlana Lazebnik
Alex Schwing
MLLM
37
81
0
29 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering
Motion-Appearance Co-Memory Networks for Video Question Answering
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
41
240
0
29 Mar 2018
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis
  Tool
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
32
29
0
16 Mar 2018
A Dataset and Architecture for Visual Reasoning with a Working Memory
A Dataset and Architecture for Visual Reasoning with a Working Memory
G. R. Yang
Igor Ganichev
Xiao-Jing Wang
Jonathon Shlens
David Sussillo
14
54
0
16 Mar 2018
VizWiz Grand Challenge: Answering Visual Questions from Blind People
VizWiz Grand Challenge: Answering Visual Questions from Blind People
Danna Gurari
Qing Li
Abigale Stangl
Anhong Guo
Chi Lin
Kristen Grauman
Jiebo Luo
Jeffrey P. Bigham
CoGe
43
812
0
22 Feb 2018
Object-based reasoning in VQA
Object-based reasoning in VQA
Mikyas T. Desta
Larry Chen
Tomasz Kornuta
32
33
0
29 Jan 2018
DVQA: Understanding Data Visualizations via Question Answering
DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle
Brian L. Price
Scott D. Cohen
Christopher Kanan
AIMat
33
365
0
24 Jan 2018
Structured Triplet Learning with POS-tag Guided Attention for Visual
  Question Answering
Structured Triplet Learning with POS-tag Guided Attention for Visual Question Answering
Zhe Wang
Xiaoyi Liu
Liangjian Chen
Limin Wang
Yu Qiao
Xiaohui Xie
Charless C. Fowlkes
29
14
0
24 Jan 2018
Visual Text Correction
Visual Text Correction
Amir Mazaheri
M. Shah
52
11
0
06 Jan 2018
Interpretable Counting for Visual Question Answering
Interpretable Counting for Visual Question Answering
Alexander R. Trott
Caiming Xiong
R. Socher
38
71
0
23 Dec 2017
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven
  Communication
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication
Jin-Hwa Kim
Nikita Kitaev
Xinlei Chen
Marcus Rohrbach
Byoung-Tak Zhang
Yuandong Tian
Dhruv Batra
Devi Parikh
DiffM
VGen
38
25
0
15 Dec 2017
IQA: Visual Question Answering in Interactive Environments
IQA: Visual Question Answering in Interactive Environments
Daniel Gordon
Aniruddha Kembhavi
Mohammad Rastegari
Joseph Redmon
Dieter Fox
Ali Farhadi
LM&Ro
25
387
0
09 Dec 2017
Learning by Asking Questions
Learning by Asking Questions
Ishan Misra
Ross B. Girshick
Rob Fergus
M. Hebert
Abhinav Gupta
Laurens van der Maaten
34
82
0
04 Dec 2017
Don't Just Assume; Look and Answer: Overcoming Priors for Visual
  Question Answering
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
OOD
76
582
0
01 Dec 2017
A Novel Framework for Robustness Analysis of Visual QA Models
A Novel Framework for Robustness Analysis of Visual QA Models
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
Guohao Li
AAML
OOD
30
34
0
16 Nov 2017
Survey of Recent Advances in Visual Question Answering
Survey of Recent Advances in Visual Question Answering
Supriya Pandhre
Shagun Sodhani
10
14
0
24 Sep 2017
FiLM: Visual Reasoning with a General Conditioning Layer
FiLM: Visual Reasoning with a General Conditioning Layer
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
128
2,158
0
22 Sep 2017
Learning Functional Causal Models with Generative Neural Networks
Learning Functional Causal Models with Generative Neural Networks
Hugo Jair Escalante
Sergio Escalera
Xavier Baro
Isabelle M Guyon
Umut Güçlü
Marcel van Gerven
CML
BDL
22
107
0
15 Sep 2017
VQS: Linking Segmentations to Questions and Answers for Supervised
  Attention in VQA and Question-Focused Semantic Segmentation
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
Chuang Gan
Yandong Li
Haoxiang Li
Chen Sun
Boqing Gong
27
126
0
15 Aug 2017
Beyond Bilinear: Generalized Multimodal Factorized High-order Pooling
  for Visual Question Answering
Beyond Bilinear: Generalized Multimodal Factorized High-order Pooling for Visual Question Answering
Zhou Yu
Jun-chen Yu
Chenchao Xiang
Jianping Fan
Dacheng Tao
31
459
0
10 Aug 2017
Structured Attentions for Visual Question Answering
Structured Attentions for Visual Question Answering
Chen Zhu
Yanpeng Zhao
Shuaiyi Huang
Kewei Tu
Yi Ma
FAtt
32
106
0
07 Aug 2017
Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for
  Visual Question Answering
Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering
Zhou Yu
Jun-chen Yu
Jianping Fan
Dacheng Tao
41
663
0
04 Aug 2017
Learning Visual Reasoning Without Strong Priors
Learning Visual Reasoning Without Strong Priors
Ethan Perez
H. D. Vries
Florian Strub
Vincent Dumoulin
Aaron Courville
OOD
NAI
34
62
0
10 Jul 2017
A simple neural network module for relational reasoning
A simple neural network module for relational reasoning
Adam Santoro
David Raposo
David Barrett
Mateusz Malinowski
Razvan Pascanu
Peter W. Battaglia
Timothy Lillicrap
GNN
NAI
37
1,605
0
05 Jun 2017
Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence
  Models for Fill-in-the-Blank Image Captioning
Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning
Q. Sun
Stefan Lee
Dhruv Batra
BDL
33
43
0
24 May 2017
Survey of Visual Question Answering: Datasets and Techniques
Survey of Visual Question Answering: Datasets and Techniques
A. Gupta
21
38
0
10 May 2017
Inferring and Executing Programs for Visual Reasoning
Inferring and Executing Programs for Visual Reasoning
Justin Johnson
B. Hariharan
Laurens van der Maaten
Judy Hoffman
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
NAI
35
541
0
10 May 2017
FOIL it! Find One mismatch between Image and Language caption
FOIL it! Find One mismatch between Image and Language caption
Ravi Shekhar
Sandro Pezzelle
Yauhen Klimovich
Aurélie Herbelot
Moin Nabi
E. Sangineto
Raffaella Bernardi
25
137
0
03 May 2017
The Forgettable-Watcher Model for Video Question Answering
The Forgettable-Watcher Model for Video Question Answering
Hongyang Xue
Zhou Zhao
Deng Cai
21
9
0
03 May 2017
The Promise of Premise: Harnessing Question Premises in Visual Question
  Answering
The Promise of Premise: Harnessing Question Premises in Visual Question Answering
Aroma Mahendru
Viraj Prabhu
Akrit Mohapatra
Dhruv Batra
Stefan Lee
NAI
37
38
0
01 May 2017
Speech-Based Visual Question Answering
Speech-Based Visual Question Answering
Ted Zhang
Dengxin Dai
Tinne Tuytelaars
Marie-Francine Moens
Luc Van Gool
40
24
0
01 May 2017
C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0
  Dataset
C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset
Aishwarya Agrawal
Aniruddha Kembhavi
Dhruv Batra
Devi Parikh
CoGe
29
80
0
26 Apr 2017
Being Negative but Constructively: Lessons Learnt from Creating Better
  Visual Question Answering Datasets
Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets
Wei-Lun Chao
Hexiang Hu
Fei Sha
24
37
0
24 Apr 2017
Learning to Reason: End-to-End Module Networks for Visual Question
  Answering
Learning to Reason: End-to-End Module Networks for Visual Question Answering
Ronghang Hu
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Kate Saenko
KELM
GNN
ReLM
LRM
44
574
0
18 Apr 2017
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal
  Attentions
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions
Amir Mazaheri
Dong-Ming Zhang
M. Shah
17
12
0
15 Apr 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
Y. Jang
Yale Song
Youngjae Yu
Youngjin Kim
Gunhee Kim
34
547
0
14 Apr 2017
Pay Attention to Those Sets! Learning Quantification from Images
Pay Attention to Those Sets! Learning Quantification from Images
Ionut-Teodor Sorodoc
Sandro Pezzelle
Aurélie Herbelot
Mariella Dimiccoli
Raffaella Bernardi
6
0
0
10 Apr 2017
It Takes Two to Tango: Towards Theory of AI's Mind
It Takes Two to Tango: Towards Theory of AI's Mind
Arjun Chandrasekaran
Deshraj Yadav
Prithvijit Chattopadhyay
Viraj Prabhu
Devi Parikh
41
54
0
03 Apr 2017
An Analysis of Visual Question Answering Algorithms
An Analysis of Visual Question Answering Algorithms
Kushal Kafle
Christopher Kanan
30
231
0
28 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement
  Learning
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
31
424
0
20 Mar 2017
Previous
1234567
Next