Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.03707
Cited By
Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions
11 September 2018
M. Wagner
H. Basevi
Rakshith Shetty
Wenbin Li
Mateusz Malinowski
M. Fritz
A. Leonardis
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions"
31 / 31 papers shown
Title
What Makes a Maze Look Like a Maze?
Joy Hsu
Jiayuan Mao
J. Tenenbaum
Noah D. Goodman
Jiajun Wu
OCL
89
6
0
12 Sep 2024
DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle
Brian L. Price
Scott D. Cohen
Christopher Kanan
AIMat
64
379
0
24 Jan 2018
Building Generalizable Agents with a Realistic and Rich 3D Environment
Yi Wu
Yuxin Wu
Georgia Gkioxari
Yuandong Tian
3DV
120
338
0
07 Jan 2018
Taking Visual Motion Prediction To New Heightfields
Sébastien Ehrhardt
Áron Monszpart
Niloy Mitra
Andrea Vedaldi
42
23
0
22 Dec 2017
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
OOD
136
585
0
01 Dec 2017
Visual Interaction Networks
Nicholas Watters
Andrea Tacchetti
T. Weber
Razvan Pascanu
Peter W. Battaglia
Daniel Zoran
PINN
3DH
81
277
0
05 Jun 2017
A simple neural network module for relational reasoning
Adam Santoro
David Raposo
David Barrett
Mateusz Malinowski
Razvan Pascanu
Peter W. Battaglia
Timothy Lillicrap
GNN
NAI
141
1,610
0
05 Jun 2017
AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles
S. Shah
Debadeepta Dey
Chris Lovett
Ashish Kapoor
88
1,976
0
15 May 2017
Learning to Reason: End-to-End Module Networks for Visual Question Answering
Ronghang Hu
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Kate Saenko
KELM
GNN
ReLM
LRM
110
576
0
18 Apr 2017
DeepMind Lab
Charlie Beattie
Joel Z Leibo
Denis Teplyashin
Tom Ward
Marcus Wainwright
...
Stephen Gaffney
Helen King
Demis Hassabis
Shane Legg
Stig Petersen
50
241
0
12 Dec 2016
Interaction Networks for Learning about Objects, Relations and Physics
Peter W. Battaglia
Razvan Pascanu
Matthew Lai
Danilo Jimenez Rezende
Koray Kavukcuoglu
AI4CE
OCL
PINN
GNN
494
1,405
0
01 Dec 2016
Long-Term Image Boundary Prediction
Apratim Bhattacharyya
Mateusz Malinowski
Bernt Schiele
Mario Fritz
54
12
0
27 Nov 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
278
1,466
0
06 Jun 2016
Unsupervised Learning for Physical Interaction through Video Prediction
Chelsea Finn
Ian Goodfellow
Sergey Levine
67
1,042
0
23 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
60
101
0
09 May 2016
ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning
Michal Kempka
Marek Wydmuch
Grzegorz Runc
Jakub Toczek
Wojciech Ja'skowski
58
695
0
06 May 2016
The Cityscapes Dataset for Semantic Urban Scene Understanding
Marius Cordts
Mohamed Omran
Sebastian Ramos
Timo Rehfeld
Markus Enzweiler
Rodrigo Benenson
Uwe Franke
Stefan Roth
Bernt Schiele
843
11,540
0
06 Apr 2016
To Fall Or Not To Fall: A Visual Approach to Physical Stability Prediction
Wenbin Li
Seyedmajid Azimi
A. Leonardis
Mario Fritz
44
70
0
31 Mar 2016
"What happens if..." Learning to Predict the Effect of Forces in Images
Roozbeh Mottaghi
Mohammad Rastegari
Abhinav Gupta
Ali Farhadi
OOD
60
123
0
17 Mar 2016
Learning Physical Intuition of Block Towers by Example
Adam Lerer
Sam Gross
Rob Fergus
PINN
71
298
0
03 Mar 2016
MovieQA: Understanding Stories in Movies through Question-Answering
Makarand Tapaswi
Yukun Zhu
Rainer Stiefelhagen
Antonio Torralba
R. Urtasun
Sanja Fidler
101
736
0
09 Dec 2015
Deep multi-scale video prediction beyond mean square error
Michaël Mathieu
Camille Couprie
Yann LeCun
GAN
122
1,880
0
17 Nov 2015
Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images
Roozbeh Mottaghi
Hessam Bagherinezhad
Mohammad Rastegari
Ali Farhadi
50
148
0
12 Nov 2015
Visual7W: Grounded Question Answering in Images
Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
89
878
0
11 Nov 2015
Stacked Attention Networks for Image Question Answering
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
BDL
101
1,875
0
07 Nov 2015
Exploring Models and Data for Image Question Answering
Mengye Ren
Ryan Kiros
R. Zemel
80
713
0
08 May 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
162
5,421
0
03 May 2015
Benchmarking in Manipulation Research: The YCB Object and Model Set and Benchmarking Protocols
B. Çalli
Aaron Walsman
Arjun Singh
S. Srinivasa
Pieter Abbeel
A. Dollar
67
294
0
10 Feb 2015
Video (language) modeling: a baseline for generative models of natural videos
MarcÁurelio Ranzato
Arthur Szlam
Joan Bruna
Michaël Mathieu
R. Collobert
S. Chopra
VGen
91
471
0
20 Dec 2014
Towards a Visual Turing Challenge
Mateusz Malinowski
Mario Fritz
60
74
0
29 Oct 2014
A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input
Mateusz Malinowski
Mario Fritz
180
695
0
01 Oct 2014
1