ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.03707
  4. Cited By
Answering Visual What-If Questions: From Actions to Predicted Scene
  Descriptions

Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions

11 September 2018
M. Wagner
H. Basevi
Rakshith Shetty
Wenbin Li
Mateusz Malinowski
M. Fritz
A. Leonardis
ArXivPDFHTML

Papers citing "Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions"

31 / 31 papers shown
Title
What Makes a Maze Look Like a Maze?
What Makes a Maze Look Like a Maze?
Joy Hsu
Jiayuan Mao
J. Tenenbaum
Noah D. Goodman
Jiajun Wu
OCL
89
6
0
12 Sep 2024
DVQA: Understanding Data Visualizations via Question Answering
DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle
Brian L. Price
Scott D. Cohen
Christopher Kanan
AIMat
64
379
0
24 Jan 2018
Building Generalizable Agents with a Realistic and Rich 3D Environment
Building Generalizable Agents with a Realistic and Rich 3D Environment
Yi Wu
Yuxin Wu
Georgia Gkioxari
Yuandong Tian
3DV
120
338
0
07 Jan 2018
Taking Visual Motion Prediction To New Heightfields
Taking Visual Motion Prediction To New Heightfields
Sébastien Ehrhardt
Áron Monszpart
Niloy Mitra
Andrea Vedaldi
42
23
0
22 Dec 2017
Don't Just Assume; Look and Answer: Overcoming Priors for Visual
  Question Answering
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal
Dhruv Batra
Devi Parikh
Aniruddha Kembhavi
OOD
136
585
0
01 Dec 2017
Visual Interaction Networks
Visual Interaction Networks
Nicholas Watters
Andrea Tacchetti
T. Weber
Razvan Pascanu
Peter W. Battaglia
Daniel Zoran
PINN
3DH
81
277
0
05 Jun 2017
A simple neural network module for relational reasoning
A simple neural network module for relational reasoning
Adam Santoro
David Raposo
David Barrett
Mateusz Malinowski
Razvan Pascanu
Peter W. Battaglia
Timothy Lillicrap
GNN
NAI
141
1,610
0
05 Jun 2017
AirSim: High-Fidelity Visual and Physical Simulation for Autonomous
  Vehicles
AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles
S. Shah
Debadeepta Dey
Chris Lovett
Ashish Kapoor
88
1,976
0
15 May 2017
Learning to Reason: End-to-End Module Networks for Visual Question
  Answering
Learning to Reason: End-to-End Module Networks for Visual Question Answering
Ronghang Hu
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Kate Saenko
KELM
GNN
ReLM
LRM
110
576
0
18 Apr 2017
DeepMind Lab
DeepMind Lab
Charlie Beattie
Joel Z Leibo
Denis Teplyashin
Tom Ward
Marcus Wainwright
...
Stephen Gaffney
Helen King
Demis Hassabis
Shane Legg
Stig Petersen
50
241
0
12 Dec 2016
Interaction Networks for Learning about Objects, Relations and Physics
Interaction Networks for Learning about Objects, Relations and Physics
Peter W. Battaglia
Razvan Pascanu
Matthew Lai
Danilo Jimenez Rezende
Koray Kavukcuoglu
AI4CE
OCL
PINN
GNN
494
1,405
0
01 Dec 2016
Long-Term Image Boundary Prediction
Long-Term Image Boundary Prediction
Apratim Bhattacharyya
Mateusz Malinowski
Bernt Schiele
Mario Fritz
54
12
0
27 Nov 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
278
1,466
0
06 Jun 2016
Unsupervised Learning for Physical Interaction through Video Prediction
Unsupervised Learning for Physical Interaction through Video Prediction
Chelsea Finn
Ian Goodfellow
Sergey Levine
67
1,042
0
23 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
60
101
0
09 May 2016
ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement
  Learning
ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning
Michal Kempka
Marek Wydmuch
Grzegorz Runc
Jakub Toczek
Wojciech Ja'skowski
58
695
0
06 May 2016
The Cityscapes Dataset for Semantic Urban Scene Understanding
The Cityscapes Dataset for Semantic Urban Scene Understanding
Marius Cordts
Mohamed Omran
Sebastian Ramos
Timo Rehfeld
Markus Enzweiler
Rodrigo Benenson
Uwe Franke
Stefan Roth
Bernt Schiele
843
11,540
0
06 Apr 2016
To Fall Or Not To Fall: A Visual Approach to Physical Stability
  Prediction
To Fall Or Not To Fall: A Visual Approach to Physical Stability Prediction
Wenbin Li
Seyedmajid Azimi
A. Leonardis
Mario Fritz
44
70
0
31 Mar 2016
"What happens if..." Learning to Predict the Effect of Forces in Images
"What happens if..." Learning to Predict the Effect of Forces in Images
Roozbeh Mottaghi
Mohammad Rastegari
Abhinav Gupta
Ali Farhadi
OOD
60
123
0
17 Mar 2016
Learning Physical Intuition of Block Towers by Example
Learning Physical Intuition of Block Towers by Example
Adam Lerer
Sam Gross
Rob Fergus
PINN
71
298
0
03 Mar 2016
MovieQA: Understanding Stories in Movies through Question-Answering
MovieQA: Understanding Stories in Movies through Question-Answering
Makarand Tapaswi
Yukun Zhu
Rainer Stiefelhagen
Antonio Torralba
R. Urtasun
Sanja Fidler
101
736
0
09 Dec 2015
Deep multi-scale video prediction beyond mean square error
Deep multi-scale video prediction beyond mean square error
Michaël Mathieu
Camille Couprie
Yann LeCun
GAN
122
1,880
0
17 Nov 2015
Newtonian Image Understanding: Unfolding the Dynamics of Objects in
  Static Images
Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images
Roozbeh Mottaghi
Hessam Bagherinezhad
Mohammad Rastegari
Ali Farhadi
50
148
0
12 Nov 2015
Visual7W: Grounded Question Answering in Images
Visual7W: Grounded Question Answering in Images
Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
89
878
0
11 Nov 2015
Stacked Attention Networks for Image Question Answering
Stacked Attention Networks for Image Question Answering
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
BDL
101
1,875
0
07 Nov 2015
Exploring Models and Data for Image Question Answering
Exploring Models and Data for Image Question Answering
Mengye Ren
Ryan Kiros
R. Zemel
80
713
0
08 May 2015
VQA: Visual Question Answering
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
162
5,421
0
03 May 2015
Benchmarking in Manipulation Research: The YCB Object and Model Set and
  Benchmarking Protocols
Benchmarking in Manipulation Research: The YCB Object and Model Set and Benchmarking Protocols
B. Çalli
Aaron Walsman
Arjun Singh
S. Srinivasa
Pieter Abbeel
A. Dollar
67
294
0
10 Feb 2015
Video (language) modeling: a baseline for generative models of natural
  videos
Video (language) modeling: a baseline for generative models of natural videos
MarcÁurelio Ranzato
Arthur Szlam
Joan Bruna
Michaël Mathieu
R. Collobert
S. Chopra
VGen
91
471
0
20 Dec 2014
Towards a Visual Turing Challenge
Towards a Visual Turing Challenge
Mateusz Malinowski
Mario Fritz
60
74
0
29 Oct 2014
A Multi-World Approach to Question Answering about Real-World Scenes
  based on Uncertain Input
A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input
Mateusz Malinowski
Mario Fritz
180
695
0
01 Oct 2014
1