Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.02799
Cited By
v1
v2
v3
v4 (latest)
Neural Module Networks
9 November 2015
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Module Networks"
34 / 634 papers shown
Title
ShapeWorld - A new test methodology for multimodal language understanding
A. Kuhnle
Ann A. Copestake
65
69
0
14 Apr 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
Y. Jang
Yale Song
Youngjae Yu
Youngjin Kim
Gunhee Kim
93
562
0
14 Apr 2017
Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries
Y. Zhang
Luyao Yuan
Yijie Guo
Zhiyuan He
I-An Huang
Honglak Lee
ObjD
92
57
0
12 Apr 2017
Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering
V. Kazemi
Ali Elqursh
OOD
89
185
0
11 Apr 2017
Pay Attention to Those Sets! Learning Quantification from Images
Ionut-Teodor Sorodoc
Sandro Pezzelle
Aurélie Herbelot
Mariella Dimiccoli
Raffaella Bernardi
37
0
0
10 Apr 2017
Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks
Tanmay Gupta
Kevin J. Shih
Saurabh Singh
Derek Hoiem
110
26
0
02 Apr 2017
A Deep Compositional Framework for Human-like Language Acquisition in Virtual Environment
Haonan Yu
Haichao Zhang
Wenyuan Xu
LM&Ro
82
25
0
28 Mar 2017
Task-driven Visual Saliency and Attention-based Visual Question Answering
Yuetan Lin
Zhangyang Pang
Donghui Wang
Yueting Zhuang
61
26
0
22 Feb 2017
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
354
2,394
0
20 Dec 2016
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale
F. Iandola
3DV
35
19
0
20 Dec 2016
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions
Peng Wang
Qi Wu
Chunhua Shen
Anton Van Den Hengel
OOD
90
86
0
16 Dec 2016
Text-guided Attention Model for Image Captioning
Jonghwan Mun
Minsu Cho
Bohyung Han
VLM
59
93
0
12 Dec 2016
Commonly Uncommon: Semantic Sparsity in Situation Recognition
Mark Yatskar
Vicente Ordonez
Luke Zettlemoyer
Ali Farhadi
VLM
73
42
0
03 Dec 2016
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
393
3,275
0
02 Dec 2016
Modeling Relationships in Referential Expressions with Compositional Modular Networks
Ronghang Hu
Marcus Rohrbach
Jacob Andreas
Trevor Darrell
Kate Saenko
84
407
0
30 Nov 2016
Attend in groups: a weakly-supervised deep learning framework for learning from web data
Bohan Zhuang
Lingqiao Liu
Yao Li
Chunhua Shen
Ian Reid
NoLa
69
89
0
30 Nov 2016
Dual Attention Networks for Multimodal Reasoning and Matching
Hyeonseob Nam
Jung-Woo Ha
Jeonghee Kim
131
670
0
02 Nov 2016
Visual Question Answering: Datasets, Algorithms, and Future Challenges
Kushal Kafle
Christopher Kanan
OOD
101
244
0
05 Oct 2016
Tutorial on Answering Questions about Images with Deep Learning
Mateusz Malinowski
Mario Fritz
VLM
62
3
0
04 Oct 2016
Graph-Structured Representations for Visual Question Answering
Damien Teney
Lingqiao Liu
Anton Van Den Hengel
GNN
NAI
123
422
0
19 Sep 2016
Learning to generalize to new compositions in image understanding
Yuval Atzmon
Jonathan Berant
Vahid Kezami
Amir Globerson
Gal Chechik
82
67
0
27 Aug 2016
Solving Visual Madlibs with Multiple Cues
Tatiana Tommasi
Arun Mallya
Bryan A. Plummer
Svetlana Lazebnik
Alexander C. Berg
Tamara L. Berg
85
18
0
11 Aug 2016
Visual Question Answering: A Survey of Methods and Datasets
Qi Wu
Damien Teney
Peng Wang
Chunhua Shen
A. Dick
Anton Van Den Hengel
111
418
0
20 Jul 2016
Revisiting Visual Question Answering Baselines
Allan Jabri
Armand Joulin
Laurens van der Maaten
OOD
67
83
0
27 Jun 2016
FVQA: Fact-based Visual Question Answering
Peng Wang
Qi Wu
Chunhua Shen
Anton van den Hengel
A. Dick
CoGe
115
464
0
17 Jun 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
342
1,471
0
06 Jun 2016
Programming with a Differentiable Forth Interpreter
Matko Bosnjak
Tim Rocktaschel
Jason Naradowsky
Sebastian Riedel
92
150
0
21 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
106
101
0
09 May 2016
Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering
Arun Mallya
Svetlana Lazebnik
101
119
0
16 Apr 2016
Attributes as Semantic Units between Natural Language and Visual Recognition
Marcus Rohrbach
VLM
40
3
0
12 Apr 2016
Reasoning About Pragmatics with Neural Listeners and Speakers
Jacob Andreas
Dan Klein
ReLM
LRM
111
175
0
02 Apr 2016
Segmentation from Natural Language Expressions
Ronghang Hu
Marcus Rohrbach
Trevor Darrell
VLM
EgoV
86
439
0
20 Mar 2016
Learning to Compose Neural Networks for Question Answering
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
NAI
KELM
BDL
CoGe
130
568
0
07 Jan 2016
Simple Baseline for Visual Question Answering
Bolei Zhou
Yuandong Tian
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
FAtt
108
324
0
07 Dec 2015
Previous
1
2
3
...
11
12
13