Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.06890
Cited By
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
20 December 2016
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning"
50 / 1,475 papers shown
Title
SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size
Jessica Yung
Rob Romijnders
Alexander Kolesnikov
Lucas Beyer
Josip Djolonga
N. Houlsby
Sylvain Gelly
Mario Lucic
Xiaohua Zhai
32
8
0
09 Apr 2021
CoCoNets: Continuous Contrastive 3D Scene Representations
Shamit Lal
Mihir Prabhudesai
Ishita Mediratta
Adam W. Harley
Katerina Fragkiadaki
SSL
3DH
3DPC
41
25
0
08 Apr 2021
How Transferable are Reasoning Patterns in VQA?
Corentin Kervadec
Theo Jaunet
G. Antipov
M. Baccouche
Romain Vuillemot
Christian Wolf
LRM
23
28
0
08 Apr 2021
Track, Check, Repeat: An EM Approach to Unsupervised Tracking
Adam W. Harley
Yiming Zuo
Jing Wen
Ayush Mangal
Shubhankar Potdar
Ritwick Chaudhry
Katerina Fragkiadaki
28
9
0
07 Apr 2021
Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering
Corentin Dancette
Rémi Cadène
Damien Teney
Matthieu Cord
CML
33
76
0
07 Apr 2021
Where and What? Examining Interpretable Disentangled Representations
Xinqi Zhu
Chang Xu
Dacheng Tao
FAtt
DRL
61
38
0
07 Apr 2021
Decomposing 3D Scenes into Objects via Unsupervised Volume Segmentation
Karl Stelzner
Kristian Kersting
Adam R. Kosiorek
38
107
0
02 Apr 2021
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers
Tsu-Jui Fu
Xinze Wang
Scott T. Grafton
Miguel P. Eckstein
Wenjie Wang
32
9
0
02 Apr 2021
Unconstrained Scene Generation with Locally Conditioned Radiance Fields
Terrance Devries
Miguel Angel Bautista
Nitish Srivastava
Graham W. Taylor
J. Susskind
36
153
0
01 Apr 2021
Composable Augmentation Encoding for Video Representation Learning
Chen Sun
Arsha Nagrani
Yonglong Tian
Cordelia Schmid
SSL
AI4TS
37
17
0
01 Apr 2021
NeRF-VAE: A Geometry Aware 3D Scene Generative Model
Adam R. Kosiorek
Heiko Strathmann
Daniel Zoran
Pol Moreno
R. Schneider
Sovna Mokrá
Danilo Jimenez Rezende
DRL
37
139
0
01 Apr 2021
Multi-Class Multi-Instance Count Conditioned Adversarial Image Generation
Amrutha Saseendran
Kathrin Skubch
Margret Keuper
VLM
GAN
31
2
0
31 Mar 2021
Dual Contrastive Loss and Attention for GANs
Ning Yu
Guilin Liu
Aysegül Dündar
Andrew Tao
Bryan Catanzaro
Larry S. Davis
Mario Fritz
GAN
40
60
0
31 Mar 2021
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning
Zhenfang Chen
Jiayuan Mao
Jiajun Wu
Kwan-Yee K. Wong
J. Tenenbaum
Chuang Gan
VGen
36
92
0
30 Mar 2021
AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning
Madeleine Grunde-McLaughlin
Ranjay Krishna
Maneesh Agrawala
CoGe
39
112
0
30 Mar 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
19
22
0
29 Mar 2021
ACRE: Abstract Causal REasoning Beyond Covariation
Chi Zhang
Baoxiong Jia
Mark Edmonds
Song-Chun Zhu
Yixin Zhu
CML
25
48
0
26 Mar 2021
Describing and Localizing Multiple Changes with Transformers
Yue Qiu
Shintaro Yamamoto
Kodai Nakashima
Ryota Suzuki
K. Iwata
Hirokatsu Kataoka
Y. Satoh
30
55
0
25 Mar 2021
Learning Part Segmentation through Unsupervised Domain Adaptation from Synthetic Vehicles
Qing Liu
Adam Kortylewski
Zhishuai Zhang
Zizhang Li
Mengqi Guo
Qihao Liu
Xiaoding Yuan
Jiteng Mu
Weichao Qiu
Alan Yuille
29
19
0
25 Mar 2021
Contrasting Contrastive Self-Supervised Representation Learning Pipelines
Klemen Kotar
Gabriel Ilharco
Ludwig Schmidt
Kiana Ehsani
Roozbeh Mottaghi
SSL
43
46
0
25 Mar 2021
How to Design Sample and Computationally Efficient VQA Models
Karan Samel
Zelin Zhao
Binghong Chen
Kuan-Chieh Wang
Haozheng Luo
Le Song
24
4
0
22 Mar 2021
Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges
Cynthia Rudin
Chaofan Chen
Zhi Chen
Haiyang Huang
Lesia Semenova
Chudi Zhong
FaML
AI4CE
LRM
61
655
0
20 Mar 2021
Hopper: Multi-hop Transformer for Spatiotemporal Reasoning
Honglu Zhou
Asim Kadav
Farley Lai
Alexandru Niculescu-Mizil
Martin Renqiang Min
Mubbasir Kapadia
H. Graf
LRM
51
18
0
19 Mar 2021
Set-to-Sequence Methods in Machine Learning: a Review
Mateusz Jurewicz
Leon Derczynski
BDL
27
9
0
17 Mar 2021
A Survey of Embodied AI: From Simulators to Research Tasks
Jiafei Duan
Samson Yu
Tangyao Li
Huaiyu Zhu
Cheston Tan
LM&Ro
33
274
0
08 Mar 2021
Rissanen Data Analysis: Examining Dataset Characteristics via Description Length
Ethan Perez
Douwe Kiela
Kyunghyun Cho
32
24
0
05 Mar 2021
Generating Images with Sparse Representations
C. Nash
Jacob Menick
Sander Dieleman
Peter W. Battaglia
33
201
0
05 Mar 2021
PHASE: PHysically-grounded Abstract Social Events for Machine Social Perception
Aviv Netanyahu
Tianmin Shu
Boris Katz
Andrei Barbu
J. Tenenbaum
28
37
0
02 Mar 2021
A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics
Qing Li
Siyuan Huang
Yining Hong
Yixin Zhu
Ying Nian Wu
Song-Chun Zhu
AIMat
24
6
0
02 Mar 2021
Generative Adversarial Transformers
Drew A. Hudson
C. L. Zitnick
ViT
25
179
0
01 Mar 2021
KANDINSKYPatterns -- An experimental exploration environment for Pattern Analysis and Machine Intelligence
Andreas Holzinger
Anna Saranti
Heimo Mueller
48
10
0
28 Feb 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
233
27,929
0
26 Feb 2021
Blocks World Revisited: The Effect of Self-Occlusion on Classification by Convolutional Neural Networks
M. Solbach
John K. Tsotsos
20
3
0
25 Feb 2021
AGENT: A Benchmark for Core Psychological Reasoning
Tianmin Shu
Abhishek Bhandwaldar
Chuang Gan
Kevin A. Smith
Shari Liu
Dan Gutfreund
E. Spelke
J. Tenenbaum
T. Ullman
34
66
0
24 Feb 2021
Weakly-supervised multi-class object localization using only object counts as labels
Kyle Mills
Isaac Tamblyn
14
1
0
23 Feb 2021
HALMA: Humanlike Abstraction Learning Meets Affordance in Rapid Problem Solving
Sirui Xie
Xiaojian Ma
Peiyu Yu
Yixin Zhu
Ying Nian Wu
Song-Chun Zhu
42
20
0
22 Feb 2021
Knowledge Hypergraph Embedding Meets Relational Algebra
Bahare Fatemi
Perouz Taslakian
David Vazquez
David Poole
24
11
0
18 Feb 2021
SLAKE: A Semantically-Labeled Knowledge-Enhanced Dataset for Medical Visual Question Answering
Bo Liu
Li-Ming Zhan
Li Xu
Lin Ma
Y. Yang
Xiao-Ming Wu
42
238
0
18 Feb 2021
Contrastive Learning Inverts the Data Generating Process
Roland S. Zimmermann
Yash Sharma
Steffen Schneider
Matthias Bethge
Wieland Brendel
SSL
240
211
0
17 Feb 2021
Predicting and Attending to Damaging Collisions for Placing Everyday Objects in Photo-Realistic Simulations
A. Magassouba
K. Sugiura
A. Nakayama
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
Hisashi Kawai
29
6
0
12 Feb 2021
Open World Compositional Zero-Shot Learning
Massimiliano Mancini
Muhammad Ferjad Naeem
Yongqin Xian
Zeynep Akata
CoGe
29
126
0
29 Jan 2021
Solving the Same-Different Task with Convolutional Neural Networks
Nicola Messina
Giuseppe Amato
F. Carrara
Claudio Gennaro
Fabrizio Falchi
AIMat
OOD
SSL
UQCV
35
17
0
22 Jan 2021
Using Shape to Categorize: Low-Shot Learning with an Explicit Shape Bias
Stefan Stojanov
Anh Thai
James M. Rehg
3DPC
24
41
0
18 Jan 2021
Kimera: from SLAM to Spatial Perception with 3D Dynamic Scene Graphs
Antoni Rosinol
Andrew Violette
Marcus Abate
Nathan Hughes
Yun Chang
Jingang Shi
Arjun Gupta
Luca Carlone
3DV
41
223
0
18 Jan 2021
Understanding in Artificial Intelligence
S. Maetschke
D. M. Iraola
Pieter Barnard
Elaheh Shafieibavani
Peter Zhong
Ying Xu
Antonio Jimeno Yepes
ELM
VLM
24
0
0
17 Jan 2021
Understanding the Role of Scene Graphs in Visual Question Answering
Vinay Damodaran
Sharanya Chakravarthy
Akshay Kumar
Anjana Umapathy
Teruko Mitamura
Yuta Nakashima
Noa Garcia
Chenhui Chu
GNN
45
32
0
14 Jan 2021
Explainability of deep vision-based autonomous driving systems: Review and challenges
Éloi Zablocki
H. Ben-younes
P. Pérez
Matthieu Cord
XAI
53
170
0
13 Jan 2021
Evaluating Disentanglement of Structured Representations
Raphaël Dang-Nhu
OCL
26
5
0
11 Jan 2021
Progressive Interpretation Synthesis: Interpreting Task Solving by Quantifying Previously Used and Unused Information
Zhengqi He
Taro Toyoizumi
19
1
0
08 Jan 2021
DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
Hung Le
Chinnadhurai Sankar
Seungwhan Moon
Ahmad Beirami
A. Geramifard
Satwik Kottur
VGen
48
18
0
01 Jan 2021
Previous
1
2
3
...
19
20
21
...
28
29
30
Next