CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

20 December 2016

Justin Johnson

B. Hariharan

Laurens van der Maaten

Li Fei-Fei

Papers citing "CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning"

50 / 1,475 papers shown

Title
Language-Mediated, Object-Centric Representation Learning Ruocheng Wang Jiayuan Mao S. Gershman Jiajun Wu 16 12 0 31 Dec 2020
Spatial Reasoning from Natural Language Instructions for Robot Manipulation S. Gubbi Anirban Biswas Raviteja Upadrashta V. Srinivasan Partha P. Talukdar B. Amrutur LM&Ro LRM 50 29 0 26 Dec 2020
Object-Centric Diagnosis of Visual Reasoning Jianwei Yang Jiayuan Mao Jiajun Wu Devi Parikh David D. Cox J. Tenenbaum Chuang Gan OCL 27 16 0 21 Dec 2020
MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification Te-Lin Wu Shikhar Singh S. Paul Gully A. Burns Nanyun Peng 30 18 0 16 Dec 2020
Visually Grounding Language Instruction for History-Dependent Manipulation Hyemin Ahn Obin Kwon Kyungdo Kim Jaeyeon Jeong Howoong Jun Hongjung Lee Dongheui Lee Songhwai Oh LM&Ro 21 6 0 16 Dec 2020
Attention over learned object embeddings enables complex visual reasoning David Ding Felix Hill Adam Santoro Malcolm Reynolds M. Botvinick OCL 27 69 0 15 Dec 2020
WILDS: A Benchmark of in-the-Wild Distribution Shifts Pang Wei Koh Shiori Sagawa Henrik Marklund Sang Michael Xie Marvin Zhang ... A. Kundaje Emma Pierson Sergey Levine Chelsea Finn Percy Liang OOD 106 1,386 0 14 Dec 2020
Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation Embedding Qingxing Cao Bailin Li Xiaodan Liang Keze Wang Liang Lin 49 36 0 14 Dec 2020
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps Qi Zhu Chenyu Gao Peng Wang Qi Wu 33 54 0 09 Dec 2020
Intrinsically Motivated Compositional Language Emergence Rishi Hazra Sonu Dixit Sayambhu Sen 11 1 0 09 Dec 2020
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions Tayfun Ates Muhammed Samil Atesoglu Cagatay Yigit .Ilker Kesen Mert Kobaş Erkut Erdem Aykut Erdem T. Goksun Deniz Yuret 27 31 0 08 Dec 2020
Edited Media Understanding Frames: Reasoning About the Intent and Implications of Visual Misinformation Jeff Da Maxwell Forbes Rowan Zellers Anthony Zheng Jena D. Hwang Antoine Bosselut Yejin Choi DiffM 25 13 0 08 Dec 2020
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding Maryam Rahnemoonfar Tashnim Chowdhury Argho Sarkar D. Varshney M. Yari Robin Murphy 22 243 0 05 Dec 2020
WeaQA: Weak Supervision via Captions for Visual Question Answering Pratyay Banerjee Tejas Gokhale Yezhou Yang Chitta Baral 25 35 0 04 Dec 2020
Multi-Label Contrastive Learning for Abstract Visual Reasoning Mikolaj Malkiñski Jacek Mańdziuk 8 40 0 03 Dec 2020
Attribute-Guided Adversarial Training for Robustness to Natural Perturbations Tejas Gokhale Rushil Anirudh B. Kailkhura Jayaraman J. Thiagarajan Chitta Baral Yezhou Yang AAML OOD 13 37 0 03 Dec 2020
Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D Ankit Goyal Kaiyu Yang Dawei Yang Jia Deng 30 41 0 03 Dec 2020
DERAIL: Diagnostic Environments for Reward And Imitation Learning Pedro Freire Adam Gleave Sam Toyer Stuart J. Russell OffRL 23 6 0 02 Dec 2020
Self-Supervised Real-to-Sim Scene Generation Aayush Prakash Shoubhik Debnath Jean-Francois Lafleche Eric Cameracci Gavriel State Stan Birchfield M. Law 37 26 0 30 Nov 2020
Self-Supervised Time Series Representation Learning by Inter-Intra Relational Reasoning Haoyi Fan Fengbin Zhang Yue Gao AI4TS 30 14 0 27 Nov 2020
Learning from Lexical Perturbations for Consistent Visual Question Answering Spencer Whitehead Hui Wu Yi R. Fung Heng Ji Rogerio Feris Kate Saenko 37 11 0 26 Nov 2020
Transformation Driven Visual Reasoning Xin Hong Yanyan Lan Liang Pang Jiafeng Guo Xueqi Cheng LRM 29 21 0 26 Nov 2020
Multimodal Learning for Hateful Memes Detection Yi Zhou Zhenhao Chen 24 56 0 25 Nov 2020
Right for the Right Concept: Revising Neuro-Symbolic Concepts by Interacting with their Explanations Wolfgang Stammer P. Schramowski Kristian Kersting FAtt 14 107 0 25 Nov 2020
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields Michael Niemeyer Andreas Geiger OCL 100 954 0 24 Nov 2020
Interpretable Visual Reasoning via Induced Symbolic Space Zhonghao Wang Kai Wang Mo Yu Jinjun Xiong Wen-mei W. Hwu M. Hasegawa-Johnson Humphrey Shi LRM OCL 16 19 0 23 Nov 2020
Modular Action Concept Grounding in Semantic Video Prediction Wei Yu Wenxin Chen Songheng Yin S. Easterbrook Animesh Garg 14 13 0 23 Nov 2020
Using Text to Teach Image Retrieval Haoyu Dong Ze Wang Qiang Qiu Guillermo Sapiro 3DV 35 4 0 19 Nov 2020
Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation Gaurav Kumar Nayak Konda Reddy Mopuri Anirban Chakraborty 25 18 0 18 Nov 2020
Disentangling 3D Prototypical Networks For Few-Shot Concept Learning Mihir Prabhudesai Shamit Lal Darshan Patil H. Tung Adam W. Harley Katerina Fragkiadaki OCL 3DV 3DPC 24 20 0 06 Nov 2020
Reasoning Over History: Context Aware Visual Dialog Muhammad A. Shah Shikib Mehri Tejas Srinivasan 11 3 0 02 Nov 2020
3D Object Recognition By Corresponding and Quantizing Neural 3D Scene Representations Mihir Prabhudesai Shamit Lal H. Tung Adam W. Harley Shubhankar Potdar Katerina Fragkiadaki 3DPC 20 2 0 30 Oct 2020
Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a Class-imbalance View Yangyang Guo Liqiang Nie Zhiyong Cheng Q. Tian Min Zhang 19 69 0 30 Oct 2020
SIRI: Spatial Relation Induced Network For Spatial Description Resolution Peiyao Wang Weixin Luo Yanyu Xu Haojie Li Shugong Xu Jianyu Yang Shenghua Gao 19 0 0 27 Oct 2020
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering Aisha Urooj Khan Amir Mazaheri N. Lobo M. Shah 34 56 0 27 Oct 2020
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions Radhika Dua Sai Srinivas Kancheti V. Balasubramanian LRM 43 22 0 24 Oct 2020
Generative Neurosymbolic Machines Jindong Jiang Sungjin Ahn BDL OCL 225 68 0 23 Oct 2020
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies Itai Gat Idan Schwartz Alex Schwing Tamir Hazan 60 90 0 21 Oct 2020
Knowledge Graph-based Question Answering with Electronic Health Records Junwoo Park Youngwoo Cho Haneol Lee Jaegul Choo Edward Choi 40 33 0 19 Oct 2020
Deep Ensembles for Low-Data Transfer Learning Basil Mustafa C. Riquelme J. Puigcerver andAndré Susano Pinto Daniel Keysers N. Houlsby FedML OOD 27 22 0 14 Oct 2020
Improving Compositional Generalization in Semantic Parsing I. Oren Jonathan Herzig Nitish Gupta Matt Gardner Jonathan Berant 29 63 0 12 Oct 2020
COGS: A Compositional Generalization Challenge Based on Semantic Interpretation Najoung Kim Tal Linzen CoGe 13 274 0 12 Oct 2020
Interpretable Neural Computation for Real-World Compositional Visual Question Answering Ruixue Tang Chao Ma CoGe 19 2 0 10 Oct 2020
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors Cathrin Elich Martin R. Oswald Marc Pollefeys Joerg Stueckler OCL 3DPC 3DV 14 12 0 08 Oct 2020
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning Mohit Shridhar Xingdi Yuan Marc-Alexandre Côté Yonatan Bisk Adam Trischler Matthew J. Hausknecht LM&Ro LLMAG 38 400 0 08 Oct 2020
Learning to Recombine and Resample Data for Compositional Generalization Ekin Akyürek Afra Feyza Akyürek Jacob Andreas 29 79 0 08 Oct 2020
CURI: A Benchmark for Productive Concept Learning Under Uncertainty Ramakrishna Vedantam Arthur Szlam Maximilian Nickel Ari S. Morcos Brenden M. Lake UQLM LRM 32 26 0 06 Oct 2020
Pathological Visual Question Answering Xuehai He Zhuo Cai Wenlan Wei Yichen Zhang Luntian Mou Eric Xing P. Xie 80 24 0 06 Oct 2020
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning Rodrigo Toro Icarte Toryn Q. Klassen Richard Valenzano Sheila A. McIlraith OffRL 49 216 0 06 Oct 2020
Meta-Learning of Structured Task Distributions in Humans and Machines Sreejan Kumar Ishita Dasgupta Jonathan Cohen Nathaniel D. Daw Thomas Griffiths OffRL 22 3 0 05 Oct 2020