ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.06890
  4. Cited By
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary
  Visual Reasoning

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

20 December 2016
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
    CoGe
ArXivPDFHTML

Papers citing "CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning"

50 / 1,475 papers shown
Title
Language-Mediated, Object-Centric Representation Learning
Language-Mediated, Object-Centric Representation Learning
Ruocheng Wang
Jiayuan Mao
S. Gershman
Jiajun Wu
16
12
0
31 Dec 2020
Spatial Reasoning from Natural Language Instructions for Robot
  Manipulation
Spatial Reasoning from Natural Language Instructions for Robot Manipulation
S. Gubbi
Anirban Biswas
Raviteja Upadrashta
V. Srinivasan
Partha P. Talukdar
B. Amrutur
LM&Ro
LRM
50
29
0
26 Dec 2020
Object-Centric Diagnosis of Visual Reasoning
Object-Centric Diagnosis of Visual Reasoning
Jianwei Yang
Jiayuan Mao
Jiajun Wu
Devi Parikh
David D. Cox
J. Tenenbaum
Chuang Gan
OCL
27
16
0
21 Dec 2020
MELINDA: A Multimodal Dataset for Biomedical Experiment Method
  Classification
MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification
Te-Lin Wu
Shikhar Singh
S. Paul
Gully A. Burns
Nanyun Peng
30
18
0
16 Dec 2020
Visually Grounding Language Instruction for History-Dependent
  Manipulation
Visually Grounding Language Instruction for History-Dependent Manipulation
Hyemin Ahn
Obin Kwon
Kyungdo Kim
Jaeyeon Jeong
Howoong Jun
Hongjung Lee
Dongheui Lee
Songhwai Oh
LM&Ro
21
6
0
16 Dec 2020
Attention over learned object embeddings enables complex visual
  reasoning
Attention over learned object embeddings enables complex visual reasoning
David Ding
Felix Hill
Adam Santoro
Malcolm Reynolds
M. Botvinick
OCL
27
69
0
15 Dec 2020
WILDS: A Benchmark of in-the-Wild Distribution Shifts
WILDS: A Benchmark of in-the-Wild Distribution Shifts
Pang Wei Koh
Shiori Sagawa
Henrik Marklund
Sang Michael Xie
Marvin Zhang
...
A. Kundaje
Emma Pierson
Sergey Levine
Chelsea Finn
Percy Liang
OOD
106
1,386
0
14 Dec 2020
Knowledge-Routed Visual Question Reasoning: Challenges for Deep
  Representation Embedding
Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation Embedding
Qingxing Cao
Bailin Li
Xiaodan Liang
Keze Wang
Liang Lin
49
36
0
14 Dec 2020
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps
Qi Zhu
Chenyu Gao
Peng Wang
Qi Wu
33
54
0
09 Dec 2020
Intrinsically Motivated Compositional Language Emergence
Intrinsically Motivated Compositional Language Emergence
Rishi Hazra
Sonu Dixit
Sayambhu Sen
11
1
0
09 Dec 2020
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions
Tayfun Ates
Muhammed Samil Atesoglu
Cagatay Yigit
.Ilker Kesen
Mert Kobaş
Erkut Erdem
Aykut Erdem
T. Goksun
Deniz Yuret
27
31
0
08 Dec 2020
Edited Media Understanding Frames: Reasoning About the Intent and Implications of Visual Misinformation
Edited Media Understanding Frames: Reasoning About the Intent and Implications of Visual Misinformation
Jeff Da
Maxwell Forbes
Rowan Zellers
Anthony Zheng
Jena D. Hwang
Antoine Bosselut
Yejin Choi
DiffM
25
13
0
08 Dec 2020
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene
  Understanding
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding
Maryam Rahnemoonfar
Tashnim Chowdhury
Argho Sarkar
D. Varshney
M. Yari
Robin Murphy
22
243
0
05 Dec 2020
WeaQA: Weak Supervision via Captions for Visual Question Answering
WeaQA: Weak Supervision via Captions for Visual Question Answering
Pratyay Banerjee
Tejas Gokhale
Yezhou Yang
Chitta Baral
25
35
0
04 Dec 2020
Multi-Label Contrastive Learning for Abstract Visual Reasoning
Multi-Label Contrastive Learning for Abstract Visual Reasoning
Mikolaj Malkiñski
Jacek Mańdziuk
8
40
0
03 Dec 2020
Attribute-Guided Adversarial Training for Robustness to Natural
  Perturbations
Attribute-Guided Adversarial Training for Robustness to Natural Perturbations
Tejas Gokhale
Rushil Anirudh
B. Kailkhura
Jayaraman J. Thiagarajan
Chitta Baral
Yezhou Yang
AAML
OOD
13
37
0
03 Dec 2020
Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations
  in 3D
Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D
Ankit Goyal
Kaiyu Yang
Dawei Yang
Jia Deng
30
41
0
03 Dec 2020
DERAIL: Diagnostic Environments for Reward And Imitation Learning
DERAIL: Diagnostic Environments for Reward And Imitation Learning
Pedro Freire
Adam Gleave
Sam Toyer
Stuart J. Russell
OffRL
23
6
0
02 Dec 2020
Self-Supervised Real-to-Sim Scene Generation
Self-Supervised Real-to-Sim Scene Generation
Aayush Prakash
Shoubhik Debnath
Jean-Francois Lafleche
Eric Cameracci
Gavriel State
Stan Birchfield
M. Law
37
26
0
30 Nov 2020
Self-Supervised Time Series Representation Learning by Inter-Intra
  Relational Reasoning
Self-Supervised Time Series Representation Learning by Inter-Intra Relational Reasoning
Haoyi Fan
Fengbin Zhang
Yue Gao
AI4TS
30
14
0
27 Nov 2020
Learning from Lexical Perturbations for Consistent Visual Question
  Answering
Learning from Lexical Perturbations for Consistent Visual Question Answering
Spencer Whitehead
Hui Wu
Yi R. Fung
Heng Ji
Rogerio Feris
Kate Saenko
37
11
0
26 Nov 2020
Transformation Driven Visual Reasoning
Transformation Driven Visual Reasoning
Xin Hong
Yanyan Lan
Liang Pang
Jiafeng Guo
Xueqi Cheng
LRM
29
21
0
26 Nov 2020
Multimodal Learning for Hateful Memes Detection
Multimodal Learning for Hateful Memes Detection
Yi Zhou
Zhenhao Chen
24
56
0
25 Nov 2020
Right for the Right Concept: Revising Neuro-Symbolic Concepts by
  Interacting with their Explanations
Right for the Right Concept: Revising Neuro-Symbolic Concepts by Interacting with their Explanations
Wolfgang Stammer
P. Schramowski
Kristian Kersting
FAtt
14
107
0
25 Nov 2020
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature
  Fields
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields
Michael Niemeyer
Andreas Geiger
OCL
100
954
0
24 Nov 2020
Interpretable Visual Reasoning via Induced Symbolic Space
Interpretable Visual Reasoning via Induced Symbolic Space
Zhonghao Wang
Kai Wang
Mo Yu
Jinjun Xiong
Wen-mei W. Hwu
M. Hasegawa-Johnson
Humphrey Shi
LRM
OCL
16
19
0
23 Nov 2020
Modular Action Concept Grounding in Semantic Video Prediction
Modular Action Concept Grounding in Semantic Video Prediction
Wei Yu
Wenxin Chen
Songheng Yin
S. Easterbrook
Animesh Garg
14
13
0
23 Nov 2020
Using Text to Teach Image Retrieval
Using Text to Teach Image Retrieval
Haoyu Dong
Ze Wang
Qiang Qiu
Guillermo Sapiro
3DV
35
4
0
19 Nov 2020
Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge
  Distillation
Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation
Gaurav Kumar Nayak
Konda Reddy Mopuri
Anirban Chakraborty
25
18
0
18 Nov 2020
Disentangling 3D Prototypical Networks For Few-Shot Concept Learning
Disentangling 3D Prototypical Networks For Few-Shot Concept Learning
Mihir Prabhudesai
Shamit Lal
Darshan Patil
H. Tung
Adam W. Harley
Katerina Fragkiadaki
OCL
3DV
3DPC
24
20
0
06 Nov 2020
Reasoning Over History: Context Aware Visual Dialog
Reasoning Over History: Context Aware Visual Dialog
Muhammad A. Shah
Shikib Mehri
Tejas Srinivasan
11
3
0
02 Nov 2020
3D Object Recognition By Corresponding and Quantizing Neural 3D Scene
  Representations
3D Object Recognition By Corresponding and Quantizing Neural 3D Scene Representations
Mihir Prabhudesai
Shamit Lal
H. Tung
Adam W. Harley
Shubhankar Potdar
Katerina Fragkiadaki
3DPC
20
2
0
30 Oct 2020
Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a
  Class-imbalance View
Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a Class-imbalance View
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Q. Tian
Min Zhang
19
69
0
30 Oct 2020
SIRI: Spatial Relation Induced Network For Spatial Description
  Resolution
SIRI: Spatial Relation Induced Network For Spatial Description Resolution
Peiyao Wang
Weixin Luo
Yanyu Xu
Haojie Li
Shugong Xu
Jianyu Yang
Shenghua Gao
19
0
0
27 Oct 2020
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual
  Question Answering
MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering
Aisha Urooj Khan
Amir Mazaheri
N. Lobo
M. Shah
34
56
0
27 Oct 2020
Beyond VQA: Generating Multi-word Answer and Rationale to Visual
  Questions
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions
Radhika Dua
Sai Srinivas Kancheti
V. Balasubramanian
LRM
43
22
0
24 Oct 2020
Generative Neurosymbolic Machines
Generative Neurosymbolic Machines
Jindong Jiang
Sungjin Ahn
BDL
OCL
225
68
0
23 Oct 2020
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing
  Functional Entropies
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies
Itai Gat
Idan Schwartz
Alex Schwing
Tamir Hazan
60
90
0
21 Oct 2020
Knowledge Graph-based Question Answering with Electronic Health Records
Knowledge Graph-based Question Answering with Electronic Health Records
Junwoo Park
Youngwoo Cho
Haneol Lee
Jaegul Choo
Edward Choi
40
33
0
19 Oct 2020
Deep Ensembles for Low-Data Transfer Learning
Deep Ensembles for Low-Data Transfer Learning
Basil Mustafa
C. Riquelme
J. Puigcerver
andAndré Susano Pinto
Daniel Keysers
N. Houlsby
FedML
OOD
27
22
0
14 Oct 2020
Improving Compositional Generalization in Semantic Parsing
Improving Compositional Generalization in Semantic Parsing
I. Oren
Jonathan Herzig
Nitish Gupta
Matt Gardner
Jonathan Berant
29
63
0
12 Oct 2020
COGS: A Compositional Generalization Challenge Based on Semantic
  Interpretation
COGS: A Compositional Generalization Challenge Based on Semantic Interpretation
Najoung Kim
Tal Linzen
CoGe
13
274
0
12 Oct 2020
Interpretable Neural Computation for Real-World Compositional Visual
  Question Answering
Interpretable Neural Computation for Real-World Compositional Visual Question Answering
Ruixue Tang
Chao Ma
CoGe
19
2
0
10 Oct 2020
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using
  Deep Shape Priors
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors
Cathrin Elich
Martin R. Oswald
Marc Pollefeys
Joerg Stueckler
OCL
3DPC
3DV
14
12
0
08 Oct 2020
ALFWorld: Aligning Text and Embodied Environments for Interactive
  Learning
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Mohit Shridhar
Xingdi Yuan
Marc-Alexandre Côté
Yonatan Bisk
Adam Trischler
Matthew J. Hausknecht
LM&Ro
LLMAG
38
400
0
08 Oct 2020
Learning to Recombine and Resample Data for Compositional Generalization
Learning to Recombine and Resample Data for Compositional Generalization
Ekin Akyürek
Afra Feyza Akyürek
Jacob Andreas
29
79
0
08 Oct 2020
CURI: A Benchmark for Productive Concept Learning Under Uncertainty
CURI: A Benchmark for Productive Concept Learning Under Uncertainty
Ramakrishna Vedantam
Arthur Szlam
Maximilian Nickel
Ari S. Morcos
Brenden M. Lake
UQLM
LRM
32
26
0
06 Oct 2020
Pathological Visual Question Answering
Pathological Visual Question Answering
Xuehai He
Zhuo Cai
Wenlan Wei
Yichen Zhang
Luntian Mou
Eric Xing
P. Xie
80
24
0
06 Oct 2020
Reward Machines: Exploiting Reward Function Structure in Reinforcement
  Learning
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
OffRL
49
216
0
06 Oct 2020
Meta-Learning of Structured Task Distributions in Humans and Machines
Meta-Learning of Structured Task Distributions in Humans and Machines
Sreejan Kumar
Ishita Dasgupta
Jonathan Cohen
Nathaniel D. Daw
Thomas Griffiths
OffRL
22
3
0
05 Oct 2020
Previous
123...202122...282930
Next