ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.06890
  4. Cited By
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary
  Visual Reasoning

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

20 December 2016
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
    CoGe
ArXivPDFHTML

Papers citing "CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning"

50 / 1,475 papers shown
Title
REMIND Your Neural Network to Prevent Catastrophic Forgetting
REMIND Your Neural Network to Prevent Catastrophic Forgetting
Tyler L. Hayes
Kushal Kafle
Robik Shrestha
Manoj Acharya
Christopher Kanan
CLL
31
295
0
06 Oct 2019
Few-Shot Abstract Visual Reasoning With Spectral Features
Few-Shot Abstract Visual Reasoning With Spectral Features
Tanner A. Bohn
Yining Hu
Charles X. Ling
VLM
17
3
0
04 Oct 2019
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
CLEVRER: CoLlision Events for Video REpresentation and Reasoning
Kexin Yi
Yuta Saito
Yunzhu Li
Pushmeet Kohli
Jiajun Wu
Antonio Torralba
J. Tenenbaum
NAI
43
457
0
03 Oct 2019
Embodied Language Grounding with 3D Visual Feature Representations
Embodied Language Grounding with 3D Visual Feature Representations
Mihir Prabhudesai
H. Tung
Syed Ashar Javed
Maximilian Sieb
Adam W. Harley
Katerina Fragkiadaki
28
21
0
02 Oct 2019
A Large-scale Study of Representation Learning with the Visual Task
  Adaptation Benchmark
A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark
Xiaohua Zhai
J. Puigcerver
Alexander Kolesnikov
P. Ruyssen
C. Riquelme
...
Michael Tschannen
Marcin Michalski
Olivier Bousquet
Sylvain Gelly
N. Houlsby
SSL
30
426
0
01 Oct 2019
On Incorporating Semantic Prior Knowledge in Deep Learning Through
  Embedding-Space Constraints
On Incorporating Semantic Prior Knowledge in Deep Learning Through Embedding-Space Constraints
Damien Teney
Ehsan Abbasnejad
Anton Van Den Hengel
NAI
29
9
0
30 Sep 2019
Scaling data-driven robotics with reward sketching and batch
  reinforcement learning
Scaling data-driven robotics with reward sketching and batch reinforcement learning
Serkan Cabi
Sergio Gomez Colmenarejo
Alexander Novikov
Ksenia Konyushkova
Scott E. Reed
...
David Barker
Jonathan Scholz
Misha Denil
Nando de Freitas
Ziyun Wang
OffRL
28
29
0
26 Sep 2019
Synthetic Data for Deep Learning
Synthetic Data for Deep Learning
Sergey I. Nikolenko
51
349
0
25 Sep 2019
Question Answering is a Format; When is it Useful?
Question Answering is a Format; When is it Useful?
Matt Gardner
Jonathan Berant
Hannaneh Hajishirzi
Alon Talmor
Sewon Min
23
51
0
25 Sep 2019
Talk2Car: Taking Control of Your Self-Driving Car
Talk2Car: Taking Control of Your Self-Driving Car
Thierry Deruyttere
Simon Vandenhende
Dusan Grujicic
Luc Van Gool
Marie-Francine Moens
LM&Ro
31
124
0
24 Sep 2019
Explainable High-order Visual Question Reasoning: A New Benchmark and
  Knowledge-routed Network
Explainable High-order Visual Question Reasoning: A New Benchmark and Knowledge-routed Network
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
33
13
0
23 Sep 2019
Learning Visual Relation Priors for Image-Text Matching and Image
  Captioning with Neural Scene Graph Generators
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators
Kuang-Huei Lee
Hamid Palangi
Xi Chen
Houdong Hu
Jianfeng Gao
VLM
30
37
0
22 Sep 2019
Learning Sparse Mixture of Experts for Visual Question Answering
Learning Sparse Mixture of Experts for Visual Question Answering
Vardaan Pahuja
Jie Fu
C. Pal
23
2
0
19 Sep 2019
Analyzing machine-learned representations: A natural language case study
Analyzing machine-learned representations: A natural language case study
Ishita Dasgupta
Demi Guo
S. Gershman
Noah D. Goodman
NAI
24
13
0
12 Sep 2019
Sunny and Dark Outside?! Improving Answer Consistency in VQA through
  Entailed Question Generation
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation
Arijit Ray
Karan Sikka
Ajay Divakaran
Stefan Lee
Giedrius Burachas
27
65
0
10 Sep 2019
Bayesian Relational Memory for Semantic Visual Navigation
Bayesian Relational Memory for Semantic Visual Navigation
Yi Wu
Yuxin Wu
Aviv Tamar
Stuart J. Russell
Georgia Gkioxari
Yuandong Tian
BDL
23
105
0
10 Sep 2019
Relationships from Entity Stream
Relationships from Entity Stream
Martin Andrews
Sam Witteveen
AI4TS
GNN
21
0
0
07 Sep 2019
PlotQA: Reasoning over Scientific Plots
PlotQA: Reasoning over Scientific Plots
Nitesh Methani
Pritha Ganguly
Mitesh M. Khapra
Pratyush Kumar
49
7
0
03 Sep 2019
Language Tasks and Language Games: On Methodology in Current Natural
  Language Processing Research
Language Tasks and Language Games: On Methodology in Current Natural Language Processing Research
David Schlangen
27
18
0
28 Aug 2019
Is the Red Square Big? MALeViC: Modeling Adjectives Leveraging Visual
  Contexts
Is the Red Square Big? MALeViC: Modeling Adjectives Leveraging Visual Contexts
Sandro Pezzelle
Raquel Fernández
VLM
19
18
0
27 Aug 2019
Visual Question Answering using Deep Learning: A Survey and Performance
  Analysis
Visual Question Answering using Deep Learning: A Survey and Performance Analysis
Yash Srivastava
Vaishnav Murali
S. Dubey
Snehasis Mukherjee
24
47
0
27 Aug 2019
Temporal Reasoning Graph for Activity Recognition
Temporal Reasoning Graph for Activity Recognition
Jingran Zhang
Fumin Shen
Xing Xu
Heng Tao Shen
39
60
0
27 Aug 2019
Don't paraphrase, detect! Rapid and Effective Data Collection for
  Semantic Parsing
Don't paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing
Jonathan Herzig
Jonathan Berant
21
40
0
26 Aug 2019
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
Weijie Su
Xizhou Zhu
Yue Cao
Bin Li
Lewei Lu
Furu Wei
Jifeng Dai
VLM
MLLM
SSL
85
1,651
0
22 Aug 2019
Compositionality decomposed: how do neural networks generalise?
Compositionality decomposed: how do neural networks generalise?
Dieuwke Hupkes
Verna Dankers
Mathijs Mul
Elia Bruni
CoGe
39
323
0
22 Aug 2019
What is needed for simple spatial language capabilities in VQA?
What is needed for simple spatial language capabilities in VQA?
A. Kuhnle
Ann A. Copestake
CoGe
23
1
0
17 Aug 2019
CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text
CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text
Koustuv Sinha
Shagun Sodhani
Jin Dong
Joelle Pineau
William L. Hamilton
32
201
0
16 Aug 2019
PHYRE: A New Benchmark for Physical Reasoning
PHYRE: A New Benchmark for Physical Reasoning
A. Bakhtin
Laurens van der Maaten
Justin Johnson
Laura Gustafson
Ross B. Girshick
LRM
24
122
0
15 Aug 2019
Fusion of Detected Objects in Text for Visual Question Answering
Fusion of Detected Objects in Text for Visual Question Answering
Chris Alberti
Jeffrey Ling
Michael Collins
David Reitter
17
173
0
14 Aug 2019
VideoNavQA: Bridging the Gap between Visual and Embodied Question
  Answering
VideoNavQA: Bridging the Gap between Visual and Embodied Question Answering
Cătălina Cangea
Eugene Belilovsky
Pietro Lio
Aaron Courville
16
17
0
14 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language
  Interactions
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
27
38
0
12 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
25
82
0
10 Aug 2019
CRIC: A VQA Dataset for Compositional Reasoning on Vision and
  Commonsense
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense
Difei Gao
Ruiping Wang
Shiguang Shan
Xilin Chen
CoGe
LRM
20
27
0
08 Aug 2019
SpatialSense: An Adversarially Crowdsourced Benchmark for Spatial
  Relation Recognition
SpatialSense: An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition
Kaiyu Yang
Olga Russakovsky
Jia Deng
3DPC
28
60
0
07 Aug 2019
Logic could be learned from images
Logic could be learned from images
Q. Guo
Y. Qian
Xinyan Liang
Yanhong She
Deyu Li
Jiye Liang
NAI
25
4
0
06 Aug 2019
Answering Questions about Data Visualizations using Efficient Bimodal
  Fusion
Answering Questions about Data Visualizations using Efficient Bimodal Fusion
Kushal Kafle
Robik Shrestha
Brian L. Price
Scott D. Cohen
Christopher Kanan
25
58
0
05 Aug 2019
An Empirical Study of Batch Normalization and Group Normalization in
  Conditional Computation
An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation
Vincent Michalski
Vikram S. Voleti
Samira Ebrahimi Kahou
Anthony Ortiz
Pascal Vincent
C. Pal
Doina Precup
BDL
30
6
0
31 Jul 2019
Disentangled Relational Representations for Explaining and Learning from
  Demonstration
Disentangled Relational Representations for Explaining and Learning from Demonstration
Yordan V. Hristov
Daniel Angelov
Michael G. Burke
A. Lascarides
S. Ramamoorthy
DRL
16
24
0
31 Jul 2019
LEAF-QA: Locate, Encode & Attend for Figure Question Answering
LEAF-QA: Locate, Encode & Attend for Figure Question Answering
Ritwick Chaudhry
Sumit Shekhar
Utkarsh Gupta
Pranav Maneriker
Prann Bansal
Ajay Joshi
LMTD
18
85
0
30 Jul 2019
V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive
  Matrices
V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices
Damien Teney
Peng Wang
Jiewei Cao
Lingqiao Liu
Chunhua Shen
Anton Van Den Hengel
9
30
0
29 Jul 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question
  Answering
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
Cheng Zhang
Wei-Lun Chao
D. Xuan
23
50
0
28 Jul 2019
Domain-Specific Priors and Meta Learning for Few-Shot First-Person
  Action Recognition
Domain-Specific Priors and Meta Learning for Few-Shot First-Person Action Recognition
Huseyin Coskun
Zeeshan Zia
Bugra Tekin
Federica Bogo
Nassir Navab
Federico Tombari
H. Sawhney
20
27
0
22 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
25
133
0
22 Jul 2019
CraftAssist: A Framework for Dialogue-enabled Interactive Agents
CraftAssist: A Framework for Dialogue-enabled Interactive Agents
Jonathan Gray
Kavya Srinet
Yacine Jernite
Haonan Yu
Zhuoyuan Chen
Demi Guo
Siddharth Goyal
C. L. Zitnick
Arthur Szlam
41
39
0
19 Jul 2019
2nd Place Solution to the GQA Challenge 2019
2nd Place Solution to the GQA Challenge 2019
Shijie Geng
Ji Zhang
Hang Zhang
Ahmed Elgammal
Dimitris N. Metaxas
ReLM
16
5
0
16 Jul 2019
Composing Neural Learning and Symbolic Reasoning with an Application to
  Visual Discrimination
Composing Neural Learning and Symbolic Reasoning with an Application to Visual Discrimination
Adithya Murali
Atharva Sehgal
Paul Krogmeier
P. Madhusudan
11
3
0
12 Jul 2019
Vision-and-Dialog Navigation
Vision-and-Dialog Navigation
Jesse Thomason
Michael Murray
Maya Cakmak
Luke Zettlemoyer
LM&Ro
57
324
0
10 Jul 2019
Neural Reasoning, Fast and Slow, for Video Question Answering
Neural Reasoning, Fast and Slow, for Video Question Answering
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
14
14
0
10 Jul 2019
Learning by Abstraction: The Neural State Machine
Learning by Abstraction: The Neural State Machine
Drew A. Hudson
Christopher D. Manning
NAI
OCL
16
258
0
09 Jul 2019
ICDAR 2019 Competition on Scene Text Visual Question Answering
ICDAR 2019 Competition on Scene Text Visual Question Answering
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Lluís Gómez
Marçal Rusiñol
Minesh Mathew
C. V. Jawahar
Ernest Valveny
Dimosthenis Karatzas
15
75
0
30 Jun 2019
Previous
123...242526...282930
Next