ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.05379
  4. Cited By
A Benchmark for Compositional Visual Reasoning

A Benchmark for Compositional Visual Reasoning

11 June 2022
Aimen Zerroug
Mohit Vaishnav
Julien Colin
Sebastian Musslick
Thomas Serre
    OCL
    CoGe
ArXivPDFHTML

Papers citing "A Benchmark for Compositional Visual Reasoning"

17 / 17 papers shown
Title
NSFlow: An End-to-End FPGA Framework with Scalable Dataflow Architecture for Neuro-Symbolic AI
NSFlow: An End-to-End FPGA Framework with Scalable Dataflow Architecture for Neuro-Symbolic AI
Hanchen Yang
Zishen Wan
Ritik Raj
Joongun Park
Ziwei Li
A. Samajdar
A. Raychowdhury
Tushar Krishna
26
0
0
27 Apr 2025
VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity
Jing Bi
Junjia Guo
Susan Liang
Guangyu Sun
Luchuan Song
...
Jinxi He
Jiarui Wu
A. Vosoughi
Chong Chen
Chenliang Xu
LRM
74
3
0
14 Mar 2025
GlyphPattern: An Abstract Pattern Recognition for Vision-Language Models
GlyphPattern: An Abstract Pattern Recognition for Vision-Language Models
Zixuan Wu
Yoolim Kim
Carolyn Jane Anderson
VLM
31
0
0
12 Aug 2024
Take A Step Back: Rethinking the Two Stages in Visual Reasoning
Take A Step Back: Rethinking the Two Stages in Visual Reasoning
Mingyu Zhang
Jiting Cai
Mingyu Liu
Yue Xu
Cewu Lu
Yong-Lu Li
LRM
39
5
0
29 Jul 2024
Beyond the Doors of Perception: Vision Transformers Represent Relations
  Between Objects
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
Michael A. Lepori
Alexa R. Tartaglini
Wai Keen Vong
Thomas Serre
Brenden M. Lake
Ellie Pavlick
42
2
0
22 Jun 2024
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
Xu Cao
Bolin Lai
Wenqian Ye
Yunsheng Ma
Joerg Heintz
Jintai Chen
Jianguo Cao
James M. Rehg
45
8
0
14 Jun 2024
Disentangling and Integrating Relational and Sensory Information in
  Transformer Architectures
Disentangling and Integrating Relational and Sensory Information in Transformer Architectures
Awni Altabaa
John Lafferty
37
3
0
26 May 2024
Image classification network enhancement methods based on knowledge
  injection
Image classification network enhancement methods based on knowledge injection
Yishuang Tian
Ning Wang
Liang Zhang
28
1
0
09 Jan 2024
Benchmarking Robustness of Text-Image Composed Retrieval
Benchmarking Robustness of Text-Image Composed Retrieval
Shitong Sun
Jindong Gu
Shaogang Gong
CoGe
47
1
0
24 Nov 2023
Imagine the Unseen World: A Benchmark for Systematic Generalization in
  Visual World Models
Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models
Yeongbin Kim
Gautam Singh
Junyeong Park
Çağlar Gülçehre
Sungjin Ahn
OCL
VLM
50
1
0
15 Nov 2023
Emergent Communication for Rules Reasoning
Emergent Communication for Rules Reasoning
Yuxuan Guo
Yifan Hao
Rui Zhang
Enshuai Zhou
Zidong Du
...
Shaohui Peng
Di Huang
Rui Chen
Qi Guo
Yunji Chen
LLMAG
LRM
AI4CE
26
0
0
08 Nov 2023
Semantic Composition in Visually Grounded Language Models
Semantic Composition in Visually Grounded Language Models
Rohan Pandey
CoGe
26
1
0
15 May 2023
Self-attention in Vision Transformers Performs Perceptual Grouping, Not
  Attention
Self-attention in Vision Transformers Performs Perceptual Grouping, Not Attention
Paria Mehrani
John K. Tsotsos
25
24
0
02 Mar 2023
Break It Down: Evidence for Structural Compositionality in Neural
  Networks
Break It Down: Evidence for Structural Compositionality in Neural Networks
Michael A. Lepori
Thomas Serre
Ellie Pavlick
46
30
0
26 Jan 2023
When and why vision-language models behave like bags-of-words, and what
  to do about it?
When and why vision-language models behave like bags-of-words, and what to do about it?
Mert Yuksekgonul
Federico Bianchi
Pratyusha Kalluri
Dan Jurafsky
James Zou
VLM
CoGe
30
362
0
04 Oct 2022
ReaSCAN: Compositional Reasoning in Language Grounding
ReaSCAN: Compositional Reasoning in Language Grounding
Zhengxuan Wu
Elisa Kreiss
Desmond C. Ong
Christopher Potts
CoGe
LRM
29
22
0
18 Sep 2021
Understanding the computational demands underlying visual reasoning
Understanding the computational demands underlying visual reasoning
Mohit Vaishnav
Rémi Cadène
A. Alamia
Drew Linsley
Rufin VanRullen
Thomas Serre
GNN
CoGe
40
16
0
08 Aug 2021
1