ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.20122
37
0

MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models

26 May 2025
Anh Thai
Stefan Stojanov
Zixuan Huang
Bikram Boote
James M. Rehg
    VLM
ArXiv (abs)PDFHTML
Main:12 Pages
8 Figures
Bibliography:3 Pages
2 Tables
Abstract

This paper introduces MEBench, a novel benchmark for evaluating mutual exclusivity (ME) bias, a cognitive phenomenon observed in children during word learning. Unlike traditional ME tasks, MEBench further incorporates spatial reasoning to create more challenging and realistic evaluation settings. We assess the performance of state-of-the-art vision-language models (VLMs) on this benchmark using novel evaluation metrics that capture key aspects of ME-based reasoning. To facilitate controlled experimentation, we also present a flexible and scalable data generation pipeline that supports the construction of diverse annotated scenes.

View on arXiv
@article{thai2025_2505.20122,
  title={ MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models },
  author={ Anh Thai and Stefan Stojanov and Zixuan Huang and Bikram Boote and James M. Rehg },
  journal={arXiv preprint arXiv:2505.20122},
  year={ 2025 }
}
Comments on this paper