Visual Anomaly Detection under Complex View-Illumination Interplay: A Large-Scale Benchmark

The practical deployment of Visual Anomaly Detection (VAD) systems is hindered by their sensitivity to real-world imaging variations, particularly the complex interplay between viewpoint and illumination which drastically alters defect visibility. Current benchmarks largely overlook this critical challenge. We introduce Multi-View Multi-Illumination Anomaly Detection (M2AD), a new large-scale benchmark comprising 119,880 high-resolution images designed explicitly to probe VAD robustness under such interacting conditions. By systematically capturing 999 specimens across 10 categories using 12 synchronized views and 10 illumination settings (120 configurations total), M2AD enables rigorous evaluation. We establish two evaluation protocols: M2AD-Synergy tests the ability to fuse information across diverse configurations, and M2AD-Invariant measures single-image robustness against realistic view-illumination effects. Our extensive benchmarking shows that state-of-the-art VAD methods struggle significantly on M2AD, demonstrating the profound challenge posed by view-illumination interplay. This benchmark serves as an essential tool for developing and validating VAD methods capable of overcoming real-world complexities. Our full dataset and test suite will be released atthis https URLto facilitate the field.
View on arXiv@article{cao2025_2505.10996, title={ Visual Anomaly Detection under Complex View-Illumination Interplay: A Large-Scale Benchmark }, author={ Yunkang Cao and Yuqi Cheng and Xiaohao Xu and Yiheng Zhang and Yihan Sun and Yuxiang Tan and Yuxin Zhang and Xiaonan Huang and Weiming Shen }, journal={arXiv preprint arXiv:2505.10996}, year={ 2025 } }