Thoughts on Objectives of Sparse and Hierarchical Masked Image Model

Abstract
Masked image modeling is one of the most poplular objectives of training. Recently, the SparK model has been proposed with superior performance among self-supervised learning models. This paper proposes a new mask pattern for this SparK model, proposing it as the Mesh Mask-ed SparK model. We report the effect of the mask pattern used for image masking in pre-training on performance.
View on arXiv@article{miyazaki2025_2505.08819, title={ Thoughts on Objectives of Sparse and Hierarchical Masked Image Model }, author={ Asahi Miyazaki and Tsuyoshi Okita }, journal={arXiv preprint arXiv:2505.08819}, year={ 2025 } }
Comments on this paper