SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size
Jessica Yung
Rob Romijnders
Alexander Kolesnikov
Lucas Beyer
Josip Djolonga
N. Houlsby
Sylvain Gelly
Mario Lucic
Xiaohua Zhai

Abstract
Before deploying machine learning models it is critical to assess their robustness. In the context of deep neural networks for image understanding, changing the object location, rotation and size may affect the predictions in non-trivial ways. In this work we perform a fine-grained analysis of robustness with respect to these factors of variation using SI-Score, a synthetic dataset. In particular, we investigate ResNets, Vision Transformers and CLIP, and identify interesting qualitative differences between these.
View on arXivComments on this paper