Transductive few-shot adapters for medical image segmentation

29 March 2023

Julio Silva-Rodríguez

Jose Dolz

Ismail Ben Ayed

ArXiv (abs)PDF HTML

Main:15 Pages

9 Figures

Bibliography:3 Pages

12 Tables

Abstract

With the recent raise of foundation models in computer vision and NLP, the pretrain-and-adapt strategy, where a large-scale model is fine-tuned on downstream tasks, is gaining popularity. However, traditional fine-tuning approaches may still require significant resources and yield sub-optimal results when the labeled data of the target task is scarce. This is especially the case in clinical settings. To address this challenge, we formalize few-shot efficient fine-tuning (FSEFT), a novel and realistic setting for medical image segmentation. Furthermore, we introduce a novel parameter-efficient fine-tuning strategy tailored to medical image segmentation, with (a) spatial adapter modules that are more appropriate for dense prediction tasks; and (b) a constrained transductive inference, which leverages task-specific prior knowledge. Our comprehensive experiments on a collection of public CT datasets for organ segmentation reveal the limitations of standard fine-tuning methods in few-shot scenarios, point to the potential of vision adapters and transductive inference, and confirm the suitability of foundation models.

View on arXiv

@article{silva-rodríguez2025_2303.17051,
  title={ Towards Foundation Models and Few-Shot Parameter-Efficient Fine-Tuning for Volumetric Organ Segmentation },
  author={ Julio Silva-Rodríguez and Jose Dolz and Ismail Ben Ayed },
  journal={arXiv preprint arXiv:2303.17051},
  year={ 2025 }
}

Comments on this paper