Overcoming Data Scarcity in Scanning Tunnelling Microscopy Image Segmentation

Scanning tunnelling microscopy (STM) is a powerful technique for imaging surfaces with atomic resolution, providing insight into physical and chemical processes at the level of single atoms and molecules. A regular task of STM image analysis is the identification and labelling of features of interest against a uniform background. Performing this manually is a labour-intensive task, requiring significant human effort. To reduce this burden, we propose an automated approach to the segmentation of STM images that uses both few-shot learning and unsupervised learning. Our technique offers greater flexibility compared to previous supervised methods; it removes the requirement for large manually annotated datasets and is thus easier to adapt to an unseen surface while still maintaining a high accuracy. We demonstrate the effectiveness of our approach by using it to recognise atomic features on three distinct surfaces: Si(001), Ge(001), and TiO(110), including adsorbed AsH molecules on the silicon and germanium surfaces. Our model exhibits strong generalisation capabilities, and following initial training, can be adapted to unseen surfaces with as few as one additional labelled data point. This work is a significant step towards efficient and material-agnostic, automatic segmentation of STM images.
View on arXiv@article{kolev2025_2506.01678, title={ Overcoming Data Scarcity in Scanning Tunnelling Microscopy Image Segmentation }, author={ Nikola L. Kolev and Max Trouton and Filippo Federici Canova and Geoff Thornton and David Z. Gao and Neil J. Curson and Taylor J. Z. Stock }, journal={arXiv preprint arXiv:2506.01678}, year={ 2025 } }