21
2

F2T2-HiT: A U-Shaped FFT Transformer and Hierarchical Transformer for Reflection Removal

Main:5 Pages
3 Figures
Bibliography:1 Pages
2 Tables
Abstract

Single Image Reflection Removal (SIRR) technique plays a crucial role in image processing by eliminating unwanted reflections from the background. These reflections, often caused by photographs taken through glass surfaces, can significantly degrade image quality. SIRR remains a challenging problem due to the complex and varied reflections encountered in real-world scenarios. These reflections vary significantly in intensity, shapes, light sources, sizes, and coverage areas across the image, posing challenges for most existing methods to effectively handle all cases. To address these challenges, this paper introduces a U-shaped Fast Fourier Transform Transformer and Hierarchical Transformer (F2T2-HiT) architecture, an innovative Transformer-based design for SIRR. Our approach uniquely combines Fast Fourier Transform (FFT) Transformer blocks and Hierarchical Transformer blocks within a UNet framework. The FFT Transformer blocks leverage the global frequency domain information to effectively capture and separate reflection patterns, while the Hierarchical Transformer blocks utilize multi-scale feature extraction to handle reflections of varying sizes and complexities. Extensive experiments conducted on three publicly available testing datasets demonstrate state-of-the-art performance, validating the effectiveness of our approach.

View on arXiv
@article{cai2025_2506.05489,
  title={ F2T2-HiT: A U-Shaped FFT Transformer and Hierarchical Transformer for Reflection Removal },
  author={ Jie Cai and Kangning Yang and Ling Ouyang and Lan Fu and Jiaming Ding and Huiming Sun and Chiu Man Ho and Zibo Meng },
  journal={arXiv preprint arXiv:2506.05489},
  year={ 2025 }
}
Comments on this paper