Unwarping Screen Content Images via Structure-texture Enhancement Network and Transformation Self-estimation

While existing implicit neural network-based image unwarping methods perform well on natural images, they struggle to handle screen content images (SCIs), which often contain large geometric distortions, text, symbols, and sharp edges. To address this, we propose a structure-texture enhancement network (STEN) with transformation self-estimation for SCI warping. STEN integrates a B-spline implicit neural representation module and a transformation error estimation and self-correction algorithm. It comprises two branches: the structure estimation branch (SEB), which enhances local aggregation and global dependency modeling, and the texture estimation branch (TEB), which improves texture detail synthesis using B-spline implicit neural representation. Additionally, the transformation self-estimation module autonomously estimates the transformation error and corrects the coordinate transformation matrix, effectively handling real-world image distortions. Extensive experiments on public SCI datasets demonstrate that our approach significantly outperforms state-of-the-art methods. Comparisons on well-known natural image datasets also show the potential of our approach for natural image distortion.
View on arXiv@article{xiao2025_2504.15108, title={ Unwarping Screen Content Images via Structure-texture Enhancement Network and Transformation Self-estimation }, author={ Zhenzhen Xiao and Heng Liu and Bingwen Hu }, journal={arXiv preprint arXiv:2504.15108}, year={ 2025 } }