STDR: Spatio-Temporal Decoupling for Real-Time Dynamic Scene Rendering

Although dynamic scene reconstruction has long been a fundamental challenge in 3D vision, the recent emergence of 3D Gaussian Splatting (3DGS) offers a promising direction by enabling high-quality, real-time rendering through explicit Gaussian primitives. However, existing 3DGS-based methods for dynamic reconstruction often suffer from \textit{spatio-temporal incoherence} during initialization, where canonical Gaussians are constructed by aggregating observations from multiple frames without temporal distinction. This results in spatio-temporally entangled representations, making it difficult to model dynamic motion accurately. To overcome this limitation, we propose \textbf{STDR} (Spatio-Temporal Decoupling for Real-time rendering), a plug-and-play module that learns spatio-temporal probability distributions for each Gaussian. STDR introduces a spatio-temporal mask, a separated deformation field, and a consistency regularization to jointly disentangle spatial and temporal patterns. Extensive experiments demonstrate that incorporating our module into existing 3DGS-based dynamic scene reconstruction frameworks leads to notable improvements in both reconstruction quality and spatio-temporal consistency across synthetic and real-world benchmarks.
View on arXiv@article{li2025_2505.22400, title={ STDR: Spatio-Temporal Decoupling for Real-Time Dynamic Scene Rendering }, author={ Zehao Li and Hao Jiang and Yujun Cai and Jianing Chen and Baolong Bi and Shuqin Gao and Honglong Zhao and Yiwei Wang and Tianlu Mao and Zhaoqi Wang }, journal={arXiv preprint arXiv:2505.22400}, year={ 2025 } }