Data Watermarking for Sequential Recommender Systems

In the era of large foundation models, data has become a crucial component in building high-performance AI systems. As the demand for high-quality and large-scale data continues to rise, data copyright protection is attracting increasing attention. In this work, we explore the problem of data watermarking for sequential recommender systems, where a watermark is embedded into the target dataset and can be detected in models trained on that dataset. We focus on two settings: dataset watermarking, which protects the ownership of the entire dataset, and user watermarking, which safeguards the data of individual users. We present a method named Dataset Watermarking for Recommender Systems (DWRS) to address them. We define the watermark as a sequence of consecutive items inserted into normal users' interaction sequences. We define a Receptive Field (RF) to guide the inserting process to facilitate the memorization of the watermark. Extensive experiments on five representative sequential recommendation models and three benchmark datasets demonstrate the effectiveness of DWRS in protecting data copyright while preserving model utility.
View on arXiv@article{zhang2025_2411.12989, title={ Data Watermarking for Sequential Recommender Systems }, author={ Sixiao Zhang and Cheng Long and Wei Yuan and Hongxu Chen and Hongzhi Yin }, journal={arXiv preprint arXiv:2411.12989}, year={ 2025 } }