Embedding Trust at Scale: Physics-Aware Neural Watermarking for Secure and Verifiable Data Pipelines

22 May 2025

Krti Tallam

ArXiv (abs)PDF HTML

Main:16 Pages

8 Figures

Bibliography:2 Pages

5 Tables

Appendix:3 Pages

Abstract

We present a robust neural watermarking framework for scientific data integrity, targeting high-dimensional fields common in climate modeling and fluid simulations. Using a convolutional autoencoder, binary messages are invisibly embedded into structured data such as temperature, vorticity, and geopotential. Our method ensures watermark persistence under lossy transformations - including noise injection, cropping, and compression - while maintaining near-original fidelity (sub-1\% MSE). Compared to classical singular value decomposition (SVD)-based watermarking, our approach achieves $>$ 98\% bit accuracy and visually indistinguishable reconstructions across ERA5 and Navier-Stokes datasets. This system offers a scalable, model-compatible tool for data provenance, auditability, and traceability in high-performance scientific workflows, and contributes to the broader goal of securing AI systems through verifiable, physics-aware watermarking. We evaluate on physically grounded scientific datasets as a representative stress-test; the framework extends naturally to other structured domains such as satellite imagery and autonomous-vehicle perception streams.

View on arXiv

@article{tallam2025_2506.12032,
  title={ Embedding Trust at Scale: Physics-Aware Neural Watermarking for Secure and Verifiable Data Pipelines },
  author={ Krti Tallam },
  journal={arXiv preprint arXiv:2506.12032},
  year={ 2025 }
}

Comments on this paper