v1v2 (latest)

On Traceability in $\ell_p$ Stochastic Convex Optimization

24 February 2025

Sasha Voitovych

Mahdi Haghifam

Idan Attias

Gintare Karolina Dziugaite

Roi Livni

Daniel M. Roy

ArXiv (abs)PDF HTML

Main:11 Pages

Bibliography:5 Pages

1 Tables

Appendix:36 Pages

Abstract

In this paper, we investigate the necessity of traceability for accurate learning in stochastic convex optimization (SCO) under $\ell_p$ geometries. Informally, we say a learning algorithm is $m$ -traceable if, by analyzing its output, it is possible to identify at least $m$ of its training samples. Our main results uncover a fundamental tradeoff between traceability and excess risk in SCO. For every $p\in [1,\infty)$ , we establish the existence of an excess risk threshold below which every sample-efficient learner is traceable with the number of samples which is a constant fraction of its training sample. For $p\in [1,2]$ , this threshold coincides with the best excess risk of differentially private (DP) algorithms, i.e., above this threshold, there exist algorithms that are not traceable, which corresponds to a sharp phase transition. For $p \in (2,\infty)$ , this threshold instead gives novel lower bounds for DP learning, partially closing an open problem in this setup. En route to establishing these results, we prove a sparse variant of the fingerprinting lemma, which is of independent interest to the community.

View on arXiv

@article{voitovych2025_2502.17384,
  title={ On Traceability in $\ell_p$ Stochastic Convex Optimization },
  author={ Sasha Voitovych and Mahdi Haghifam and Idan Attias and Gintare Karolina Dziugaite and Roi Livni and Daniel M. Roy },
  journal={arXiv preprint arXiv:2502.17384},
  year={ 2025 }
}

Comments on this paper

On Traceability in ℓp\ell_pℓp​ Stochastic Convex Optimization

On Traceability in $\ell_p$ Stochastic Convex Optimization