v1v2 (latest)

When is Memorization of Irrelevant Training Data Necessary for High-Accuracy Learning?

Symposium on the Theory of Computing (STOC), 2020

11 December 2020

Papers citing "When is Memorization of Irrelevant Training Data Necessary for High-Accuracy Learning?"

50 / 92 papers shown

Extracting alignment data in open models

Federico Barbero

Xiangming Gu

Christopher A. Choquette-Choo

314

21 Oct 2025

AI Agents as Universal Task Solvers

Alessandro Achille

Stefano Soatto

LRM

189

14 Oct 2025

A Law of Data Reconstruction for Random Features (and Beyond)

195

26 Sep 2025

Efficiently Attacking Memorization Scores

322

24 Sep 2025

Synth-MIA: A Testbed for Auditing Privacy Leakage in Tabular Data Synthesis

189

22 Sep 2025

Access Paths for Efficient Ordering with Large Language Models

Dimitris Tsirogiannis

239

30 Aug 2025

Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks

290

06 Aug 2025

A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset

452

20 Jun 2025

Black-Box Privacy Attacks on Shared Representations in Multitask Learning

290

19 Jun 2025

Memorization in Language Models through the Lens of Intrinsic Dimension

Stefan Arnold

PILM

390

11 Jun 2025

Trade-offs in Data Memorization via Strong Data Processing InequalitiesAnnual Conference Computational Learning Theory (COLT), 2025

497

02 Jun 2025

How much do language models memorize?

469

30 May 2025

Bayesian Perspective on Memorization and Reconstruction

338

29 May 2025

Querying Kernel Methods Suffices for Reconstructing their Training Data

252

25 May 2025

T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

389

07 Apr 2025

Trustworthy Machine Learning via Memorization and the Granular Long-Tail: A Survey on Interactions, Tradeoffs, and Beyond

594

10 Mar 2025

Machine Learners Should Acknowledge the Legal Implications of Large Language Models as Personal Data

552

03 Mar 2025

The Pitfalls of Memorization: When Memorization Hurts GeneralizationInternational Conference on Learning Representations (ICLR), 2024

463

10 Dec 2024

Improved Localized Machine Unlearning Through the Lens of Memorization

Reihaneh Torkzadehmahani

Reza Nasirigerdeh

Georgios Kaissis

Daniel Rueckert

Gintare Karolina Dziugaite

Eleni Triantafillou

287

03 Dec 2024

Slowing Down Forgetting in Continual Learning

475

11 Nov 2024

Undesirable Memorization in Large Language Models: A Survey

710

03 Oct 2024

Range Membership Inference Attacks

Jiashu Tao

Reza Shokri

510

09 Aug 2024

Demystifying Verbatim Memorization in Large Language Models

Jing Huang

Diyi Yang

Christopher Potts

ELM PILM MU

382

25 Jul 2024

A Survey on Machine Unlearning: Techniques and New Emerged Privacy RisksJournal of Information Security and Applications (JISA), 2024

Hengzhu Liu

Ping Xiong

Tianqing Zhu

Philip S. Yu

272

10 Jun 2024

Data Reconstruction: When You See It and When You Don't

350

24 May 2024

Exploring prompts to elicit memorization in masked language model-based named entity recognitionPLoS ONE (PLoS ONE), 2024

Yuxi Xia

Anastasiia Sedova

Pedro Henrique Luz de Araujo

Vasiliki Kougia

Lisa Nussbaumer

Benjamin Roth

316

05 May 2024

Differentially Private Reinforcement Learning with Self-Play

Dan Qiao

Yu Wang

285

11 Apr 2024

Gradient Descent is Pareto-Optimal in the Oracle Complexity and Memory Tradeoff for Feasibility ProblemsIEEE Annual Symposium on Foundations of Computer Science (FOCS), 2024

Moise Blanchard

298

10 Apr 2024

Unveiling Privacy, Memorization, and Input Curvature Links

345

28 Feb 2024

Information Complexity of Stochastic Convex Optimization: Applications to Generalization and Memorization

Idan Attias

Gintare Karolina Dziugaite

Mahdi Haghifam

Roi Livni

Daniel M. Roy

376

14 Feb 2024

Do LLMs Dream of Ontologies?ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2024

Marco Bombieri

Paolo Fiorini

Simone Paolo Ponzetto

M. Rospocher

CLL

408

26 Jan 2024

Memorization in Self-Supervised Learning Improves Downstream Generalization

Wenhao Wang

Muhammad Ahmad Kaleem

453

19 Jan 2024

The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline

Qianli Shen

336

07 Jan 2024

SoK: Unintended Interactions among Machine Learning Defenses and Risks

422

07 Dec 2023

Differentially Private Non-Convex Optimization under the KL Condition with Optimal RatesInternational Conference on Algorithmic Learning Theory (ALT), 2023

370

22 Nov 2023

On Retrieval Augmentation and the Limitations of Language Model Training

264

16 Nov 2023

Privacy Threats in Stable Diffusion Models

Thomas Cilloni

Charles Fleming

Charles Walter

262

15 Nov 2023

SoK: Memorisation in machine learning

Dmitrii Usynin

Moritz Knolle

Georgios Kaissis

365

06 Nov 2023

Why Train More? Effective and Efficient Membership Inference via Memorization

Jihye Choi

291

12 Oct 2023

What do larger image classifiers memorise?

Sanjiv Kumar

292

09 Oct 2023

Anonymous Learning via Look-Alike Clustering: A Precise Analysis of Model GeneralizationNeural Information Processing Systems (NeurIPS), 2023

Adel Javanmard

Vahab Mirrokni

484

06 Oct 2023

Deconstructing Data Reconstruction: Multiclass, Weight Decay and General LossesNeural Information Processing Systems (NeurIPS), 2023

Gal Vardi

316

04 Jul 2023

Deconstructing Classifiers: Towards A Data Reconstruction Attack Against Text Classification Models

Adel M. Elmahdy

A. Salem

SILM

362

23 Jun 2023

Memory-Query Tradeoffs for Randomized Convex OptimizationIEEE Annual Symposium on Foundations of Computer Science (FOCS), 2023

Xinyu Chen

Binghui Peng

320

21 Jun 2023

Machine Unlearning: A SurveyACM Computing Surveys (ACM Comput. Surv.), 2023

Philip S. Yu

318

06 Jun 2023

TMI! Finetuned Models Leak Private Information from their Pretraining DataProceedings on Privacy Enhancing Technologies (PoPETs), 2023

348

01 Jun 2023

Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive TasksNeural Information Processing Systems (NeurIPS), 2023

339

102

28 May 2023

Private Everlasting PredictionNeural Information Processing Systems (NeurIPS), 2023

321

16 May 2023

AI Model Disgorgement: Methods and ChoicesProceedings of the National Academy of Sciences of the United States of America (PNAS), 2023

265

07 Apr 2023

Near Optimal Memory-Regret Tradeoff for Online LearningIEEE Annual Symposium on Foundations of Computer Science (FOCS), 2023

Binghui Peng

A. Rubinstein

CLL

413

03 Mar 2023