Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.05271
Cited By
Does Learning Require Memorization? A Short Tale about a Long Tail
12 June 2019
Vitaly Feldman
TDI
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Does Learning Require Memorization? A Short Tale about a Long Tail"
50 / 336 papers shown
Title
DMRL: Data- and Model-aware Reward Learning for Data Extraction
Zhiqiang Wang
Ruoxi Cheng
31
0
0
07 May 2025
Measuring Déjà vu Memorization Efficiently
Narine Kokhlikyan
Bargav Jayaraman
Florian Bordes
Chuan Guo
Kamalika Chaudhuri
30
1
0
08 Apr 2025
Malicious and Unintentional Disclosure Risks in Large Language Models for Code Generation
Rafiqul Rabin
Sean McGregor
Nick Judd
AAML
PILM
57
0
0
27 Mar 2025
BLIA: Detect model memorization in binary classification model through passive Label Inference attack
Mohammad Wahiduzzaman Khan
Sheng Chen
Ilya Mironov
Leizhen Zhang
Rabib Noor
47
0
0
17 Mar 2025
PoisonedParrot: Subtle Data Poisoning Attacks to Elicit Copyright-Infringing Content from Large Language Models
Michael-Andrei Panaitescu-Liess
Pankayaraj Pathmanathan
Yigitcan Kaya
Zora Che
Bang An
Sicheng Zhu
Aakriti Agrawal
Furong Huang
AAML
71
0
0
10 Mar 2025
Trustworthy Machine Learning via Memorization and the Granular Long-Tail: A Survey on Interactions, Tradeoffs, and Beyond
Qiongxiu Li
Xiaoyu Luo
Yiyi Chen
Johannes Bjerva
48
0
0
10 Mar 2025
Adopt a PET! An Exploration of PETs, Policy, and Practicalities for Industry in Canada
Masoumeh Shafieinejad
Xi He
Bailey Kacsmar
OnRL
44
0
0
04 Mar 2025
Interrogating LLM design under a fair learning doctrine
Johnny Tian-Zheng Wei
Maggie Wang
Ameya Godbole
Jonathan H. Choi
Robin Jia
32
0
0
22 Feb 2025
On Memorization in Diffusion Models
Xiangming Gu
Chao Du
Tianyu Pang
Chongxuan Li
Min-Bin Lin
Ye Wang
DiffM
TDI
166
43
0
21 Feb 2025
Differentially Private Prototypes for Imbalanced Transfer Learning
Dariush Wahdany
Matthew Jagielski
Adam Dziedzic
Franziska Boenisch
88
0
0
17 Feb 2025
Captured by Captions: On Memorization and its Mitigation in CLIP Models
Wenhao Wang
Adam Dziedzic
Grace C. Kim
Michael Backes
Franziska Boenisch
93
0
0
11 Feb 2025
Hallucination, Monofacts, and Miscalibration: An Empirical Investigation
Miranda Muqing Miao
Michael Kearns
67
0
0
11 Feb 2025
FairDropout: Using Example-Tied Dropout to Enhance Generalization of Minority Groups
Géraldin Nanfack
Eugene Belilovsky
59
0
0
10 Feb 2025
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations
Kaixuan Huang
Jiacheng Guo
Zihao Li
X. Ji
Jiawei Ge
...
Yangsibo Huang
Chi Jin
Xinyun Chen
Chiyuan Zhang
Mengdi Wang
AAML
LRM
100
7
0
10 Feb 2025
The Best Instruction-Tuning Data are Those That Fit
Dylan Zhang
Qirun Dai
Hao Peng
ALM
117
3
0
06 Feb 2025
Memorization Inheritance in Sequence-Level Knowledge Distillation for Neural Machine Translation
Verna Dankers
Vikas Raunak
VLM
65
0
0
03 Feb 2025
The Silent Majority: Demystifying Memorization Effect in the Presence of Spurious Correlations
Chenyu You
Haocheng Dai
Yifei Min
Jasjeet Sekhon
S. Joshi
James S. Duncan
68
2
0
01 Jan 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
93
12
0
31 Dec 2024
Where Did Your Model Learn That? Label-free Influence for Self-supervised Learning
Nidhin Harilal
Amit Rege
Reza Akbarian Bafghi
M. Raissi
C. Monteleoni
TDI
43
0
0
22 Dec 2024
The Impact of Generalization Techniques on the Interplay Among Privacy, Utility, and Fairness in Image Classification
Ahmad Hassanpour
Amir Zarei
Khawla Mallat
Anderson Santana de Oliveira
Bian Yang
79
0
0
16 Dec 2024
The Pitfalls of Memorization: When Memorization Hurts Generalization
Reza Bayat
Mohammad Pezeshki
Elvis Dohmatob
David Lopez-Paz
Pascal Vincent
OOD
105
3
0
10 Dec 2024
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
A. Feder Cooper
Christopher A. Choquette-Choo
Miranda Bogen
Matthew Jagielski
Katja Filippova
...
Abigail Z. Jacobs
Andreas Terzis
Hanna M. Wallach
Nicolas Papernot
Katherine Lee
AILaw
MU
93
10
0
09 Dec 2024
Robust Testing for Deep Learning using Human Label Noise
Gordon Lim
Stefan Larson
Kevin Leach
NoLa
65
0
0
29 Nov 2024
Continual Memorization of Factoids in Language Models
Howard Chen
Jiayi Geng
Adithya Bhaskar
Dan Friedman
Danqi Chen
KELM
56
1
0
11 Nov 2024
Unlearning in- vs. out-of-distribution data in LLMs under gradient-based method
Teodora Baluta
Pascal Lamblin
Daniel Tarlow
Fabian Pedregosa
Gintare Karolina Dziugaite
MU
32
1
0
07 Nov 2024
Generalizability of Memorization Neural Networks
Lijia Yu
Xiao-Shan Gao
Lijun Zhang
Yibo Miao
36
1
0
01 Nov 2024
Attribute-to-Delete: Machine Unlearning via Datamodel Matching
Kristian Georgiev
Roy Rinberg
Sung Min Park
Shivam Garg
Andrew Ilyas
Aleksander Madry
Seth Neel
MU
49
3
0
30 Oct 2024
Scalability of memorization-based machine unlearning
Kairan Zhao
Peter Triantafillou
MU
44
2
0
21 Oct 2024
Mislabeled examples detection viewed as probing machine learning models: concepts, survey and extensive benchmark
Thomas George
Pierre Nodet
A. Bondu
Vincent Lemaire
VLM
32
0
0
21 Oct 2024
A Theoretical Survey on Foundation Models
Shi Fu
Yuzhu Chen
Yingjie Wang
Dacheng Tao
28
0
0
15 Oct 2024
Fragile Giants: Understanding the Susceptibility of Models to Subpopulation Attacks
Isha Gupta
Hidde Lycklama
Emanuel Opel
Evan Rose
Anwar Hithnawi
AAML
34
0
0
11 Oct 2024
Decoding Secret Memorization in Code LLMs Through Token-Level Characterization
Yuqing Nie
Chong Wang
Kaixin Wang
Guoai Xu
Guosheng Xu
Haoyu Wang
OffRL
136
1
0
11 Oct 2024
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
Tianjian Li
Haoran Xu
Weiting Tan
Kenton Murray
Daniel Khashabi
35
1
0
06 Oct 2024
ConDa: Fast Federated Unlearning with Contribution Dampening
Vikram S Chundawat
Pushkar Niroula
Prasanna Dhungana
Stefan Schoepf
Murari Mandal
Alexandra Brintrup
FedML
26
3
0
05 Oct 2024
Undesirable Memorization in Large Language Models: A Survey
Ali Satvaty
Suzan Verberne
Fatih Turkmen
ELM
PILM
74
7
0
03 Oct 2024
Localizing Memorization in SSL Vision Encoders
Wenhao Wang
Adam Dziedzic
Michael Backes
Franziska Boenisch
34
2
0
27 Sep 2024
Predicting and analyzing memorization within fine-tuned Large Language Models
Jérémie Dentan
Davide Buscaldi
A. Shabou
Sonia Vanier
37
0
0
27 Sep 2024
Data-Centric AI Governance: Addressing the Limitations of Model-Focused Policies
Ritwik Gupta
Leah Walker
Rodolfo Corona
Stephanie Fu
Suzanne Petryk
Janet Napolitano
Trevor Darrell
Andrew W. Reddie
ELM
40
3
0
25 Sep 2024
Synthetic continued pretraining
Zitong Yang
Neil Band
Shuangping Li
Emmanuel Candès
Tatsunori Hashimoto
CLL
SyDa
38
11
0
11 Sep 2024
A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models
Geonhee Kim
Marco Valentino
André Freitas
LRM
AI4CE
30
7
0
16 Aug 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion
Adi Haviv
Shahar Sarfaty
Uri Y. Hacohen
N. Elkin-Koren
Roi Livni
Amit H. Bermano
37
2
0
15 Aug 2024
Investigating Characteristics of Media Recommendation Solicitation in r/ifyoulikeblank
M. Bhuiyan
Donghan Hu
Andrew Jelson
Tanushree Mitra
Sang Won Lee
20
0
0
12 Aug 2024
Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks
Verna Dankers
Ivan Titov
45
5
0
09 Aug 2024
Range Membership Inference Attacks
Jiashu Tao
Reza Shokri
42
1
0
09 Aug 2024
The Data Addition Dilemma
Judy Hanwen Shen
Inioluwa Deborah Raji
Irene Y. Chen
37
5
0
08 Aug 2024
Demystifying Verbatim Memorization in Large Language Models
Jing Huang
Diyi Yang
Christopher Potts
ELM
PILM
MU
55
19
0
25 Jul 2024
Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundational Models
Simha Sankar Baradwaj
Destiny Gilliland
Jack Rincon
Henning Hermjakob
Yu Yan
...
Dean Wang
Karol Watson
Alex Bui
Wei Wang
Peipei Ping
48
5
0
18 Jul 2024
Extracting Training Data from Document-Based VQA Models
Francesco Pinto
N. Rauschmayr
F. Tramèr
Philip Torr
Federico Tombari
37
3
0
11 Jul 2024
A Method to Facilitate Membership Inference Attacks in Deep Learning Models
Zitao Chen
Karthik Pattabiraman
MIACV
MLAU
AAML
MIALM
75
1
0
02 Jul 2024
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning
Akshara Prabhakar
Thomas L. Griffiths
R. Thomas McCoy
LRM
42
16
0
01 Jul 2024
1
2
3
4
5
6
7
Next