Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.07805
Cited By
Extracting Training Data from Large Language Models
14 December 2020
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
Katherine Lee
Adam Roberts
Tom B. Brown
D. Song
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Extracting Training Data from Large Language Models"
50 / 359 papers shown
Title
Deduplicating Training Data Mitigates Privacy Risks in Language Models
Nikhil Kandpal
Eric Wallace
Colin Raffel
PILM
MU
26
274
0
14 Feb 2022
Understanding Rare Spurious Correlations in Neural Networks
Yao-Yuan Yang
Chi-Ning Chou
Kamalika Chaudhuri
AAML
16
25
0
10 Feb 2022
Competition-Level Code Generation with AlphaCode
Yujia Li
David Choi
Junyoung Chung
Nate Kushman
Julian Schrittwieser
...
Esme Sutherland Robson
Pushmeet Kohli
Nando de
Koray Kavukcuoglu
Oriol Vinyals
19
1,292
0
08 Feb 2022
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models
Boxin Wang
Wei Ping
Chaowei Xiao
P. Xu
M. Patwary
M. Shoeybi
Bo-wen Li
Anima Anandkumar
Bryan Catanzaro
9
64
0
08 Feb 2022
Survey of Hallucination in Natural Language Generation
Ziwei Ji
Nayeon Lee
Rita Frieske
Tiezheng Yu
D. Su
...
Delong Chen
Wenliang Dai
Ho Shu Chan
Andrea Madotto
Pascale Fung
HILM
LRM
40
2,232
0
08 Feb 2022
Datamodels: Predicting Predictions from Training Data
Andrew Ilyas
Sung Min Park
Logan Engstrom
Guillaume Leclerc
A. Madry
TDI
33
130
0
01 Feb 2022
Toward Training at ImageNet Scale with Differential Privacy
Alexey Kurakin
Shuang Song
Steve Chien
Roxana Geambasu
Andreas Terzis
Abhradeep Thakurta
25
99
0
28 Jan 2022
Recommendation Unlearning
C. L. Philip Chen
Fei Sun
M. Zhang
Bolin Ding
MU
25
85
0
18 Jan 2022
Reconstructing Training Data with Informed Adversaries
Borja Balle
Giovanni Cherubin
Jamie Hayes
MIACV
AAML
30
158
0
13 Jan 2022
MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Rowan Zellers
Jiasen Lu
Ximing Lu
Youngjae Yu
Yanpeng Zhao
Mohammadreza Salehi
Aditya Kusupati
Jack Hessel
Ali Farhadi
Yejin Choi
26
207
0
07 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
129
14,615
0
20 Dec 2021
Model Stealing Attacks Against Inductive Graph Neural Networks
Yun Shen
Xinlei He
Yufei Han
Yang Zhang
16
60
0
15 Dec 2021
Survey of Generative Methods for Social Media Analysis
Stan Matwin
Aristides Milios
P. Prałat
Amílcar Soares
Franccois Théberge
13
3
0
13 Dec 2021
Improving language models by retrieving from trillions of tokens
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
...
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELM
RALM
60
1,013
0
08 Dec 2021
Membership Inference Attacks From First Principles
Nicholas Carlini
Steve Chien
Milad Nasr
Shuang Song
Andreas Terzis
Florian Tramèr
MIACV
MIALM
22
639
0
07 Dec 2021
How much do language models copy from their training data? Evaluating linguistic novelty in text generation using RAVEN
R. Thomas McCoy
P. Smolensky
Tal Linzen
Jianfeng Gao
Asli Celikyilmaz
SyDa
17
119
0
18 Nov 2021
LMdiff: A Visual Diff Tool to Compare Language Models
Hendrik Strobelt
Benjamin Hoover
Arvind Satyanarayan
Sebastian Gehrmann
VLM
29
19
0
02 Nov 2021
Availability Attacks Create Shortcuts
Da Yu
Huishuai Zhang
Wei Chen
Jian Yin
Tie-Yan Liu
AAML
20
57
0
01 Nov 2021
MetaICL: Learning to Learn In Context
Sewon Min
M. Lewis
Luke Zettlemoyer
Hannaneh Hajishirzi
LRM
52
466
0
29 Oct 2021
Adaptive Differentially Private Empirical Risk Minimization
Xiaoxia Wu
Lingxiao Wang
Irina Cristali
Quanquan Gu
Rebecca Willett
24
6
0
14 Oct 2021
Differentially Private Fine-tuning of Language Models
Da Yu
Saurabh Naik
A. Backurs
Sivakanth Gopi
Huseyin A. Inan
...
Y. Lee
Andre Manoel
Lukas Wutschitz
Sergey Yekhanin
Huishuai Zhang
134
346
0
13 Oct 2021
Towards a Unified View of Parameter-Efficient Transfer Learning
Junxian He
Chunting Zhou
Xuezhe Ma
Taylor Berg-Kirkpatrick
Graham Neubig
AAML
21
892
0
08 Oct 2021
Capturing Structural Locality in Non-parametric Language Models
Frank F. Xu
Junxian He
Graham Neubig
Vincent J. Hellendoorn
19
14
0
06 Oct 2021
Trustworthy AI: From Principles to Practices
Bo-wen Li
Peng Qi
Bo Liu
Shuai Di
Jingen Liu
Jiquan Pei
Jinfeng Yi
Bowen Zhou
117
355
0
04 Oct 2021
Towards Zero-Label Language Learning
Zirui Wang
Adams Wei Yu
Orhan Firat
Yuan Cao
SyDa
180
102
0
19 Sep 2021
Membership Inference Attacks Against Recommender Systems
Minxing Zhang
Z. Ren
Zihan Wang
Pengjie Ren
Zhumin Chen
Pengfei Hu
Yang Zhang
MIACV
AAML
10
83
0
16 Sep 2021
Just What do You Think You're Doing, Dave?' A Checklist for Responsible Data Use in NLP
Anna Rogers
Timothy Baldwin
Kobi Leins
102
64
0
14 Sep 2021
The five Is: Key principles for interpretable and safe conversational AI
M. Wahde
M. Virgolin
34
5
0
31 Aug 2021
Selective Differential Privacy for Language Modeling
Weiyan Shi
Aiqi Cui
Evan Li
R. Jia
Zhou Yu
13
68
0
30 Aug 2021
Large-Scale Differentially Private BERT
Rohan Anil
Badih Ghazi
Vineet Gupta
Ravi Kumar
Pasin Manurangsi
24
131
0
03 Aug 2021
Who's Afraid of Thomas Bayes?
Erick Galinkin
AAML
20
0
0
30 Jul 2021
Pointer Value Retrieval: A new benchmark for understanding the limits of neural network generalization
Chiyuan Zhang
M. Raghu
Jon M. Kleinberg
Samy Bengio
OOD
19
30
0
27 Jul 2021
Private Alternating Least Squares: Practical Private Matrix Completion with Tighter Rates
Steve Chien
Prateek Jain
Walid Krichene
Steffen Rendle
Shuang Song
Abhradeep Thakurta
Li Zhang
20
19
0
20 Jul 2021
Wordcraft: a Human-AI Collaborative Editor for Story Writing
Andy Coenen
Luke Davis
Daphne Ippolito
Emily Reif
Ann Yuan
LLMAG
21
71
0
15 Jul 2021
Deduplicating Training Data Makes Language Models Better
Katherine Lee
Daphne Ippolito
A. Nystrom
Chiyuan Zhang
Douglas Eck
Chris Callison-Burch
Nicholas Carlini
SyDa
242
591
0
14 Jul 2021
Anticipating Safety Issues in E2E Conversational AI: Framework and Tooling
Emily Dinan
Gavin Abercrombie
A. S. Bergman
Shannon L. Spruit
Dirk Hovy
Y-Lan Boureau
Verena Rieser
32
105
0
07 Jul 2021
A Survey on Data Augmentation for Text Classification
Markus Bayer
M. Kaufhold
Christian A. Reuter
28
334
0
07 Jul 2021
Is GPT-3 Text Indistinguishable from Human Text? Scarecrow: A Framework for Scrutinizing Machine Text
Yao Dou
Maxwell Forbes
Rik Koncel-Kedziorski
Noah A. Smith
Yejin Choi
DeLMO
6
126
0
02 Jul 2021
Covariance-Aware Private Mean Estimation Without Private Covariance Estimation
Gavin Brown
Marco Gaboardi
Adam D. Smith
Jonathan R. Ullman
Lydia Zakynthinou
FedML
23
48
0
24 Jun 2021
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data
Paul Pu Liang
Terrance Liu
Anna Cai
Michal Muszynski
Ryo Ishii
Nicholas B. Allen
Randy P. Auerbach
David Brent
Ruslan Salakhutdinov
Louis-Philippe Morency
30
16
0
24 Jun 2021
Membership Inference on Word Embedding and Beyond
Saeed Mahloujifar
Huseyin A. Inan
Melissa Chase
Esha Ghosh
Marcello Hasegawa
MIACV
SILM
17
46
0
21 Jun 2021
Federated Learning with Buffered Asynchronous Aggregation
John Nguyen
Kshitiz Malik
Hongyuan Zhan
Ashkan Yousefpour
Michael G. Rabbat
Mani Malek
Dzmitry Huba
FedML
13
288
0
11 Jun 2021
Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning
Liangqiong Qu
Yuyin Zhou
Paul Pu Liang
Yingda Xia
Feifei Wang
Ehsan Adeli
L. Fei-Fei
D. Rubin
FedML
AI4CE
19
173
0
10 Jun 2021
Membership Inference Attack Susceptibility of Clinical Language Models
Abhyuday N. Jagannatha
Bhanu Pratap Singh Rawat
Hong-ye Yu
MIACV
22
60
0
16 Apr 2021
Does BERT Pretrained on Clinical Notes Reveal Sensitive Data?
Eric P. Lehman
Sarthak Jain
Karl Pichotta
Yoav Goldberg
Byron C. Wallace
OOD
MIACV
22
117
0
15 Apr 2021
Nine Potential Pitfalls when Designing Human-AI Co-Creative Systems
Daniel Buschek
Lukas Mecke
Florian Lehmann
Hai Dang
13
42
0
01 Apr 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures
Sushant Singh
A. Mahmood
AI4TS
55
92
0
23 Mar 2021
Membership Inference Attacks on Machine Learning: A Survey
Hongsheng Hu
Z. Salcic
Lichao Sun
Gillian Dobbie
Philip S. Yu
Xuyun Zhang
MIACV
30
412
0
14 Mar 2021
Privacy Regularization: Joint Privacy-Utility Optimization in Language Models
Fatemehsadat Mireshghallah
Huseyin A. Inan
Marcello Hasegawa
Victor Rühle
Taylor Berg-Kirkpatrick
Robert Sim
11
39
0
12 Mar 2021
A Study of Face Obfuscation in ImageNet
Kaiyu Yang
Jacqueline Yau
Li Fei-Fei
Jia Deng
Olga Russakovsky
PICV
CVBM
28
144
0
10 Mar 2021
Previous
1
2
3
4
5
6
7
8
Next