Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.17012
Cited By
Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language Models
26 February 2024
Jeffrey G. Wang
Jason Wang
Marvin Li
Seth Neel
MIALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language Models"
2 / 2 papers shown
Title
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
253
1,986
0
31 Dec 2020
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
D. Song
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
290
1,814
0
14 Dec 2020
1