Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.01684
Cited By
v1
v2 (latest)
Harnessing large-language models to generate private synthetic text
2 June 2023
Alexey Kurakin
Natalia Ponomareva
Umar Syed
Liam MacDermed
Andreas Terzis
SILM
SyDa
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Harnessing large-language models to generate private synthetic text"
14 / 14 papers shown
Title
Private Training & Data Generation by Clustering Embeddings
Felix Y. Zhou
Samson Zhou
Vahab Mirrokni
Alessandro Epasto
Vincent Cohen-Addad
27
0
0
20 Jun 2025
Synthesize Privacy-Preserving High-Resolution Images via Private Textual Intermediaries
Haoxiang Wang
Zinan Lin
Da Yu
Huishuai Zhang
40
0
0
09 Jun 2025
Synthesizing and Adapting Error Correction Data for Mobile Large Language Model Applications
Yanxiang Zhang
Zheng Xu
Shanshan Wu
Yuanbo Zhang
Daniel Ramage
KELM
48
0
0
24 May 2025
Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation
Vincent Koc
LM&MA
84
0
0
17 May 2025
A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage
Rui Xin
Niloofar Mireshghallah
Shuyue Stella Li
Michael Duan
Hyunwoo Kim
Yejin Choi
Yulia Tsvetkov
Sewoong Oh
Pang Wei Koh
155
7
0
28 Apr 2025
Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs
Bowen Tan
Zheng Xu
Eric P. Xing
Zhiting Hu
Shanshan Wu
SyDa
165
3
0
16 Mar 2025
PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation
Tao Fan
Guoqiang Ma
Yuanfeng Song
Lixin Fan
Kai Chen
Qiang Yang
90
1
0
21 Feb 2025
The Canary's Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text
Matthieu Meeus
Lukas Wutschitz
Santiago Zanella Béguelin
Shruti Tople
Reza Shokri
195
1
0
19 Feb 2025
Is API Access to LLMs Useful for Generating Private Synthetic Tabular Data?
Marika Swanberg
Ryan McKenna
Edo Roth
Albert Cheu
Peter Kairouz
143
2
0
10 Feb 2025
ShieldGemma: Generative AI Content Moderation Based on Gemma
Wenjun Zeng
Yuchi Liu
Ryan Mullins
Ludovic Peran
Joe Fernandez
...
Drew Proud
Piyush Kumar
Bhaktipriya Radharapu
Olivia Sturman
O. Wahltinez
AI4MH
115
49
0
31 Jul 2024
Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models
Ding Huang
Ting Li
Jian Huang
DiffM
95
1
0
06 Jun 2024
Federated Domain-Specific Knowledge Transfer on Large Language Models Using Synthetic Data
Haoran Li
Xinyuan Zhao
Dadi Guo
Hanlin Gu
Huiping Zhuang
Yuxing Han
Yangqiu Song
Lixin Fan
Qiang Yang
101
1
0
23 May 2024
Hot PATE: Private Aggregation of Distributions for Diverse Task
Edith Cohen
Benjamin Cohen-Wang
Xin Lyu
Jelani Nelson
Tamas Sarlos
Uri Stemmer
122
4
0
04 Dec 2023
Differentially Private Fine-tuning of Language Models
Da Yu
Saurabh Naik
A. Backurs
Sivakanth Gopi
Huseyin A. Inan
...
Y. Lee
Andre Manoel
Lukas Wutschitz
Sergey Yekhanin
Huishuai Zhang
262
373
0
13 Oct 2021
1