Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.04062
Cited By
Machine Learning for Synthetic Data Generation: A Review
8 February 2023
Ying-Cheng Lu
Minjie Shen
Huazheng Wang
Xiao Wang
Capucine Van Rechem
Tianfan Fu
Wenqi Wei
SyDa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Machine Learning for Synthetic Data Generation: A Review"
26 / 76 papers shown
Title
Trustworthy Distributed AI Systems: Robustness, Privacy, and Governance
Wenqi Wei
Ling Liu
31
16
0
02 Feb 2024
Socially Aware Synthetic Data Generation for Suicidal Ideation Detection Using Large Language Models
Hamideh Ghanadian
I. Nejadgholi
Hussein Al Osman
SyDa
40
18
0
25 Jan 2024
FinLLMs: A Framework for Financial Reasoning Dataset Generation with Large Language Models
Ziqiang Yuan
Kaiyuan Wang
Shoutai Zhu
Ye Yuan
Jingya Zhou
Yanlin Zhu
Wenqi Wei
42
5
0
19 Jan 2024
Uncertainty Quantification on Clinical Trial Outcome Prediction
Tianyi Chen
Yingzhou Lu
Nan Hao
Capucine Van Rechem
Jintai Chen
Tianfan Fu
25
21
0
07 Jan 2024
Generative Model-Driven Synthetic Training Image Generation: An Approach to Cognition in Rail Defect Detection
Rahatara Ferdousi
Chunsheng Yang
M. Anwar Hossain
Fedwa Laamarti
M. S. Hossain
Abdulmotaleb El Saddik
19
0
0
31 Dec 2023
GenoCraft: A Comprehensive, User-Friendly Web-Based Platform for High-Throughput Omics Data Analysis and Visualization
Yingzhou Lu
Minjie Shen
Yue Zhao
Chenhao Li
Fan Meng
Xiao Wang
David M. Herrington
Yue Wang
Tim Fu
Capucine Van Rechem
22
3
0
21 Dec 2023
SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples
Phillip Howard
Avinash Madasu
Tiep Le
Gustavo Lujan Moreno
Anahita Bhiwandiwalla
Vasudev Lal
55
16
0
30 Nov 2023
SurvTimeSurvival: Survival Analysis On The Patient With Multiple Visits/Records
Le Hung
Eng-Jon Ong
Miroslaw Bober
29
1
0
16 Nov 2023
SynDiffix: More accurate synthetic structured data
Paul Francis
Cristian Berneanu
Edon Gashi
30
1
0
16 Nov 2023
TarGEN: Targeted Data Generation with Large Language Models
Himanshu Gupta
Kevin Scaria
Ujjwala Anantheswaran
Shreyas Verma
Mihir Parmar
Saurabh Arjun Sawant
Chitta Baral
Swaroop Mishra
SyDa
32
8
0
27 Oct 2023
Reimagining Synthetic Tabular Data Generation through Data-Centric AI: A Comprehensive Benchmark
Lasse Hansen
Nabeel Seedat
M. Schaar
Andrija Petrović
44
19
0
25 Oct 2023
AI-Generated Images as Data Source: The Dawn of Synthetic Era
Zuhao Yang
Fangneng Zhan
Kunhao Liu
Muyu Xu
Shijian Lu
EGVM
31
18
0
03 Oct 2023
Security and Privacy on Generative Data in AIGC: A Survey
Tao Wang
Yushu Zhang
Shuren Qi
Ruoyu Zhao
Zhihua Xia
Jian Weng
56
44
0
18 Sep 2023
Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data Generation
Charles OÑeill
Y. Ting 丁
I. Ciucă
Jack Miller
Thang Bui
SyDa
37
1
0
15 Aug 2023
ViT2EEG: Leveraging Hybrid Pretrained Vision Transformers for EEG Data
Ruiqi Yang
Eric Modesitt
ViT
31
12
0
01 Aug 2023
Trends in Machine Learning and Electroencephalogram (EEG): A Review for Undergraduate Researchers
Nathaniel Murungi
Michael Vinh Pham
Xu-feng Dai
Xiaodong Qu
23
14
0
06 Jul 2023
Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets
Brandon Smith
Miguel Farinha
S. Hall
Hannah Rose Kirk
Aleksandar Shtedritski
Max Bain
41
19
0
24 May 2023
PrivTrace: Differentially Private Trajectory Synthesis by Adaptive Markov Model
Haiming Wang
Zhikun Zhang
Tianhao Wang
Shibo He
Michael Backes
Jiming Chen
Yang Zhang
38
35
0
02 Oct 2022
Interpretable Molecular Graph Generation via Monotonic Constraints
Yuanqi Du
Xiaojie Guo
Amarda Shehu
Liang Zhao
63
19
0
28 Feb 2022
When do GANs replicate? On the choice of dataset size
Qianli Feng
Chen Guo
Fabian Benitez-Quiroz
Aleix M. Martinez
142
54
0
23 Feb 2022
Differentially Private Fine-tuning of Language Models
Da Yu
Saurabh Naik
A. Backurs
Sivakanth Gopi
Huseyin A. Inan
...
Y. Lee
Andre Manoel
Lukas Wutschitz
Sergey Yekhanin
Huishuai Zhang
134
347
0
13 Oct 2021
Robin Hood and Matthew Effects: Differential Privacy Has Disparate Impact on Synthetic Data
Georgi Ganev
Bristena Oprisanu
Emiliano De Cristofaro
37
57
0
23 Sep 2021
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
D. Song
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
290
1,815
0
14 Dec 2020
A Survey on Bias and Fairness in Machine Learning
Ninareh Mehrabi
Fred Morstatter
N. Saxena
Kristina Lerman
Aram Galstyan
SyDa
FaML
323
4,212
0
23 Aug 2019
Junction Tree Variational Autoencoder for Molecular Graph Generation
Wengong Jin
Regina Barzilay
Tommi Jaakkola
224
1,340
0
12 Feb 2018
Generating Multi-label Discrete Patient Records using Generative Adversarial Networks
E. Choi
Siddharth Biswal
B. Malin
J. Duke
Walter F. Stewart
Jimeng Sun
SyDa
GAN
156
569
0
19 Mar 2017
Previous
1
2