ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.08921
  4. Cited By
How Faithful is your Synthetic Data? Sample-level Metrics for Evaluating
  and Auditing Generative Models

How Faithful is your Synthetic Data? Sample-level Metrics for Evaluating and Auditing Generative Models

17 February 2021
Ahmed Alaa
B. V. Breugel
Evgeny S. Saveliev
M. Schaar
ArXivPDFHTML

Papers citing "How Faithful is your Synthetic Data? Sample-level Metrics for Evaluating and Auditing Generative Models"

50 / 108 papers shown
Title
Metrics that matter: Evaluating image quality metrics for medical image generation
Metrics that matter: Evaluating image quality metrics for medical image generation
Yash Deo
Yan Jia
T. Lassila
William A. P. Smith
T. Lawton
Siyuan Kang
Alejandro F. Frangi
Ibrahim Habli
39
0
0
12 May 2025
Boosting Statistic Learning with Synthetic Data from Pretrained Large Models
Boosting Statistic Learning with Synthetic Data from Pretrained Large Models
Jialong Jiang
Wenkang Hu
Jian Huang
Yuling Jiao
Xu Liu
DiffM
50
0
0
08 May 2025
What's Wrong with Your Synthetic Tabular Data? Using Explainable AI to Evaluate Generative Models
What's Wrong with Your Synthetic Tabular Data? Using Explainable AI to Evaluate Generative Models
Jan Kapar
Niklas Koenen
Martin Jullum
64
0
0
29 Apr 2025
Sparks of Science: Hypothesis Generation Using Structured Paper Data
Sparks of Science: Hypothesis Generation Using Structured Paper Data
Charles OÑeill
Tirthankar Ghosal
Roberta Răileanu
Mike Walmsley
Thang Bui
Kevin Schawinski
I. Ciucă
LRM
51
0
0
17 Apr 2025
Critical Challenges and Guidelines in Evaluating Synthetic Tabular Data: A Systematic Review
Critical Challenges and Guidelines in Evaluating Synthetic Tabular Data: A Systematic Review
Nazia Nafis
Inaki Esnaola
Alvaro Martinez-Perez
Maria-Cruz Villa-Uriol
Venet Osmani
LMTD
52
0
0
10 Apr 2025
Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation Framework
Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation Framework
Andrey Sidorenko
Michael Platzer
Mario Scriminaci
P. Tiwald
44
0
0
02 Apr 2025
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models
Tadeusz Dziarmaga
Marcin Kądziołka
Artur Kasymov
Marcin Mazur
EGVM
100
0
0
24 Mar 2025
How Well Does Your Tabular Generator Learn the Structure of Tabular Data?
Xiangjian Jiang
Nikola Simidjievski
M. Jamnik
LMTD
80
0
0
13 Mar 2025
Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets
H. Kniesel
Pedro Hermosilla
Timo Ropinski
60
0
0
12 Mar 2025
Privacy-Preserving Fair Synthetic Tabular Data
Fatima Jahan Sarmin
Atiquer R. Rahman
Christopher J. Henry
Noman Mohammed
45
0
0
04 Mar 2025
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions
Zhong Li
Qi Huang
Lincen Yang
Jiayang Shi
Zhao Yang
N. V. Stein
Thomas Bäck
M. Leeuwen
DiffM
44
0
0
24 Feb 2025
Evaluating Inter-Column Logical Relationships in Synthetic Tabular Data Generation
Evaluating Inter-Column Logical Relationships in Synthetic Tabular Data Generation
Yunbo Long
Liming Xu
Alexandra Brintrup
80
1
0
06 Feb 2025
Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models
Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models
Ran Xu
Hejie Cui
Yue Yu
Xuan Kan
Wenqi Shi
Yuchen Zhuang
Wei Jin
Joyce C. Ho
Carl Yang
66
13
0
28 Jan 2025
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data
P. Tiwald
Ivona Krchova
Andrey Sidorenko
Mariana Vargas-Vieyra
Mario Scriminaci
Michael Platzer
47
1
0
21 Jan 2025
Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation
Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation
Mohammad Khalil
Farhad Vadiee
Ronas Shakya
Qinyi Liu
SyDa
36
7
0
03 Jan 2025
Understanding and Mitigating Memorization in Diffusion Models for
  Tabular Data
Understanding and Mitigating Memorization in Diffusion Models for Tabular Data
Zhengyu Fang
Zhimeng Jiang
Huiyuan Chen
Xiao Li
Jing Li
76
2
0
15 Dec 2024
Exploring the Impact of Synthetic Data on Human Gesture Recognition
  Tasks Using GANs
Exploring the Impact of Synthetic Data on Human Gesture Recognition Tasks Using GANs
George Kontogiannis
Pantelis Tzamalis
Sotiris Nikoletseas
66
0
0
09 Dec 2024
A Review on Generative AI Models for Synthetic Medical Text, Time
  Series, and Longitudinal Data
A Review on Generative AI Models for Synthetic Medical Text, Time Series, and Longitudinal Data
Mohammad Loni
Fatemeh Poursalim
Mehdi Asadi
Arash Gharehbaghi
SyDa
81
0
0
19 Nov 2024
Conditional Vendi Score: An Information-Theoretic Approach to Diversity
  Evaluation of Prompt-based Generative Models
Conditional Vendi Score: An Information-Theoretic Approach to Diversity Evaluation of Prompt-based Generative Models
Mohammad Jalali
Azim Ospanov
Amin Gohari
Farzan Farnia
EGVM
37
2
0
05 Nov 2024
GRADE: Quantifying Sample Diversity in Text-to-Image Models
GRADE: Quantifying Sample Diversity in Text-to-Image Models
Royi Rassin
Aviv Slobodkin
Shauli Ravfogel
Yanai Elazar
Yoav Goldberg
85
1
0
29 Oct 2024
Diffusion-nested Auto-Regressive Synthesis of Heterogeneous Tabular Data
Diffusion-nested Auto-Regressive Synthesis of Heterogeneous Tabular Data
Hengrui Zhang
Liancheng Fang
Qitian Wu
Philip S. Yu
DiffM
LMTD
31
1
0
28 Oct 2024
Fair4Free: Generating High-fidelity Fair Synthetic Samples using Data
  Free Distillation
Fair4Free: Generating High-fidelity Fair Synthetic Samples using Data Free Distillation
Md Fahim Sikder
Daniel de Leng
Fredrik Heintz
29
1
0
02 Oct 2024
Forte : Finding Outliers with Representation Typicality Estimation
Forte : Finding Outliers with Representation Typicality Estimation
Debargha Ganguly
Warren Morningstar
A. Yu
Vipin Chaudhary
OODD
41
0
0
02 Oct 2024
Introducing SDICE: An Index for Assessing Diversity of Synthetic Medical
  Datasets
Introducing SDICE: An Index for Assessing Diversity of Synthetic Medical Datasets
Mohammed Talha Alam
Raza Imam
Mohammad Areeb Qazi
Asim Ukaye
Karthik Nandakumar
MedIm
18
0
0
28 Sep 2024
Synthetic Data Generation and Automated Multidimensional Data Labeling
  for AI/ML in General and Circular Coordinates
Synthetic Data Generation and Automated Multidimensional Data Labeling for AI/ML in General and Circular Coordinates
Alice Williams
Boris Kovalerchuk
28
0
0
03 Sep 2024
DIAGen: Diverse Image Augmentation with Generative Models
DIAGen: Diverse Image Augmentation with Generative Models
Tobias Lingenberg
Markus Reuter
Gopika Sudhakaran
Dominik Gojny
Stefan Roth
Simone Schaub-Meyer
DiffM
25
3
0
26 Aug 2024
Towards Realistic Synthetic User-Generated Content: A Scaffolding
  Approach to Generating Online Discussions
Towards Realistic Synthetic User-Generated Content: A Scaffolding Approach to Generating Online Discussions
K. Balog
John Palowitch
Barbara Ikica
Filip Radlinski
Hamidreza Alvari
Mehdi Manshadi
SyDa
39
1
0
15 Aug 2024
Towards a Scalable Reference-Free Evaluation of Generative Models
Towards a Scalable Reference-Free Evaluation of Generative Models
Azim Ospanov
Jingwei Zhang
Mohammad Jalali
Xuenan Cao
Andrej Bogdanov
Farzan Farnia
EGVM
32
1
0
03 Jul 2024
FairX: A comprehensive benchmarking tool for model analysis using
  fairness, utility, and explainability
FairX: A comprehensive benchmarking tool for model analysis using fairness, utility, and explainability
Md Fahim Sikder
R. Ramachandranpillai
Daniel de Leng
Fredrik Heintz
31
2
0
20 Jun 2024
Advancing Retail Data Science: Comprehensive Evaluation of Synthetic
  Data
Advancing Retail Data Science: Comprehensive Evaluation of Synthetic Data
Yu Xia
Chi-Hua Wang
Joshua Mabry
Guang Cheng
ELM
32
4
0
19 Jun 2024
Data Plagiarism Index: Characterizing the Privacy Risk of Data-Copying
  in Tabular Generative Models
Data Plagiarism Index: Characterizing the Privacy Risk of Data-Copying in Tabular Generative Models
Joshua Ward
Chi-Hua Wang
Guang Cheng
34
3
0
18 Jun 2024
Deep Temporal Deaggregation: Large-Scale Spatio-Temporal Generative
  Models
Deep Temporal Deaggregation: Large-Scale Spatio-Temporal Generative Models
David Bergstrom
Mattias Tiger
Fredrik Heintz
DiffM
AI4TS
30
0
0
18 Jun 2024
Causality for Tabular Data Synthesis: A High-Order Structure Causal
  Benchmark Framework
Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark Framework
Ruibo Tu
Zineb Senane
Lele Cao
Cheng Zhang
Hedvig Kjellström
G. Henter
CML
45
4
0
12 Jun 2024
ClavaDDPM: Multi-relational Data Synthesis with Cluster-guided Diffusion
  Models
ClavaDDPM: Multi-relational Data Synthesis with Cluster-guided Diffusion Models
Wei Pang
Masoumeh Shafieinejad
Lucy Liu
Xi He
38
8
0
28 May 2024
Permissioned Blockchain-based Framework for Ranking Synthetic Data
  Generators
Permissioned Blockchain-based Framework for Ranking Synthetic Data Generators
Narasimha Raghavan
Mohammad Hossein Tabatabaei
Severin Elvatun
V. Vallevik
S. Larønningen
J. F. Nygård
40
2
0
12 May 2024
Synthetic Data in Radiological Imaging: Current State and Future Outlook
Synthetic Data in Radiological Imaging: Current State and Future Outlook
E. Sizikova
A. Badal
Jana G. Delfino
Miguel Lago
Brandon Nelson
Niloufar Saharkhiz
B. Sahiner
Ghada Zamzmi
Aldo Badano
MedIm
40
5
0
08 May 2024
Identification of Novel Modes in Generative Models via Fourier-based
  Differential Clustering
Identification of Novel Modes in Generative Models via Fourier-based Differential Clustering
Jingwei Zhang
Mohammad Jalali
Cheuk Ting Li
Farzan Farnia
31
3
0
04 May 2024
Why Tabular Foundation Models Should Be a Research Priority
Why Tabular Foundation Models Should Be a Research Priority
B. V. Breugel
M. Schaar
LMTD
VLM
AI4CE
39
33
0
02 May 2024
Bt-GAN: Generating Fair Synthetic Healthdata via Bias-transforming
  Generative Adversarial Networks
Bt-GAN: Generating Fair Synthetic Healthdata via Bias-transforming Generative Adversarial Networks
R. Ramachandranpillai
Md Fahim Sikder
David Bergstrom
Fredrik Heintz
SyDa
30
6
0
21 Apr 2024
Synthetic Medical Imaging Generation with Generative Adversarial
  Networks For Plain Radiographs
Synthetic Medical Imaging Generation with Generative Adversarial Networks For Plain Radiographs
John R. McNulty
Lee Kho
Alexandria L. Case
Charlie Fornaca
Drew Johnston
David Slater
J. Abzug
Sybil A. Russell
MedIm
33
5
0
28 Mar 2024
An Interpretable Evaluation of Entropy-based Novelty of Generative
  Models
An Interpretable Evaluation of Entropy-based Novelty of Generative Models
Jingwei Zhang
Cheuk Ting Li
Farzan Farnia
EGVM
30
6
0
27 Feb 2024
Exploring Precision and Recall to assess the quality and diversity of
  LLMs
Exploring Precision and Recall to assess the quality and diversity of LLMs
Florian Le Bronnec
Alexandre Verine
Benjamin Négrevergne
Y. Chevaleyre
Alexandre Allauzen
40
14
0
16 Feb 2024
Systematic Assessment of Tabular Data Synthesis Algorithms
Systematic Assessment of Tabular Data Synthesis Algorithms
Yuntao Du
Ninghui Li
27
4
0
09 Feb 2024
PQMass: Probabilistic Assessment of the Quality of Generative Models using Probability Mass Estimation
PQMass: Probabilistic Assessment of the Quality of Generative Models using Probability Mass Estimation
Pablo Lemos
Sammy N. Sharief
Nikolay Malkin
Laurence Perreault Levasseur
Y. Hezaveh
Laurence Perreault-Levasseur
Yashar Hezaveh
21
3
0
06 Feb 2024
A Bias-Variance Decomposition for Ensembles over Multiple Synthetic Datasets
A Bias-Variance Decomposition for Ensembles over Multiple Synthetic Datasets
Ossi Raisa
Antti Honkela
67
0
0
06 Feb 2024
A primer on synthetic health data
A primer on synthetic health data
Jennifer Anne Bartell
Sander Boisen Valentin
Anders Krogh
Henning Langberg
Martin Bøgsted
24
1
0
31 Jan 2024
Can I trust my fake data -- A comprehensive quality assessment framework
  for synthetic tabular data in healthcare
Can I trust my fake data -- A comprehensive quality assessment framework for synthetic tabular data in healthcare
V. Vallevik
Aleksandar Babic
S. Marshall
Severin Elvatun
Helga Brogger
S. Alagaratnam
B. Edwin
Narasimha Raghavan
Anne Kjersti Befring
J. F. Nygård
39
19
0
24 Jan 2024
Downstream Task-Oriented Generative Model Selections on Synthetic Data
  Training for Fraud Detection Models
Downstream Task-Oriented Generative Model Selections on Synthetic Data Training for Fraud Detection Models
Yinan Cheng
ChiHua Wang
Vamsi K. Potluru
T. Balch
Guang Cheng
15
7
0
01 Jan 2024
The Challenges of Image Generation Models in Generating Multi-Component
  Images
The Challenges of Image Generation Models in Generating Multi-Component Images
Tham Yik Foong
Shashank Kotyan
Poyuan Mao
Danilo Vasconcellos Vargas
EGVM
41
1
0
22 Nov 2023
Time-series Generation by Contrastive Imitation
Time-series Generation by Contrastive Imitation
Daniel Jarrett
Ioana Bica
M. Schaar
AI4TS
13
24
0
02 Nov 2023
123
Next