ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.05233
  4. Cited By
Diffusion Models Beat GANs on Image Synthesis
v1v2v3v4 (latest)

Diffusion Models Beat GANs on Image Synthesis

11 May 2021
Prafulla Dhariwal
Alex Nichol
ArXiv (abs)PDFHTMLGithub (6795★)

Papers citing "Diffusion Models Beat GANs on Image Synthesis"

50 / 5,158 papers shown
Title
Facial Expression-Enhanced TTS: Combining Face Representation and
  Emotion Intensity for Adaptive Speech
Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech
Yunji Chu
Yunseob Shim
Unsang Park
100
0
0
24 Sep 2024
Unleashing the Potential of Synthetic Images: A Study on Histopathology
  Image Classification
Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification
Leire Benito-Del-Valle
Aitor Alvarez-Gila
Itziar Eguskiza
C. L. Saratxaga
DiffMMedIm
87
0
0
24 Sep 2024
ASD-Diffusion: Anomalous Sound Detection with Diffusion Models
ASD-Diffusion: Anomalous Sound Detection with Diffusion Models
Fengrun Zhang
Xiang Xie
Kai Guo
DiffM
151
0
0
24 Sep 2024
Zero-Shot Detection of AI-Generated Images
Zero-Shot Detection of AI-Generated Images
D. Cozzolino
Giovanni Poggi
Matthias Nießner
L. Verdoliva
166
15
0
24 Sep 2024
TFG: Unified Training-Free Guidance for Diffusion Models
TFG: Unified Training-Free Guidance for Diffusion Models
Haotian Ye
Haowei Lin
Jiaqi Han
Minkai Xu
Sheng Liu
Yitao Liang
Jianzhu Ma
James Zou
Stefano Ermon
74
31
0
24 Sep 2024
VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient
  Speaker-Adaptive Text-to-Speech via Autoguidance
VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance
Jiheum Yeom
Heeseung Kim
Jooyoung Choi
Che Hyun Lee
Nohil Park
Sungroh Yoon
67
1
0
24 Sep 2024
ImPoster: Text and Frequency Guidance for Subject Driven Action
  Personalization using Diffusion Models
ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models
D. Kothandaraman
Kuldeep Kulkarni
Sumit Shekhar
Balaji Vasan Srinivasan
Dinesh Manocha
DiffM
97
2
0
24 Sep 2024
PRESTO: Fast Motion Planning Using Diffusion Models Based on Key-Configuration Environment Representation
PRESTO: Fast Motion Planning Using Diffusion Models Based on Key-Configuration Environment Representation
Mingyo Seo
Yoonyoung Cho
Yoonchang Sung
Peter Stone
Yuke Zhu
Beomjoon Kim
DiffM
161
0
0
24 Sep 2024
Mixture of Efficient Diffusion Experts Through Automatic Interval and
  Sub-Network Selection
Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
Alireza Ganjdanesh
Yan Kang
Yuchen Liu
Richard Y. Zhang
Zhe Lin
Heng Huang
DiffM
118
5
0
23 Sep 2024
TextToon: Real-Time Text Toonify Head Avatar from Single Video
TextToon: Real-Time Text Toonify Head Avatar from Single Video
Luchuan Song
Lele Chen
Celong Liu
Pinxin Liu
Chenliang Xu
DiffM
98
12
0
23 Sep 2024
Neural Differential Appearance Equations
Neural Differential Appearance Equations
Chen Liu
Tobias Ritschel
106
0
0
23 Sep 2024
DH-FaceVid-1K: A Large-Scale High-Quality Dataset for Face Video Generation
DH-FaceVid-1K: A Large-Scale High-Quality Dataset for Face Video Generation
Donglin Di
Hao Feng
Wenzhang Sun
Yongjia Ma
Hao Li
Wei Chen
Xiaofei Gou
Tonghua Su
Xun Yang
CVBM
153
2
0
23 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLMDiffM
204
16
0
23 Sep 2024
Multi-modal Generative AI: Multi-modal LLMs, Diffusions and the Unification
Multi-modal Generative AI: Multi-modal LLMs, Diffusions and the Unification
X. Wang
Yuwei Zhou
Bin Huang
Hong Chen
Wenwu Zhu
DiffM
177
1
0
23 Sep 2024
D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic
  Robotic Manipulation
D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation
Songlin Wei
Haoran Geng
Jiayi Chen
Congyue Deng
Wenbo Cui
Chengyang Zhao
Xiaomeng Fang
Leonidas Guibas
He Wang
MDE
99
9
0
22 Sep 2024
Implicit Dynamical Flow Fusion (IDFF) for Generative Modeling
Implicit Dynamical Flow Fusion (IDFF) for Generative Modeling
Mohammad R. Rezaei
Rahul G. Krishnan
Milos R. Popovic
M. Lankarany
DiffM
149
0
0
22 Sep 2024
Content-aware Tile Generation using Exterior Boundary Inpainting
Content-aware Tile Generation using Exterior Boundary Inpainting
Sam Sartor
Pieter Peers
DiffM
73
1
0
21 Sep 2024
JVID: Joint Video-Image Diffusion for Visual-Quality and
  Temporal-Consistency in Video Generation
JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation
Hadrien Reynaud
Matthew Baugh
Mischa Dombrowski
Sarah Cechnicka
Qingjie Meng
Bernhard Kainz
VLM
72
0
0
21 Sep 2024
Present and Future Generalization of Synthetic Image Detectors
Present and Future Generalization of Synthetic Image Detectors
Pablo Bernabeu Perez
Enrique Lopez-Cuena
Dario Garcia-Gasulla
57
0
0
21 Sep 2024
Adversarial Attacks on Parts of Speech: An Empirical Study in
  Text-to-Image Generation
Adversarial Attacks on Parts of Speech: An Empirical Study in Text-to-Image Generation
G M Shahariar
Jia Chen
Jiachen Li
Yue Dong
90
2
0
21 Sep 2024
Physics-Informed Latent Diffusion for Multimodal Brain MRI Synthesis
Physics-Informed Latent Diffusion for Multimodal Brain MRI Synthesis
Sven Lüpke
Yousef Yeganeh
Ehsan Adeli
Nassir Navab
Azade Farshad
MedImDiffM
72
5
0
20 Sep 2024
Towards the Discovery of Down Syndrome Brain Biomarkers Using Generative
  Models
Towards the Discovery of Down Syndrome Brain Biomarkers Using Generative Models
Jordi Malé
Juan Fortea
Mateus Rozalem Aranha
Yann Heuzé
Neus Martínez-Abadías
Xavier Sevillano
DiffM
69
1
0
20 Sep 2024
Generative Aerodynamic Design with Diffusion Probabilistic Models
Generative Aerodynamic Design with Diffusion Probabilistic Models
Thomas Wagenaar
Simone Mancini
Andrés Mateo-Gabín
DiffMAI4CE
66
0
0
20 Sep 2024
Invisible Servoing: a Visual Servoing Approach with Return-Conditioned Latent Diffusion
Invisible Servoing: a Visual Servoing Approach with Return-Conditioned Latent Diffusion
Bishoy Gerges
Barbara Bazzana
Nicolò Botteghi
Youssef Aboudorra
Antonio Franchi
DiffM
185
1
0
20 Sep 2024
What does guidance do? A fine-grained analysis in a simple setting
What does guidance do? A fine-grained analysis in a simple setting
Muthu Chidambaram
Khashayar Gatmiry
Sitan Chen
Holden Lee
Jianfeng Lu
63
15
0
19 Sep 2024
DNI: Dilutional Noise Initialization for Diffusion Video Editing
DNI: Dilutional Noise Initialization for Diffusion Video Editing
Sunjae Yoon
Gwanhyeong Koo
Ji Woo Hong
Chang D. Yoo
DiffM
91
4
0
19 Sep 2024
LVCD: Reference-based Lineart Video Colorization with Diffusion Models
LVCD: Reference-based Lineart Video Colorization with Diffusion Models
Zhitong Huang
Mohan Zhang
Jing Liao
DiffMVGen
116
14
0
19 Sep 2024
AudioEditor: A Training-Free Diffusion-Based Audio Editing Framework
AudioEditor: A Training-Free Diffusion-Based Audio Editing Framework
Yuhang Jia
Yang Chen
Jinghua Zhao
Shiwan Zhao
Wenjia Zeng
Yong Chen
Yong Qin
DiffM
71
2
0
19 Sep 2024
Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation
  for Real-Time 3D Human Motion Prediction
Bayesian-Optimized One-Step Diffusion Model with Knowledge Distillation for Real-Time 3D Human Motion Prediction
Sibo Tian
Minghui Zheng
Xiao Liang
DiffM
79
0
0
19 Sep 2024
Fundus image enhancement through direct diffusion bridges
Fundus image enhancement through direct diffusion bridges
Sehui Kim
Hyungjin Chung
Se Hie Park
Eui-Sang Chung
Kayoung Yi
Jong Chul Ye
DiffMMedIm
78
0
0
19 Sep 2024
Denoising diffusion models for high-resolution microscopy image
  restoration
Denoising diffusion models for high-resolution microscopy image restoration
Pamela Osuna-Vargas
Maren H. Wehrheim
Lucas Zinz
Johanna Rahm
Ashwin Balakrishnan
Alexandra Kaminer
Mike Heilemann
Matthias Kaschube
DiffMMedIm
79
1
0
18 Sep 2024
Generation of Complex 3D Human Motion by Temporal and Spatial
  Composition of Diffusion Models
Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models
Lorenzo Mandelli
Stefano Berretti
DiffM
105
3
0
18 Sep 2024
InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation
  Inversion in Guided Diffusion Models
InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation Inversion in Guided Diffusion Models
Yan Zheng
Lemeng Wu
DiffMMDE
43
0
0
18 Sep 2024
GUNet: A Graph Convolutional Network United Diffusion Model for Stable
  and Diversity Pose Generation
GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation
Shuowen Liang
Sisi Li
Qingyun Wang
Cen Zhang
Kaiquan Zhu
Tian Yang
DiffM
59
0
0
18 Sep 2024
Robust Symmetry Detection via Riemannian Langevin Dynamics
Robust Symmetry Detection via Riemannian Langevin Dynamics
Jihyeon Je
Jiayi Liu
Guandao Yang
Boyang Deng
Shengqu Cai
Gordon Wetzstein
Or Litany
Leonidas Guibas
52
7
0
18 Sep 2024
DiffESM: Conditional Emulation of Temperature and Precipitation in Earth
  System Models with 3D Diffusion Models
DiffESM: Conditional Emulation of Temperature and Precipitation in Earth System Models with 3D Diffusion Models
Seth Bassetti
Brian Hutchinson
Claudia Tebaldi
Ben Kravitz
87
6
0
17 Sep 2024
Ultrasound Image Enhancement with the Variance of Diffusion Models
Ultrasound Image Enhancement with the Variance of Diffusion Models
Yuxin Zhang
Clément Huneau
Jérôme Idier
Diana Mateus
MedIm
105
1
0
17 Sep 2024
SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable
  Channel-Wise Membrane Thresholds
SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable Channel-Wise Membrane Thresholds
Zhixing Hou
Maoxu Gao
Hang Yu
Mengyu Yang
Chio-in Ieong
95
1
0
17 Sep 2024
MM2Latent: Text-to-facial image generation and editing in GANs with
  multimodal assistance
MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance
Debin Meng
Christos Tzelepis
Ioannis Patras
Georgios Tzimiropoulos
DiffM
98
0
0
17 Sep 2024
Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Tianqi Chen
Shujian Zhang
Mingyuan Zhou
DiffM
209
6
0
17 Sep 2024
DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models
DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models
Avirup Das
Rishabh Dev Yadav
Sihao Sun
Mingfei Sun
Samuel Kaski
Wei Pan
92
3
0
17 Sep 2024
Optimizing Resource Consumption in Diffusion Models through
  Hallucination Early Detection
Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection
Federico Betti
Lorenzo Baraldi
Lorenzo Baraldi
Rita Cucchiara
N. Sebe
DiffM
88
0
0
16 Sep 2024
Incorporating Classifier-Free Guidance in Diffusion Model-Based
  Recommendation
Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation
Noah Buchanan
Susan Gauch
Quan Mai
DiffMVLM
91
1
0
16 Sep 2024
MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion
MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion
Lehong Wu
Lilang Lin
Jiahang Zhang
Yi Ma
Jiaying Liu
DiffM
116
2
0
16 Sep 2024
Taming Diffusion Models for Image Restoration: A Review
Taming Diffusion Models for Image Restoration: A Review
Ziwei Luo
Fredrik K. Gustafsson
Zheng Zhao
Jens Sjölund
Thomas B. Schön
115
7
0
16 Sep 2024
PixelBytes: Catching Unified Representation for Multimodal Generation
PixelBytes: Catching Unified Representation for Multimodal Generation
Fabien Furfaro
53
0
0
16 Sep 2024
Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based
  models
Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models
Alexander Koch
O. U. Aydin
A. Hilbert
Jana Rieger
Satoru Tanioka
F. Ishida
Dietmar Frey
DiffMMedIm
77
1
0
16 Sep 2024
LASERS: LAtent Space Encoding for Representations with Sparsity for
  Generative Modeling
LASERS: LAtent Space Encoding for Representations with Sparsity for Generative Modeling
Xin Li
Anand Sarwate
58
0
0
16 Sep 2024
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval
Yifei Xin
Xuxin Cheng
Zhihong Zhu
Xusheng Yang
Yuexian Zou
DiffM
103
5
0
16 Sep 2024
InteractPro: A Unified Framework for Motion-Aware Image Composition
InteractPro: A Unified Framework for Motion-Aware Image Composition
Weijing Tao
Xiaofeng Yang
Miaomiao Cui
Guosheng Lin
DiffM
90
2
0
16 Sep 2024
Previous
123...282930...102103104
Next