ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.15282
  4. Cited By
Cascaded Diffusion Models for High Fidelity Image Generation
v1v2v3 (latest)

Cascaded Diffusion Models for High Fidelity Image Generation

30 May 2021
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
ArXiv (abs)PDFHTML

Papers citing "Cascaded Diffusion Models for High Fidelity Image Generation"

50 / 874 papers shown
Title
4M: Massively Multimodal Masked Modeling
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
99
74
0
11 Dec 2023
ControlNet-XS: Designing an Efficient and Effective Architecture for
  Controlling Text-to-Image Diffusion Models
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models
Denis Zavadski
Johann-Friedrich Feiden
Carsten Rother
DiffM
81
10
0
11 Dec 2023
A Note on the Convergence of Denoising Diffusion Probabilistic Models
A Note on the Convergence of Denoising Diffusion Probabilistic Models
S. Mbacke
Omar Rivasplata
DiffM
85
6
0
10 Dec 2023
Learn to Optimize Denoising Scores for 3D Generation: A Unified and
  Improved Diffusion Prior on NeRF and 3D Gaussian Splatting
Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting
Xiaofeng Yang
Yiwen Chen
Cheng Chen
Chi Zhang
Yi Tian Xu
Xulei Yang
Fayao Liu
Guosheng Lin
3DGSDiffM
70
18
0
08 Dec 2023
RL Dreams: Policy Gradient Optimization for Score Distillation based 3D
  Generation
RL Dreams: Policy Gradient Optimization for Score Distillation based 3D Generation
Aradhya Neeraj Mathur
Phu-Cuong Pham
Aniket Bera
Ojaswa Sharma
69
0
0
08 Dec 2023
GenTron: Diffusion Transformers for Image and Video Generation
GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen
Mengmeng Xu
Jiawei Ren
Yuren Cong
Sen He
Yanping Xie
Animesh Sinha
Ping Luo
Tao Xiang
Juan-Manuel Perez-Rua
VGen
99
41
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGenDiffM
64
43
0
07 Dec 2023
DemoCaricature: Democratising Caricature Generation with a Rough Sketch
DemoCaricature: Democratising Caricature Generation with a Rough Sketch
Dar-Yen Chen
A. Bhunia
Subhadeep Koley
Aneeshan Sain
Pinaki Nath Chowdhury
Yi-Zhe Song
94
8
0
07 Dec 2023
Diffusing Colors: Image Colorization with Text Guided Diffusion
Diffusing Colors: Image Colorization with Text Guided Diffusion
Nir Zabari
Aharon Azulay
Alexey Gorkor
Tavi Halperin
Ohad Fried
DiffM
141
20
0
07 Dec 2023
Resolution Chromatography of Diffusion Models
Resolution Chromatography of Diffusion Models
Juno Hwang
Yong-Hyun Park
Junghyo Jo
DiffM
55
1
0
07 Dec 2023
Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion
Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion
Kira Prabhu
Jane Wu
Lynn Tsai
Peter Hedman
Dan B. Goldman
Ben Poole
Michael Broxton
DiffM
66
8
0
06 Dec 2023
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Xuanchi Ren
Jiahui Huang
Fangyin Wei
Ken Museth
Sanja Fidler
Francis Williams
86
66
0
06 Dec 2023
Alchemist: Parametric Control of Material Properties with Diffusion
  Models
Alchemist: Parametric Control of Material Properties with Diffusion Models
Prafull Sharma
Varun Jampani
Yuanzhen Li
Xuhui Jia
Dmitry Lagun
Frédo Durand
William T. Freeman
Mark J. Matthews
DiffM
130
26
0
05 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models
Analyzing and Improving the Training Dynamics of Diffusion Models
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
153
203
0
05 Dec 2023
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Zhuoran Yu
Chenchen Zhu
Sean Culatana
Raghuraman Krishnamoorthi
Fanyi Xiao
Yong Jae Lee
177
15
0
04 Dec 2023
DPHMs: Diffusion Parametric Head Models for Depth-based Tracking
DPHMs: Diffusion Parametric Head Models for Depth-based Tracking
Jiapeng Tang
Angela Dai
Yinyu Nie
Lev Markhasin
Justus Thies
Matthias Niessner
DiffM
120
10
0
02 Dec 2023
Consistent Mesh Diffusion
Consistent Mesh Diffusion
Julian Knodt
Xifeng Gao
76
3
0
01 Dec 2023
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion
  Models
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models
Pengxiang Li
Kai Chen
Zhili Liu
Ruiyuan Gao
Lanqing Hong
Guo Zhou
Hua Yao
Dit-Yan Yeung
Huchuan Lu
Xu Jia
VGenDiffM
66
0
0
01 Dec 2023
DFU: scale-robust diffusion model for zero-shot super-resolution image
  generation
DFU: scale-robust diffusion model for zero-shot super-resolution image generation
Alex Havrilla
Kevin Rojas
Wenjing Liao
Molei Tao
94
2
0
30 Nov 2023
DREAM: Diffusion Rectification and Estimation-Adaptive Models
DREAM: Diffusion Rectification and Estimation-Adaptive Models
Jinxin Zhou
Tianyu Ding
Tianyi Chen
Jiachen Jiang
Ilya Zharkov
Zhihui Zhu
Luming Liang
93
7
0
30 Nov 2023
MotionEditor: Editing Video Motion via Content-Aware Diffusion
MotionEditor: Editing Video Motion via Content-Aware Diffusion
Shuyuan Tu
Qi Dai
Zhi-Qi Cheng
Hang-Rui Hu
Xintong Han
Zuxuan Wu
Yu-Gang Jiang
DiffMVGen
102
31
0
30 Nov 2023
Detailed Human-Centric Text Description-Driven Large Scene Synthesis
Detailed Human-Centric Text Description-Driven Large Scene Synthesis
Gwanghyun Kim
Dong un Kang
H. Seo
Hayeon Kim
Se Young Chun
3DVDiffM
61
2
0
30 Nov 2023
Prompt-Based Exemplar Super-Compression and Regeneration for
  Class-Incremental Learning
Prompt-Based Exemplar Super-Compression and Regeneration for Class-Incremental Learning
Ruxiao Duan
Yaoyao Liu
Jieneng Chen
Adam Kortylewski
Alan Yuille
DiffMVLM
102
1
0
30 Nov 2023
Diffusion Models Without Attention
Diffusion Models Without Attention
Jing Nathan Yan
Jiatao Gu
Alexander M. Rush
111
69
0
30 Nov 2023
ElasticDiffusion: Training-free Arbitrary Size Image Generation through Global-Local Content Separation
ElasticDiffusion: Training-free Arbitrary Size Image Generation through Global-Local Content Separation
Moayed Haji-Ali
Guha Balakrishnan
Vicente Ordonez
179
27
0
30 Nov 2023
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
Sherwin Bahmani
Ivan Skorokhodov
Victor Rong
Gordon Wetzstein
Leonidas Guibas
Peter Wonka
Sergey Tulyakov
Jeong Joon Park
Andrea Tagliasacchi
David B. Lindell
DiffM
143
112
0
29 Nov 2023
SODA: Bottleneck Diffusion Models for Representation Learning
SODA: Bottleneck Diffusion Models for Representation Learning
Drew A. Hudson
Daniel Zoran
Mateusz Malinowski
Andrew Kyle Lampinen
Andrew Jaegle
James L. McClelland
Loic Matthey
Felix Hill
Alexander Lerchner
DiffM
106
56
0
29 Nov 2023
Leveraging Graph Diffusion Models for Network Refinement Tasks
Leveraging Graph Diffusion Models for Network Refinement Tasks
Puja Trivedi
Ryan Rossi
David Arbour
Tong Yu
Franck Dernoncourt
Sungchul Kim
Nedim Lipka
Namyong Park
Nesreen K. Ahmed
Danai Koutra
DiffM
88
0
0
29 Nov 2023
SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion
  Priors
SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors
Dave Zhenyu Chen
Haoxuan Li
Hsin-Ying Lee
Sergey Tulyakov
Matthias Nießner
DiffM
76
29
0
28 Nov 2023
Unlocking Spatial Comprehension in Text-to-Image Diffusion Models
Unlocking Spatial Comprehension in Text-to-Image Diffusion Models
Mohammad Mahdi Derakhshani
Menglin Xia
Harkirat Singh Behl
Cees G. M. Snoek
Victor Rühle
86
2
0
28 Nov 2023
ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person
  Interactions
ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions
Anindita Ghosh
Rishabh Dabral
Vladislav Golyanik
Christian Theobalt
Philipp Slusallek
99
23
0
28 Nov 2023
Improving Denoising Diffusion Probabilistic Models via Exploiting Shared
  Representations
Improving Denoising Diffusion Probabilistic Models via Exploiting Shared Representations
Delaram Pirhayatifard
Taha Toghani
Guha Balakrishnan
César A. Uribe
DiffM
93
1
0
27 Nov 2023
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions
Jiemin Fang
Junjie Wang
Xiaopeng Zhang
Lingxi Xie
Qi Tian
3DGSDiffM
130
117
0
27 Nov 2023
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
Sicong Leng
Yangqiaoyu Zhou
Mohammed Haroon Dupty
W. Lee
Sam Joyce
Wei Lu
3DV
67
15
0
27 Nov 2023
Enhancing Perceptual Quality in Video Super-Resolution through
  Temporally-Consistent Detail Synthesis using Diffusion Models
Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models
C. Rota
M. Buzzelli
Joost van de Weijer
DiffM
104
3
0
27 Nov 2023
LLMGA: Multimodal Large Language Model based Generation Assistant
LLMGA: Multimodal Large Language Model based Generation Assistant
Bin Xia
Shiyin Wang
Yingfan Tao
Yitong Wang
Jiaya Jia
MLLM
95
12
0
27 Nov 2023
LFSRDiff: Light Field Image Super-Resolution via Diffusion Models
LFSRDiff: Light Field Image Super-Resolution via Diffusion Models
Wentao Chao
Fuqing Duan
Xuechun Wang
Yingqian Wang
Guanghui Wang
DiffM
113
6
0
27 Nov 2023
Functional Diffusion
Functional Diffusion
Biao Zhang
Peter Wonka
DiffM
193
9
0
26 Nov 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
330
1,190
0
25 Nov 2023
FreePIH: Training-Free Painterly Image Harmonization with Diffusion
  Model
FreePIH: Training-Free Painterly Image Harmonization with Diffusion Model
Ruibin Li
Jingcai Guo
Song Guo
Qihua Zhou
Jiewei Zhang
DiffM
100
9
0
25 Nov 2023
Synthetic Shifts to Initial Seed Vector Exposes the Brittle Nature of
  Latent-Based Diffusion Models
Synthetic Shifts to Initial Seed Vector Exposes the Brittle Nature of Latent-Based Diffusion Models
Poyuan Mao
Shashank Kotyan
Tham Yik Foong
Danilo Vasconcellos Vargas
81
6
0
24 Nov 2023
ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model
ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model
Eslam Mohamed Bakr
Liangbing Zhao
Vincent Tao Hu
Matthieu Cord
Patrick Pérez
Mohamed Elhoseiny
74
0
0
24 Nov 2023
DemoFusion: Democratising High-Resolution Image Generation With No $$$
DemoFusion: Democratising High-Resolution Image Generation With No
Ruoyi Du
Dongliang Chang
Timothy M. Hospedales
Yi-Zhe Song
Zhanyu Ma
127
56
0
24 Nov 2023
A Somewhat Robust Image Watermark against Diffusion-based Editing Models
A Somewhat Robust Image Watermark against Diffusion-based Editing Models
Mingtian Tan
Tianhao Wang
Somesh Jha
WIGM
81
3
0
22 Nov 2023
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
Katja Schwarz
Seung Wook Kim
Jun Gao
Sanja Fidler
Andreas Geiger
Karsten Kreis
100
6
0
22 Nov 2023
FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud
  Generation
FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation
Chenliang Zhou
Fangcheng Zhong
Param Hanji
Zhilin Guo
Kyle Fogarty
Alejandro Sztrajman
Hongyun Gao
Cengiz Öztireli
82
3
0
20 Nov 2023
Pyramid Diffusion for Fine 3D Large Scene Generation
Pyramid Diffusion for Fine 3D Large Scene Generation
Yuheng Liu
Xinke Li
Xueting Li
Lu Qi
Chongshou Li
Ming-Hsuan Yang
145
19
0
20 Nov 2023
MoVideo: Motion-Aware Video Generation with Diffusion Models
MoVideo: Motion-Aware Video Generation with Diffusion Models
Christos Sakaridis
Yuchen Fan
Kai Zhang
Radu Timofte
Luc Van Gool
Rakesh Ranjan
DiffMVGen
85
10
0
19 Nov 2023
Emu Video: Factorizing Text-to-Video Generation by Explicit Image
  Conditioning
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
Rohit Girdhar
Mannat Singh
Andrew Brown
Quentin Duval
S. Azadi
Sai Saketh Rambhatla
Akbar Shah
Xi Yin
Devi Parikh
Ishan Misra
DiffMVGen
129
209
0
17 Nov 2023
A Study on Altering the Latent Space of Pretrained Text to Speech Models
  for Improved Expressiveness
A Study on Altering the Latent Space of Pretrained Text to Speech Models for Improved Expressiveness
Mathias Vogel
DiffM
47
0
0
17 Nov 2023
Previous
123...8910...161718
Next