ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.10741
  4. Cited By
GLIDE: Towards Photorealistic Image Generation and Editing with
  Text-Guided Diffusion Models

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

20 December 2021
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
ArXivPDFHTML

Papers citing "GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models"

50 / 2,597 papers shown
Title
Prompt-to-Prompt Image Editing with Cross Attention Control
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
92
1,692
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using
  Textual Inversion
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Y. Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
36
1,782
0
02 Aug 2022
Exploring the GLIDE model for Human Action-effect Prediction
Exploring the GLIDE model for Human Action-effect Prediction
Fangjun Li
David C. Hogg
Anthony G. Cohn
37
0
0
01 Aug 2022
Testing Relational Understanding in Text-Guided Image Generation
Testing Relational Understanding in Text-Guided Image Generation
C. Conwell
T. Ullman
EGVM
152
64
0
29 Jul 2022
GAUDI: A Neural Architect for Immersive 3D Scene Generation
GAUDI: A Neural Architect for Immersive 3D Scene Generation
Miguel Angel Bautista
Pengsheng Guo
Samira Abnar
Walter A. Talbott
Alexander Toshev
...
Shuangfei Zhai
Hanlin Goh
Daniel Ulbricht
Afshin Dehghan
J. Susskind
SyDa
3DGS
34
135
0
27 Jul 2022
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented
  Diffusion Models
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
18
70
0
26 Jul 2022
What is Healthy? Generative Counterfactual Diffusion for Lesion
  Localization
What is Healthy? Generative Counterfactual Diffusion for Lesion Localization
Pedro Sanchez
Antanas Kascenas
Xiao Liu
Alison Q. OÑeil
Sotirios A. Tsaftaris
MedIm
DiffM
26
63
0
25 Jul 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
27
296
0
20 Jul 2022
Controllable Data Generation by Deep Learning: A Review
Controllable Data Generation by Deep Learning: A Review
Shiyu Wang
Yuanqi Du
Xiaojie Guo
Bo Pan
Zhaohui Qin
Liang Zhao
31
28
0
19 Jul 2022
Towards Diverse and Faithful One-shot Adaption of Generative Adversarial
  Networks
Towards Diverse and Faithful One-shot Adaption of Generative Adversarial Networks
Yabo Zhang
Mingshuai Yao
Yuxiang Wei
Zhilong Ji
Jinfeng Bai
W. Zuo
13
23
0
18 Jul 2022
Progressive Deblurring of Diffusion Models for Coarse-to-Fine Image
  Synthesis
Progressive Deblurring of Diffusion Models for Coarse-to-Fine Image Synthesis
Sangyun Lee
Hyungjin Chung
Jaehyeon Kim
Jong Chul Ye
DiffM
29
45
0
16 Jul 2022
EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic
  Differential Equations
EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
Min Zhao
Fan Bao
Chongxuan Li
Jun Zhu
DiffM
38
189
0
14 Jul 2022
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval
Jinbin Bai
Chunhui Liu
Feiyue Ni
Haofan Wang
Mengying Hu
Xiaofeng Guo
Lele Cheng
45
11
0
11 Jul 2022
Continuous Methods : Hamiltonian Domain Translation
Continuous Methods : Hamiltonian Domain Translation
Emmanuel Menier
M. Bucci
Mouadh Yagoubi
L. Mathelin
Marc Schoenauer
19
1
0
08 Jul 2022
Jointly Harnessing Prior Structures and Temporal Consistency for Sign
  Language Video Generation
Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation
Yuchen Suo
Zhedong Zheng
Xiaohan Wang
Bang Zhang
Yi Yang
SLR
24
16
0
08 Jul 2022
Back to the Source: Diffusion-Driven Test-Time Adaptation
Back to the Source: Diffusion-Driven Test-Time Adaptation
Jin Gao
Jialing Zhang
Xihui Liu
Trevor Darrell
Evan Shelhamer
Dequan Wang
TTA
11
51
0
07 Jul 2022
Accelerating Score-based Generative Models with Preconditioned Diffusion
  Sampling
Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling
He Ma
Li Zhang
Xiatian Zhu
Jianfeng Feng
DiffM
92
25
0
05 Jul 2022
American == White in Multimodal Language-and-Image AI
American == White in Multimodal Language-and-Image AI
Robert Wolfe
Aylin Caliskan
VLM
27
46
0
01 Jul 2022
Semantic Image Synthesis via Diffusion Models
Semantic Image Synthesis via Diffusion Models
Weilun Wang
Weilun Wang
Wen-gang Zhou
Dongdong Chen
Dong Chen
Lu Yuan
Houqiang Li
DiffM
228
177
0
30 Jun 2022
Text-Driven Stylization of Video Objects
Text-Driven Stylization of Video Objects
Sebastian Loeschcke
Serge J. Belongie
Sagie Benaim
VGen
DiffM
25
16
0
24 Jun 2022
The ArtBench Dataset: Benchmarking Generative Models with Artworks
The ArtBench Dataset: Benchmarking Generative Models with Artworks
Peiyuan Liao
Xiuyu Li
Xihui Liu
Kurt Keutzer
19
47
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
107
1,062
0
22 Jun 2022
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
Minguk Kang
Joonghyuk Shin
Jaesik Park
EGVM
16
67
0
19 Jun 2022
Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for
  Inverse Problems
Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems
Giannis Daras
Y. Dagan
A. Dimakis
C. Daskalakis
BDL
31
15
0
18 Jun 2022
Lossy Compression with Gaussian Diffusion
Lossy Compression with Gaussian Diffusion
Lucas Theis
Tim Salimans
Matthew D. Hoffman
Fabian Mentzer
DiffM
28
77
0
17 Jun 2022
Estimating the Optimal Covariance with Imperfect Mean in Diffusion
  Probabilistic Models
Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models
Fan Bao
Chongxuan Li
Jiacheng Sun
Jun Zhu
Bo Zhang
DiffM
30
72
0
15 Jun 2022
CARD: Classification and Regression Diffusion Models
CARD: Classification and Regression Diffusion Models
Xizewen Han
Huangjie Zheng
Mingyuan Zhou
DiffM
49
109
0
15 Jun 2022
Bootstrapping Multi-view Representations for Fake News Detection
Bootstrapping Multi-view Representations for Fake News Detection
Qichao Ying
Xiaoxiao Hu
Yangming Zhou
Zhenxing Qian
Dan Zeng
Shiming Ge
24
45
0
12 Jun 2022
gDDIM: Generalized denoising diffusion implicit models
gDDIM: Generalized denoising diffusion implicit models
Qinsheng Zhang
Molei Tao
Yongxin Chen
DiffM
34
112
0
11 Jun 2022
CLIP-Actor: Text-Driven Recommendation and Stylization for Animating
  Human Meshes
CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes
Youwang Kim
Ji-Yeon Kim
Tae-Hyun Oh
3DH
CLIP
30
48
0
09 Jun 2022
Intra-agent speech permits zero-shot task acquisition
Intra-agent speech permits zero-shot task acquisition
Chen Yan
Federico Carnevale
Petko Georgiev
Adam Santoro
Aurelia Guy
Alistair Muldal
Chia-Chun Hung
Josh Abramson
Timothy Lillicrap
Greg Wayne
LM&Ro
36
9
0
07 Jun 2022
Blended Latent Diffusion
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
59
373
0
06 Jun 2022
Volumetric Disentanglement for 3D Scene Manipulation
Volumetric Disentanglement for 3D Scene Manipulation
Sagie Benaim
Frederik Warburg
Peter Ebert Christensen
Serge J. Belongie
25
15
0
06 Jun 2022
Learning with Capsules: A Survey
Learning with Capsules: A Survey
Fabio De Sousa Ribeiro
Kevin Duarte
Miles Everett
Georgios Leontidis
M. Shah
3DPC
MedIm
29
19
0
06 Jun 2022
Diffusion-GAN: Training GANs with Diffusion
Diffusion-GAN: Training GANs with Diffusion
Zhendong Wang
Huangjie Zheng
Pengcheng He
Weizhu Chen
Mingyuan Zhou
DiffM
24
221
0
05 Jun 2022
Priors in Deep Image Restoration and Enhancement: A Survey
Priors in Deep Image Restoration and Enhancement: A Survey
Yunfan Lu
Yi Lin
Hao Wu
Yuan Luo
Xueye Zheng
Hui Xiong
Lin Wang
38
3
0
04 Jun 2022
Compositional Visual Generation with Composable Diffusion Models
Compositional Visual Generation with Composable Diffusion Models
Nan Liu
Shuang Li
Yilun Du
Antonio Torralba
J. Tenenbaum
DiffM
CoGe
37
496
0
03 Jun 2022
Style-Content Disentanglement in Language-Image Pretraining
  Representations for Zero-Shot Sketch-to-Image Synthesis
Style-Content Disentanglement in Language-Image Pretraining Representations for Zero-Shot Sketch-to-Image Synthesis
Jan Zuiderveld
DRL
15
1
0
03 Jun 2022
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
Jie Shi
Chenfei Wu
Jian Liang
Xiang Liu
Nan Duan
DiffM
14
25
0
01 Jun 2022
Elucidating the Design Space of Diffusion-Based Generative Models
Elucidating the Design Space of Diffusion-Based Generative Models
Tero Karras
M. Aittala
Timo Aila
S. Laine
DiffM
50
1,832
0
01 Jun 2022
Improved Vector Quantized Diffusion Models
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
178
63
0
31 May 2022
Few-Shot Diffusion Models
Few-Shot Diffusion Models
Giorgio Giannone
Didrik Nielsen
Ole Winther
DiffM
183
49
0
30 May 2022
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech
  with Untranscribed Data
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Sungwon Kim
Heeseung Kim
Sung-Hoon Yoon
DiffM
196
52
0
30 May 2022
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for
  Binaural Audio Synthesis
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
Yichong Leng
Zehua Chen
Junliang Guo
Haohe Liu
Jiawei Chen
...
Lei He
Xiang-Yang Li
Tao Qin
Sheng Zhao
Tie-Yan Liu
DiffM
53
58
0
30 May 2022
CyCLIP: Cyclic Contrastive Language-Image Pretraining
CyCLIP: Cyclic Contrastive Language-Image Pretraining
Shashank Goel
Hritik Bansal
S. Bhatia
Ryan A. Rossi
Vishwa Vinay
Aditya Grover
CLIP
VLM
179
132
0
28 May 2022
Multimodal Fake News Detection via CLIP-Guided Learning
Multimodal Fake News Detection via CLIP-Guided Learning
Yangming Zhou
Qichao Ying
Zhenxing Qian
Sheng Li
Xinpeng Zhang
10
53
0
28 May 2022
Pretraining is All You Need for Image-to-Image Translation
Pretraining is All You Need for Image-to-Image Translation
Tengfei Wang
Ting Zhang
Bo Zhang
Hao Ouyang
Dong Chen
Qifeng Chen
Fang Wen
DiffM
189
178
0
25 May 2022
Mutual Information Divergence: A Unified Metric for Multimodal
  Generative Models
Mutual Information Divergence: A Unified Metric for Multimodal Generative Models
Jin-Hwa Kim
Yunji Kim
Jiyoung Lee
Kang Min Yoo
Sang-Woo Lee
EGVM
36
32
0
25 May 2022
Accelerating Diffusion Models via Early Stop of the Diffusion Process
Accelerating Diffusion Models via Early Stop of the Diffusion Process
Zhaoyang Lyu
Xu Xudong
Ceyuan Yang
Dahua Lin
Bo Dai
DiffM
195
92
0
25 May 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
60
5,778
0
23 May 2022
Previous
123...505152
Next