ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.13335
  4. Cited By
Latent Space Disentanglement in Diffusion Transformers Enables Zero-shot
  Fine-grained Semantic Editing

Latent Space Disentanglement in Diffusion Transformers Enables Zero-shot Fine-grained Semantic Editing

23 August 2024
Zitao Shuai
Chenwei Wu
Zhengxu Tang
Bowen Song
Liyue Shen
ArXiv (abs)PDFHTML

Papers citing "Latent Space Disentanglement in Diffusion Transformers Enables Zero-shot Fine-grained Semantic Editing"

24 / 24 papers shown
Title
Contrastive Denoising Score for Text-guided Latent Diffusion Image
  Editing
Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing
Hyelin Nam
Gihyun Kwon
Geon Yeong Park
Jong Chul Ye
DiffM
60
29
0
30 Nov 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAGMLLM
1.5K
14,631
0
15 Mar 2023
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image
  Diffusion Models
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
Hila Chefer
Yuval Alaluf
Yael Vinker
Lior Wolf
Daniel Cohen-Or
DiffM
111
514
0
31 Jan 2023
Scalable Diffusion Models with Transformers
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
100
2,386
0
19 Dec 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions
InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
207
1,830
0
17 Nov 2022
DiffEdit: Diffusion-based semantic image editing with mask guidance
DiffEdit: Diffusion-based semantic image editing with mask guidance
Guillaume Couairon
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
143
507
0
20 Oct 2022
DreamFusion: Text-to-3D using 2D Diffusion
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
161
2,433
0
29 Sep 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
200
1,789
0
02 Aug 2022
Compositional Visual Generation with Composable Diffusion Models
Compositional Visual Generation with Composable Diffusion Models
Nan Liu
Shuang Li
Yilun Du
Antonio Torralba
J. Tenenbaum
DiffMCoGe
174
525
0
03 Jun 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLMDiffM
413
6,897
0
13 Apr 2022
Generative Adversarial Networks
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
283
30,149
0
01 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
466
15,665
0
20 Dec 2021
Self-Supervised Learning Disentangled Group Representation as Feature
Self-Supervised Learning Disentangled Group Representation as Feature
Tan Wang
Zhongqi Yue
Jianqiang Huang
Qianru Sun
Hanwang Zhang
OOD
75
69
0
28 Oct 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
166
1,235
0
30 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
244
7,933
0
11 May 2021
Closed-Form Factorization of Latent Semantics in GANs
Closed-Form Factorization of Latent Semantics in GANs
Yujun Shen
Bolei Zhou
GAN
114
591
0
13 Jul 2020
GANSpace: Discovering Interpretable GAN Controls
GANSpace: Discovering Interpretable GAN Controls
Erik Härkönen
Aaron Hertzmann
J. Lehtinen
Sylvain Paris
123
902
0
06 Apr 2020
Interpreting the Latent Space of GANs for Semantic Face Editing
Interpreting the Latent Space of GANs for Semantic Face Editing
Yujun Shen
Jinjin Gu
Xiaoou Tang
Bolei Zhou
CVBMGAN
117
1,123
0
25 Jul 2019
What Does BERT Look At? An Analysis of BERT's Attention
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
218
1,601
0
11 Jun 2019
Linguistic Knowledge and Transferability of Contextual Representations
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
135
735
0
21 Mar 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
599
10,590
0
12 Dec 2018
Towards a Definition of Disentangled Representations
Towards a Definition of Disentangled Representations
I. Higgins
David Amos
David Pfau
S. Racanière
Loic Matthey
Danilo Jimenez Rezende
Alexander Lerchner
OCLDRL
103
480
0
05 Dec 2018
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg3DV
1.9K
77,341
0
18 May 2015
Deep Learning Face Attributes in the Wild
Deep Learning Face Attributes in the Wild
Ziwei Liu
Ping Luo
Xiaogang Wang
Xiaoou Tang
CVBM
244
8,424
0
28 Nov 2014
1