Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.12572
Cited By
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation
22 November 2022
Narek Tumanyan
Michal Geyer
Shai Bagon
Tali Dekel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation"
50 / 512 papers shown
Title
TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models
Teng-Fang Hsiao
Bo-Kai Ruan
Yi-Lun Wu
Tzu-Ling Lin
Hong-Han Shuai
VLM
48
0
0
19 Mar 2025
SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models
Subhadeep Koley
Tapas Kumar Dutta
Aneeshan Sain
Pinaki Nath Chowdhury
A. Bhunia
Yi-Zhe Song
VLM
66
0
0
18 Mar 2025
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
Seokhyeon Hong
Chaelin Kim
Serin Yoon
Junghyun Nam
Sihun Cha
Junyong Noh
DiffM
VGen
68
1
0
18 Mar 2025
DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode
Junjia Huang
Pengxiang Yan
Jinhang Cai
Jiyang Liu
Zhao Wang
Yitong Wang
Xinglong Wu
Guanbin Li
DiffM
70
0
0
17 Mar 2025
A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models
Ziqiang Li
Jun Li
Lizhi Xiong
Zhangjie Fu
Zechao Li
VLM
54
0
0
17 Mar 2025
Adams Bashforth Moulton Solver for Inversion and Editing in Rectified Flow
Yongjia Ma
Donglin Di
Xuan Liu
Xiaokai Chen
Lei Fan
Wei Chen
Tonghua Su
45
0
0
17 Mar 2025
FiVE: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
Minghan Li
C. Xie
Y. Wu
Lei Zhang
M. Wang
DiffM
VGen
57
0
0
17 Mar 2025
RASA: Replace Anyone, Say Anything -- A Training-Free Framework for Audio-Driven and Universal Portrait Video Editing
Tianrui Pan
Lin Liu
Jie Liu
X. Zhang
J. Tang
Gangshan Wu
Q. Tian
DiffM
VGen
48
0
0
14 Mar 2025
LUSD: Localized Update Score Distillation for Text-Guided Image Editing
Worameth Chinchuthakun
Tossaporn Saengja
Nontawat Tritrong
Pitchaporn Rewatbowornwong
Pramook Khungurn
Supasorn Suwajanakorn
DiffM
46
0
0
14 Mar 2025
PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing
H. Iqbal
Nazmul Karim
Umar Khalid
Azib Farooq
Z. Zhong
Jing Hua
Chen Chen
DiffM
3DGS
VGen
47
0
0
14 Mar 2025
DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Hongbin Lin
Zilu Guo
Y. Zhang
Shuaicheng Niu
Yafeng Li
R. Zhang
Shuguang Cui
Zhen Li
DiffM
53
0
0
14 Mar 2025
EmoAgent: Multi-Agent Collaboration of Plan, Edit, and Critic, for Affective Image Manipulation
Qi Mao
Haobo Hu
Yujie He
Difei Gao
Haokun Chen
Libiao Jin
DiffM
45
0
0
14 Mar 2025
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models
Runze He
Bo Cheng
Yuhang Ma
Qingxiang Jia
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Liebucha Wu
Dawei Leng
Yuhui Yin
DiffM
VLM
49
0
0
13 Mar 2025
MoEdit: On Learning Quantity Perception for Multi-object Image Editing
Yanfeng Li
Kahou Chan
Yue Sun
C. Lam
Tong Tong
Zitong Yu
Keren Fu
Xiaohong Liu
Tao Tan
DiffM
38
0
0
13 Mar 2025
DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image
Qi Zhao
Zhan Ma
Pan Zhou
VGen
67
0
0
13 Mar 2025
AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption
Joonsung Jeon
Woo Jae Kim
Suhyeon Ha
Sooel Son
Sung-eui Yoon
DiffM
AAML
54
0
0
13 Mar 2025
I2V3D: Controllable image-to-video generation with 3D guidance
Zhiyuan Zhang
Dongdong Chen
J. Liao
VGen
53
0
0
12 Mar 2025
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
Yucheng Suo
Fan Ma
Kaixin Shen
Linchao Zhu
Yi Yang
VLM
49
0
0
12 Mar 2025
Zero-Shot Subject-Centric Generation for Creative Application Using Entropy Fusion
Kaifeng Zou
Xiaoyi Feng
Peng Wang
Tao Huang
Zizhou Huang
Zhang Haihang
Yuntao Zou
Dagang Li
DiffM
49
0
0
12 Mar 2025
MGHanD: Multi-modal Guidance for authentic Hand Diffusion
Taehyeon Eum
Jieun Choi
Tae-Kyun Kim
40
0
0
11 Mar 2025
NullFace: Training-Free Localized Face Anonymization
Han-Wei Kung
Tuomas Varanka
Terence Sim
N. Sebe
DiffM
PICV
66
0
0
11 Mar 2025
AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models
Bo Huang
Wenlun Xu
Qizhuo Han
Haodong Jing
Ying Li
DiffM
36
0
0
10 Mar 2025
What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Xavier Thomas
Deepti Ghadiyaram
DiffM
89
0
0
09 Mar 2025
Consistent Image Layout Editing with Diffusion Models
Tao Xia
Yudi Zhang
Ting Liu Lei Zhang
DiffM
59
1
0
09 Mar 2025
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
Xiang Gao
Shuai Yang
Jiaying Liu
DiffM
46
0
0
08 Mar 2025
RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification
Zhen Yang
Guibao Shen
Liang Hou
Mushui Liu
Luozhou Wang
Xin Tao
Pengfei Wan
Di Zhang
Ying-cong Chen
DiffM
74
0
0
04 Mar 2025
CacheQuant: Comprehensively Accelerated Diffusion Models
Xuewen Liu
Zhikai Li
Qingyi Gu
DiffM
35
0
0
03 Mar 2025
Morpheus: Text-Driven 3D Gaussian Splat Shape and Color Stylization
Jamie Wynn
Z. Qureshi
Jakub Powierza
Jamie Watson
Mohamed Sayed
3DGS
DiffM
71
0
0
03 Mar 2025
Phantom: Training Robots Without Robots Using Only Human Videos
Marion Lepert
Jiaying Fang
Jeannette Bohg
OffRL
40
8
0
02 Mar 2025
Attention Distillation: A Unified Approach to Visual Characteristics Transfer
Yang Zhou
Xu Gao
Zichong Chen
Hui Huang
DiffM
65
5
0
27 Feb 2025
Tight Inversion: Image-Conditioned Inversion for Real Image Editing
Edo Kadosh
Nir Goren
Or Patashnik
Daniel Garibi
Daniel Cohen-Or
DiffM
65
0
0
27 Feb 2025
Identity-preserving Distillation Sampling by Fixed-Point Iterator
SeonHwa Kim
Jiwon Kim
S. Park
Donghoon Ahn
Jiwon Kang
Seungryong Kim
Kyong Hwan Jin
Eunju Cha
43
0
0
27 Feb 2025
Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training
Botao Ye
Sifei Liu
Xueting Li
Marc Pollefeys
Ming Yang
64
0
0
25 Feb 2025
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Xiangpeng Yang
Linchao Zhu
Hehe Fan
Yi Yang
DiffM
VGen
41
5
0
24 Feb 2025
Control-CLIP: Decoupling Category and Style Guidance in CLIP for Specific-Domain Generation
Zexi Jia
Chuanwei Huang
Hongyan Fei
Yeshuang Zhu
Zhiqiang Yuan
Jinchao Zhang
Jie Zhou
DiffM
VLM
69
0
0
17 Feb 2025
Precise Parameter Localization for Textual Generation in Diffusion Models
Łukasz Staniszewski
Bartosz Cywiñski
Franziska Boenisch
Kamil Deja
Adam Dziedzic
DiffM
157
0
0
17 Feb 2025
MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models
Kamil Garifullin
Maxim Nikolaev
Andrey Kuznetsov
Aibek Alanov
58
0
0
10 Feb 2025
AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection
Shuheng Zhang
Y. Liu
Hongbo Zhou
Jun Peng
Yiyi Zhou
Xiaoshuai Sun
Rongrong Ji
VGen
38
0
0
08 Feb 2025
CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes
Tim Broedermann
Christos Sakaridis
Yuqian Fu
Luc Van Gool
57
23
0
28 Jan 2025
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control
Aosong Feng
Weikang Qiu
Jinbin Bai
Xiao Zhang
Zhen Dong
Kaicheng Zhou
Rex Ying
Leandros Tassiulas
DiffM
58
6
0
28 Jan 2025
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
Andrey Palaev
Adil Mehmood Khan
S. M. Ahsan Kazmi
DiffM
48
0
0
23 Jan 2025
3D Object Manipulation in a Single Image using Generative Models
Ruisi Zhao
Zechuan Zhang
Zongxin Yang
Yi Yang
38
1
0
22 Jan 2025
IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion
Tharun Anand
Aryan Garg
Kaushik Mitra
VGen
DiffM
47
0
0
13 Jan 2025
Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning
Maomao Li
Lijian Lin
Yunfei Liu
Ye Zhu
Yu Li
DiffM
VGen
39
0
0
11 Jan 2025
EditAR: Unified Conditional Generation with Autoregressive Models
Jiteng Mu
Nuno Vasconcelos
X. Wang
DiffM
38
4
0
08 Jan 2025
Nested Attention: Semantic-aware Attention Values for Concept Personalization
Or Patashnik
Rinon Gal
Daniil Ostashev
Sergey Tulyakov
Kfir Aberman
Daniel Cohen-Or
DiffM
38
5
0
03 Jan 2025
MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation
Haoyu Zheng
Wenqiao Zhang
Zheqi Lv
Yu Zhong
Yang Dai
...
Yongliang Shen
Juncheng Billy Li
Dongping Zhang
Siliang Tang
Yueting Zhuang
DiffM
VGen
57
0
0
31 Dec 2024
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Jiehui Huang
Xiao Dong
Wenhui Song
Zheng Chong
Jun Zhou
...
Long Chen
Hanhui Li
Yiqiang Yan
Shengcai Liao
Xiaodan Liang
DiffM
50
19
0
31 Dec 2024
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Hao Fei
Shengqiong Wu
H. Zhang
Tat-Seng Chua
Shuicheng Yan
64
37
0
31 Dec 2024
Edicho: Consistent Image Editing in the Wild
Qingyan Bai
Hao Ouyang
Yinghao Xu
Qiuyu Wang
Ceyuan Yang
Ka Leong Cheng
Yujun Shen
Qifeng Chen
DiffM
74
1
0
30 Dec 2024
Previous
1
2
3
4
5
...
9
10
11
Next