Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
v1
v2 (latest)
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,418 papers shown
Title
SuperMark: Robust and Training-free Image Watermarking via Diffusion-based Super-Resolution
Runyi Hu
Jing Zhang
Yiming Li
Jiwei Li
Qing Guo
Han Qiu
Tianwei Zhang
WIGM
AAML
165
2
0
13 Dec 2024
OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Yuanzhi Zhu
R. Wang
Shilin Lu
Junnan Li
Hanshu Yan
Peng Sun
SupR
184
5
0
12 Dec 2024
Olympus: A Universal Task Router for Computer Vision Tasks
Yuanze Lin
Yunsheng Li
Dongdong Chen
Weijian Xu
Ronald Clark
Philip Torr
VLM
ObjD
548
1
0
12 Dec 2024
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Xi Chen
Zhifei Zhang
He Zhang
Yuqian Zhou
Seunggeun Kim
...
Nanxuan Zhao
Yilin Wang
Hui Ding
Zhe Lin
Hengshuang Zhao
VGen
DiffM
187
29
0
10 Dec 2024
FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing
Yingying Deng
Xiangyu He
Changwang Mei
Peisong Wang
Fan Tang
124
9
0
10 Dec 2024
PrEditor3D: Fast and Precise 3D Shape Editing
Ziya Erkoç
Can Gümeli
Chaoyang Wang
Matthias Nießner
Angela Dai
Peter Wonka
Hsin-Ying Lee
Peiye Zhuang
146
3
0
09 Dec 2024
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing
Jinbin Bai
Wei Chow
L. Yang
Hefei Ling
Juncheng Billy Li
Hao Zhang
Shuicheng Yan
187
10
0
05 Dec 2024
DIVE: Taming DINO for Subject-Driven Video Editing
Yi Huang
Wei Xiong
He Zhang
Chaoqi Chen
Jianzhuang Liu
Mingfu Yan
Shifeng Chen
VGen
DiffM
119
1
0
04 Dec 2024
Composed Image Retrieval for Training-Free Domain Conversion
Nikos Efthymiadis
Bill Psomas
Zakaria Laskar
Konstantinos Karantzalos
Yannis Avrithis
Ondřej Chum
Giorgos Tolias
119
0
0
04 Dec 2024
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Qu He
Jinlong Peng
P. Xu
Boyuan Jiang
Xiaobin Hu
...
Yang Liu
Yun Wang
Chengjie Wang
Xuelong Li
Jing Zhang
DiffM
212
1
0
04 Dec 2024
Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation
Yiftach Edelstein
Or Patashnik
Dana Cohen-Bar
Lihi Zelnik-Manor
138
0
0
03 Dec 2024
GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing
Khawar Islam
M. Zaheer
Arif Mahmood
Karthik Nandakumar
Naveed Akhtar
DiffM
213
2
0
03 Dec 2024
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion
Kai He
Chin-Hsuan Wu
Igor Gilitschenski
DiffM
3DGS
129
0
0
02 Dec 2024
3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting
Ziyang Yan
Lei Li
Yihua Shao
Siyu Chen
Wuzong Kai
Lei Li
Hao Zhao
Fabio Remondino
3DGS
168
3
0
02 Dec 2024
PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Control
Ruichen Wang
Junliang Zhang
Qingsong Xie
Chen Chen
H. Lu
DiffM
127
1
0
02 Dec 2024
InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences
Chenyang Zhu
Kai Li
Yue Ma
Longxiang Tang
Chengyu Fang
Chubin Chen
Qifeng Chen
Xiu Li
163
15
0
02 Dec 2024
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Bolin Lai
F. Xu
Miao Liu
Xiaoliang Dai
Nikhil Mehta
...
Zeyi Huang
James M. Rehg
Sangmin Lee
Ning Zhang
Tong Xiao
138
3
0
02 Dec 2024
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
Ziqi Pang
Tianyuan Zhang
Fujun Luan
Yunze Man
Hao Tan
Kai Zhang
William T. Freeman
Yu-Xiong Wang
VGen
135
20
0
02 Dec 2024
OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking
Xinyu Zhang
Zecheng Tang
Zhipei Xu
Runyi Li
Youmin Xu
Bin Chen
Feng Gao
Jian Zhang
WIGM
199
5
0
02 Dec 2024
Lightweight Contenders: Navigating Semi-Supervised Text Mining through Peer Collaboration and Self Transcendence
Qianren Mao
Weifeng Jiang
Qingbin Liu
Chenghua Lin
Qian Li
Xianqing Wen
Jianxin Li
Jinhu Lu
108
0
0
01 Dec 2024
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
Yatian Pang
Bin Zhu
Bin Lin
Mingzhe Zheng
Francis E. H. Tay
Ser-Nam Lim
Harry Yang
Li Yuan
VGen
3DH
124
7
0
30 Nov 2024
Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing
Wenyi Mo
Tianyu Zhang
Yalong Bai
Fuchun Sun
Ji-Rong Wen
DiffM
120
0
0
29 Nov 2024
LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair
Xue Song
Jiequan Cui
Haiqi Zhang
Jiaxin Shi
Jingjing Chen
Chi Zhang
Yu-Gang Jiang
172
1
0
28 Nov 2024
3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes
Tejaswini Medi
Arianna Rampini
Pradyumna Reddy
P. Jayaraman
Margret Keuper
DiffM
166
0
0
28 Nov 2024
Steering Rectified Flow Models in the Vector Field for Controlled Image Generation
Maitreya Patel
Song Wen
Dimitris N. Metaxas
Yezhou Yang
DiffM
196
6
0
27 Nov 2024
Diffusion Self-Distillation for Zero-Shot Customized Image Generation
Shengqu Cai
Eric Ryan Chan
Yunzhi Zhang
Leonidas Guibas
Jiajun Wu
Gordon Wetzstein
132
13
0
27 Nov 2024
Training Data Synthesis with Difficulty Controlled Diffusion Model
Zerun Wang
Jiafeng Mao
Xueting Wang
Toshihiko Yamasaki
DiffM
119
0
0
27 Nov 2024
Generative Image Layer Decomposition with Visual Effects
Jinrui Yang
Qing Liu
Yuezun Li
Seunggeun Kim
D. Pakhomov
Mengwei Ren
Jianming Zhang
Zhe Lin
Cihang Xie
Yuyin Zhou
DiffM
131
3
0
26 Nov 2024
InsightEdit: Towards Better Instruction Following for Image Editing
Yingjing Xu
Jie Kong
Jiazhi Wang
Xiao Pan
Bo Lin
Qiang Liu
DiffM
128
1
0
26 Nov 2024
Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis
Xinyu Hou
Zongsheng Yue
Xiaoming Li
Chen Change Loy
VGen
DiffM
140
0
0
26 Nov 2024
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
Sudarshan Rajagopalan
Nithin Gopalakrishnan Nair
Jay N. Paranjape
Vishal M. Patel
DiffM
167
1
0
26 Nov 2024
Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Eric Hanchen Jiang
Yasi Zhang
Zhi Zhang
Yixin Wan
Andrew Lizarraga
Shufan Li
Ying Nian Wu
DiffM
130
3
0
25 Nov 2024
UVCG: Leveraging Temporal Consistency for Universal Video Protection
KaiZhou Li
Jindong Gu
Xinchun Yu
Junjie Cao
Yansong Tang
Xiao-Ping Zhang
AAML
121
0
0
25 Nov 2024
One Diffusion to Generate Them All
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
VLM
164
9
0
25 Nov 2024
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
Hanhui Wang
Yihua Zhang
Ruizheng Bai
Yue Zhao
Sijia Liu
Zhuowen Tu
AAML
PICV
165
2
0
25 Nov 2024
DynamicAvatars: Accurate Dynamic Facial Avatars Reconstruction and Precise Editing with Diffusion Models
Yangyang Qian
Yuan Sun
Yu-Xiao Guo
DiffM
487
0
0
24 Nov 2024
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
P. Xu
Boyuan Jiang
Xiaobin Hu
Donghao Luo
Qu He
Jing Zhang
Chengjie Wang
Yunsheng Wu
Charles Ling
Boyu Wang
227
3
0
24 Nov 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
Hao Zhang
Yueting Zhuang
DiffM
236
29
0
24 Nov 2024
FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
Jiawei Zhang
Zijian Wu
Zhiyang Liang
Yicheng Gong
Dongfang Hu
Yao Yao
Xun Cao
Hao Zhu
3DGS
196
2
0
23 Nov 2024
GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
Éloi Zablocki
Valentin Gerard
Amaia Cardiel
Eric Gaussier
Matthieu Cord
Eduardo Valle
164
0
0
23 Nov 2024
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
Ryugo Morita
Stanislav Frolov
Brian B. Moser
Takahiro Shirakawa
Ko Watanabe
Andreas Dengel
Jinjia Zhou
DiffM
156
0
0
23 Nov 2024
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Jiahao Hu
Tianxiong Zhong
Xuebo Wang
Boyuan Jiang
Xingye Tian
Fei Yang
Pengfei Wan
Di Zhang
VGen
117
3
0
22 Nov 2024
HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation
Abdul Basit Anees
A. Baykal
Muhammed Burak Kizil
Duygu Ceylan
Erkut Erdem
Aykut Erdem
CLIP
169
1
0
19 Nov 2024
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing
Chang-Shu Liu
Rui Li
Kaidong Zhang
Yunwei Lan
Dong Liu
DiffM
VGen
87
7
0
17 Nov 2024
Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Alessandro Fontanella
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Sarah Parisot
102
2
0
16 Nov 2024
MaskMedPaint: Masked Medical Image Inpainting with Diffusion Models for Mitigation of Spurious Correlations
Qixuan Jin
Walter Gerych
Marzyeh Ghassemi
DiffM
MedIm
77
0
0
16 Nov 2024
ColorEdit: Training-free Image-Guided Color editing with diffusion model
Xingxi Yin
Zhi Li
Jingfeng Zhang
Chenglin Li
Yin Zhang
DiffM
157
0
0
15 Nov 2024
Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing
Zitao Shuai
Chenwei Wu
Zhengxu Tang
Bowen Song
Liyue Shen
DiffM
96
0
0
12 Nov 2024
Material Transforms from Disentangled NeRF Representations
Ivan Lopes
Jean-François Lalonde
Raoul de Charette
64
0
0
12 Nov 2024
Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors
Anisha Pal
Julia Kruk
Mansi Phute
Manognya Bhattaram
Diyi Yang
Duen Horng Chau
Judy Hoffman
AAML
81
3
0
12 Nov 2024
Previous
1
2
3
...
7
8
9
...
27
28
29
Next