Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
v1
v2 (latest)
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,418 papers shown
Title
LatentEditor: Text Driven Local Editing of 3D Scenes
Umar Khalid
Hasan Iqbal
Nazmul Karim
Jingyi Hua
Chong Chen
DiffM
3DGS
77
15
0
14 Dec 2023
LIME: Localized Image Editing via Attention Regularization in Diffusion Models
Enis Simsar
A. Tonioni
Yongqin Xian
Thomas Hofmann
Federico Tombari
DiffM
68
9
0
14 Dec 2023
VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
Jinguo Zhu
Xiaohan Ding
Yixiao Ge
Yuying Ge
Sijie Zhao
Hengshuang Zhao
Xiaohua Wang
Ying Shan
ViT
VLM
82
37
0
14 Dec 2023
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
Ruoxi Shi
Xinyue Wei
Cheng Wang
Hao Su
81
17
0
14 Dec 2023
SHAP-EDITOR: Instruction-guided Latent 3D Editing in Seconds
Minghao Chen
Junyu Xie
Iro Laina
Andrea Vedaldi
KELM
84
10
0
14 Dec 2023
Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Liangchen Song
Liangliang Cao
Jiatao Gu
Yifan Jiang
Junsong Yuan
Hao Tang
DiffM
84
15
0
13 Dec 2023
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment
M. Lavrenyuk
Shariq Farooq Bhat
Matthias Müller
Peter Wonka
ObjD
MDE
86
9
0
13 Dec 2023
ToViLaG: Your Visual-Language Generative Model is Also An Evildoer
Xinpeng Wang
Xiaoyuan Yi
Han Jiang
Shanlin Zhou
Zhihua Wei
Xing Xie
73
15
0
13 Dec 2023
DiffuseRAW: End-to-End Generative RAW Image Processing for Low-Light Images
Rishit Dagli
56
2
0
13 Dec 2023
The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization
Jiafeng Mao
Xueting Wang
Kiyoharu Aizawa
DiffM
105
5
0
13 Dec 2023
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Sicheng Mo
Fangzhou Mu
Kuan Heng Lin
Yanli Liu
Bochen Guan
Yin Li
Bolei Zhou
DiffM
105
67
0
12 Dec 2023
GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
Tomávs Souvcek
Dima Damen
Michael Wray
Ivan Laptev
Josef Sivic
VGen
88
21
0
12 Dec 2023
MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing
Kangneng Zhou
Daiheng Gao
Xuan Wang
Jie Zhang
Peng Zhang
...
Shiqi Yang
Bang Zhang
Liefeng Bo
Yaxing Wang
Ming-Ming Cheng
DiffM
112
4
0
12 Dec 2023
Relightful Harmonization: Lighting-aware Portrait Background Replacement
Mengwei Ren
Wei Xiong
Jae Shin Yoon
Zhixin Shu
Jianming Zhang
HyunJoon Jung
Guido Gerig
He Zhang
DiffM
102
24
0
11 Dec 2023
CAD: Photorealistic 3D Generation via Adversarial Distillation
Bo Liu
Despoina Paschalidou
Ian Huang
Hongyu Liu
Bokui Shen
Xiaoyu Xiang
Jing Liao
Leonidas Guibas
DiffM
146
11
0
11 Dec 2023
UpFusion: Novel View Diffusion from Unposed Sparse View Observations
Bharath Raj Nagoor Kani
Hsin-Ying Lee
Sergey Tulyakov
Shubham Tulsiani
84
6
0
11 Dec 2023
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
Yuzhou Huang
Liangbin Xie
Xintao Wang
Ziyang Yuan
Xiaodong Cun
...
Jiantao Zhou
Chao Dong
Rui Huang
Ruimao Zhang
Ying Shan
DiffM
74
77
0
11 Dec 2023
InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following
Shufan Li
Harkanwar Singh
Aditya Grover
DiffM
93
10
0
11 Dec 2023
Stellar: Systematic Evaluation of Human-Centric Personalized Text-to-Image Methods
Panos Achlioptas
Alexandros Benetatos
Iordanis Fostiropoulos
Dimitris Skourtis
122
9
0
11 Dec 2023
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Ka Leong Cheng
Qiuyu Wang
Zifan Shi
Kecheng Zheng
Yinghao Xu
Ouyang Hao
Qifeng Chen
Yujun Shen
3DH
137
4
0
11 Dec 2023
Neutral Editing Framework for Diffusion-based Video Editing
Sunjae Yoon
Gwanhyeong Koo
Jiajing Hong
Changdong Yoo
VGen
DiffM
47
1
0
10 Dec 2023
A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
Maomao Li
Yu Li
Tianyu Yang
Yunfei Liu
Dongxu Yue
Zhihui Lin
Dong Xu
VGen
36
9
0
10 Dec 2023
Exploring the Naturalness of AI-Generated Images
Zijian Chen
Wei Sun
Haoning Wu
Zicheng Zhang
Jun Jia
...
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guangtao Zhai
Wenjun Zhang
109
22
0
09 Dec 2023
SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation
Thuan Hoang Nguyen
Anh Tran
DiffM
78
66
0
08 Dec 2023
SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control
Jaskirat Singh
Jianming Zhang
Qing Liu
Cameron Smith
Zhe Lin
Liang Zheng
DiffM
80
11
0
08 Dec 2023
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao
Zhouhui Lian
118
30
0
08 Dec 2023
RS-Corrector: Correcting the Racial Stereotypes in Latent Diffusion Models
Yue Jiang
Yueming Lyu
Tianxiang Ma
Bo Peng
Jing Dong
119
4
0
08 Dec 2023
Reality's Canvas, Language's Brush: Crafting 3D Avatars from Monocular Video
Yuchen Rao
Eduardo Pérez-Pellitero
Benjamin Busam
Yiren Zhou
Jifei Song
86
0
0
08 Dec 2023
Gen2Det: Generate to Detect
Saksham Suri
Fanyi Xiao
Animesh Sinha
Sean Culatana
Raghuraman Krishnamoorthi
Chenchen Zhu
Abhinav Shrivastava
VLM
DiffM
91
10
0
07 Dec 2023
NeRFiller: Completing Scenes via Generative 3D Inpainting
Ethan Weber
Aleksander Holyñski
Varun Jampani
Saurabh Saxena
Noah Snavely
Abhishek Kar
Angjoo Kanazawa
106
32
0
07 Dec 2023
GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen
Mengmeng Xu
Jiawei Ren
Yuren Cong
Sen He
Yanping Xie
Animesh Sinha
Ping Luo
Tao Xiang
Juan-Manuel Perez-Rua
VGen
99
41
0
07 Dec 2023
Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images
Yiqun Zhang
Zhen Qin
Yang Liu
Dylan Campbell
65
2
0
07 Dec 2023
Style Transfer to Calvin and Hobbes comics using Stable Diffusion
Sloke Shrestha
Sundar Sripada
Asvin Venkataramanan
DiffM
52
1
0
07 Dec 2023
LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Bolin Lai
Xiaoliang Dai
Lawrence Chen
Guan Pang
James M. Rehg
Miao Liu
109
17
0
06 Dec 2023
AVID: Any-Length Video Inpainting with Diffusion Model
Zhixing Zhang
Bichen Wu
Xiaoyan Wang
Yaqiao Luo
Luxin Zhang
Yinan Zhao
Peter Vajda
Dimitris N. Metaxas
Licheng Yu
VGen
DiffM
128
42
0
06 Dec 2023
DiffusionSat: A Generative Foundation Model for Satellite Imagery
Samar Khanna
Patrick Liu
Linqi Zhou
Chenlin Meng
Robin Rombach
Marshall Burke
David B. Lobell
Stefano Ermon
87
65
0
06 Dec 2023
Language-Informed Visual Concept Learning
Sharon Lee
Yunzhi Zhang
Shangzhe Wu
Jiajun Wu
CoGe
72
9
0
06 Dec 2023
Context Diffusion: In-Context Aware Image Generation
Ivona Najdenkoska
Animesh Sinha
Abhimanyu Dubey
Dhruv Mahajan
Vignesh Ramanathan
Filip Radenovic
DiffM
53
14
0
06 Dec 2023
FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models
Junhyuk So
Jungwon Lee
Eunhyeok Park
DiffM
85
11
0
06 Dec 2023
Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Felix Wimbauer
Bichen Wu
Edgar Schoenfeld
Xiaoliang Dai
Ji Hou
...
Jonas Kohler
Christian Rupprecht
Zorah Lähner
Peter Vajda
Jialiang Wang
DiffM
95
78
0
06 Dec 2023
DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing
Shao-Yu Chang
Hwann-Tzong Chen
Tyng-Luh Liu
DiffM
VGen
102
3
0
05 Dec 2023
ReconFusion: 3D Reconstruction with Diffusion Priors
Rundi Wu
B. Mildenhall
Philipp Henzler
Keunhong Park
Ruiqi Gao
...
Pratul P. Srinivasan
Dor Verbin
Jonathan T. Barron
Ben Poole
Aleksander Holynski
179
191
0
05 Dec 2023
Alchemist: Parametric Control of Material Properties with Diffusion Models
Prafull Sharma
Varun Jampani
Yuanzhen Li
Xuhui Jia
Dmitry Lagun
Frédo Durand
William T. Freeman
Mark J. Matthews
DiffM
130
26
0
05 Dec 2023
MagicStick: Controllable Video Editing via Control Handle Transformations
Yue Ma
Xiaodong Cun
Yin-Yin He
Chenyang Qi
Xintao Wang
Ying Shan
Xiu Li
Qifeng Chen
VGen
120
26
0
05 Dec 2023
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
Fengyuan Shi
Jiaxi Gu
Hang Xu
Songcen Xu
Wei Zhang
Limin Wang
VGen
DiffM
70
14
0
05 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
153
203
0
05 Dec 2023
Readout Guidance: Learning Control from Diffusion Features
Grace Luo
Trevor Darrell
Oliver Wang
Dan B. Goldman
Aleksander Holynski
101
27
0
04 Dec 2023
Generative Powers of Ten
Xiaojuan Wang
Janne Kontkanen
Brian L. Curless
Steven M. Seitz
Ira Kemelmacher-Shlizerman
B. Mildenhall
Pratul P. Srinivasan
Dor Verbin
Aleksander Holynski
77
10
0
04 Dec 2023
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Zhuoran Yu
Chenchen Zhu
Sean Culatana
Raghuraman Krishnamoorthi
Fanyi Xiao
Yong Jae Lee
177
15
0
04 Dec 2023
Collaborative Neural Painting
Nicola Dall’Asen
Willi Menapace
E. Peruzzo
E. Sangineto
Yiming Wang
Elisa Ricci
95
0
0
04 Dec 2023
Previous
1
2
3
...
20
21
22
...
27
28
29
Next