Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
v1
v2 (latest)
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,418 papers shown
Title
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing
Kai Zhang
Lingbo Mo
Wenhu Chen
Huan Sun
Yu-Chuan Su
EGVM
226
277
0
16 Jun 2023
Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion Model
Lu Yu
Weikang Xiang
Kang Han
DiffM
79
16
0
15 Jun 2023
UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video
Zhi-Hao Lin
Bohan Liu
Yi-Ting Chen
David A. Forsyth
Jia-Bin Huang
Jia-Bin Huang
Anand Bhattad
Shenlong Wang
VGen
140
10
0
15 Jun 2023
Evaluating Data Attribution for Text-to-Image Models
Sheng-Yu Wang
Alexei A. Efros
Jun-Yan Zhu
Richard Y. Zhang
TDI
94
33
0
15 Jun 2023
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data
Stephanie Fu
Netanel Y. Tamir
Shobhita Sundaram
Lucy Chai
Richard Y. Zhang
Tali Dekel
Phillip Isola
EGVM
100
123
0
15 Jun 2023
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Seoyeon Kim
Minguk Kang
Dongwon Kim
Jaesik Park
Suha Kwak
VLM
96
11
0
14 Jun 2023
On the Robustness of Latent Diffusion Models
Jianping Zhang
Zhuoer Xu
Shiwen Cui
Changhua Meng
Weibin Wu
Michael R. Lyu
AAML
82
20
0
14 Jun 2023
GeneCIS: A Benchmark for General Conditional Image Similarity
S. Vaze
Nicolas Carion
Ishan Misra
VLM
DiffM
100
30
0
13 Jun 2023
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Shuai Yang
Yifan Zhou
Ziwei Liu
Chen Change Loy
VGen
DiffM
102
221
0
13 Jun 2023
InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions
Jiale Xu
Xintao Wang
Yannan Cao
Weihao Cheng
Ying Shan
Shenghua Gao
DiffM
86
11
0
12 Jun 2023
Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models
Nan Liu
Yilun Du
Shuang Li
J. Tenenbaum
Antonio Torralba
DiffM
CoGe
100
27
0
08 Jun 2023
SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions
Yuseung Lee
Kunho Kim
Hyunjin Kim
Minhyuk Sung
DiffM
127
67
0
08 Jun 2023
Designing a Better Asymmetric VQGAN for StableDiffusion
Zixin Zhu
Xuelu Feng
DongDong Chen
Jianmin Bao
Le Wang
Yinpeng Chen
Lu Yuan
Gang Hua
DiffM
95
35
0
07 Jun 2023
Fine-Grained Visual Prompting
Lingfeng Yang
Yueze Wang
Xiang Li
Xinlong Wang
Jian Yang
ObjD
VLM
115
68
0
07 Jun 2023
Emergent Correspondence from Image Diffusion
Luming Tang
Menglin Jia
Qianqian Wang
Cheng Perng Phoo
Bharath Hariharan
118
270
0
06 Jun 2023
HeadSculpt: Crafting 3D Head Avatars with Text
Xiaoping Han
Yukang Cao
Kai Han
Xiatian Zhu
Jiankang Deng
Yi-Zhe Song
Tao Xiang
Kwan-Yee K. Wong
DiffM
73
47
0
05 Jun 2023
Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions
Shaoxu Li
DiffM
60
5
0
05 Jun 2023
Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark
Shuyu Yang
Yinan Zhou
Yaxiong Wang
Yujiao Wu
Li Zhu
Zhedong Zheng
VLM
DiffM
144
92
0
05 Jun 2023
User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques
Sunwoo Kim
Wooseok Jang
Hyunsung Kim
Junho Kim
Yunjey Choi
Seung Wook Kim
Gayeong Lee
DiffM
65
6
0
05 Jun 2023
Video Colorization with Pre-trained Text-to-Image Diffusion Models
Hanyuan Liu
M. Xie
Jinbo Xing
Chengze Li
T. Wong
VLM
DiffM
104
13
0
02 Jun 2023
Adjustable Visual Appearance for Generalizable Novel View Synthesis
Josef Bengtson
David Nilsson
Che-Tsung Lin
Marcel Büsching
Fredrik Kahl
98
0
0
02 Jun 2023
Diffusion Self-Guidance for Controllable Image Generation
Dave Epstein
Allan Jabri
Ben Poole
Alexei A. Efros
Aleksander Holynski
111
266
0
01 Jun 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffM
VLM
154
44
0
01 Jun 2023
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation
Shaozhe Hao
Kai Han
Shihao Zhao
Kwan-Yee K. Wong
88
10
0
01 Jun 2023
Differential Diffusion: Giving Each Pixel Its Strength
E. Levin
Ohad Fried
DiffM
88
21
0
01 Jun 2023
Versatile Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers
Ke Xu
Hongrui Chen
Zihao Zhu
Li Liu
Baoyuan Wu
DiffM
126
11
0
01 Jun 2023
AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars
Mohit Mendiratta
Xingang Pan
Mohamed A. Elgharib
Kartik Teotia
Mallikarjun B R
A. Tewari
Vladislav Golyanik
Adam Kortylewski
Christian Theobalt
DiffM
75
30
0
01 Jun 2023
Diffusion Brush: A Latent Diffusion Model-based Editing Tool for AI-generated Images
Peyman Gholami
R. Xiao
DiffM
116
3
0
31 May 2023
Control4D: Efficient 4D Portrait Editing with Text
Ruizhi Shao
Jingxiang Sun
Cheng Peng
Zerong Zheng
Boyao Zhou
Hongwen Zhang
Yebin Liu
DiffM
116
25
0
31 May 2023
A Unified Conditional Framework for Diffusion-based Image Restoration
Yuanhang Zhang
Xiaoyu Shi
Dasong Li
Xiaogang Wang
Jian Wang
Hongsheng Li
DiffM
94
26
0
31 May 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Joey Tianyi Zhou
DiffM
101
55
0
30 May 2023
Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video Translation Using Conditional Image Diffusion Models
Ernie Chu
Shuohao Lin
Jun-Cheng Chen
DiffM
64
21
0
30 May 2023
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images
Viraj Prabhu
Sriram Yenamandra
Prithvijit Chattopadhyay
Judy Hoffman
104
42
0
30 May 2023
Diffusion Model for Dense Matching
Jisu Nam
Gyuseong Lee
Sunwoo Kim
Ines Hyeonsu Kim
Hyoungwon Cho
Seyeong Kim
Seung Wook Kim
DiffM
84
10
0
30 May 2023
Nested Diffusion Processes for Anytime Image Generation
Noam Elata
Bahjat Kawar
T. Michaeli
Michael Elad
DiffM
62
4
0
30 May 2023
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Rui Yang
Lin Song
Yanwei Li
Sijie Zhao
Yixiao Ge
Xiu Li
Ying Shan
SyDa
MLLM
88
227
0
30 May 2023
Real-World Image Variation by Aligning Diffusion Inversion Chain
Yuechen Zhang
Jinbo Xing
Eric Lo
Jiaya Jia
99
35
0
30 May 2023
Controllable Text-to-Image Generation with GPT-4
Tianjun Zhang
Yi Zhang
Vibhav Vineet
Neel Joshi
Xin Eric Wang
DiffM
150
44
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
69
37
0
29 May 2023
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions
Noam Rotstein
David Bensaid
Shaked Brody
Roy Ganz
Ron Kimmel
VLM
81
31
0
28 May 2023
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference
Zihao Yu
Haoyang Li
Fangcheng Fu
Xupeng Miao
Tengjiao Wang
DiffM
93
8
0
27 May 2023
CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography
Jiwen Yu
Xuanyu Zhang
You-song Xu
Jian Zhang
DiffM
99
53
0
26 May 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Shihao Zhao
Dongdong Chen
Yen-Chun Chen
Jianmin Bao
Shaozhe Hao
Lu Yuan
Kwan-Yee K. Wong
115
268
0
25 May 2023
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Omri Avrahami
Kfir Aberman
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
VLM
DiffM
106
178
0
25 May 2023
Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation
Lisa Dunlap
Alyssa Umino
Han Zhang
Jiezhi Yang
Joseph E. Gonzalez
Trevor Darrell
DiffM
91
79
0
25 May 2023
CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion
Guangyao Zhai
Evin Pınar Örnek
Shun-cheng Wu
Yan Di
F. Tombari
Nassir Navab
Benjamin Busam
DiffM
139
14
0
25 May 2023
ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models
Yuxin Zhang
Weiming Dong
Fan Tang
Nisha Huang
Haibin Huang
Chongyang Ma
Tong-Yee Lee
Oliver Deussen
Changsheng Xu
DiffM
99
81
0
25 May 2023
Towards Language-guided Interactive 3D Generation: LLMs as Layout Interpreter with Generative Feedback
Yiqi Lin
Hao Wu
Ruichen Wang
H. Lu
Xiaodong Lin
Hui Xiong
Lin Wang
3DV
63
13
0
25 May 2023
Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets
Brandon Smith
Miguel Farinha
Elizaveta Semenova
Hannah Rose Kirk
Aleksandar Shtedritski
Max Bain
90
19
0
24 May 2023
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wanrong Zhu
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
Xinze Wang
William Yang Wang
MLLM
125
180
0
24 May 2023
Previous
1
2
3
...
25
26
27
28
29
Next