Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
v1
v2 (latest)
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,418 papers shown
Title
Localized Gaussian Splatting Editing with Contextual Awareness
Hanyuan Xiao
Yingshu Chen
Huajian Huang
Haolin Xiong
Jing Yang
P. Prasad
Yajie Zhao
3DGS
DiffM
93
4
0
31 Jul 2024
Fine-gained Zero-shot Video Sampling
Dengsheng Chen
Jie Hu
Javier Segovia-Aguas
Enhua Wu
VGen
DiffM
53
0
0
31 Jul 2024
Matting by Generation
Zhixiang Wang
Baiang Li
Jian Wang
Yu-Lun Liu
Jinwei Gu
Yung-Yu Chuang
Shiníchi Satoh
DiffM
85
1
0
30 Jul 2024
Add-SD: Rational Generation without Manual Reference
Lingfeng Yang
Xinyu Zhang
Xiang Li
Jinwen Chen
Kun Yao
Gang Zhang
Errui Ding
Ling-Ling Liu
Jingdong Wang
Jian Yang
62
0
0
30 Jul 2024
SceneTeller: Language-to-3D Scene Generation
Basak Melis Öcal
Maxim Tatarchenko
Sezer Karaoglu
Theo Gevers
92
9
0
30 Jul 2024
Learning Feature-Preserving Portrait Editing from Generated Pairs
Bowei Chen
Tiancheng Zhi
Peihao Zhu
Shen Sang
Jing Liu
Linjie Luo
DiffM
94
0
0
29 Jul 2024
Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Ekaterina Iakovleva
Fabio Pizzati
Philip Torr
Stéphane Lathuiliere
DiffM
96
0
0
29 Jul 2024
Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory
Xiaoyan Xing
Vincent Tao Hu
J. H. Metzen
Konrad Groh
Sezer Karaoglu
Theo Gevers
89
4
0
29 Jul 2024
Auto DragGAN: Editing the Generative Image Manifold in an Autoregressive Manner
Pengxiang Cai
Zhiwei Liu
Guibo Zhu
Yunfang Niu
Jinqiao Wang
DiffM
82
1
0
26 Jul 2024
Answerability Fields: Answerable Location Estimation via Diffusion Models
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
M. Kawanabe
DiffM
81
0
0
26 Jul 2024
RegionDrag: Fast Region-Based Image Editing with Diffusion Models
Jingyi Lu
Xinghui Li
Kai Han
DiffM
89
14
0
25 Jul 2024
ReCorD: Reasoning and Correcting Diffusion for HOI Generation
Jian-Yu Jiang-Lin
Kang-Yang Huang
Ling Lo
Yi-Ning Huang
Terence Lin
Jhih-Ciang Wu
Hong-Han Shuai
Wen-Huang Cheng
DiffM
62
5
0
25 Jul 2024
PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
Rishubh Parihar
VS Sachidanand
Sabariswaran Mani
Tejan Karmali
R. V. Babu
DiffM
109
16
0
24 Jul 2024
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Lirui Zhao
Tianshuo Yang
Wenqi Shao
Yuxin Zhang
Yu Qiao
Ping Luo
Kaipeng Zhang
Rongrong Ji
DiffM
81
3
0
24 Jul 2024
MLLM-CompBench: A Comparative Reasoning Benchmark for Multimodal LLMs
Jihyung Kil
Zheda Mai
Justin Lee
Zihe Wang
Kerrie Cheng
Lemeng Wang
Ye Liu
A. Chowdhury
Wei-Lun Chao
CoGe
VLM
145
19
0
23 Jul 2024
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions
Fabio Tosi
Pierluigi Zama Ramirez
Matteo Poggi
DiffM
MQ
MDE
91
13
0
23 Jul 2024
StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation
Nauman Riaz
S. Saifullah
S. Agne
Andreas Dengel
Sheraz Ahmed
DiffM
65
0
0
22 Jul 2024
DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer
Jinfeng Wei
Xiaofeng Zhang
84
14
0
21 Jul 2024
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
Bowen Zhang
Cheng Yang
Xuanhui Liu
DiffM
72
0
0
21 Jul 2024
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
Zheng Chong
Xiao Dong
Haoxiang Li
Shiyue Zhang
Wenqing Zhang
Xujie Zhang
Hanqing Zhao
D. Jiang
Xiaodan Liang
DiffM
136
24
0
21 Jul 2024
Diffusion Models as Data Mining Tools
Ioannis Siglidis
Aleksander Holynski
Alexei A. Efros
Mathieu Aubry
Shiry Ginosar
DiffM
MedIm
95
3
0
20 Jul 2024
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
100
11
0
19 Jul 2024
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control
Huadai Liu
Jialei Wang
Rongjie Huang
Yang Liu
Jiayang Xu
Zhou Zhao
71
4
0
18 Jul 2024
Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking
Zhiyuan Ma
Guoli Jia
Biqing Qi
Bowen Zhou
WIGM
109
13
0
18 Jul 2024
Training-Free Large Model Priors for Multiple-in-One Image Restoration
Xuanhua He
Lang Li
Yingying Wang
Hui Zheng
Ke Cao
K. Yan
Rui Li
Chengjun Xie
Jie Zhang
Man Zhou
DiffM
142
0
0
18 Jul 2024
Image Inpainting Models are Effective Tools for Instruction-guided Image Editing
Xu Ju
Junhao Zhuang
Zhaoyang Zhang
Hao Wang
Qiang Xu
Ying Shan
DiffM
60
2
0
18 Jul 2024
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Seokhun Choi
H. Song
Jaechul Kim
Taehyeong Kim
Hoseok Do
3DGS
105
23
0
16 Jul 2024
Model Inversion Attacks Through Target-Specific Conditional Diffusion Models
Ouxiang Li
Yanbin Hao
Zhicai Wang
Bin Zhu
Shuo Wang
Zaixi Zhang
Fuli Feng
DiffM
60
3
0
16 Jul 2024
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation
Jiwook Kim
Seonho Lee
Jaeyo Shin
Jiho Choi
Hyunjung Shim
DiffM
126
0
0
16 Jul 2024
InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models
Nirat Saini
Navaneeth Bodla
Ashish Shrivastava
Avinash Ravichandran
Xiao Zhang
Abhinav Shrivastava
Bharat Singh
DiffM
59
3
0
15 Jul 2024
Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval
Youngsun Lim
Hyunjung Shim
DiffM
HILM
MQ
71
4
0
15 Jul 2024
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Qinyu Yang
Haoxin Chen
Yong Zhang
Menghan Xia
Xiaodong Cun
Zhixun Su
Ying Shan
DiffM
77
3
0
14 Jul 2024
3DEgo: 3D Editing on the Go!
Umar Khalid
Hasan Iqbal
Azib Farooq
Michael J. Hua
Chong Chen
VGen
86
7
0
14 Jul 2024
PSC: Posterior Sampling-Based Compression
Noam Elata
T. Michaeli
Michael Elad
DiffM
100
0
0
13 Jul 2024
PersonificationNet: Making customized subject act like a person
Tianchu Guo
Pengyu Li
Biao Wang
Xiansheng Hua
46
0
0
12 Jul 2024
The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective
Zhen Qin
Daoyuan Chen
Wenhao Zhang
Liuyi Yao
Yilun Huang
Bolin Ding
Yaliang Li
Shuiguang Deng
145
7
0
11 Jul 2024
Coherent and Multi-modality Image Inpainting via Latent Space Optimization
Lingzhi Pan
Tong Zhang
Bingyuan Chen
Qi Zhou
Wei Ke
Sabine Süsstrunk
Mathieu Salzmann
DiffM
59
2
0
10 Jul 2024
Generative Image as Action Models
Mohit Shridhar
Yat Long Lo
Stephen James
117
12
0
10 Jul 2024
Adversarial Attacks and Defenses on Text-to-Image Diffusion Models: A Survey
Chenyu Zhang
Mingwang Hu
Wenhui Li
Lanjun Wang
81
20
0
10 Jul 2024
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
163
32
0
10 Jul 2024
ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Shaozhe Hao
Kai Han
Zhengyao Lv
Shihao Zhao
Kwan-Yee K. Wong
DiffM
CoGe
127
7
0
09 Jul 2024
CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding
Wenhao Xu
Wenming Weng
Yueyi Zhang
Zhiwei Xiong
VLM
71
0
0
09 Jul 2024
VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving
Yibo Liu
Zheyuan Yang
Guile Wu
Y. Ren
Kejian Lin
Bingbing Liu
Yang Liu
Jinjun Shan
76
6
0
09 Jul 2024
Sketch-Guided Scene Image Generation
Tianyu Zhang
Xiaoxuan Xie
Xusheng Du
H. Xie
DiffM
82
2
0
09 Jul 2024
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
Yu Zeng
Vishal M. Patel
Haochen Wang
Xun Huang
Ting-Chun Wang
Xuan Li
Yogesh Balaji
DiffM
73
23
0
08 Jul 2024
The Tug-of-War Between Deepfake Generation and Detection
Hannah Lee
Changyeon Lee
Kevin Farhat
Lin Qiu
Steve Geluso
Aerin Kim
O. Etzioni
70
2
0
08 Jul 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang
Aoxue Li
Zhenguo Li
Xihui Liu
MLLM
DiffM
132
40
0
08 Jul 2024
AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling
Sherry X. Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Misha Sra
Pradeep Sen
55
0
0
08 Jul 2024
OneDiff: A Generalist Model for Image Difference Captioning
Erdong Hu
Longteng Guo
Tongtian Yue
Zijia Zhao
Shuning Xue
Jing Liu
VLM
121
2
0
08 Jul 2024
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Haozhe Zhao
Xiaojian Ma
Liang Chen
Shuzheng Si
Rujie Wu
Kaikai An
Peiyu Yu
Minjia Zhang
Qing Li
Baobao Chang
108
63
0
07 Jul 2024
Previous
1
2
3
...
11
12
13
...
27
28
29
Next