Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,353 papers shown
Title
Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking
Zhiyuan Ma
Guoli Jia
Biqing Qi
Bowen Zhou
WIGM
76
10
0
18 Jul 2024
Training-Free Large Model Priors for Multiple-in-One Image Restoration
Xuanhua He
Lang Li
Yingying Wang
Hui Zheng
Ke Cao
K. Yan
Rui Li
Chengjun Xie
Jie Zhang
Man Zhou
DiffM
56
0
0
18 Jul 2024
Image Inpainting Models are Effective Tools for Instruction-guided Image Editing
Xu Ju
Junhao Zhuang
Zhaoyang Zhang
Hao Wang
Qiang Xu
Ying Shan
DiffM
51
1
0
18 Jul 2024
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Seokhun Choi
H. Song
Jaechul Kim
Taehyeong Kim
Hoseok Do
3DGS
40
19
0
16 Jul 2024
Model Inversion Attacks Through Target-Specific Conditional Diffusion Models
Ouxiang Li
Yanbin Hao
Zhicai Wang
Bin Zhu
Shuo Wang
Zaixi Zhang
Fuli Feng
DiffM
25
3
0
16 Jul 2024
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation
Jiwook Kim
Seonho Lee
Jaeyo Shin
Jiho Choi
Hyunjung Shim
DiffM
52
0
0
16 Jul 2024
InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models
Nirat Saini
Navaneeth Bodla
Ashish Shrivastava
Avinash Ravichandran
Xiao Zhang
Abhinav Shrivastava
Bharat Singh
DiffM
29
2
0
15 Jul 2024
Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval
Youngsun Lim
Hyunjung Shim
DiffM
HILM
MQ
48
3
0
15 Jul 2024
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Qinyu Yang
Haoxin Chen
Yong Zhang
Menghan Xia
Xiaodong Cun
Zhixun Su
Ying Shan
DiffM
37
1
0
14 Jul 2024
3DEgo: 3D Editing on the Go!
Umar Khalid
Hasan Iqbal
Azib Farooq
Michael J. Hua
Chong Chen
VGen
29
6
0
14 Jul 2024
PSC: Posterior Sampling-Based Compression
Noam Elata
T. Michaeli
Michael Elad
DiffM
60
1
0
13 Jul 2024
PersonificationNet: Making customized subject act like a person
Tianchu Guo
Pengyu Li
Biao Wang
Xiansheng Hua
39
0
0
12 Jul 2024
The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective
Zhen Qin
Daoyuan Chen
Wenhao Zhang
Liuyi Yao
Yilun Huang
Bolin Ding
Yaliang Li
Shuiguang Deng
62
5
0
11 Jul 2024
Coherent and Multi-modality Image Inpainting via Latent Space Optimization
Lingzhi Pan
Tong Zhang
Bingyuan Chen
Qi Zhou
Wei Ke
Sabine Süsstrunk
Mathieu Salzmann
DiffM
40
2
0
10 Jul 2024
Generative Image as Action Models
Mohit Shridhar
Yat Long Lo
Stephen James
47
9
0
10 Jul 2024
Adversarial Attacks and Defenses on Text-to-Image Diffusion Models: A Survey
Chenyu Zhang
Mingwang Hu
Wenhui Li
Lanjun Wang
41
15
0
10 Jul 2024
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
71
27
0
10 Jul 2024
ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Shaozhe Hao
Kai Han
Zhengyao Lv
Shihao Zhao
Kwan-Yee K. Wong
DiffM
CoGe
36
6
0
09 Jul 2024
CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding
Wenhao Xu
Wenming Weng
Yueyi Zhang
Zhiwei Xiong
VLM
49
0
0
09 Jul 2024
VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving
Yibo Liu
Zheyuan Yang
Guile Wu
Y. Ren
Kejian Lin
Bingbing Liu
Yang Liu
Jinjun Shan
44
5
0
09 Jul 2024
Sketch-Guided Scene Image Generation
Tianyu Zhang
Xiaoxuan Xie
Xusheng Du
H. Xie
DiffM
43
2
0
09 Jul 2024
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
Yu Zeng
Vishal M. Patel
Haochen Wang
Xun Huang
Ting-Chun Wang
Xuan Li
Yogesh Balaji
DiffM
32
18
0
08 Jul 2024
The Tug-of-War Between Deepfake Generation and Detection
Hannah Lee
Changyeon Lee
Kevin Farhat
Lin Qiu
Steve Geluso
Aerin Kim
O. Etzioni
34
1
0
08 Jul 2024
OneDiff: A Generalist Model for Image Difference Captioning
Erdong Hu
Longteng Guo
Tongtian Yue
Zijia Zhao
Shuning Xue
Jing Liu
VLM
39
2
0
08 Jul 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang
Aoxue Li
Zhenguo Li
Xihui Liu
MLLM
DiffM
61
26
0
08 Jul 2024
AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling
Sherry X. Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Misha Sra
Pradeep Sen
37
0
0
08 Jul 2024
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Haozhe Zhao
Xiaojian Ma
Liang Chen
Shuzheng Si
Rujie Wu
Kaikai An
Peiyu Yu
Minjia Zhang
Qing Li
Baobao Chang
42
44
0
07 Jul 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
89
7
0
07 Jul 2024
PartCraft: Crafting Creative Objects by Parts
Kam Woh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
45
6
0
05 Jul 2024
A Survey of Data Synthesis Approaches
Hsin-Yu Chang
Pei-Yu Chen
Tun-Hsiang Chou
Chang-Sheng Kao
Hsuan-Yun Yu
Yen-Ting Lin
Yun-Nung Chen
49
6
0
04 Jul 2024
Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration
Yuhong Zhang
Hengsheng Zhang
Xinning Chai
Zhengxue Cheng
Rong Xie
Li-Na Song
Wenjun Zhang
DiffM
51
4
0
04 Jul 2024
Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes
Yusuke Hirota
Jerone T. A. Andrews
Dora Zhao
Orestis Papakyriakopoulos
Apostolos Modas
Yuta Nakashima
Alice Xiang
49
4
0
04 Jul 2024
Lateralization LoRA: Interleaved Instruction Tuning with Modality-Specialized Adaptations
Zhiyang Xu
Minqian Liu
Ying Shen
Joy Rimchala
Jiaxin Zhang
Qifan Wang
Yu Cheng
Lifu Huang
VLM
39
2
0
04 Jul 2024
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations
Benno Krojer
Dheeraj Vattikonda
Luis Lara
Varun Jampani
Eva Portelance
Christopher Pal
Siva Reddy
EGVM
VGen
49
3
0
03 Jul 2024
TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation
Chaofan Luo
Donglin Di
Xun Yang
Yongjia Ma
Zhou Xue
Chen Wei
Yebin Liu
3DGS
49
9
0
02 Jul 2024
Diffusion Models and Representation Learning: A Survey
Michael Fuest
Pingchuan Ma
Ming Gui
Johannes S. Fischer
Vincent Tao Hu
Bjorn Ommer
DiffM
43
20
0
30 Jun 2024
Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation
Yuchuan Tian
Jianhong Han
Hanting Chen
Yuanyuan Xi
Guoyang Zhang
Jie Hu
Chao Xu
Yunhe Wang
ViT
VLM
36
7
0
30 Jun 2024
GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing
Yisong Xiao
Aishan Liu
QianJia Cheng
Zhenfei Yin
Siyuan Liang
Jiapeng Li
Jing Shao
Xianglong Liu
Dacheng Tao
53
4
0
30 Jun 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLM
DiffM
59
3
0
28 Jun 2024
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
Ivan Villa-Renteria
Mason L. Wang
Zachary Shah
Zhe Li
Soohyun Kim
Neelesh Ramachandran
Mert Pilanci
42
0
0
27 Jun 2024
From Efficient Multimodal Models to World Models: A Survey
Xinji Mai
Zeng Tao
Junxiong Lin
Haoran Wang
Yang Chang
Yanlan Kang
Yan Wang
Wenqiang Zhang
37
5
0
27 Jun 2024
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance
Younghyun Kim
Geunmin Hwang
Junyu Zhang
Eunbyung Park
62
6
0
26 Jun 2024
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
68
22
0
26 Jun 2024
SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing
Ruihuang Li
Liyi Chen
Zhengqiang Zhang
Varun Jampani
Vishal M. Patel
Lei Zhang
DiffM
46
0
0
25 Jun 2024
LIPE: Learning Personalized Identity Prior for Non-rigid Image Editing
Aoyang Liu
Qingnan Fan
Shuai Qin
Hong Gu
Yansong Tang
DiffM
58
1
0
25 Jun 2024
ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance
Shuwei Shi
Wenbo Li
Yuechen Zhang
Jingwen He
Biao Gong
Yinqiang Zheng
57
10
0
24 Jun 2024
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng
Yuxin Cui
Haomiao Tang
Zekun Qi
Runpei Dong
Jing Bai
Chunrui Han
Zheng Ge
Xiangyu Zhang
Shu-Tao Xia
EGVM
83
31
0
24 Jun 2024
A3D: Does Diffusion Dream about 3D Alignment?
Savva Ignatyev
Nina Konovalova
Daniil Selikhanovych
Nikolay Patakin
Nikolay Patakin
...
Anton Konushin
Peter Wonka
Alexander Filippov
Peter Wonka
Evgeny Burnaev
DiffM
71
0
0
21 Jun 2024
Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps
Nikita Starodubcev
Mikhail Khoroshikh
Artem Babenko
Dmitry Baranchuk
DiffM
41
4
0
20 Jun 2024
Adversaries Can Misuse Combinations of Safe Models
Erik Jones
Anca Dragan
Jacob Steinhardt
50
7
0
20 Jun 2024
Previous
1
2
3
...
10
11
12
...
26
27
28
Next