Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
v1
v2 (latest)
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,418 papers shown
Title
EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models
Ruoxi Chen
Haibo Jin
Yixin Liu
Jinyin Chen
Haohan Wang
Lichao Sun
93
11
0
19 Nov 2023
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
Rohit Girdhar
Mannat Singh
Andrew Brown
Quentin Duval
S. Azadi
Sai Saketh Rambhatla
Akbar Shah
Xi Yin
Devi Parikh
Ishan Misra
DiffM
VGen
129
209
0
17 Nov 2023
SelfEval: Leveraging the discriminative nature of generative models for evaluation
Sai Saketh Rambhatla
Ishan Misra
EGVM
90
5
0
17 Nov 2023
Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Animesh Sinha
Bo Sun
Anmol Kalia
Arantxa Casanova
Elliot Blanchard
...
Ankit Ramchandani
Maziar Sanjabi
Sonal Gupta
Amy Bearman
Dhruv Mahajan
DiffM
69
4
0
17 Nov 2023
Emu Edit: Precise Image Editing via Recognition and Generation Tasks
Shelly Sheynin
Adam Polyak
Uriel Singer
Yuval Kirstain
Amit Zohar
Oron Ashual
Devi Parikh
Yaniv Taigman
87
153
0
16 Nov 2023
3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation
Dale Decatur
Itai Lang
Kfir Aberman
Rana Hanocka
91
17
0
16 Nov 2023
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Yanwu Xu
Yang Zhao
Zhisheng Xiao
Tingbo Hou
217
121
0
14 Nov 2023
Unlock the Power: Competitive Distillation for Multi-Modal Large Language Models
Xinwei Li
Li Lin
Shuai Wang
Chen Qian
39
4
0
14 Nov 2023
FIRST: A Million-Entry Dataset for Text-Driven Fashion Synthesis and Design
Zhen Huang
Yihao Li
Dong Pei
Jiapeng Zhou
Xuliang Ning
Jianlin Han
Xiaoguang Han
Xuejun Chen
93
3
0
13 Nov 2023
MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model
Shuwei Shao
Zhongcai Pei
Weihai Chen
Dingchi Sun
Peter C. Y. Chen
Zhengguo Li
MDE
DiffM
77
8
0
13 Nov 2023
IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models
Zhaoyuan Yang
Zhengyang Yu
Zhiwei Xu
Jaskirat Singh
Jing Zhang
Dylan Campbell
Peter Tu
Richard Hartley
99
11
0
12 Nov 2023
Finetuning Text-to-Image Diffusion Models for Fairness
Xudong Shen
Chao Du
Tianyu Pang
Min Lin
Yongkang Wong
Mohan S. Kankanhalli
117
57
0
11 Nov 2023
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models
Haibo Yang
Yang Chen
Yingwei Pan
Ting Yao
Zhineng Chen
Tao Mei
76
20
0
09 Nov 2023
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Jingwen Chen
Yingwei Pan
Ting Yao
Tao Mei
DiffM
105
42
0
09 Nov 2023
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Shilong Liu
Hao Cheng
Haotian Liu
Hao Zhang
Feng Li
...
Hang Su
Jun Zhu
Lei Zhang
Jianfeng Gao
Chun-yue Li
MLLM
VLM
113
126
0
09 Nov 2023
ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image
Senthil Purushwalkam
Nikhil Naik
80
5
0
09 Nov 2023
A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Xingzhe He
Zhiwen Cao
Nicholas I. Kolkin
Lantao Yu
Kun Wan
Helge Rhodin
Ratheesh Kalarot
93
14
0
07 Nov 2023
Cross-Image Attention for Zero-Shot Appearance Transfer
Yuval Alaluf
Daniel Garibi
Or Patashnik
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
96
80
0
06 Nov 2023
AnyText: Multilingual Visual Text Generation And Editing
Yuxiang Tuo
Wangmeng Xiang
Jun-Yan He
Yifeng Geng
Xuansong Xie
DiffM
166
83
0
06 Nov 2023
InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image
Jianhui Li
Shilong Liu
Zidong Liu
Yikai Wang
Kaiwen Zheng
Jinghui Xu
Jianmin Li
Jun Zhu
88
10
0
06 Nov 2023
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing
Wei-Ge Chen
Irina Spiridonova
Jianwei Yang
Jianfeng Gao
Chun-yue Li
MLLM
VLM
93
37
0
01 Nov 2023
Consistent Video-to-Video Transfer Using Synthetic Dataset
Jiaxin Cheng
Tianjun Xiao
Tong He
VGen
DiffM
116
17
0
01 Nov 2023
SeamlessNeRF: Stitching Part NeRFs with Gradient Propagation
Bingchen Gong
Yuehao Wang
Xiaoguang Han
Qi Dou
84
3
0
30 Oct 2023
IterInv: Iterative Inversion for Pixel-Level T2I Models
Chuanming Tang
Kai Wang
Joost van de Weijer
100
4
0
30 Oct 2023
Learning to Follow Object-Centric Image Editing Instructions Faithfully
Tuhin Chakrabarty
Kanishk Singh
Arkadiy Saakyan
Smaranda Muresan
DiffM
77
7
0
29 Oct 2023
SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching
Xinghui Li
Jingyi Lu
Kai Han
V. Prisacariu
DiffM
108
21
0
26 Oct 2023
PERF: Panoramic Neural Radiance Field from a Single Panorama
Guangcong Wang
Peng Wang
Zhaoxi Chen
Wenping Wang
Chen Change Loy
Ziwei Liu
MDE
116
35
0
25 Oct 2023
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models
Tianyi Lu
Xing Zhang
Jiaxi Gu
Hang Xu
Renjing Pei
Songcen Xu
Zuxuan Wu
DiffM
VGen
71
5
0
25 Oct 2023
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
DiffM
153
8
0
25 Oct 2023
CVPR 2023 Text Guided Video Editing Competition
Jay Zhangjie Wu
Xiuyu Li
Difei Gao
Zhen Dong
Jinbin Bai
...
Xu Cheng
Jie Tang
Mike Zheng Shou
Kurt Keutzer
Forrest N. Iandola
100
35
0
24 Oct 2023
WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
Jun-Yan He
Zhi-Qi Cheng
Chenyang Li
Jingdong Sun
Wangmeng Xiang
...
Yusen Hu
Bin Luo
Yifeng Geng
Xuansong Xie
Jingren Zhou
64
13
0
20 Oct 2023
EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model
Zheyu Zhang
Lanhong Yao
Bin Wang
Debesh Jha
Elif Keles
Alpay Medetalibeyoglu
Ulas Bagci
MedIm
90
15
0
19 Oct 2023
Audio Editing with Non-Rigid Text Prompts
Francesco Paissan
Luca Della Libera
Zhepei Wang
Mirco Ravanelli
Paris Smaragdis
Cem Subakan
DiffM
85
5
0
19 Oct 2023
Object-aware Inversion and Reassembly for Image Editing
Zhen Yang
Dinggang Gui
Wen Wang
Hao Chen
Bohan Zhuang
Chunhua Shen
DiffM
102
19
0
18 Oct 2023
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
Xinhua Cheng
Tianyu Yang
Jianan Wang
Yu Li
Lei Zhang
Jian Zhang
Li-ming Yuan
DiffM
90
43
0
18 Oct 2023
Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors
Pengchong Hu
Zhizhong Han
125
13
0
17 Oct 2023
BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys
Yu Gu
Jianwei Yang
Naoto Usuyama
Chun-yue Li
Sheng Zhang
M. Lungren
Jianfeng Gao
Hoifung Poon
MedIm
111
24
0
16 Oct 2023
A Survey on Video Diffusion Models
Zhen Xing
Qijun Feng
Haoran Chen
Qi Dai
Hang-Rui Hu
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
EGVM
VGen
179
139
0
16 Oct 2023
TOSS:High-quality Text-guided Novel View Synthesis from a Single Image
Yukai Shi
Jianan Wang
He Cao
Boshi Tang
Xianbiao Qi
Tianyu Yang
Yukun Huang
Shilong Liu
Lei Zhang
H. Shum
DiffM
66
20
0
16 Oct 2023
Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models
Kevin Black
Mitsuhiko Nakamoto
P. Atreya
Homer Walke
Chelsea Finn
Aviral Kumar
Sergey Levine
DiffM
LM&Ro
133
143
0
16 Oct 2023
ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion
Jiayu Yang
Ziang Cheng
Yunfei Duan
Pan Ji
Hongdong Li
DiffM
88
58
0
16 Oct 2023
AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion
Yitong Jiang
Zhaoyang Zhang
Tianfan Xue
Liang Feng
DiffM
159
46
0
16 Oct 2023
ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context
Binglun Wang
Niladri Shekhar Dutt
Niloy J. Mitra
88
11
0
15 Oct 2023
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task
Maya Okawa
Ekdeep Singh Lubana
Robert P. Dick
Hidenori Tanaka
CoGe
DiffM
119
65
0
13 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRM
MLLM
DiffM
35
25
0
12 Oct 2023
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Zeqiang Lai
Xizhou Zhu
Jifeng Dai
Yu Qiao
Wenhai Wang
MLLM
DiffM
105
24
0
11 Oct 2023
KwaiYiiMath: Technical Report
Jia-Yi Fu
Lei Lin
Xiaoyang Gao
Pengli Liu
Zhengzong Chen
...
Zijia Lin
Fuzheng Zhang
Zhongyuan Wang
Di Zhang
Kun Gai
LRM
ReLM
RALM
133
2
0
11 Oct 2023
Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else
Hazarapet Tunanyan
Dejia Xu
Shant Navasardyan
Zhangyang Wang
Humphrey Shi
DiffM
130
9
0
11 Oct 2023
Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model
Shiyuan Yang
Xiaodong Chen
Jing Liao
DiffM
86
66
0
11 Oct 2023
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
130
111
0
11 Oct 2023
Previous
1
2
3
...
22
23
24
...
27
28
29
Next