Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
v1
v2 (latest)
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,418 papers shown
Title
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Yuancheng Wang
Zeqian Ju
Xuejiao Tan
Lei He
Zhizheng Wu
Jiang Bian
Sheng Zhao
DiffM
150
55
0
03 Apr 2023
Subject-driven Text-to-Image Generation via Apprenticeship Learning
Wenhu Chen
Hexiang Hu
Yandong Li
Nataniel Rui
Xuhui Jia
Ming-Wei Chang
William W. Cohen
DiffM
156
194
0
01 Apr 2023
Going Beyond Nouns With Vision & Language Models Using Synthetic Data
Paola Cascante-Bonilla
Khaled Shehada
James Smith
Sivan Doveh
Donghyun Kim
...
Gül Varol
A. Oliva
Vicente Ordonez
Rogerio Feris
Leonid Karlinsky
VLM
SyDa
125
42
0
30 Mar 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
Vidit Goel
E. Peruzzo
Yi Ding
Dejia Xu
Xingqian Xu
N. Sebe
Trevor Darrell
Zhangyang Wang
Humphrey Shi
DiffM
69
8
0
30 Mar 2023
MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
49
19
0
29 Mar 2023
Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion
Hiromichi Kamata
Yuiko Sakuma
Akio Hayakawa
Masato Ishii
T. Narihira
DiffM
88
40
0
28 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
Pierre Fernandez
Guillaume Couairon
Hervé Jégou
Matthijs Douze
Teddy Furon
WIGM
131
198
0
27 Mar 2023
Training-free Content Injection using h-space in Diffusion Models
Jaeseok Jeong
Mingi Kwon
Youngjung Uh
DiffM
103
28
0
27 Mar 2023
Guiding AI-Generated Digital Content with Wireless Perception
Jiacheng Wang
Hongyang Du
Dusit Niyato
Zehui Xiong
Jiawen Kang
Shiwen Mao
Xuemin
X. Shen
61
13
0
26 Mar 2023
Human Preference Score: Better Aligning Text-to-Image Models with Human Preference
Xiaoshi Wu
Keqiang Sun
Feng Zhu
Rui Zhao
Hongsheng Li
128
164
0
25 Mar 2023
DreamBooth3D: Subject-Driven Text-to-3D Generation
Amit Raj
S. Kaza
Ben Poole
Michael Niemeyer
Nataniel Ruiz
...
Kfir Aberman
Michael Rubinstein
Jonathan T. Barron
Yuanzhen Li
Varun Jampani
DiffM
115
228
0
23 Mar 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
VGen
88
581
0
23 Mar 2023
Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions
Ayaan Haque
Matthew Tancik
Alexei A. Efros
Aleksander Holynski
Angjoo Kanazawa
VGen
DiffM
118
377
0
22 Mar 2023
Pix2Video: Video Editing using Image Diffusion
Duygu Ceylan
C. Huang
Niloy J. Mitra
DiffM
VGen
143
262
0
22 Mar 2023
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
K. Pnvr
Bharat Singh
P. Ghosh
Behjat Siddiquie
David Jacobs
DiffM
86
29
0
22 Mar 2023
Vox-E: Text-guided Voxel Editing of 3D Objects
Etai Sella
Gal Fiebelman
Peter Hedman
Hadar Averbuch-Elor
DiffM
109
75
0
21 Mar 2023
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Lukas Höllein
Ang Cao
Andrew Owens
Justin Johnson
Matthias Nießner
DiffM
130
182
0
21 Mar 2023
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
Geonmo Gu
Sanghyuk Chun
Wonjae Kim
HeeJae Jun
Yoohoon Kang
Sangdoo Yun
DiffM
138
59
0
21 Mar 2023
Zero-1-to-3: Zero-shot One Image to 3D Object
Ruoshi Liu
Rundi Wu
Basile Van Hoorick
P. Tokmakov
Sergey Zakharov
Carl Vondrick
DiffM
153
1,113
0
20 Mar 2023
Localizing Object-level Shape Variations with Text-to-Image Diffusion Models
Or Patashnik
Daniel Garibi
Idan Azuri
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
95
120
0
20 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
162
286
0
20 Mar 2023
DialogPaint: A Dialog-based Image Editing Model
Jingxuan Wei
Shiyu Wu
Xin Jiang
Yequan Wang
KELM
DiffM
82
5
0
17 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
129
21
0
17 Mar 2023
HIVE: Harnessing Human Feedback for Instructional Visual Editing
Shu Zhen Zhang
Xinyi Yang
Yihao Feng
Can Qin
Chia-Chih Chen
...
Haiquan Wang
Silvio Savarese
Stefano Ermon
Caiming Xiong
Ran Xu
93
116
0
16 Mar 2023
Efficient Diffusion Training via Min-SNR Weighting Strategy
Tiankai Hang
Shuyang Gu
Chen Li
Jianmin Bao
Dong Chen
Han Hu
Xin Geng
B. Guo
108
163
0
16 Mar 2023
P+: Extended Textual Conditioning in Text-to-Image Generation
A. Voynov
Qinghao Chu
Daniel Cohen-Or
Kfir Aberman
VLM
DiffM
119
186
0
16 Mar 2023
Automatic Geo-alignment of Artwork in Children's Story Books
Jakub J Dylag
V. Suarez
James Wald
Aneesha Amodini Uvara
DiffM
59
0
0
16 Mar 2023
Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models
D. Kothandaraman
Dinesh Manocha
Ming Lin
Dinesh Manocha
72
5
0
15 Mar 2023
Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels
J. Cross-Zamirski
P. Anand
Guy B. Williams
E. Mouchet
Yinhai Wang
Carola-Bibiane Schönlieb
VLM
DiffM
MedIm
98
8
0
15 Mar 2023
Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer
Serin Yang
Hyunmin Hwang
Jong Chul Ye
DiffM
162
62
0
15 Mar 2023
Text-to-image Diffusion Models in Generative AI: A Survey
Chenshuang Zhang
Chaoning Zhang
Mengchun Zhang
In So Kweon
VLM
120
280
0
14 Mar 2023
Accountable Textual-Visual Chat Learns to Reject Human Instructions in Image Re-creation
Zhiwei Zhang
Yuliang Liu
MLLM
80
0
0
10 Mar 2023
Video-P2P: Video Editing with Cross-attention Control
Shaoteng Liu
Yuechen Zhang
Wenbo Li
Zhe Lin
Jiaya Jia
DiffM
VGen
207
221
0
08 Mar 2023
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
Chenfei Wu
Sheng-Kai Yin
Weizhen Qi
Xiaodong Wang
Zecheng Tang
Nan Duan
MLLM
LRM
144
649
0
08 Mar 2023
ELODIN: Naming Concepts in Embedding Spaces
Rodrigo Mello
Filipe Calegario
Geber Ramalho
DiffM
132
1
0
07 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
249
233
0
03 Mar 2023
Collage Diffusion
Vishnu Sarukkai
Linden Li
Arden Ma
Christopher Ré
Kayvon Fatahalian
DiffM
82
27
0
01 Mar 2023
Monocular Depth Estimation using Diffusion Models
Saurabh Saxena
Abhishek Kar
Mohammad Norouzi
David J. Fleet
DiffM
VLM
MDE
109
86
0
28 Feb 2023
Enhanced Controllability of Diffusion Models via Feature Disentanglement and Realism-Enhanced Sampling Methods
Wonwoong Cho
Hareesh Ravi
Midhun Harikumar
V. Khuc
Krishna Kumar Singh
Jingwan Lu
David I. Inouye
Ajinkya Kale
DiffM
161
7
0
28 Feb 2023
Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models
Rinon Gal
Moab Arar
Yuval Atzmon
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
DiffM
139
200
0
23 Feb 2023
Controlled and Conditional Text to Image Generation with Diffusion Prior
Pranav Aggarwal
Hareesh Ravi
Naveen Marri
Sachin Kelkar
F. Chen
...
Alvin Ghouas
Sarah Saber
Malavika Ramprasad
Baldo Faieta
Ajinkya Kale
DiffM
100
7
0
23 Feb 2023
Scaling Robot Learning with Semantically Imagined Experience
Tianhe Yu
Ted Xiao
Austin Stone
Jonathan Tompson
Anthony Brohan
...
M. Dee
Jodilyn Peralta
Brian Ichter
Karol Hausman
F. Xia
LM&Ro
DiffM
96
155
0
22 Feb 2023
Cross-domain Compositing with Pretrained Diffusion Models
Roy Hachnochi
Mingrui Zhao
Nadav Orzech
Rinon Gal
Ali Mahdavi-Amiri
Daniel Cohen-Or
Amit H. Bermano
DiffM
127
17
0
20 Feb 2023
Prompt Stealing Attacks Against Text-to-Image Generation Models
Xinyue Shen
Y. Qu
Michael Backes
Yang Zhang
83
38
0
20 Feb 2023
Text-driven Visual Synthesis with Latent Diffusion Prior
Tingbo Liao
Songwei Ge
Yiran Xu
Yao-Chih Lee
Badour Albahar
Jia-Bin Huang
DiffM
78
6
0
16 Feb 2023
MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation
Omer Bar-Tal
Lior Yariv
Y. Lipman
Tali Dekel
91
395
1
16 Feb 2023
PRedItOR: Text Guided Image Editing with Diffusion Prior
Hareesh Ravi
Sachin Kelkar
Midhun Harikumar
Ajinkya Kale
DiffM
107
12
0
15 Feb 2023
From paintbrush to pixel: A review of deep neural networks in AI-generated art
Anne-Sofie Maerten
Derya Soydaner
80
25
0
14 Feb 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
270
4,192
1
10 Feb 2023
Auditing Gender Presentation Differences in Text-to-Image Models
Yanzhe Zhang
Lu Jiang
Greg Turk
Diyi Yang
EGVM
90
24
0
07 Feb 2023
Previous
1
2
3
...
27
28
29
Next