Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 1,355 papers shown
Title
Exploring the Naturalness of AI-Generated Images
Zijian Chen
Wei Sun
Haoning Wu
Zicheng Zhang
Jun Jia
...
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guangtao Zhai
Wenjun Zhang
25
19
0
09 Dec 2023
SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation
Thuan Hoang Nguyen
Anh Tran
DiffM
26
58
0
08 Dec 2023
SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control
Jaskirat Singh
Jianming Zhang
Qing Liu
Cameron Smith
Zhe Lin
Liang Zheng
DiffM
38
11
0
08 Dec 2023
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao
Zhouhui Lian
79
27
0
08 Dec 2023
RS-Corrector: Correcting the Racial Stereotypes in Latent Diffusion Models
Yue Jiang
Yueming Lyu
Tianxiang Ma
Bo Peng
Jing Dong
51
3
0
08 Dec 2023
Reality's Canvas, Language's Brush: Crafting 3D Avatars from Monocular Video
Yuchen Rao
Eduardo Pérez-Pellitero
Benjamin Busam
Yiren Zhou
Jifei Song
48
0
0
08 Dec 2023
Gen2Det: Generate to Detect
Saksham Suri
Fanyi Xiao
Animesh Sinha
Sean Culatana
Raghuraman Krishnamoorthi
Chenchen Zhu
Abhinav Shrivastava
VLM
DiffM
29
9
0
07 Dec 2023
NeRFiller: Completing Scenes via Generative 3D Inpainting
Ethan Weber
Aleksander Holyñski
Varun Jampani
Saurabh Saxena
Noah Snavely
Abhishek Kar
Angjoo Kanazawa
70
31
0
07 Dec 2023
GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen
Mengmeng Xu
Jiawei Ren
Yuren Cong
Sen He
Yanping Xie
Animesh Sinha
Ping Luo
Tao Xiang
Juan-Manuel Perez-Rua
VGen
44
38
0
07 Dec 2023
Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images
Yiqun Zhang
Zhen Qin
Yang Liu
Dylan Campbell
32
2
0
07 Dec 2023
Style Transfer to Calvin and Hobbes comics using Stable Diffusion
Sloke Shrestha
Sundar Sripada
Asvin Venkataramanan
DiffM
18
1
0
07 Dec 2023
LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Bolin Lai
Xiaoliang Dai
Lawrence Chen
Guan Pang
James M. Rehg
Miao Liu
49
15
0
06 Dec 2023
AVID: Any-Length Video Inpainting with Diffusion Model
Zhixing Zhang
Bichen Wu
Xiaoyan Wang
Yaqiao Luo
Luxin Zhang
Yinan Zhao
Peter Vajda
Dimitris N. Metaxas
Licheng Yu
VGen
DiffM
47
33
0
06 Dec 2023
DiffusionSat: A Generative Foundation Model for Satellite Imagery
Samar Khanna
Patrick Liu
Linqi Zhou
Chenlin Meng
Robin Rombach
Marshall Burke
David B. Lobell
Stefano Ermon
32
58
0
06 Dec 2023
Language-Informed Visual Concept Learning
Sharon Lee
Yunzhi Zhang
Shangzhe Wu
Jiajun Wu
CoGe
29
9
0
06 Dec 2023
Context Diffusion: In-Context Aware Image Generation
Ivona Najdenkoska
Animesh Sinha
Abhimanyu Dubey
Dhruv Mahajan
Vignesh Ramanathan
Filip Radenovic
DiffM
21
10
0
06 Dec 2023
FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models
Junhyuk So
Jungwon Lee
Eunhyeok Park
DiffM
41
9
0
06 Dec 2023
Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Felix Wimbauer
Bichen Wu
Edgar Schoenfeld
Xiaoliang Dai
Ji Hou
...
Jonas Kohler
Christian Rupprecht
Daniel Cremers
Peter Vajda
Jialiang Wang
DiffM
43
59
0
06 Dec 2023
DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing
Shao-Yu Chang
Hwann-Tzong Chen
Tyng-Luh Liu
DiffM
VGen
44
3
0
05 Dec 2023
ReconFusion: 3D Reconstruction with Diffusion Priors
Rundi Wu
B. Mildenhall
Philipp Henzler
Keunhong Park
Ruiqi Gao
...
Pratul P. Srinivasan
Dor Verbin
Jonathan T. Barron
Ben Poole
Aleksander Holynski
81
173
0
05 Dec 2023
Alchemist: Parametric Control of Material Properties with Diffusion Models
Prafull Sharma
Varun Jampani
Yuanzhen Li
Xuhui Jia
Dmitry Lagun
Frédo Durand
William T. Freeman
Mark J. Matthews
DiffM
47
24
0
05 Dec 2023
MagicStick: Controllable Video Editing via Control Handle Transformations
Yue Ma
Xiaodong Cun
Yin-Yin He
Chenyang Qi
Xintao Wang
Ying Shan
Xiu Li
Qifeng Chen
VGen
29
24
0
05 Dec 2023
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
Fengyuan Shi
Jiaxi Gu
Hang Xu
Songcen Xu
Wei Zhang
Limin Wang
VGen
DiffM
36
12
0
05 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
61
158
0
05 Dec 2023
Readout Guidance: Learning Control from Diffusion Features
Grace Luo
Trevor Darrell
Oliver Wang
Dan B. Goldman
Aleksander Holynski
21
21
0
04 Dec 2023
Generative Powers of Ten
Xiaojuan Wang
Janne Kontkanen
Brian L. Curless
Steven M. Seitz
Ira Kemelmacher-Shlizerman
B. Mildenhall
Pratul P. Srinivasan
Dor Verbin
Aleksander Holynski
29
8
0
04 Dec 2023
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Zhuoran Yu
Chenchen Zhu
Sean Culatana
Raghuraman Krishnamoorthi
Fanyi Xiao
Yong Jae Lee
117
15
0
04 Dec 2023
Collaborative Neural Painting
Nicola Dall’Asen
Willi Menapace
E. Peruzzo
E. Sangineto
Yiming Wang
Elisa Ricci
29
0
0
04 Dec 2023
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Jiarui Xu
Yossi Gandelsman
Amir Bar
Jianwei Yang
Jianfeng Gao
Trevor Darrell
Xiaolong Wang
VLM
28
3
0
04 Dec 2023
Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion
Hanyu Wang
Pengxiang Wu
Kevin Dela Rosa
Chen Wang
Abhinav Shrivastava
35
9
0
04 Dec 2023
Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
Runze He
Shaofei Huang
Xuecheng Nie
Tianrui Hui
Luoqi Liu
Jiao Dai
Jizhong Han
Guanbin Li
Si Liu
DiffM
35
7
0
04 Dec 2023
The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language Model
Yilin Ye
Qian Zhu
Shishi Xiao
Kang Zhang
Wei Zeng
49
4
0
04 Dec 2023
Diffusion Handles: Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D
Karran Pandey
Paul Guerrero
Matheus Gadelha
Yannick Hold-Geoffroy
Karan Singh
Niloy Mitra
DiffM
34
31
0
02 Dec 2023
Sequential Modeling Enables Scalable Learning for Large Vision Models
Yutong Bai
Xinyang Geng
K. Mangalam
Amir Bar
Alan Yuille
Trevor Darrell
Jitendra Malik
Alexei A. Efros
MLLM
VLM
38
158
0
01 Dec 2023
Gaussian Grouping: Segment and Edit Anything in 3D Scenes
Mingqiao Ye
Martin Danelljan
Fisher Yu
Lei Ke
3DGS
DiffM
39
167
0
01 Dec 2023
Text-Guided 3D Face Synthesis -- From Generation to Editing
Yunjie Wu
Yapeng Meng
Zhipeng Hu
Lincheng Li
Haoqian Wu
Kun Zhou
Weiwei Xu
Xin Yu
DiffM
61
9
0
01 Dec 2023
Lasagna: Layered Score Distillation for Disentangled Object Relighting
D. Bashkirova
Arijit Ray
Rupayan Mallick
Sarah Adel Bargal
Jianming Zhang
Ranjay Krishna
Kate Saenko
35
3
0
30 Nov 2023
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Zhen Xing
Qi Dai
Zihao Zhang
Hui Zhang
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
63
17
0
30 Nov 2023
S2ST: Image-to-Image Translation in the Seed Space of Latent Diffusion
V. Kolmogorov
Rustem Takhanov
Dani Lischinski
DiffM
55
3
0
30 Nov 2023
Exploiting Diffusion Prior for Generalizable Dense Prediction
Hsin-Ying Lee
Hung-Yu Tseng
Hsin-Ying Lee
Ming-Hsuan Yang
DiffM
MDE
49
18
0
30 Nov 2023
Motion-Conditioned Image Animation for Video Editing
Wilson Yan
Andrew Brown
Pieter Abbeel
Rohit Girdhar
S. Azadi
DiffM
VGen
73
12
0
30 Nov 2023
SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples
Phillip Howard
Avinash Madasu
Tiep Le
Gustavo Lujan Moreno
Anahita Bhiwandiwalla
Vasudev Lal
57
16
0
30 Nov 2023
CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation
Zineng Tang
Ziyi Yang
Mahmoud Khademi
Yang Liu
Chenguang Zhu
Mohit Bansal
LRM
MLLM
AuLLM
58
45
0
30 Nov 2023
Detailed Human-Centric Text Description-Driven Large Scene Synthesis
Gwanghyun Kim
Dong un Kang
H. Seo
Hayeon Kim
Se Young Chun
3DV
DiffM
31
2
0
30 Nov 2023
Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing
Hyelin Nam
Gihyun Kwon
Geon Yeong Park
Jong Chul Ye
DiffM
29
27
0
30 Nov 2023
CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt
Haiyao Xiao
Chenglai Zhong
Xuan Gao
Yudong Guo
Juyong Zhang
38
0
0
30 Nov 2023
Non-Cross Diffusion for Semantic Consistency
Ziyang Zheng
Ruiyuan Gao
Qiang Xu
DiffM
37
2
0
30 Nov 2023
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models
Zhonghao Wang
Wei Wei
Yang Zhao
Zhisheng Xiao
M. Hasegawa-Johnson
Humphrey Shi
Tingbo Hou
DiffM
41
11
0
30 Nov 2023
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
Qidong Huang
Xiao-wen Dong
Pan Zhang
Bin Wang
Conghui He
Jiaqi Wang
Dahua Lin
Weiming Zhang
Neng H. Yu
MLLM
47
171
0
29 Nov 2023
SODA: Bottleneck Diffusion Models for Representation Learning
Drew A. Hudson
Daniel Zoran
Mateusz Malinowski
Andrew Kyle Lampinen
Andrew Jaegle
James L. McClelland
Loic Matthey
Felix Hill
Alexander Lerchner
DiffM
38
49
0
29 Nov 2023
Previous
1
2
3
...
19
20
21
...
26
27
28
Next